Integer sorting on multicores and GPUs can be realized by a variety of approaches that include variants of distribution-based methods such as radix-sort, comparison-oriented algorithms such as deterministic regular sampling and random sampling parallel sorting, and network-based algorithms such as Batcher's bitonic sorting algorithm.In this work we present an experimental study of integer sorting on multicore processors.We have implemented serial and parallel radix-sort for various radixes, deterministic regular oversampling and random oversampling parallel sorting, and also some previously little explored or unexplored variants of bitonic-sort and odd-even transposition sort.The study uses multithreading and multiprocessing parallel programming libraries with the C language implementations working under Open MPI, MulticoreBSP, and BSPlib utilizing the same source code.A secondary objective is to attempt to model the performance of these algorithm implementations under the MBSP (Multi-memory BSP) model. We first provide some general high-level observations on the performance of these implementations. If we can conclude anything is that accurate prediction of performance by taking into consideration architecture dependent features such as the structure and characteristics of multiple memory hierarchies is difficult and more often than not untenable. To some degree this is affected by the overhead imposed by the high-level library used in the programming effort. We can still draw however some reliable conclusions and reason about the performance of these implementations using the MBSP model, thus making MBSP useful and usable.Integer sorting on multicores and GPUs can be realized by traditional distribution-specific algorithms such as radix-sort [3,12,25,28], or variants of it that use fewer rounds of the baseline count-sort implementation provided additional information about key values is available [6,39].Other approaches include algorithms that use specialized hardware or software features of a particular multicore architecture [4,6,22,25]. Comparison-based algorithms have also been used with some obvious tweaks: use of deterministic regular sampling sorting [34] that utilizes serial radix-sort for local sorting [8,9,10] or use other methods for local sorting [38,3,5,6,22]. Network-based algorithms such as Batcher's [1] bitonic sorting [23,3,30,31,5] have also been utilized. In particualar, bitonic sorting is a low programming overhead algorithm and thus more suitable for GPU and few-core architectures, is simple to implement, and quite fast when few keys are to be sorted, even if its theoretical performance is suboptimal.In this work we perform an experimental study of integer sorting on multicore processors using multithreading and multiprocessing based libraries that facilitate parallel programming. Our implementations need only recompilation of the same C language source to work under Open MPI [29], MulticoreBSP [36], and a multi-processing and out of maintenace library, BSPlib [19].Towards this we have impl...