“…These data-transfer functions must be used together with one of three synchronization mechanismsfence, post-start-complete-wait, and lock-unlock-as shown in Figure 1. Many MPI implementations, including all vendor MPIs, support one-sided communication, with varying levels of optimization [1,2,4,6,7,9,10,12,14,15]. Nonetheless, Gabriel et al [3] found that, because of the synchronization overhead in one-sided communication, regular point-to-point communication performs better than one-sided communication in five MPI implementations: NEC, Hitachi, IBM, Sun, and LAM.…”