“…This allows to overlap the update of one partition while the dependencies of others are being resolved. We executed the program five times on 8 processors, each time dividing the vector in respectively, 8,16,32,64, and 128 partitions. Each processor was thus allocated, respectively, 1, 2, 4, 8, and 16 partitions.…”