“…Previous work on parallelism has concentrated, among other topics, on compilation techniques for multicomputers [5,8,51,24], for multiprocessors [47,7], and for automatic discovery of parallelism [21,48,39,18,36,26]. Since neither data layout transformations nor cache locality was the central issue in any of these papers, we do not discuss them here any further.…”