“…Specialized cache hierarchies. These trends have been widely recognized, and there are many proposals to accelerate data movement, e.g., in machine learning [2,50], graph analytics [92,95,150], data structures [54,58,154], memoization [8,40,153,154], compression [9,36,90,106,107,118,136,146], data layout [7,23,155], prefetching [6,131,149], coherence and synchronization [34,75,151,152], memory management [85,135], and system software [67,108,127]. While highly effective, they share the drawback of requiring custom hardware.…”