“…Naively employing PIM to accelerate data-intensive workloads can lead to sub-optimal performance due to the many design constraints PIM substrates impose (e.g., limited area and power budget available inside 3D-stacked memories [6] or manufacturing limitations of combining memory and logic elements [6,13]). Therefore, many recent works co-design specialized PIM accelerators and algorithms to improve performance and reduce the energy consumption of (i) applications from various application domains, such as graph processing , machine learning [1,, bioinformatics , high-performance computing [95,[101][102][103][104][105][106][107][108][109][110][111][112], databases [18,19,29,46,60,[113][114][115][116][117][118][119][120][121][122][123][124][125][126][127][128][129][130], security [131][132][133][134][135...…”