“…In modern datacenters, commodity networking devices, especially programmable switches and Smart NICs, have been leveraged to accelerate datacenter applications. Such approaches include caching [73,94,102,148], compute offloading [7,16,56,93,94,109,144,145,150], protocol offloading [34, 35, 72, 84, 91, 95-97, 138, 147, 161, 174, 181], load balancing [81,104], etc. However, due to the limited memory capacity of such devices (e.g., O(10MB) of on-chip SRAM for programmable switches [1,83] and O(10GB) of on-board DRAM for Smart NICs [101,122]), they may fall short of efficiently handling datacenter applications requiring large memory footprints.…”