“…Like previous works (Zhang et al, 2014;Cohen et al, 2016;de Queiroz and Chou, 2016;Thanou et al, 2016;de Queiroz and Chou, 2017;Pavez et al, 2018;Chou et al, 2020;Krivokuća et al, 2020), both V-PCC and G-PCC compress geometry first, then compress attributes conditioned on geometry. Neural networks have been applied with some success to geometry compression (Yan et al, 2019;Quach et al, 2019;Guarda et al, 2019a,b;Guarda et al, 2020;Tang et al, 2020;Quach et al, 2020a;Milani, 2020Milani, , 2021Lazzarotto et al, 2021), but not to lossy attribute compression. Exceptions may include (Quach et al, 2020b), which uses learned neural 3D → 2D folding but compresses with conventional image coding, and Deep-PCAC (Sheng et al, 2021), which compresses attributes using a PointNet-style architecture, which is not volumetric and underperforms our framework by 2-5 dB (see Figure 12B and Supplementary Material).…”