“…Today's AI models use Internet-scraped data, and thus unwittingly train on synthetic data (Figure 2). Moreover, AI-synthesized data is increasingly popular [5][6][7][8][9][10] because it is convenient [11,12], anonymous [13][14][15][16], can augment real data [17,18], and can match AI models' ever-increasing sizes [19][20][21].…”