“…Preprocessing and postprocessing datasets (XLSX) ( 108 ) as well as Jupyter Notebooks (IPYNB) containing Python codes ( 109 ) that were used for data preparation, statistical and unsupervised clustering analyses, and data visualization are deposited at cited Figshare repositories ( 108 , 109 ) and available at https://github.com/PaleoLipidRR/marine-AOA-GDGT-distribution/ or upon request from the corresponding author. Previously published data were used for this work, including GDGTs derived from 1) cultured Thermoproteia ( 39 ), 2) cultured thermophilic AOA ( 40 – 42 ), 3) environmental hot spring samples ( 24 , 43 – 48 ), 4) cultured shallow AOA ( 25 , 28 , 41 ), 5) SPM ( 49 , 50 , 52 , 63 , 66 , 72 , 73 , 77 , 78 , 107 – 111 ), 6) core top sediments ( 62 – 65 , 69 , 72 , 74 , 77 , 78 , 109 , 112 – 115 ), and 7) paleo-marine sediments ( 17 , 21 , 31 – 33 , 51 , 116 …”