“…Viral sequences were identified based on VirSorter2's SOP with (Guo et al, 2021b) and P>=0.95 using DeepVriFinder. We collected metagenomics data from various sources, including SRP188615/PRJNA526405 (Gaio et al, 2022;Gaio et al, 2021), CNP0000824 (Chen et al, 2021b), PRJEB11755 (Xiao et al, 2016), PRJNA788462 (Gaire et al, 2022), PRJNA775062 (Tao et al, 2022), PRJEB22062 (Luiken et al, 2020), PRJCA009609 (Wu et al, 2022a), PRJEB44118 (Zhang et al, 2022), and 70 samples collected in our lab. In total, 4,650 samples were used in our study to extract the viral contigs.…”