51Genomes are an integral component of the biological information about an organism and, logically, the more 52 complete the genome, the more informative it is. Historically, bacterial and archaeal genomes were 53 reconstructed from pure (monoclonal) cultures and the first reported sequences were manually curated to 54 completion. However, the bottleneck imposed by the requirement for isolates precluded genomic insights 55 for the vast majority of microbial life. Shotgun sequencing of microbial communities, referred to initially as 56 community genomics and subsequently as genome-resolved metagenomics, can circumvent this limitation 57 by obtaining metagenome-assembled genomes (MAGs), but gaps, local assembly errors, chimeras and 58 contamination by fragments from other genomes limit the value of these genomes. Here, we discuss genome 59 curation to improve and in some cases achieve complete (circularized, no gaps) MAGs (CMAGs). To date, few 60 CMAGs have been generated, although notably some are from very complex systems such as soil and 61 sediment. Through analysis of ~7000 published complete bacterial isolate genomes, we verify the value of 62 cumulative GC skew in combination with other metrics to establish bacterial genome sequence accuracy.
63Interestingly, analysis of cumulative GC skew identified potential mis-assemblies in some reference genomes 64 of isolated bacteria and the repeat sequences that likely gave rise to them. We discuss methods that could 65 be implemented in bioinformatic approaches for curation to ensure that metabolic and evolutionary 66 analyses can be based on very high-quality genomes. 67 68 69 70 2017), and even human blood (Moustafa et al. 2017) amenable to shotgun metagenomic surveys and recovery 87 of MAGs. Although incomplete, draft MAGs represent a major advance over knowing nothing about the genes 88 and pathways present in an organism, and led to the discovery of new metabolisms. For example, the complete 89 oxidation of ammonia to nitrate via nitrite (i.e., comammox) was determined by the detection of necessary 90 genes in a single MAG (Daims et al. 2015; van Kessel et al. 2015). MAGs are often derived from uncultivated 91 organisms that can be quite distantly related to any isolated species, which is a clear advantage of MAGs 92 (Becraft et al. 2017; Garg et al. 2019). For this reason, genome-resolved metagenomics has been critical for 93 more comprehensive descriptions of bacterial and archaeal diversity and the overall topology of the Tree of 94