Much knowledge about bacteriophages has been obtained via genomics and metagenomics over the last decades. However, most studies dealing with prophage diversity have rarely conducted phage species delimitation (aspect 1) and have hardly integrated the population structure of the host (aspect 2). Yet, these two aspects are essential in assessing phage diversity. Here, we implemented an operational definition of phage species (clustering at 95% identity, 90% coverage) and integrated the host’s population structure to understand prophage diversity better. Gathering the most extensive data set of
Acinetobacter baumannii
phages (4,152 prophages + 122 virulent phages, distributed in 46 countries in the world), we show that 91% (875 out of 963) of the prophage species have four or fewer prophages per species, and just five prophage species have more than 100 prophages. Most prophage species have a narrow host range and are geographically restricted; yet, very few have a broad host range being well spread in distant lineages of
A. baumannii
. These few broad host range prophage species are not only cosmopolitan but also the most abundant species. We also noted that polylysogens had very divergent prophages, belonging to different prophage species, and prophages can easily be gained and lost within the bacterial lineages. Finally, even with this extensive data set, the prophage diversity has not been fully grasped. Our study highlights how integrating the host population structure and a solid operational definition of phage species allows us to better appreciate phage diversity and its transmission dynamics.
IMPORTANCE
Much knowledge about bacteriophages has been obtained via genomics and metagenomics over the last decades. However, most studies dealing with prophage diversity have rarely conducted phage species delimitation (aspect 1) and have hardly integrated the population structure of the host (aspect 2). Yet, these two aspects are essential in assessing phage diversity. Here, we implemented an operational definition of phage species (clustering at 95% identity, 90% coverage) and integrated the host’s population structure to understand prophage diversity better. Gathering the most extensive data set of
Acinetobacter baumannii
phages, we show that most prophage species have four or fewer prophages per species, and just five prophage species have more than 100 prophages. Most prophage species have a narrow host range and are geographically restricted; yet, very few have a broad host range being well spread in distant lineages of
A. baumannii
. These few broad host range prophage species are cosmopolitan and the most abundant species. Prophages in the same bacterial genome are very divergent, and prophages can easily be gained and lost within the bacterial lineages. Finally, even with this extensive data set, the prophage diversity has not been fully grasped. This study shows how integrating the host population structure and clustering at the species level allows us to better appreciate phage diversity and its transmission dynamics.