26Discovery of polymorphisms under co-selective pressure or epistasis has received considerable 27 recent attention in population genomics. Both statistical modeling of the population level co-variation 28 of alleles across the chromosome and model-free testing of dependencies between pairs of 29 polymorphisms have been shown to successfully uncover patterns of selection in bacterial 30 populations. Here we introduce a model-free method, SpydrPick, whose computational efficiency 31 enables analysis at the scale of pan-genomes of many bacteria. SpydrPick incorporates an efficient 32 correction for population structure, which is demonstrated to maintain a very low rate of false positive 33 findings among those SNP pairs highlighted to deviate significantly from the null hypothesis of neutral 34 co-evolution in simulated data. We also introduce a new type of visualization of the results similar to 35 the Manhattan plots used in genome-wide association studies, which enables rapid exploration of the 36 identified signals of co-evolution. Application of the method to large population genomic data sets of 37 two major human pathogens, Streptococcus pneumoniae and Neisseria meningitidis, revealed both 38 previously identified and novel putative targets of co-selection related to virulence and antibiotic 39 resistance, highlighting the potential of this approach to drive molecular discoveries, even in the 40 absence of phenotypic data. 41 42 43 44 45 Statistical analysis of co-variation between non-adjacent sites in large protein alignments has matured 46 since its inception, over 20 years ago (1-7). More recently, attention has also been directed towards 47 performing a similar type of exploratory analysis of genome-wide nucleotide alignments for bacterial 48 populations to reveal putative sites evolving under co-selective pressures and possibly being involved 49 in epistatic interactions (8-10). Genome-scale analysis of co-variation at single-nucleotide resolution, 50 here termed as genome-wide epistasis and co-selection study (GWES), poses considerable 51 computational challenges as the number of pairs to be considered increases quadratically with the 52 number of sites. Previous GWES approaches have been based on either straightforward pairwise 53 tests (8), which do not distinguish between indirect and direct interactions, or a more elaborate model-54 based technique known as direct coupling analysis (DCA) (9, 10). 55 The main motivation behind pairwise methods has typically been scalability, however, a 56 recent simulation study on high-dimensional structure learning of synthetic network models showed 57 that a family of pairwise methods based on mutual information (MI) may be as accurate as and even 58 outperform model-based methods in the small sample regime (arXiv:1901.04345), which is the typical 59 setting for most bacterial population genomic data. While MI has been proposed for the analysis of 60 protein alignments (1, 11), it has not yet been systematically applied to bacterial population genomi...