45 Plasmodium vivax malaria is much less common in Africa than the rest of the world 46 because the parasite relies primarily on the Duffy antigen/chemokine receptor (DARC) 47 to invade human erythrocytes, and the majority of Africans are Duffy negative. Recently, 48 there has been a dramatic increase in the reporting of P. vivax cases in Africa, with a 49 high number of them being in Duffy negative individuals, potentially indicating P. vivax 50 has evolved an alternative invasion mechanism that can overcome Duffy negativity.51 Here, we analyzed single nucleotide polymorphism (SNP) and copy number variation 52 (CNV) in Whole Genome Sequence (WGS) data from 44 P. vivax samples isolated from 53 symptomatic malaria patients in southwestern Ethiopia, where both Duffy positive and 54 Duffy negative individuals are found. A total of 236,351 SNPs were detected, of which 55 21.9% was nonsynonymous and 78.1% was synonymous mutations. The largest 56 number of SNPs were detected on chromosomes 9 (33,478 SNPs; 14% of total) and 10 57 (28,133 SNPs; 11.9%). There were particularly high levels of polymorphism in 58 erythrocyte binding gene candidates including reticulocyte binding protein 2c (RBP2c), 59 merozoite surface protein 1 (MSP1), and merozoite surface protein 3 (MSP3.5, 60 MSP3.85 and MSP3.9). Thirteen genes related to immunogenicity and erythrocyte 61 binding function were detected with significant signals of positive selection. Variation in 62 gene copy number was also concentrated in genes involved in host-parasite 63 interactions, including the expansion of the Duffy binding protein gene (PvDBP) on 64 chromosome 6 and several PIR genes. Based on the phylogeny constructed from the 65 whole genome sequences, the expansion of these genes was an independent process 66 among the P. vivax lineages in Ethiopia. We further inferred transmission patterns of P. 67 vivax infections among study sites and showed various levels of gene flow at a small 68 geographical scale. The genomic features of P. vivax provided baseline data for future 69 comparison with those in Duffy-negative individuals, and allowed us to develop a panel 70 of informative Single Nucleotide Polymorphic markers diagnostic at a micro-71 geographical scale. 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 90 Vivax malaria is the most geographically widespread human malaria, causing over 130 91 million clinical cases per year worldwide [1]. Plasmodium vivax can produce dormant 92 liver-stage hypnozoites within infected hosts, giving rise to relapse infections from 93 months to years. This unique feature of P. vivax contributes to an increase in 94 transmission potential and increases the challenge of elimination [2]. Understanding P.95 vivax genome variation will advance our knowledge of parasite biology and host-96 parasite interactions, as well as identify potential drug resistance mechanisms [3, 4].97 Such data will also help identify molecular targets for vaccine development [5][6][7], and 98 provide new means to track the transmission and spread of ...