Throughout the last 3 decades, Ebola virus (EBOV) outbreaks have been confined to isolated areas within Central Africa; however, the 2014 variant reached unprecedented transmission and mortality rates. While the outbreak was still under way, it was reported that the variant leading up to this outbreak evolved faster than previous EBOV variants, but evidence for diversifying selection was undetermined. Here, we test this selection hypothesis and show that while previous EBOV outbreaks were preceded by bursts of diversification, evidence for site-specific diversifying selection during the emergence of the 2014 EBOV clade is weak. However, we show strong evidence supporting an interplay between selection and correlated evolution (epistasis), particularly in the mucin-like domain (MLD) of the EBOV glycoprotein. By reconstructing ancestral structures of the MLD, we further propose a structural mechanism explaining how the substitutions that accumulated between 1918 and 1969 distorted the MLD, while more recent epistatic substitutions restored part of the structure, with the most recent substitution being adaptive. We suggest that it is this complex interplay between weak selection, epistasis, and structural constraints that has shaped the evolution of the 2014 EBOV variant.
IMPORTANCEThe role that selection plays in the emergence of viral epidemics remains debated, particularly in the context of the 2014 EBOV outbreak. Most critically, should such evidence exist, it is generally unclear how this relates to function and increased virulence. Here, we show that the viral lineage leading up to the 2014 outbreak underwent a complex interplay between selection and correlated evolution (epistasis) in a protein region that is critical for immune evasion. We then reconstructed the three-dimensional structure of this domain and showed that the initial mutations in this lineage deformed the structure, while subsequent mutations restored part of the structure. Along this mutational path, the first and last mutations were adaptive, while the intervening ones were epistatic. Altogether, we provide a mechanistic model that explains how selection and epistasis acted on the structural constraints that materialized during the 2014 EBOV outbreak.
The five viruses that constitute the genus Ebolavirus (Zaire ebolavirus [EBOV], Sudan ebolavirus, Bundibugyo ebolavirus, Reston ebolavirus, and Tai Forest ebolavirus) have been the cause of a major public health concern in sub-Saharan Africa for over 3 decades (1). Historically, outbreaks have been confined to isolated areas within Central Africa; however, the 2014 outbreak reached an unprecedented level, making this the largest outbreak since the discovery of the virus in 1976 (2).The primary analysis of EBOV isolates conducted by Gire and colleagues found a large number of nonsynonymous mutations between the 2014 EBOV sequences and all previously published EBOV sequences (3). In particular, 50 fixed nonsynonymous changes were observed, all of which were distinct to the 2014 EBOV vari...