RNA interference (RNAi) requires RNA-dependent RNA polymerases (RdRPs) in many eukaryotes, and RNAi amplification constitutes the only known function for eukaryotic RdRPs. Yet in animals, classical model organisms can elicit RNAi without possessing RdRPs, and only nematode RNAi was shown to require RdRPs. Here we show that RdRP genes are much more common in animals than previously thought, even in insects, where they had been assumed not to exist. RdRP genes were present in the ancestors of numerous clades, and they were subsequently lost at a high frequency. In order to probe the function of RdRPs in a deuterostome (the cephalochordate Branchiostoma lanceolatum), we performed high-throughput analyses of small RNAs from various Branchiostoma developmental stages. Our results show that Branchiostoma RdRPs do not appear to participate in RNAi: we did not detect any candidate small RNA population exhibiting classical siRNA length or sequence features. Our results show that RdRPs have been independently lost in dozens of animal clades, and even in a clade where they have been conserved (cephalochordates) their function in RNAi amplification is not preserved. Such a dramatic functional variability reveals an unexpected plasticity in RNA silencing pathways.
Author summaryRNA interference (RNAi) is a conserved gene regulation system in eukaryotes. In non-animal eukaryotes, it necessitates RNA-dependent RNA polymerases ("RdRPs"). Among animals, only nematodes appear to require RdRPs for RNAi. Yet additional animal clades have RdRPs and it is assumed that they participate in RNAi. Here, we find that RdRPs are much more common in animals than previously thought, but their genes were independently lost in many lineages. Focusing on a species with RdRP genes (a cephalochordate), we found that it does not use them for RNAi. While RNAi is the only known function for eukaryotic RdRPs, our results suggest additional roles. Eukaryotic RdRPs thus have a complex evolutionary history in animals, with frequent independent losses and apparent functional diversification. November 28, 2018 1/23 Phylogenetic tree reconstruction 100 Amino acid sequences of the eukaryotic RdRP domain (Pfam #PF05183) were retrieved 101 from PFAM [35], and supplemented with the RdRP domains of the proteins identified 102 in the 538 animal proteomes (cf above). Sequences were aligned using hmmalign [36] 103 using the HMM profile of the PF05183 RdRP domain. Sequences for which the domain 104 was incomplete were deteled from the alignment. Sites used to reconstruct the 105 phylogenetic tree were selected using trimAl [37] on the Phylemon 2.0 webserver [38]. 106 Bayesian inference (BI) tree was inferred using MrBayes 3.2.6 [39], with the model 107 recommended by ProtTest 1.4 [40] under the Akaike information criterion (LG+Γ), at 108 the CIPRES Science Gateway portal [41]. Two independent runs were performed, each 109 with 4 chains and one million generations. A burn-in of 25% was used and a fifty 110 majority-rule consensus tree was calculated for the remaining ...