An essential step toward reconstructing pathogen transmission and answering epidemiologically relevant questions from genomic data is obtaining pairwise genetic distance between infections. For recombining organisms such as malaria parasites, relatedness measures quantifying recent shared ancestry would provide a meaningful distance, suggesting methods based on identity by descent (IBD). While the concept of relatedness and consequently an IBD approach is fairly straightforward for individual parasites, the distance between polyclonal infections, which are prevalent in malaria, presents specific challenges and awaits a general solution that could be applied to infections of any clonality and accommodate multiallelic (e.g. microsatellite or microhaplotype) and biallelic (SNP) data. Filling this methodological gap, we present Dcifer (Distance for complex infections: fast estimation of relatedness), a method for calculating genetic distance between polyclonal infections, which is designed for unphased data, explicitly accounts for population allele frequencies and complexity of infection, and provides reliable inference. Dcifer’s IBD-based framework allows us to define model parameters that represent interhost relatedness and to propose corresponding estimators with attractive statistical properties. By using combinatorics to account for unobserved phased haplotypes, Dcifer is able to quickly process large datasets and estimate pairwise relatedness along with measures of uncertainty. We show that Dcifer delivers accurate and interpretable results and detects related infections with statistical power that is 2-4 times greater than that of approaches based on identity by state. Applications to real data indicate that relatedness structure aligns with geographic locations. Dcifer is implemented in a comprehensive publicly available software package.
Zanzibar has made significant progress toward malaria elimination, but recent stagnation requires novel approaches. We developed a highly multiplexed droplet digital PCR (ddPCR)-based amplicon sequencing method targeting 35 microhaplotypes and drug-resistance loci, and successfully sequenced 290 samples from five districts covering both main islands. Here, we elucidate fine-scale Plasmodium falciparum population structure and infer relatedness and connectivity of infections using an identity-by-descent (IBD) approach. Despite high genetic diversity, we observe pronounced fine-scale spatial and temporal parasite genetic structure. Clusters of near-clonal infections on Pemba indicate persistent local transmission with limited parasite importation, presenting an opportunity for local elimination efforts. Furthermore, we observe an admixed parasite population on Unguja and detect a substantial fraction (2.9%) of significantly related infection pairs between Zanzibar and the mainland, suggesting recent importation. Our study provides a high-resolution view of parasite genetic structure across the Zanzibar archipelago and provides actionable insights for prioritizing malaria elimination efforts.
Over the past 15 years, Zanzibar has made great strides toward malaria elimination; yet progress has stalled. Parasite genetic data of Plasmodium falciparum may inform strategies for malaria elimination by helping to identify contributory factors to parasite persistence. Here we elucidate fine-scale parasite population structure and infer relatedness and connectivity of infections using an identity-by-descent (IBD) approach. We sequenced 518 P. falciparum samples from 5 districts covering both main islands using a novel, highly multiplexed droplet digital PCR (ddPCR)-based amplicon deep sequencing method targeting 35 microhaplotypes and drug-resistance loci. Despite high genetic diversity, we observe strong fine-scale spatial and temporal structure of local parasite populations, including isolated populations on Pemba Island and genetically admixed populations on Unguja Island, providing evidence of ongoing local transmission. We observe a high proportion of highly related parasites in individuals living closer together, including between clinical index cases and the mostly asymptomatic cases surrounding them, consistent with isolation-by-distance. We identify a substantial fraction (2.9%) of related parasite pairs between Zanzibar, and mainland Tanzania and Kenya, consistent with recent importation. We identify haplotypes known to confer resistance to known antimalarials in all districts, including multidrug-resistant parasites, but most parasites remain sensitive to current first-line treatments. Our study provides a high-resolution view of parasite genetic structure across the Zanzibar archipelago and reveals actionable patterns, including isolated parasite populations, which may be prioritized for malaria elimination.
An essential step toward reconstructing pathogen transmission and answering epidemiologically relevant questions from genomic data is obtaining pairwise genetic distance between infections. For recombining organisms such as malaria parasites, relatedness measures quantifying recent shared ancestry would provide a meaningful distance, suggesting methods based on identity by descent (IBD). While the concept of relatedness and consequently an IBD approach is fairly straightforward for individual parasites, the distance between polyclonal infections, which are prevalent in malaria, presents specific challenges and awaits a general solution that could be applied to infections of any clonality and accommodate multiallelic (e.g. microsatellite or microhaplotype) and biallelic (SNP) data. Filling this methodological gap, we present Dcifer (Distance for complex infections: fast estimation of relatedness), a method for calculating genetic distance between polyclonal infections, which is designed for unphased data, explicitly accounts for population allele frequencies and complexity of infection, and provides reliable inference. Dcifer’s IBD-based framework allows us to define model parameters that represent interhost relatedness and to propose corresponding estimators with attractive statistical properties. By using combinatorics to account for unobserved phased haplotypes, Dcifer is able to quickly process large datasets and estimate pairwise relatedness along with measures of uncertainty. We show that Dcifer delivers accurate and interpretable results and detects related infections with statistical power that is 2-4 times greater than that of approaches based on identity by state. Applications to real data indicate that relatedness structure aligns with geographic locations. Dcifer is implemented in a comprehensive publicly available software package.
Malaria cases are frequently recorded in the Ethiopian highlands even at altitudes above 2,000 m. The epidemiology of malaria in the Ethiopian highlands, and in particular the role of importation by human migration from the highly endemic lowlands is not well understood. We characterized the parasite population structure and genetic relatedness by sequencing 159 P. falciparum samples from Gondar and an additional 28 samples from Ziway using a highly multiplexed droplet digital PCR (ddPCR)-based amplicon deep sequencing method targeting 35 microhaplotypes and drug resistance loci. Diversity was moderate (mean HE: 0.54), and infection complexity was low (74.9% single clone infections). A significant percentage of infections shared genomic haplotypes, even across transmission seasons, indicating persistent local and focal transmission. Multiple clusters of clonal or near-clonal infections were identified, highlighting the overall high genetic relatedness. Frequently, infections from travelers were the earliest observed cases, suggesting that parasites may have been imported and then transmitted locally. We observed population structure between Gondar and Ziway, although some haplotypes were shared between sites. 31.1% of infections carried pfhrp2 deletions and 84.4% pfhrp3 deletions, and 28.7% pfhrp2/pfhrp3 double deletions. Parasites with pfhrp2/3 deletions and wild-type parasites were genetically distinct. Mutations associated with resistance to sulfadoxine-pyrimethamine and lumefantrine were observed at near-fixation, but no mutations in pfk13 were found. In conclusion, genomic data corroborates local transmission and the importance of intensified control in the Ethiopian highlands.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.