Gavin Mackenzie scite author profile

Gavin Mackenzie

2Publications

6Citation Statements Received

18Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Glasgow, Institute of Infection and Immunity

Publications

Order By: Most citations

Varia: a tool for prediction, analysis and visualisation of variable genes

et al. 2022

View full text Add to dashboard Cite

Background Parasites use polymorphic gene families to evade the immune system or interact with the host. Assessing the diversity and expression of such gene families in pathogens can inform on the repertoire or host interaction phenotypes of clinical relevance. However, obtaining the sequences and quantifying their expression is a challenge. In Plasmodium falciparum, the highly polymorphic var genes encode the major virulence protein, PfEMP1, which bind a range of human receptors through varying combinations of DBL and CIDR domains. Here we present a tool, Varia, to predict near full-length gene sequences and domain compositions of query genes from database genes sharing short sequence tags. Varia generates output through two complementary pipelines. Varia_VIP returns all putative gene sequences and domain compositions of the query gene from any partial sequence provided, thereby enabling experimental validation of specific genes of interest and detailed assessment of their putative domain structure. Varia_GEM accommodates rapid profiling of var gene expression in complex patient samples from DBLα expression sequence tags (EST), by computing a sample overall transcript profile stratified by PfEMP1 domain types. Results Varia_VIP was tested querying sequence tags from all DBL domain types using different search criteria. On average 92% of query tags had one or more 99% identical database hits, resulting in the full-length query gene sequence being identified (> 99% identical DNA > 80% of query gene) among the five most prominent database hits, for ~ 33% of the query genes. Optimized Varia_GEM settings allowed correct prediction of > 90% of domains placed among the four most N-terminal domains, including the DBLα domain, and > 70% of C-terminal domains. With this accuracy, N-terminal domains could be predicted for > 80% of queries, whereas prediction rates of C-terminal domains dropped with the distance from the DBLα from 70 to 40%. Conclusion Prediction of var sequence and domain composition is possible from short sequence tags. Varia can be used to guide experimental validation of PfEMP1 sequences of interest and conduct high-throughput analysis of var type expression in patient samples.

show abstract

Varia: Prediction, analysis and visualisation of variable genes

Mackenzie

Jensen

Lavstsen

et al. 2020

Preprint

View full text Add to dashboard Cite

Assessing the diversity or expression of variable gene families in pathogens can inform about immune escape mechanisms or host interaction phenotypes of clinical relevance. However, obtaining the sequences and quantifying their expression is a challenge. Here, we present a tool, which based on unique sequence tag similarity between members of a gene family, predicts the domains encoded by the queried gene. As an example, we are using the var gene family, encoding the major virulence proteins (PfEMP1) of the human malaria parasite, Plasmodium falciparum. We developed Varia, which predicts the likely var gene sequence and encoded protein domain composition of a gene from short sequence tags. We provide a new extended annotated var genome database, in which Varia identifies genes with identical tag sequences and compares these to return the most probable domain composition of the query gene. Varia's ability to predict correct PfEMP1 domain compositions from short var sequence tags was tested in two complementary pipelines to (a) return the putative gene sequences and domain compositions of the query gene from any partial sequence provided, thereby enabling detailed assessment of specific genes putative function and experimental validation of these (b) to accommodate rapid profiling of var gene expression in complex patient samples, by compiling the overall domain prevalence among var transcripts predicted identified and quantified by next generation sequencing of so-called var DBLα-sequence tags. Availability and implementation: Varia is available on GitHub (https://github.com/GCJMackenzie/Varia) under the MIT license.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gavin Mackenzie

Varia: a tool for prediction, analysis and visualisation of variable genes

Varia: Prediction, analysis and visualisation of variable genes

Contact Info

Product

Resources

About