Background: Global life expectancy has been increasing without a corresponding increase in health span and with greater risk for aging-associated diseases such as Alzheimer′s disease (AD). An urgent need to delay the onset of aging-associated diseases has arisen and a dramatic increase in the number of potential molecular targets has led to the challenge of prioritizing targets to promote successful aging. Here, we developed a pipeline to prioritize aging-related genes which integrates the plethora of publicly available genomic, transcriptomic, proteomic and morphological data of C. elegans by applying a supervised machine learning approach. Additionally, a unique biological post-processing analysis of the computational output was performed to better reveal the prioritized gene′s function within the context of pathways and processes involved in aging across the lifespan of C. elegans. Results: Four known aging-related genes, daf-2, involved in insulin signaling; let-363 and rsks-1, involved in mTOR signaling; age-1, involved in PI3 kinase signaling, were present in the top 10% of 4380 ranked genes related to different markers of cellular dysfunction, validating the computational output. Further, our ranked output showed that 91% of the top 438 ranked genes consisted of known genes on GenAge, while the remaining genes had thus far not yet been associated with aging-related processes. Conclusion: These ranked genes can be translated to known human orthologs potentially uncovering previously unknown information about the basic aging processes in humans. These genes (and their downstream pathways) could also serve as targets against aging-related diseases, such as AD.
Introduction The analysis of signaling pathways is a cornerstone in clarifying the biological mechanisms involved in complex genetic disorders. These pathways have intricate topologies, and the existing methods that are used for the interpretation of these pathways, remain limited. We have therefore developed the Adaptable Large-Scale Causal Analysis (ALaSCA) computational platform, which uses causal analysis and counterfactual simulation techniques. ALaSCA offers the ability to simulate the outcome of a number of different hypotheses to gain insight into the complex dynamics of biological mechanisms prior to, or even without, wet lab experimentation. ALaSCA is offered as a proprietary Python library for bioinformaticians and data scientists to use in their life sciences workflows. Here we demonstrate the ability of ALaSCA to untangle the pivots and redundancies within biological pathways of various drivers of a specific phenotypic process. This is achieved by studying a major disease of global relevance, namely Type 1 Diabetes (T1D), and quantifying causal relationships between antioxidant proteins and T1D progression. ALaSCA is also benchmarked against standard associative analysis methods. Methods We use our in silico simulation platform, ALaSCA, to apply both a number of machine learning (ML) and data imputation techniques, and perform causal inference and counterfactual simulation. ALaSCA uses standard ML and causal analysis libraries as well as custom code developed for data imputation and counterfactual simulation. Counterfactual simulation is a method for simulating potential or hypothetical model outcomes in the field of causal analysis (Glymour, Pearl and Jewell, 2016). We apply ALaSCA to T1D by using proteomic data from Liu et al. (2018), as the patients were selected based on the presence of T1D susceptible HLA (human leukocyte antigen)-DR/DQ alleles through genotyping at birth and followed prospectively. The genetic cause of T1D in this cohort is therefore known and the mechanism and proteins through which it causes T1D are well-characterized. This biological mechanism was converted into a directed acyclic graph (DAG) for the subsequent causal analyses. The dataset was used to benchmark the causal inference and counterfactual simulation capabilities of ALaSCA. Results and discussion After data imputation of the Liu et al. (2018) dataset, causal inference and counterfactual simulation were completed. The causal inference output of the HLA, antioxidant, and non-causal proteins showed that the HLA proteins had the overall strongest causal effects on T1D, with antioxidant proteins having the overall second largest causal effects on T1D. The non-causal proteins showed negligibly small effects on T1D in comparison with the HLA and antioxidant proteins. With counterfactual simulation we were able to replicate evidence for and gain understanding into the protective effect that antioxidant proteins, specifically Superoxide dismutase 1 (SOD1), have in T1D, a trend which is seen in literature. We were also able to replicate an unusual case from literature where antioxidant proteins, specifically Catalase, do not have a protective effect on T1D. Conclusion By analyzing the disease mechanism, with the inferred causal effects and counterfactual simulation, we identified the upstream HLA proteins, specifically the DR alpha chain and DR beta 4 chain proteins as causes of the protective effect of the antioxidant proteins on T1D. In contrast, through counterfactual simulation of the unusual case, in which the DR alpha chain and DR beta 4 chain proteins are not present in the model, we saw that the adverse effect which the antioxidant proteins have on T1D is due to the HLA protein, DQ beta 1 chain, and not the antioxidant proteins themselves. Future work would entail the application of the ALaSCA platform on various other diseases, and to integrate it into wet lab experimental design in a number of different biological study areas and topics.
Introduction An urgent need to delay the onset of aging-associated diseases has arisen due to increasing human lifespan. A dramatic surge in the number of identified potential molecular targets that could promote successful aging, has led to the challenge of prioritizing these targets for further research and drug development. In our previous work, we prioritized genes associated with aging processes based on their similarity to known aging-related genes and dysfunction marker genes in C. elegans. The goal of this study was to demonstrate the ability of our computational platform to identify molecular drivers of neuronal aging using specialized causal inference techniques. S6K was highly ranked in the previous study and here the nearby neighbors in its protein interaction network were selected to explore ALaSCA′s (Adaptable Large-Scale Causal Analysis) ability to identify possible drivers of Alzheimer′s disease. Methods Utilizing head and brain proteome data, two of ALaSCA′s capabilities were used to understand how protein changes over the lifespan of Drosophila melanogaster affect a feature of neuronal aging, namely climbing ability: ● Pearson correlation analysis was used to assess the relationship between the changes in abundance of specific proteins associated (through protein-protein interactions) with S6K and climbing ability. ● Pearlian causal inference, required to achieve formal causal analysis, was used to determine which pathway, associated with proteins linked to S6K, has the largest effect on climbing ability and therefore to what degree these specific proteins are driving neuronal aging. Results and discussion Based on the correlation results, the proteins associated with fz, a gene encoding for the fz family of receptors that are involved in Wnt signaling, display an increase in abundance as climbing ability declines over time. When viewed together with the fz proteins′ strong negative causal value, it seems that their increased abundance over the lifespan of Drosophila is an important driver of the observed decrease in climbing ability. Additionally, expression of the genes FZD1 and FZD7 (fz orthologs) is altered in the hippocampus early on in Alzheimer′s disease human samples and in an amyloid precursor protein mouse model. Conclusion We have demonstrated the potential of the ALaSCA platform to identify and provide evidence behind molecular mechanisms. This capability enables identification of possible drivers of Alzheimer′s disease - as the human orthologs of the proteins identified here, through its Pearlian causal inference capability, have been linked to Alzheimer′s disease progression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.