Even with identification of multiple causal genetic variants for common human diseases, understanding the molecular processes mediating the causal variants’ effect on the disease remains a challenge. This understanding is crucial for the development of therapeutic strategies to prevent and treat disease. While static profiling of gene expression is primarily used to get insights into the biological bases of diseases, it makes differentiating the causative from the correlative effects difficult, as the dynamics of the underlying biological processes are not monitored. Using yeast as a model, we studied genome-wide gene expression dynamics in the presence of a causal variant as the sole genetic determinant, and performed allele-specific functional validation to delineate the causal effects of the genetic variant on the phenotype. Here, we characterized the precise genetic effects of a functional MKT1 allelic variant in sporulation efficiency variation. A mathematical model describing meiotic landmark events and conditional activation of MKT1 expression during sporulation specified an early meiotic role of this variant. By analyzing the early meiotic genome-wide transcriptional response, we demonstrate an MKT1-dependent role of novel modulators, namely, RTG1/3, regulators of mitochondrial retrograde signaling, and DAL82, regulator of nitrogen starvation, in additively effecting sporulation efficiency. In the presence of functional MKT1 allele, better respiration during early sporulation was observed, which was dependent on the mitochondrial retrograde regulator, RTG3. Furthermore, our approach showed that MKT1 contributes to sporulation independent of Puf3, an RNA-binding protein that steady-state transcription profiling studies have suggested to mediate MKT1-pleiotropic effects during mitotic growth. These results uncover interesting regulatory links between meiosis and mitochondrial retrograde signaling. In this study, we highlight the advantage of analyzing allele-specific transcriptional dynamics of mediating genes. Applications in higher eukaryotes can be valuable for inferring causal molecular pathways underlying complex dynamic processes, such as development, physiology and disease progression.
BackgroundThe subcellular localization of a protein is an important aspect of its function. However, the experimental annotation of locations is not even complete for well-studied model organisms. Text mining might aid database curators to add experimental annotations from the scientific literature. Existing extraction methods have difficulties to distinguish relationships between proteins and cellular locations co-mentioned in the same sentence.ResultsLocText was created as a new method to extract protein locations from abstracts and full texts. LocText learned patterns from syntax parse trees and was trained and evaluated on a newly improved LocTextCorpus. Combined with an automatic named-entity recognizer, LocText achieved high precision (P = 86%±4). After completing development, we mined the latest research publications for three organisms: human (Homo sapiens), budding yeast (Saccharomyces cerevisiae), and thale cress (Arabidopsis thaliana). Examining 60 novel, text-mined annotations, we found that 65% (human), 85% (yeast), and 80% (cress) were correct. Of all validated annotations, 40% were completely novel, i.e. did neither appear in the annotations nor the text descriptions of Swiss-Prot.ConclusionsLocText provides a cost-effective, semi-automated workflow to assist database curators in identifying novel protein localization annotations. The annotations suggested through text-mining would be verified by experts to guarantee high-quality standards of manually-curated databases such as Swiss-Prot.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-018-2021-9) contains supplementary material, which is available to authorized users.
We assessed the pan-cancer predictability of multi-omic biomarkers from haematoxylin and eosin (H&E)-stained whole slide image (WSI) using deep learning and standard evaluation measures throughout a systematic study. A total of 13,443 deep learning (DL) models predicting 4,481 multi-omic biomarkers across 32 cancer types were trained and validated. The investigated biomarkers included genetic mutations, transcriptomic (mRNA) and proteomic under- and over-expression status, metabolomic pathways, established markers relevant for prognosis, including gene expression signatures, molecular subtypes, clinical outcomes and response to treatment. Overall, we established the general feasibility of predicting multi-omic markers across solid cancer types, where 50% of the models could predict biomarkers with the area under the curve (AUC) of more than 0.633 (with 25% of the models having AUC larger than 0.711). Aggregating across the omic types, our deep learning models achieved the following performance: mean AUC of 0.634 ±0.117 in predicting driver SNV mutations; 0.637 ±0.108 for over-/under-expression of transcriptomic genes; 0.666 ±0.108 for over-/under-expression of proteomes; 0.564 ±0.081 for metabolomic pathways; 0.653 ±0.097 for gene signatures and molecular subtypes; 0.742 ±0.120 for standard of care biomarkers; and 0.671 ±0.120 for clinical outcomes and treatment responses. The biomarkers were shown to be detectable from routine histology images across all investigated cancer types, with aggregate mean AUC exceeding 0.62 in almost all cancers. In addition, we observed that predictability is reproducible within-marker and less dependent on sample size and positivity ratio, indicating a degree of true predictability inherent to the biomarker itself.
Studying the molecular consequences of rare genetic variants has the potential to identify novel and hitherto uncharacterized pathways causally contributing to phenotypic variation. Here, we characterize the functional consequences of a rare coding variant of TAO3, previously reported to contribute significantly to sporulation efficiency variation in Saccharomyces cerevisiae. During mitosis, the common TAO3 allele interacts with CBK1—a conserved NDR kinase. Both TAO3 and CBK1 are components of the RAM signaling network that regulates cell separation and polarization during mitosis. We demonstrate that the role of the rare allele TAO3(4477C) in meiosis is distinct from its role in mitosis by being independent of ACE2—a RAM network target gene. By quantitatively measuring cell morphological dynamics, and expressing the TAO3(4477C) allele conditionally during sporulation, we show that TAO3 has an early role in meiosis. This early role of TAO3 coincides with entry of cells into meiotic division. Time-resolved transcriptome analyses during early sporulation identified regulators of carbon and lipid metabolic pathways as candidate mediators. We show experimentally that, during sporulation, the TAO3(4477C) allele interacts genetically with ERT1 and PIP2, regulators of the tricarboxylic acid cycle and gluconeogenesis metabolic pathways, respectively. We thus uncover a meiotic functional role for TAO3, and identify ERT1 and PIP2 as novel regulators of sporulation efficiency. Our results demonstrate that studying the causal effects of genetic variation on the underlying molecular network has the potential to provide a more extensive understanding of the pathways driving a complex trait.
We present a public validation of PANProfiler (ER, PR, HER2), an in-vitro medical device (IVD) that predicts the qualitative status of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) by analysing the hematoxylin and eosin (H&E)-stained tissue scan. In public validation on 648 (ER), 648 (PR) and 560 (HER2) unseen cases with known biomarker status, the device achieves an accuracy of 87% (ER), 83% (PR) and 87% (HER2). The validation offers early evidence of the ability to predict clinically relevant breast biomarkers from an H&E slide in a relevant clinical setting.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.