Nuclear copies of mitochondrial genes (numts) are a well-known feature of eukaryotic genomes and a concern in systematics, as they can mislead phylogenetic inferences when inadvertently used. Studies on avian numts initially based on the chicken genome suggest that numts may be uncommon and relatively short among birds. Here we ask how common numts are in falcons, based on recently sequenced genomes of the Saker falcon (Falco cherrug) and Peregrine falcon (F. peregrinus). We identified numts by BLASTN searches and then extracted CYTB, ND2 and COI sequences from them, which were then used for phylogeny inference along with several sequences from other species in Falconiformes. Our results indicate that avian numts may be much more frequent and longer than previously thought. Phylogenetic inferences revealed multiple independent nuclear insertions throughout the history of the Falconiformes, including cases of sequences available in public databases and wrongly identified as authentic mtDNA. New sequencing technologies and ongoing efforts for whole genome sequencing will provide exciting opportunities for avian numt research in the near future.
Gene-expression profiling can be used to classify human tumors into molecular subtypes or risk groups, representing potential future clinical tools for treatment prediction and prognostication. However, it is less well-known how prognostic gene signatures derived in one malignancy perform in a pan-cancer context. In this study, a gene-rule-based single sample predictor (SSP) called classifier for lung adenocarcinoma molecular subtypes (CLAMS) associated with proliferation was tested in almost 15 000 samples from 32 cancer types to classify samples into better or worse prognosis. Of the 14 malignancies that presented both CLAMS classes in sufficient numbers, survival outcomes were significantly different for breast, brain, kidney and liver cancer. Patients with samples classified as better prognosis by CLAMS were generally of lower tumor grade and disease stage, and had improved prognosis according to other type-specific classifications (e.g. PAM50 for breast cancer). In all, 99.1% of non-lung cancer cases classified as better outcome by CLAMS were comprised within the range of proliferation scores of lung adenocarcinoma cases with a predicted better prognosis by CLAMS. This finding demonstrates the potential of tuning SSPs to identify specific levels of for instance tumor proliferation or other transcriptional programs through predictor training. Together, pan-cancer studies such as this may take us one step closer to understanding how gene-expression-based SSPs act, which gene-expression programs might be important in different malignancies, and how to derive tools useful for prognostication that are efficient across organs.
Background Pathogenic germline variants (PGVs) in certain genes are linked to higher lifetime risk of developing breast cancer and can influence preventive surgery decisions and therapy choices. Public health programs offer genetic screening based on criteria designed to assess personal risk and identify individuals more likely to carry PGVs, dividing patients into screened and non-screened groups. How tumor biology and clinicopathological characteristics differ between these groups is understudied and could guide refinement of screening criteria. Methods Six thousand six hundred sixty breast cancer patients diagnosed in South Sweden during 2010–2018 were included with available clinicopathological and RNA sequencing data, 900 (13.5%) of which had genes screened for PGVs through routine clinical screening programs. We compared characteristics of screened patients and tumors to non-screened patients, as well as between screened patients with (n = 124) and without (n = 776) PGVs. Results Broadly, breast tumors in screened patients showed features of a more aggressive disease. However, few differences related to tumor biology or patient outcome remained significant after stratification by clinical subgroups or PAM50 subtypes. Triple-negative breast cancer (TNBC), the subgroup most enriched for PGVs, showed the most differences between screening subpopulations (e.g., higher tumor proliferation in screened cases). Significant differences in PGV prevalence were found between clinical subgroups/molecular subtypes, e.g., TNBC cases were enriched for BRCA1 PGVs. In general, clinicopathological differences between screened and non-screened patients mimicked those between patients with and without PGVs, e.g., younger age at diagnosis for positive cases. However, differences in tumor biology/microenvironment such as immune cell composition were additionally seen within PGV carriers/non-carriers in ER + /HER2 − cases, but not between screening subpopulations in this subgroup. Conclusions Characterization of molecular tumor features in patients clinically screened and not screened for PGVs represents a relevant read-out of guideline criteria. The general lack of molecular differences between screened/non-screened patients after stratification by relevant breast cancer subsets questions the ability to improve the identification of screening candidates based on currently used patient and tumor characteristics, pointing us towards universal screening. Nevertheless, while that is not attained, molecular differences identified between PGV carriers/non-carriers suggest the possibility of further refining patient selection within certain patient subsets using RNA-seq through, e.g., gene signatures. Trial registration The Sweden Cancerome Analysis Network – Breast (SCAN-B) was prospectively registered at ClinicalTrials.gov under the identifier NCT02306096.
Lung cancer is primarily a disease of the elderly, with a median age at diagnosis around 70 years. In our study we sought to address the question of whether and how clinical characteristics, molecular alterations and molecular phenotypes differ between patient populations with early-stage lung adenocarcinoma (AC) with respect to age at diagnosis. Patients were stratified based on age at diagnosis into five systematic age bins (<50, 50-60, 60-70, 70-80 and ≥80 years). To assess clinicopathological variables on a population-based level, we accessed data from the national quality registry for lung cancer in Sweden. In parallel, we used compiled datasets from public cohorts to investigate focal and genome-wide DNA alterations, epigenetic alterations, immune composition and transcriptional patterns in relation to age at diagnosis. Gender, stage, WHO performance and likelihood of receiving chemotherapy as adjuvant treatment were linked to age at diagnosis. Associations between younger patient age and likelihood of harboring certain driver mutations (eg, in EGFR and ALK) were confirmed. We also found an association between age at diagnosis and certain mutational signatures. However, age did not seem to drive transcriptional, copy number, or epigenetic variation in the tumors. Based on our findings, age at diagnosis alone does not appear to provide an additional layer of biological complexity above that of proposed genetic and transcriptional phenotypes of AC.
PAM50 gene expression subtypes represent a cornerstone in molecular classification of breast cancer and are included in risk prediction models to guide therapy. We aimed to illustrate the impact of included genes and biological processes on subtyping while considering a tumor’s underlying clinical subgroup defined by ER, PR and HER2 status. To do this we used a population-representative and clinically annotated primary breast tumor cohort of 6233 samples profiled by RNA sequencing and applied a perturbation strategy of excluding co-expressed genes (gene sets). We demonstrate how PAM50 nearest centroid classification depends on biological processes present across, but also within, ER/PR/HER2 subgroups and PAM50 subtypes themselves. Our analysis highlights several key aspects of PAM50 classification. Firstly, we observed a tight connection between a tumor’s nearest and second nearest PAM50 centroid. Additionally, we show that second-best subtype is associated with overall survival in ER-positive, HER2-negative, and node negative disease. We also note that ERBB2 has little impact on PAM50 classification in HER2-positive disease regardless of ER-status, and that the Basal subtype is highly stable in contrast to the Normal subtype. Improved consciousness of the commonly used PAM50 subtyping scheme will aid in our understanding and interpretation of breast tumors that have seemingly conflicting PAM50 classification when compared to clinical biomarkers. Finally, our study adds further support in challenging the common misconception that PAM50 subtypes are distinct classes by illustrating that PAM50 subtypes in tumors represent a continuum that may have clinical implications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.