BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.
PURPOSE Variation in risk of adverse clinical outcomes in patients with cancer and COVID-19 has been reported from relatively small cohorts. The NCATS’ National COVID Cohort Collaborative (N3C) is a centralized data resource representing the largest multicenter cohort of COVID-19 cases and controls nationwide. We aimed to construct and characterize the cancer cohort within N3C and identify risk factors for all-cause mortality from COVID-19. METHODS We used 4,382,085 patients from 50 US medical centers to construct a cohort of patients with cancer. We restricted analyses to adults ≥ 18 years old with a COVID-19–positive or COVID-19–negative diagnosis between January 1, 2020, and March 25, 2021. We followed N3C selection of an index encounter per patient for analyses. All analyses were performed in the N3C Data Enclave Palantir platform. RESULTS A total of 398,579 adult patients with cancer were identified from the N3C cohort; 63,413 (15.9%) were COVID-19–positive. Most common represented cancers were skin (13.8%), breast (13.7%), prostate (10.6%), hematologic (10.5%), and GI cancers (10%). COVID-19 positivity was significantly associated with increased risk of all-cause mortality (hazard ratio, 1.20; 95% CI, 1.15 to 1.24). Among COVID-19–positive patients, age ≥ 65 years, male gender, Southern or Western US residence, an adjusted Charlson Comorbidity Index score ≥ 4, hematologic malignancy, multitumor sites, and recent cytotoxic therapy were associated with increased risk of all-cause mortality. Patients who received recent immunotherapies or targeted therapies did not have higher risk of overall mortality. CONCLUSION Using N3C, we assembled the largest nationally representative cohort of patients with cancer and COVID-19 to date. We identified demographic and clinical factors associated with increased all-cause mortality in patients with cancer. Full characterization of the cohort will provide further insights into the effects of COVID-19 on cancer outcomes and the ability to continue specific cancer treatments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.