2021
DOI: 10.1038/s41467-021-21254-9
|View full text |Cite
|
Sign up to set email alerts
|

Uniform genomic data analysis in the NCI Genomic Data Commons

Abstract: The goal of the National Cancer Institute’s (NCI’s) Genomic Data Commons (GDC) is to provide the cancer research community with a data repository of uniformly processed genomic and associated clinical data that enables data sharing and collaborative analysis in the support of precision medicine. The initial GDC dataset include genomic, epigenomic, proteomic, clinical and other data from the NCI TCGA and TARGET programs. Data production for the GDC started in June, 2015 using an OpenStack-based private cloud. B… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
53
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 88 publications
(54 citation statements)
references
References 43 publications
1
53
0
Order By: Relevance
“…The discovery cohort included patients from Therapeutically Applicable Research to Generate Effective Treatments (TARGET) program (n=149; 123 high-risk) (dbGAP accession ID phs000218.v22.p8) (online supplemental table 1). RNAseq paired-end (PE) FastQ files, whole exome sequencing (WES) alignment BAM files, somatic mutation MAF (Mutation Annotation Format) files, and clinical data were downloaded from Genomic Data Commons (GDC) 30 (https:// portal. gdc.…”
Section: Study Cohorts and Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…The discovery cohort included patients from Therapeutically Applicable Research to Generate Effective Treatments (TARGET) program (n=149; 123 high-risk) (dbGAP accession ID phs000218.v22.p8) (online supplemental table 1). RNAseq paired-end (PE) FastQ files, whole exome sequencing (WES) alignment BAM files, somatic mutation MAF (Mutation Annotation Format) files, and clinical data were downloaded from Genomic Data Commons (GDC) 30 (https:// portal. gdc.…”
Section: Study Cohorts and Datasetsmentioning
confidence: 99%
“…For the discovery cohort, the somatic mutations were harmonized using four somatic variant callers (MuTect2, VarScan2, SomaticSniper, and MuSE). 30 After rigorous filtering following GDC's guidelines (https:// docs. gdc.…”
Section: Somatic Mutation Detection Hla Genotyping and Neoantigen Predictionmentioning
confidence: 99%
“…It is now possible to compare miRNA levels between tumors and normal adjacent tissues on a large collection of human cases [Zhang et al, 2021], allowing a rigorous assessment of miR-34a expression in tumorigenesis. Selecting every cancer type where miRNA expression is available for primary tumor and normal adjacent tissue, in at least 10 studied cases (n=20 cancer types), we did not find any cancer type where miR-34a was significantly down-regulated (Figure 1 A ).…”
Section: Resultsmentioning
confidence: 99%
“…The package contains the TCGA consortium-provided level 3 data, generated by the HiSeq and GenomeAnalyzer platforms, from 450 primary colorectal cancer patient samples [53]. For a more comprehensive and up-to-date phenotype information, associated patients' clinical data were retrieved from the genomic data commons [54][55][56][57].…”
Section: Biomedical Significance Experiments Datamentioning
confidence: 99%