Background
Precision medicine has become a mainstay of cancer care in recent years. The National Cancer Institute (NCI) Surveillance, Epidemiology, and End Results (SEER) Program has been an authoritative source of cancer statistics and data since 1973. However, tumor genomic information has not been adequately captured in the cancer surveillance data, which impedes population-based research on molecular subtypes. To address this, the SEER Program has developed and implemented a centralized process to link SEER registries’ tumor cases with genomic test results that are provided by molecular laboratories to the registries.
Methods
Data linkages were carried out following operating procedures for centralized linkages established by the SEER Program. The linkages used Match*Pro, a probabilistic linkage software, and were facilitated by the registries’ trusted third party (an honest broker). The SEER registries provide to NCI limited datasets that undergo preliminary evaluation prior to their release to the research community.
Results
Recently conducted genomic linkages included OncotypeDX Breast Recurrence Score, OncotypeDX Breast Ductal Carcinoma in Situ, OncotypeDX Genomic Prostate Score, Decipher Prostate Genomic Classifier, DecisionDX Uveal Melanoma, DecisionDX Preferentially Expressed Antigen in Melanoma, DecisionDX Melanoma, and germline tests results in Georgia and California SEER registries.
Conclusions
The linkages of cancer cases from SEER registries with genomic test results obtained from molecular laboratories offer an effective approach for data collection in cancer surveillance. By providing de-identified data to the research community, the NCI’s SEER Program enables scientists to investigate numerous research inquiries.