2017
DOI: 10.1158/0008-5472.can-17-0615
|View full text |Cite
|
Sign up to set email alerts
|

DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records

Abstract: Precise phenotype information is needed to understand the effects of genetic and epigenetic changes on tumor behavior and responsiveness. Extraction and representation of cancer phenotypes is currently mostly performed manually making it difficult to correlate phenotypic data to genomic data. In addition, genomic data is being produced at an increasingly faster pace, exacerbating the problem. The DeepPhe software enables automated extraction of detailed phenotype information from Electronic Medical Records of … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
66
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 78 publications
(67 citation statements)
references
References 4 publications
0
66
0
1
Order By: Relevance
“…Frame Elements References CANCER DIAGNOSIS NAME: cancer type [44], [45], [42], [46], [47], [48], [49], [50], [51] ANATOMICAL SITE: the location description of the finding (including primary and metastatic sites) [45], [52], [42], [53], [54], [55], [25], [27], [56], [57] HISTOLOGY: histological description (e.g. carcinoma) [44], [52], [58], [55], [53], [54], [4], [27], [59], [43], [57] GRADE: appearance of the cancerous cells, can be frame with further information (GRADING VALUE) [44], [52], [54], [4], [48], [27], [59], [60], [43], [61], [62] INVASION TYPE: the stage or level of invasion [52] TUMOR BLOCK: tissue cores removed from regions of interest in paraffinembedded tissues (e.g.…”
Section: Framementioning
confidence: 99%
See 3 more Smart Citations
“…Frame Elements References CANCER DIAGNOSIS NAME: cancer type [44], [45], [42], [46], [47], [48], [49], [50], [51] ANATOMICAL SITE: the location description of the finding (including primary and metastatic sites) [45], [52], [42], [53], [54], [55], [25], [27], [56], [57] HISTOLOGY: histological description (e.g. carcinoma) [44], [52], [58], [55], [53], [54], [4], [27], [59], [43], [57] GRADE: appearance of the cancerous cells, can be frame with further information (GRADING VALUE) [44], [52], [54], [4], [48], [27], [59], [60], [43], [61], [62] INVASION TYPE: the stage or level of invasion [52] TUMOR BLOCK: tissue cores removed from regions of interest in paraffinembedded tissues (e.g.…”
Section: Framementioning
confidence: 99%
“…0.6 mm in diameter) [52] TISSUE BANK: identifiers about location of tissue samples within an institution [52] STATUS: whether confirmed, suspected and there is no evidence of finding (e.g. probable, definite, without) [42], [57] RECURRENT STATUS: the value of recurrent status [42], [63], [57] TEMPORAL INFORMATION: refers to information about time (e.g., year, month, and date, 2007-08-04) [42], [57] SPECIMEN TYPE: the type of specimen involved in diagnosis [53] LATERALITY: describes the side of a paired organ associated with origin of the primary cancer [53], [54], [25], [48], [64], [27], [51] TUMOR SIZE: how large across the tumor is at its widest point (part of cancer staging) [52], [53], [54], [25], [48], [65], [59], [60], [62], [57] TNM STAGE: cancer staging system, can be a separate frame with further information (TNM CLASSIFICATION) [55], [53], [2], [3], [25], [66], [67], [60], [50], [40], [61] EXTENSION: direct extension of tumor [53] UNCERTAINTY: used to differentiate clinical suspicions from conclusive findings (e.g., possible, likely) [68],…”
Section: Framementioning
confidence: 99%
See 2 more Smart Citations
“…Existing cTAKES 12 pipelines were extended to extract cancer information, to use rules to infer higherlevel summaries, and to store results in a Neo4j graph database (www.neo4j.com). The initial DeepPhe architecture is described in detail by Savova, et al 7 .…”
Section: Information Model and Natural Language Processing Tool Develmentioning
confidence: 99%