2007
DOI: 10.1197/jamia.m2130
|View full text |Cite
|
Sign up to set email alerts
|

Collection of Cancer Stage Data by Classifying Free-text Medical Reports

Abstract: Cancer staging provides a basis for planning clinical management, but also allows for meaningful analysis of cancer outcomes and evaluation of cancer care services. Despite this, stage data in cancer registries is often incomplete, inaccurate, or simply not collected. This article describes a prototype software system (Cancer Stage Interpretation System, CSIS) that automatically extracts cancer staging information from medical reports. The system uses text classification techniques to train support vector mach… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 61 publications
(39 citation statements)
references
References 22 publications
0
38
0
Order By: Relevance
“…The most common approach for entity identification is to map tokens from segmented text into a target dictionaries such as SNOMED-CT, ICD, UMLS etc. Tools for mapping concepts in biomedical literature such as MetaMap can also process tokenized clinical texts [49, 50], but other clinically customized tools exist [51, 52] such as the HITex system [53] and the Knowledge Map Concept Identifier [53, 54]. …”
Section: 3 Phenotype Extraction From Ehr Textmentioning
confidence: 99%
“…The most common approach for entity identification is to map tokens from segmented text into a target dictionaries such as SNOMED-CT, ICD, UMLS etc. Tools for mapping concepts in biomedical literature such as MetaMap can also process tokenized clinical texts [49, 50], but other clinically customized tools exist [51, 52] such as the HITex system [53] and the Knowledge Map Concept Identifier [53, 54]. …”
Section: 3 Phenotype Extraction From Ehr Textmentioning
confidence: 99%
“…The concepts in the Metathesaurus originate from terminologies used in different areas 6 , and due to the diversity of this database, we also explored using a subset that comprises a medical terminology: SNOMED 7 . In this case we also rely on MetaMap to extract the concepts, but we filter out those that are not present in SNOMED.…”
Section: Knowledge Sources and Feature Representationmentioning
confidence: 99%
“…Their initial experiments showed the difficulty of the primary tumour stage detection (T), with a top accuracy of 64%. In a follow-up paper they explored richer annotation, and a combination of ML and rulebased post-processing [7]. They performed fine-grained annotation of stage details for each sentence in order to build their system, and they observed improvements over a coarse-grained (documentlevel) multiclass classifier.…”
Section: Introductionmentioning
confidence: 96%
See 1 more Smart Citation
“…In a previous study, McCowan et al. extracted cancer staging information from pathology reports using support vector machines (SVMs) . Dublin et al.…”
Section: Introductionmentioning
confidence: 99%