2008
DOI: 10.1197/jamia.m2437
|View full text |Cite
|
Sign up to set email alerts
|

Mayo Clinic NLP System for Patient Smoking Status Identification

Abstract: This article describes our system entry for the 2006 I2B2 contest "Challenges in Natural Language Processing for Clinical Data" for the task of identifying the smoking status of patients. Our system makes the simplifying assumption that patient-level smoking status determination can be achieved by accurately classifying individual sentences from a patient's record. We created our system with reusable text analysis components built on the Unstructured Information Management Architecture and Weka. This reuse of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
65
0
1

Year Published

2011
2011
2018
2018

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 95 publications
(66 citation statements)
references
References 5 publications
0
65
0
1
Order By: Relevance
“…of patients [25]. Recently, more general medical text processing tools have been developed and made available [4,24].…”
Section: Related Workmentioning
confidence: 99%
“…of patients [25]. Recently, more general medical text processing tools have been developed and made available [4,24].…”
Section: Related Workmentioning
confidence: 99%
“…Indicators of socio-economic status, such as measures of occupational prestige, unemployment, education, and homelessness (Hollister et al 2016), along with country-of-origin (Farber-Eger et al 2017) have been extracted from clinical freetext. Smoking status and alcohol use have also been extracted using natural language processing (Chen and Garcia-Webb 2014;Savova et al). While text extraction has limitations, the use of structured elements (billing codes, etc.)…”
Section: Information On the Individual Levelmentioning
confidence: 99%
“…cTAKES was first applied to phenotype extraction studies [43] and then was extended to identify document-level patient smoking status [45] and patient level summarization in the first i2b2 challenge [46]. The system was used to generate features for a state-of-the-art system in the 2010 i2b2 challenge on relation extraction of medical problems, tests, and treatments [47].…”
Section: Pipelinementioning
confidence: 99%