2021
DOI: 10.3389/fmed.2021.748168
|View full text |Cite
|
Sign up to set email alerts
|

Development of a Structured Query Language and Natural Language Processing Algorithm to Identify Lung Nodules in a Cancer Centre

Abstract: Importance: The stratification of indeterminate lung nodules is a growing problem, but the burden of lung nodules on healthcare services is not well-described. Manual service evaluation and research cohort curation can be time-consuming and potentially improved by automation.Objective: To automate lung nodule identification in a tertiary cancer centre.Methods: This retrospective cohort study used Electronic Healthcare Records to identify CT reports generated between 31st October 2011 and 24th July 2020. A stru… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…Overall the F1 scores ranged from 0.71 to 0.89, Sensitivity ranged from 0.70–0.90, and PPV ranged from 0.64–0.89. The best performing algorithm was linear support vector machine with F1 0.89, sensitivity 0.90, and PPV 0.88 for predicting concerning lung nodules [ 24 ]. It is important to note, the machine learning technique was used after a pre-processing step using a rule-based algorithm to identify CT reports that actually had nodules.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Overall the F1 scores ranged from 0.71 to 0.89, Sensitivity ranged from 0.70–0.90, and PPV ranged from 0.64–0.89. The best performing algorithm was linear support vector machine with F1 0.89, sensitivity 0.90, and PPV 0.88 for predicting concerning lung nodules [ 24 ]. It is important to note, the machine learning technique was used after a pre-processing step using a rule-based algorithm to identify CT reports that actually had nodules.…”
Section: Discussionmentioning
confidence: 99%
“…Another study also examined CT chest imaging reports using a rule-based natural language processing algorithm to identify lung nodules and more complex machine learning algorithms to determine the presence of concerning features. However, again, the authors only examined one type of imaging modality and did not focus on post-treatment surveillance imaging [ 24 ]. Though CT chest is recommended by NCCN guidelines for post-treatment surveillance [ 16 ], in clinical practice, multiple different imaging types are utilized routinely to rule out or diagnose recurrence in these patients.…”
Section: Introductionmentioning
confidence: 99%
“…The latter can be analysed using NLP approaches. An overview of NLP in oncology is provided by Yim et al [12], and example early diagnosis uses include identifying abnormal cancer screening results [31], auditing colonoscopy or cystoscopy standards [32,33] and identifying or risk-stratifying pre-malignant lesions [34][35][36][37][38]. NLP has also been used to automate patient identification for clinical trials, reducing the burden of eligibility checks [39].…”
Section: Data Types: Electronic Healthcare Recordsmentioning
confidence: 99%