2021
DOI: 10.2196/27955
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Extraction of Lung Cancer Staging Information From Computed Tomography Reports: Deep Learning Approach

Abstract: Background Lung cancer is the leading cause of cancer deaths worldwide. Clinical staging of lung cancer plays a crucial role in making treatment decisions and evaluating prognosis. However, in clinical practice, approximately one-half of the clinical stages of lung cancer patients are inconsistent with their pathological stages. As one of the most important diagnostic modalities for staging, chest computed tomography (CT) provides a wealth of information about cancer staging, but the free-text natu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 25 publications
(12 citation statements)
references
References 43 publications
0
12
0
Order By: Relevance
“…The outcome of the subtasks could be used as an input for a rule-based system. Several ML approaches extract named entities from radiology reports [ 21 , 22 ]. Named entity recognition could be seen as a subtask for staging, although errors made by our approach in extracting concepts (entities) are limited (see Table 4 ).…”
Section: Discussionmentioning
confidence: 99%
“…The outcome of the subtasks could be used as an input for a rule-based system. Several ML approaches extract named entities from radiology reports [ 21 , 22 ]. Named entity recognition could be seen as a subtask for staging, although errors made by our approach in extracting concepts (entities) are limited (see Table 4 ).…”
Section: Discussionmentioning
confidence: 99%
“…However, the free-text nature of CT reports makes it difficult to understand and analyze them using computer programs. In our previous work [ 27 ], we developed an information extraction system composed of named entity recognition, relation classification, and postprocessing modules to extract valuable information in a pipeline manner. However, in this pipeline, the subsequent tasks would be influenced by the outputs of former tasks, which may affect the performance of the whole system.…”
Section: Methodsmentioning
confidence: 99%
“…A case of the MTQA application is shown in Figure 1 . Finally, the extracted head and tail entities are organized as triples, and a rule-based postprocessing algorithm proposed in the previous work [ 27 ] is used to process the triples to obtain the standardized NLP-extracted features. Furthermore, the NLP-extracted features were manually reviewed and corrected by a clinician based on the report contents as the gold standard features.…”
Section: Methodsmentioning
confidence: 99%
“…There are 2 main entity relation extraction techniques; that is, template rule-based methods and eigenvectorbased methods. Under the template rule-based method, the language features of entity relations are first organized by linguists, after which the corresponding entity relation rules are compiled, and finally, the entity relations are extracted through rule-based matching (1)(2)(3)(4)(5). The eigenvectorbased methods can be divided into the following two types: traditional machine learning and deep learning.…”
Section: Relevant Workmentioning
confidence: 99%