RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning 2017
DOI: 10.26615/978-954-452-049-6_100
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Encoding of Pathology Reports Using Natural Language Processing

Abstract: In this article we present a system that extracts information from pathology reports. The reports are written in Norwegian and contain free text describing prostate biopsies. Currently, these reports are manually coded for research and statistical purposes by trained experts at the Cancer Registry of Norway where the coders extract values for a set of predefined fields that are specific for prostate cancer. The presented system is rule based and achieves an average F-score of 0.91 for the fields Gleason grade,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
2
2
2

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 6 publications
0
6
0
1
Order By: Relevance
“…Today, this is a manual and time‐consuming process that involves reading the pathology report and then reporting the numbers and data found. This could easily be automated using NLP techniques, specifically machine learning based mining, since there are lots of older manually coded pathology reports (Spasic et al, 2014; Weegar et al, 2017; Weegar & Dalianis, 2015). Weegar and Dalianis (2015) used rule‐based algorithms since the pathology reports were well structured.…”
Section: Unstructured Data Analysismentioning
confidence: 99%
“…Today, this is a manual and time‐consuming process that involves reading the pathology report and then reporting the numbers and data found. This could easily be automated using NLP techniques, specifically machine learning based mining, since there are lots of older manually coded pathology reports (Spasic et al, 2014; Weegar et al, 2017; Weegar & Dalianis, 2015). Weegar and Dalianis (2015) used rule‐based algorithms since the pathology reports were well structured.…”
Section: Unstructured Data Analysismentioning
confidence: 99%
“…The third study is by Weegar et al (2017) that used a much larger subset of pathology reports for prostate cancer than used by both Singh et al (2015) and Dahl et al (2016). Weegar et al (2017) built a rule-based system which extracted structured information from over 554 pathology reports for prostate cancer written in Norwegian. The authors divided the 554 pathology reports in 388 documents for the development set and 176 documents for the test set.…”
Section: The Case Of the Cancer Registry Of Norwaymentioning
confidence: 99%
“…The text contains descriptions of 9 biopsies, four from the left side and five from the right side. Figure published inWeegar et al (2017)…”
mentioning
confidence: 99%
“…With the continued growth in the number of cancer patients, and the increase in treatment complexity, cancer registries face a significant challenge in manually reviewing the large quantity of reports [1], [2]. In this situation, Natural Language Processing (NLP) systems can offer a unique opportunity to automatically encode the unstructured reports into structured data.…”
Section: Introductionmentioning
confidence: 99%
“…NLP approaches for information extraction within the biomedical research areas range from rule-based systems [4], to domain-specific systems using feature-based classification [2], to the recent deep networks for end-to-end feature extraction and classification [1]. NLP has had varied degree of success with free-text pathology reports [5].…”
Section: Introductionmentioning
confidence: 99%