2020
DOI: 10.1109/jbhi.2020.3005016
|View full text |Cite
|
Sign up to set email alerts
|

Classification of Cancer Pathology Reports: A Large-Scale Comparative Study

Abstract: We report about the application of state-ofthe-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a large number of classes (134 morphological classes and 61 topographical classes). We compare alternative architectures i… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(2 citation statements)
references
References 41 publications
(53 reference statements)
0
2
0
Order By: Relevance
“…where τ r defines the number of elements of the r-th sequence. RNNs can be used in tasks regarding Natural Language Processing (NLP) [48,[77][78][79], time series analysis [80] and, in general, all the tasks involving ordered set of data [81]. Note that, in general, for sequence-to-sequence problems also y can be a sequence of elements, as for example in machine translation where the inputs and outputs of the RNN are sentences in different languages [82].…”
Section: A5 Recurrent Neural Networkmentioning
confidence: 99%
“…where τ r defines the number of elements of the r-th sequence. RNNs can be used in tasks regarding Natural Language Processing (NLP) [48,[77][78][79], time series analysis [80] and, in general, all the tasks involving ordered set of data [81]. Note that, in general, for sequence-to-sequence problems also y can be a sequence of elements, as for example in machine translation where the inputs and outputs of the RNN are sentences in different languages [82].…”
Section: A5 Recurrent Neural Networkmentioning
confidence: 99%
“…Most of these proposals have focused on the English language [4], [33] and Chinese [16]. Recently there is also growing interest in extracting cancer-related information in other languages such as Italian [34], French [25] and Bulgarian [35].…”
Section: Related Workmentioning
confidence: 99%