2019
DOI: 10.1136/bmjopen-2018-023232
|View full text |Cite
|
Sign up to set email alerts
|

Using natural language processing to extract structured epilepsy data from unstructured clinic letters: development and validation of the ExECT (extraction of epilepsy clinical text) system

Abstract: ObjectiveRoutinely collected healthcare data are a powerful research resource but often lack detailed disease-specific information that is collected in clinical free text, for example, clinic letters. We aim to use natural language processing techniques to extract detailed clinical information from epilepsy clinic letters to enrich routinely collected data.DesignWe used the general architecture for text engineering (GATE) framework to build an information extraction system, ExECT (extraction of epilepsy clinic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
46
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 48 publications
(46 citation statements)
references
References 16 publications
0
46
0
Order By: Relevance
“…SAIL Databank took this approach because of the risks of introducing insufficiently deidentified information into the databank. Working with free-text data at source is by means of an NHS honorary contract [ 8 ], and all proposals to use SAIL data must have received approval from an independent IGRP before access can be granted via the data safe haven [ 77 ].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…SAIL Databank took this approach because of the risks of introducing insufficiently deidentified information into the databank. Working with free-text data at source is by means of an NHS honorary contract [ 8 ], and all proposals to use SAIL data must have received approval from an independent IGRP before access can be granted via the data safe haven [ 77 ].…”
Section: Resultsmentioning
confidence: 99%
“…Alternative methods focus only on isolating the relevant clinical information from personal identifiers via extraction of specified variables such as medication dosage instructions or diagnoses, which are whitelisted and preserved in text. Whitelisting can be thought of as the converse of blacklisting in that it extracts clinically informative data rather than excluding disallowed pieces of information [ 6 - 8 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…[27] The most comprehensive pipeline specific to the field of epilepsy is ExECT (extraction of epilepsy clinical text). [28] This extracts a diagnosis of epilepsy as a binary field (88% precision, 89% recall), focal seizures as a binary field (96% precision, 70% recall), generalized seizures as a binary field (89% precision, 52% recall), and epilepsy type as a trinary field defined as either focal, generalized, or absence (90% precision, 80% recall).…”
Section: Epilepsy Type and Seizure Typementioning
confidence: 99%
“…14 Another example is the extraction of epilepsy variables from clinical reports using ExECT (extraction of epilepsy clinical text), a system based on GATE. 15 Another approach to extract information from texts is the use of machine learning, which is often done with classical approaches, such as support vector machines or logistic regression. 8,9,11 In recent years, newer deep learning approaches have also been evaluated for information extraction.…”
Section: Related Workmentioning
confidence: 99%