Negation’s Not Solved: Generalizability Versus Optimizability in Clinical Natural Language Processing

Wu, Stephen; Miller, Timothy A.; Masanz, James; Coarr, Matt; Halgrim, Scott; Carrell, David; Clark, Cheryl

doi:10.1371/journal.pone.0112774

Cited by 113 publications

(79 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We make use of the (Wu et al, 2014) system in these experiments, as it is freely available as part of the Apache cTAKES (Savova et al, 2010) 1 clinical NLP software, and can be easily retrained.…”

Section: Methodsmentioning

confidence: 99%

“…However generalizability of negation systems is still lacking, as cross-domain experiments suffer dramatic performance losses, even while obtaining F1 scores over 90% in the domain of the training data (Wu et al, 2014).…”

Section: Introductionmentioning

confidence: 99%

“…The MITRE system (Clark et al, 2011) used conditional random fields to tag cues and their scopes, then incorporated cue information, section features, semantic and syntactic class features, and lexical surface features into a maximum entropy classifier. Finally, Wu et al (2014) incorporated many of the dependency features from rule-based DepNeg system (Sohn et al, 2012) and the best features from the i2b2 Challenge into a machine learning system.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised Domain Adaptation for Clinical Negation Detection

Miller¹,

Bethard²,

Amiri³

et al. 2017

BioNLP 2017

Self Cite

View full text Add to dashboard Cite

Detecting negated concepts in clinical texts is an important part of NLP information extraction systems. However, generalizability of negation systems is lacking, as cross-domain experiments suffer dramatic performance losses. We examine the performance of multiple unsupervised domain adaptation algorithms on clinical negation detection, finding only modest gains that fall well short of in-domain performance.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Domain Adaptation for Clinical Negation Detection

Miller¹,

Bethard²,

Amiri³

et al. 2017

BioNLP 2017

Self Cite

View full text Add to dashboard Cite

show abstract

“…A comprehensive study on current state-of-the-art negation detection algorithms and their performance on different corpora is presented by Wu et al (2014). As is concluded in this study, none of the existing state-of-the-art systems are guaranteed to work well on a new domain or corpus, and there are still open issues when it comes to creating a generalizable negation detection solution.…”

Section: Related Workmentioning

confidence: 98%

Don't Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Gkotsis

Velupillai²,

Oellrich

et al. 2016

Proceedings of the Third Workshop on Computational Lingusitics And Clinical Psychology

View full text Add to dashboard Cite

Mental Health Records (MHRs) contain freetext documentation about patients' suicide and suicidality. In this paper, we address the problem of determining whether grammatic variants (inflections) of the word "suicide" are affirmed or negated. To achieve this, we populate and annotate a dataset with over 6,000 sentences originating from a large repository of MHRs. The resulting dataset has high InterAnnotator Agreement (κ 0.93). Furthermore, we develop and propose a negation detection method that leverages syntactic features of text 1 . Using parse trees, we build a set of basic rules that rely on minimum domain knowledge and render the problem as binary classification (affirmed vs. negated). Since the overall goal is to identify patients who are expected to be at high risk of suicide, we focus on the evaluation of positive (affirmed) cases as determined by our classifier. Our negation detection approach yields a recall (sensitivity) value of 94.6% for the positive cases and an overall accuracy value of 91.9%. We believe that our approach can be integrated with other clinical Natural Language Processing tools in order to further advance information extraction capabilities.

show abstract

“…This should be leveraged to harness the benefits offered by machine learning solutions. Recently, Wu et al (2014) argued that negation detection is not of practical value without in-domain training and/or development, and described an SVM-based approach using hand-crafted features.…”

Section: Related Workmentioning

confidence: 99%

Proceedings of the Second Workshop on Extra-Propositional Aspects of Meaning in Computational Semantics (ExProM 2015)

Blanco¹,

Morante²,

Sporleder³

2015

View full text Add to dashboard Cite

Statistical Machine Translation has come a long way improving the translation quality of a range of different linguistic phenomena. With negation however, techniques proposed and implemented for improving translation performance on negation have simply followed from the developers' beliefs about why performance is worse. These beliefs, however, have never been validated by an error analysis of the translation output. In contrast, the current paper shows that an informative empirical error analysis can be formulated in terms of (1) the set of semantic elements involved in the meaning of negation, and (2) a small set of string-based operations that can characterise errors in the translation of those elements. Results on a Chinese-to-English translation task confirm the robustness of our analysis cross-linguistically and the basic assumptions can inform an automated investigation into the causes of translation errors. Conclusions drawn from this analysis should guide future work on improving the translation of negative sentences. IntroductionIn recent years, there has been increasing interest in improving the quality of SMT systems over a wide range of linguistic phenomena, including coreference resolution (Hardmeier et al., 2014) and modality (Baker et al., 2012). Amongst these, however, translating negation is still a problem that has not been researched thoroughly.This paper takes an empirical approach towards understanding why negation is a problem in SMT.More specifically, we try to answer two main questions:1. What kind of errors are involved in translating negation?2. What are the causes of these errors during decoding?While previous work (section 2) has shown that translating negation is a problem, it has not addressed either of these questions. The present paper focuses on the first one; we show that tailoring to a semantic task, string-based error categories standardly used to evaluate the quality of the machine translation output, allows us to cover the wide range of errors occurring while translating negative sentences (section 3). We report the results of the analysis of a Hierarchical Phrase Based Model (Chiang, 2007) on a Chineseto-English translation task (section 4), where we show that all error categories occur to some extent with scope reordering being the most frequent (section 5).Addressing question (2) requires connecting the assumptions behind this manual error analysis to errors occurring along the translation pipeline. As such, we complete the analysis by briefly introduce an automatic method to investigate the causes of the errors at decoding time (section 6).Conclusion and future works are reported in section 7 and 8. 2 Previous WorkIn recent years, automatic recognition of negation has been the focus of considerable work. Following Blanco and Moldoval (2011) and Morante and Blanco (2012) detecting negation is a task of unraveling its structure, i.e. locating in a text its four main components:• Cue: the word or multi-word unit inherently expressing negation (e.g. 'He is not d...

show abstract

Negation’s Not Solved: Generalizability Versus Optimizability in Clinical Natural Language Processing

Cited by 113 publications

References 13 publications

Unsupervised Domain Adaptation for Clinical Negation Detection

Unsupervised Domain Adaptation for Clinical Negation Detection

Don't Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Proceedings of the Second Workshop on Extra-Propositional Aspects of Meaning in Computational Semantics (ExProM 2015)

Contact Info

Product

Resources

About