Agreement, the F-Measure, and Reliability in Information Retrieval

Hripcsak, George; Rothschild, Adam S.

doi:10.1197/jamia.m1733

Cited by 738 publications

(440 citation statements)

References 5 publications

Supporting

Mentioning

430

Contrasting

Unclassified

Order By: Relevance

“…However, it is not suitable for entity recognition tasks [26]. We adopt the F-measure proposed by [13], which allows computing pair-wise inter-annotator agreement using the standard Precision, Recall and the harmonic F-measure in information studies by treating one annotator as gold standard and the other as predictions. Table 1 shows the pair-wise agreement for each entity class.…”

Section: Cost Of the Processmentioning

confidence: 99%

A Methodology towards Effective and Efficient Manual Document Annotation: Addressing Annotator Discrepancy and Annotation Quality

Zhang

Chapman

Ciravegna

2010

Knowledge Engineering and Management by the Masses

View full text Add to dashboard Cite

Abstract. Manual document annotation is an essential technique for knowledge acquisition and capture. Creating high-quality annotations is a difficult task due to inter-annotator discrepancy, the problem that annotators can never agree completely on what and exactly how to annotate. To address this, traditional document annotation involves multiple domain experts working on the same annotation task in an iterative and collaborative manner to identify and resolve discrepancies progressively. However, such a detailed process is often ineffective despite taking significant time and effort; unfortunately, discrepancies remain high in many cases. This paper proposes an alternative approach to document annotation. The approach tackles the problem by firstly studying annotators' suitability based on the types of information to be annotated; then identifying and isolating the most inconsistent annotators who tend to cause the majority of discrepancies in a task; finally distributing annotation workload among the most suitable annotators. Tested in a named entity annotation task in the domain of archaeology, we show that compared to the traditional approach to document annotation, it produces larger amounts of better quality annotations that result in higher machine learning accuracy while requires significantly less time and effort.

show abstract

Section: Cost Of the Processmentioning

confidence: 99%

A Methodology towards Effective and Efficient Manual Document Annotation: Addressing Annotator Discrepancy and Annotation Quality

Zhang

Chapman

Ciravegna

2010

Knowledge Engineering and Management by the Masses

View full text Add to dashboard Cite

show abstract

“…This metric approximates the kappa coefficient (Cohen, 1960) 8 http://labda.inf.uc3m.es/SpanishADRCorpus when the number of true negatives (TN) is very large (Hripcsak and Rothschild, 2005). In our case, we can state that the number of TN is very high since TN are all the terms that are not true positives, false positives nor false negatives.…”

Section: Corpus Creationmentioning

confidence: 91%

Detecting drugs and adverse events from Spanish social media streams

Segura-Bedmar

Revert

Martı́nez

2014

Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi)

View full text Add to dashboard Cite

To the best of our knowledge, this is the first work that does drug and adverse event detection from Spanish posts collected from a health social media. First, we created a goldstandard corpus annotated with drugs and adverse events from social media. Then, Textalytics, a multilingual text analysis engine, was applied to identify drugs and possible adverse events. Overall recall and precision were 0.80 and 0.87 for drugs, and 0.56 and 0.85 for adverse events.

show abstract

“…The current version of Inforex enables simultaneous and independent annotation of the same text sample by more than one annotator. Moreover, the annotation process coordinator may keep track of inter-annotator agreement between two raters thanks to the Agreement module which uses Positive Specific Agreement (PSA) measure (Hripcsak and Rothschild, 2005) to calculate the reliability (see Figure 5). View configuration gives the opportunity to define annotation layers, subsets or categories, users and set of documents that have to be analysed.…”

Section: Annotation Agreementmentioning

confidence: 99%

Inforex—a Collaborative System for Text Corpora Annotation and Analysis

Marcińczuk

Oleksy

Kocoń

2017

RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning

View full text Add to dashboard Cite

We report a first major upgrade of Inforex -a web-based system for qualitative and collaborative text corpora annotation and analysis. Inforex is a part of Polish CLARIN infrastructure 1 . It is integrated with a digital repository for storing and publishing language resources 2 and it allows to visualize, browse and annotate text corpora stored in the repository. As a result of a series of workshops for researchers in Humanities and Social Sciences we improved the graphical interface to make the system more friendly and readable for non-experienced users. We also implemented a new functionality for a gold standard annotation which includes private annotations and annotation agreement by a super-annotator.

show abstract

Agreement, the F-Measure, and Reliability in Information Retrieval

Cited by 738 publications

References 5 publications

A Methodology towards Effective and Efficient Manual Document Annotation: Addressing Annotator Discrepancy and Annotation Quality

A Methodology towards Effective and Efficient Manual Document Annotation: Addressing Annotator Discrepancy and Annotation Quality

Detecting drugs and adverse events from Spanish social media streams

Inforex—a Collaborative System for Text Corpora Annotation and Analysis

Contact Info

Product

Resources

About