Marieke van Erp scite author profile

This shared task focuses on identifying unusual, previously-unseen entities in the context of emerging discussions. Named entities form the basis of many modern approaches to other tasks (like event clustering and summarization), but recall on them is a real problem in noisy text -even among annotators. This drop tends to be due to novel entities and surface forms. Take for example the tweet "so.. kktny in 30 mins?!" -even human experts find the entity kktny hard to detect and resolve. The goal of this task is to provide a definition of emerging and of rare entities, and based on that, also datasets for detecting these entities. The task as described in this paper evaluated the ability of participating entries to detect and classify novel and emerging named entities in noisy text.

show abstract

Analysis of named entity recognition and linking for tweets

Derczynski

Maynard

Rizzo

et al. 2015

Information Processing & Management

291

186

View full text Add to dashboard Cite

Applying natural language processing for mining and intelligent information access to tweets (a form of microblog) is a challenging, emerging research area. Unlike carefully authored news text and other longer content, tweets pose a number of new challenges, due to their short, noisy, context-dependent, and dynamic nature. Information extraction from tweets is typically performed in a pipeline, comprising consecutive stages of language identification, tokenisation, part-of-speech tagging, named entity recognition and entity disambiguation (e.g. with respect to DBpedia). In this work, we describe a new Twitter entity disambiguation dataset, and conduct an empirical analysis of named entity recognition and disambiguation, investigating how robust a number of state-of-the-art systems are on such noisy texts, what the main sources of error are, and which problems should be further investigated to improve the state of the art.

show abstract

Building event-centric knowledge graphs from news

Rospocher

Erp

Vossen

et al. 2016

Journal of Web Semantics

143

109

View full text Add to dashboard Cite

SemEval-2015 Task 4: TimeLine: Cross-Document Event Ordering

Minard¹,

Speranza²,

Agirre³

et al. 2015

View full text Add to dashboard Cite

This paper describes the outcomes of the TimeLine task (Cross-Document Event Ordering), that was organised within the Time and Space track of SemEval-2015. Given a set of documents and a set of target entities, the task consisted of building a timeline for each entity, by detecting, anchoring in time and ordering the events involving that entity. The TimeLine task goes a step further than previous evaluation challenges by requiring participant systems to perform both event coreference and temporal relation extraction across documents. Four teams submitted the output of their systems to the four proposed subtracks for a total of 13 runs, the best of which obtained an F 1 -score of 7.85 in the main track (timeline creation from raw text).

show abstract

NewsReader: Using knowledge resources in a cross-lingual reading machine to generate more knowledge from massive streams of news

Vossen

Agerri

Aldabe

et al. 2016

Knowledge-Based Systems

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Marieke van Erp

Results of the WNUT2017 Shared Task on Novel and Emerging Entity Recognition

Analysis of named entity recognition and linking for tweets

Building event-centric knowledge graphs from news

SemEval-2015 Task 4: TimeLine: Cross-Document Event Ordering

NewsReader: Using knowledge resources in a cross-lingual reading machine to generate more knowledge from massive streams of news

Contact Info

Product

Resources

About