2009
DOI: 10.1007/978-3-642-04930-9_17
|View full text |Cite
|
Sign up to set email alerts
|

Context and Domain Knowledge Enhanced Entity Spotting in Informal Text

Abstract: Abstract. This paper explores the application of restricted relationship graphs (RDF) and statistical NLP techniques to improve named entity annotation in challenging Informal English domains. We validate our approach using on-line forums discussing popular music. Named entity annotation is particularly difficult in this domain because it is characterized by a large number of ambiguous entities, such as the Madonna album "Music" or Lilly Allen's pop hit "Smile". We evaluate improvements in annotation accuracy … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(18 citation statements)
references
References 11 publications
0
18
0
Order By: Relevance
“…Our collection module uses the Twitter Streaming API 5 . The Twitter Streaming API allows near-realtime access to various subsets of Twitter public statuses.…”
Section: Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…Our collection module uses the Twitter Streaming API 5 . The Twitter Streaming API allows near-realtime access to various subsets of Twitter public statuses.…”
Section: Architecturementioning
confidence: 99%
“…Entities such as Obama, Senate and Health Care Bill are mentioned within the text in microposts and represent finer grained semantic units that can be extracted. The task of Named Entity Recognition has been studied in casual text [5] and in more general form following both unsupervised and supervised machine learning approaches [9]. The best performing systems achieve up to 90.8 F 1 score [12] through supervised approaches, i.e.…”
Section: A Extracting Semantic Descriptorsmentioning
confidence: 99%
“…Kleb et al [14] used concept-dependant text patterns for the disambiguation of text information. Gruhl et al [9] trained an SVM classifier in order to spot ontology entities. Here, many common ideas from information retrieval (IR) have been transferred to this domain.…”
Section: Related Workmentioning
confidence: 99%
“…Also, many try to transfer NLP approaches to this domain [26,14,9], mostly focusing on a specific domain, using domain-specific measures. So far, in the field of ontology-based entity disambiguation, domain-independent complex structures in semantic graphs have not been exploited.…”
Section: Introductionmentioning
confidence: 99%
“…However when dealing with social media sites, performing NLP can be particularly difficult due to the typically informal nature of user posts, which tend to contain a lot of slang and contextdependant terms, with little attention given to spelling and grammar (Gruhl et al, 2009). Thus, while NLP algorithms are potentially very useful tools for investigating SNSs, there are challenges particular to user-generated content which must be handled.…”
Section: Current Approaches For Data Mining and Analysismentioning
confidence: 99%