2018
DOI: 10.1007/978-3-030-00671-6_8
|View full text |Cite
|
Sign up to set email alerts
|

TSE-NER: An Iterative Approach for Long-Tail Entity Extraction in Scientific Publications

Abstract: Named Entity Recognition and Typing (NER/NET) is a challenging task, especially with long-tail entities such as the ones found in scientific publications. These entities (e.g. "WebKB","StatSnowball") are rare, often relevant only in specific knowledge domains, yet important for retrieval and exploration purposes. State-of-the-art NER approaches employ supervised machine learning models, trained on expensive typelabeled data laboriously produced by human annotators. A common workaround is the generation of labe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 16 publications
(27 citation statements)
references
References 23 publications
0
26
0
1
Order By: Relevance
“…As outlined in the previous section, a core design feature of TSE-NER is the heuristic filter step in each iteration, which is designed to filter out named entities which are most likely misrecognized (this can easily happen as the used training data is noisy due to the strong reliance on heuristics). While it was shown in [12] that this filter step indeed increases the precision of the overall approach, it does also impact the recall negatively. For example, this could happen by filtering out true positives, i.e.…”
Section: Collaborative Crowd Feedbackmentioning
confidence: 99%
See 4 more Smart Citations
“…As outlined in the previous section, a core design feature of TSE-NER is the heuristic filter step in each iteration, which is designed to filter out named entities which are most likely misrecognized (this can easily happen as the used training data is noisy due to the strong reliance on heuristics). While it was shown in [12] that this filter step indeed increases the precision of the overall approach, it does also impact the recall negatively. For example, this could happen by filtering out true positives, i.e.…”
Section: Collaborative Crowd Feedbackmentioning
confidence: 99%
“…In this section we will summarize TSE-NER, an iterative five-step low-cost approach for training NER/NET classifiers for long-tail entity types. For more detailed information on this approach, refer to [12]. The approach is summarized in the following five steps:…”
Section: Tse-ner: An Iterative Approach For Long-tail Entity Extractionmentioning
confidence: 99%
See 3 more Smart Citations