2023
DOI: 10.1002/pra2.839
|View full text |Cite
|
Sign up to set email alerts
|

Tuning Out the Noise: Benchmarking Entity Extraction for Digitized Native American Literature

Nikolaus Nova Parulian,
Ryan Dubnicek,
Daniel J. Evans
et al.

Abstract: Named Entity Recognition (NER), the automated identification and tagging of entities in text, is a popular natural language processing task, and has the power to transform restricted data into open datasets of entities for further research. This project benchmarks four NER models–Stanford NER, BookNLP, spaCy‐trf and RoBERTa–to identify the most accurate approach and generate an open‐access, gold‐standard dataset of human annotated entities. To meet a real‐world use case, we benchmark these models on a sample d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 18 publications
(16 reference statements)
0
0
0
Order By: Relevance