Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop 2019
DOI: 10.18653/v1/p19-2025
|View full text |Cite
|
Sign up to set email alerts
|

Corpus Creation and Analysis for Named Entity Recognition in Telugu-English Code-Mixed Social Media Data

Abstract: Named Entity Recognition(NER) is one of the important tasks in Natural Language Processing(NLP) and also is a sub task of Information Extraction. In this paper we present our work on NER in Telugu-English code-mixed social media data. Code-Mixing, a progeny of multilingualism is a way in which multilingual people express themselves on social media by using linguistics units from different languages within a sentence or speech context. Entity Extraction from social media data such as tweets(twitter) 1 is in gen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…The annotation of the Named Entities and Parts of Speech was performed by four native speakers of Tigrinya. To evaluate the quality of the task, we conducted Interannotator agreement (IAA) tests computed by Fleiss' kappa to measure the harmony in our decisions Srirangam et al (2019). Fleiss' kappa is the better choice than Cohen's kappa in this situation as the number of annotators is greater than two.…”
Section: Dataset Analysismentioning
confidence: 99%
“…The annotation of the Named Entities and Parts of Speech was performed by four native speakers of Tigrinya. To evaluate the quality of the task, we conducted Interannotator agreement (IAA) tests computed by Fleiss' kappa to measure the harmony in our decisions Srirangam et al (2019). Fleiss' kappa is the better choice than Cohen's kappa in this situation as the number of annotators is greater than two.…”
Section: Dataset Analysismentioning
confidence: 99%