2021
DOI: 10.21467/proceedings.115.12
|View full text |Cite
|
Sign up to set email alerts
|

Bodo Resources for NLP - An Overview of Existing Primary Resources for Bodo

Abstract: With over 1.4 million Bodo speakers, there is a need for Automated Language Processing systems such as Machine translation, Part Of Speech tagging, Speech recognition, Named Entity Recognition, and so on. In order to develop such a system it requires a sufficient amount of dataset. In this paper we present a detailed description of the primary resources available for Bodo language that can be used as datasets to study Natural Language Processing and its applications. We have listed out different resources avai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…As a result, we've established specific strict guidelines. These regulations are based on the community standards of ™Facebook 10 and ™YouTube 11 . Comments with the following aims should be marked as hate.…”
Section: Dataset Annotationmentioning
confidence: 99%
See 2 more Smart Citations
“…As a result, we've established specific strict guidelines. These regulations are based on the community standards of ™Facebook 10 and ™YouTube 11 . Comments with the following aims should be marked as hate.…”
Section: Dataset Annotationmentioning
confidence: 99%
“…Indian government law is also introduced against hate speech [6]. Several social media platforms revised their community guidelines to eradicate hate, automatically detecting hate comments and posts and giving users access to report posts and comments 12 . English and other popular languages benefit from their global popularity.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Under this wave a new dawn began with the growing socio-political consciousness among a handful of enlighten Bodos. This came at a time, when formidable size of the Bodos had already renounced their ancestral traditions and ethnicity by adopting Assamese identity to avoid social discrimination (Narzary B. , 2007). Furthermore, the British policy to open the doors of Assam for outsiders to fill the needs of colonial administration have allowed free flow of people from rest of India and neighboring countries.…”
Section: Rise Of Bodo Identity and Political Renaissance Under Gurude...mentioning
confidence: 99%
“…The text of the raw corpus is from different domains such as Aesthetics (Culture, Cinema, Literature, Biographies, and Folklore), Commerce, Mass media (Classified, Discussion, Editorial, Sports, General news, Health, Weather, and Social), Science and Technology (Agriculture, Environmental Science, Textbook, Astrology, Mechanical Engineering, and Environmental Science) and Social Sciences (Economics, Education, Political Science, Linguistics, Health and Family Welfare, History, Text Book, Law, etc). We also acquired another corpus from the work (Narzary et al 2022). The final consolidated corpus has 1.6 million tokens and 191k sentences.…”
Section: Introductionmentioning
confidence: 99%