Yemane Tedla scite author profile

Yemane Tedla

5Publications

30Citation Statements Received

27Citation Statements Given

How they've been cited

How they cite others

Affiliations

Nagaoka University of Technology

Publications

Order By: Most citations

Tigrinya Part-of-Speech Tagging with Morphological Patterns and the New Nagaoka Tigrinya Corpus

Tedla¹,

Yamamoto²,

Marasinghe³

2016

IJCA

View full text Add to dashboard Cite

This paper presents the first part-of-speech (POS) tagging research for Tigrinya (Semitic language) from the newly constructed Nagaoka Tigrinya Corpus. The raw text was extracted from a newspaper published in Eritrea in the Tigrinya language. This initial corpus was cleaned and formatted in plaintext and the Text Encoding Initiative (TEI) XML format. A tagset of 73 tags was designed, and the corpus for POS was manually annotated. This tagset encompasses three levels of grammatical information, which are the main POS categories, subcategories, and POS clitics. The POS tagged corpus contains 72,080 tokens. Tigrinya has a unique pattern of root-template morphology that can be utilized to infer POS categories. Subsequently, a supervised learning approach based on conditional random fields (CRFs) and support vector machines (SVMs) was applied, trained over contextual features of words and POS tags, morphological patterns, and affixes. A rigorous parameter optimization was performed and different combinations of features, data size, and tagsets were experimented upon to boost the overall accuracy, and particularly the prediction of POS for unknown words. For a reduced tagset of 20 tags, an overall accuracy of 90.89% was obtained on a stratified 10-fold cross validation. Enriching contextual features with morphological and affix features improved performance up to 41.01 percentage point, which is significant. General Termsnatural language processing, part-of-speech tagging

show abstract

Analyzing word embeddings and improving POS tagger of tigrinya

Tedla

Yamamoto

2017

View full text Add to dashboard Cite

The effect of shallow segmentation on English-Tigrinya statistical machine translation

Tedla

Yamamoto

2016

View full text Add to dashboard Cite

Morphological Segmentation with LSTM Neural Networks for Tigrinya

Tedla¹,

Yamamoto²

2018

IJNLC

View full text Add to dashboard Cite

show abstract

Morphological Segmentation With LSTM Neural Networks for Tigrinya

Tedla¹

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yemane Tedla

Tigrinya Part-of-Speech Tagging with Morphological Patterns and the New Nagaoka Tigrinya Corpus

Analyzing word embeddings and improving POS tagger of tigrinya

The effect of shallow segmentation on English-Tigrinya statistical machine translation

Morphological Segmentation with LSTM Neural Networks for Tigrinya

Morphological Segmentation With LSTM Neural Networks for Tigrinya

Contact Info

Product

Resources

About