N. Satya Krishna scite author profile

N. Satya Krishna

4Publications

7Citation Statements Received

0Citation Statements Given

How they've been cited

How they cite others

Affiliations

Birla Institute of Technology and Science - Hyderabad Campus, Institute for Development and Research in Banking Technology, Siddhartha Medical College

Publications

Order By: Most citations

Improving Code-mixed POS Tagging Using Code-mixed Embeddings

Bhattu

Krishna

Somayajulu

et al. 2020

ACM Trans. Asian Low-Resour. Lang. Inf. Process.

View full text Add to dashboard Cite

Social media data has become invaluable component of business analytics. A multitude of nuances of social media text make the job of conventional text analytical tools difficult. Code-mixing of text is a phenomenon prevalent among social media users, wherein words used are borrowed from multiple languages, though written in the commonly understood roman script. All the existing supervised learning methods for tasks such as Parts Of Speech (POS) tagging for code-mixed social media (CMSM) text typically depend on a large amount of training data. Preparation of such large training data is resource-intensive, requiring expertise in multiple languages. Though the preparation of small dataset is possible, the out of vocabulary (OOV) words pose major difficulty, while learning models from CMSM text as the number of different ways of writing non-native words in roman script is huge. POS tagging for code-mixed text is non-trivial, as tagging should deal with syntactic rules of multiple languages. The important research question addressed by this article is whether abundantly available unlabeled data can help in resolving the difficulties posed by code-mixed text for POS tagging. We develop an approach for scraping and building word embeddings for code-mixed text illustrating it for Bengali-English, Hindi-English, and Telugu-English code-mixing scenarios. We used a hierarchical deep recurrent neural network with linear-chain CRF layer on top of it to improve the performance of POS tagging in CMSM text by capturing contextual word features and character-sequence–based information. We prepared a labeled resource for POS tagging of CMSM text by correcting 19% of labels from an existing resource. A detailed analysis of the performance of our approach with varying levels of code-mixing is provided. The results indicate that the F1-score of our approach with custom embeddings is better than the CRF-based baseline by 5.81%, 5.69%, and 6.3% in Bengali, Hindi , and Telugu languages, respectively.

show abstract

Language Identification in Mixed Script

Sristy

Krishna

et al. 2017

View full text Add to dashboard Cite

Design and Development of a Knowledge-based Framework for Trouser Procurement: Bid Evaluation Software Tool (BEST); Volume II: Research Methodology

Jayaraman¹,

Narayanan²,

Krishna³

et al. 1996

View full text Add to dashboard Cite

Sentiment Analysis in Telugu–English CMSM Text

Saini¹,

Prathyusha²,

Mahitha³

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

N. Satya Krishna

Improving Code-mixed POS Tagging Using Code-mixed Embeddings

Language Identification in Mixed Script

Design and Development of a Knowledge-based Framework for Trouser Procurement: Bid Evaluation Software Tool (BEST); Volume II: Research Methodology

Sentiment Analysis in Telugu–English CMSM Text

Contact Info

Product

Resources

About