Onkar Pandit scite author profile

Onkar Pandit

5Publications

61Citation Statements Received

59Citation Statements Given

How they've been cited

How they cite others

Affiliations

Indian Statistical Institute

Publications

Order By: Most citations

Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks

Chakrabarty¹,

Pandit²,

Garain³

2017

View full text Add to dashboard Cite

We introduce a composite deep neural network architecture for supervised and language independent context sensitive lemmatization. The proposed method considers the task as to identify the correct edit tree representing the transformation between a word-lemma pair. To find the lemma of a surface word, we exploit two successive bidirectional gated recurrent structures -the first one is used to extract the character level dependencies and the next one captures the contextual information of the given word. The key advantages of our model compared to the state-of-the-art lemmatizers such as Lemming and Morfette are -(i) it is independent of human decided features (ii) except the gold lemma, no other expensive morphological attribute is required for joint learning. We evaluate the lemmatizer on nine languages -Bengali, Catalan, Dutch, Hindi, Hungarian, Italian, Latin, Romanian and Spanish. It is found that except Bengali, the proposed method outperforms Lemming and Morfette on the other languages. To train the model on Bengali, we develop a gold lemma annotated dataset 1 (having 1, 702 sentences with a total of 20, 257 word tokens), which is an additional contribution of this work.

show abstract

CNN for Text-Based Multiple Choice Question Answering

Chaturvedi¹,

Pandit²,

Garain³

2018

View full text Add to dashboard Cite

The task of Question Answering is at the very core of machine comprehension. In this paper, we propose a Convolutional Neural Network (CNN) model for textbased multiple choice question answering where questions are based on a particular article. Given an article and a multiple choice question, our model assigns a score to each question-option tuple and chooses the final option accordingly. We test our model on Textbook Question Answering (TQA) and SciQ dataset. Our model outperforms several LSTM-based baseline models on the two datasets.

show abstract

Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content

Garain

Pandit

Augereau

et al. 2017

View full text Add to dashboard Cite

Probing for Bridging Inference in Transformer Language Models

Pandit¹,

Hou²

2021

View full text Add to dashboard Cite

We probe pre-trained transformer language models for bridging inference. We first investigate individual attention heads in BERT and observe that attention heads at higher layers prominently focus on bridging relations incomparison with the lower and middle layers, also, few specific attention heads concentrate consistently on bridging. More importantly, we consider language models as a whole in our second approach where bridging anaphora resolution is formulated as a masked token prediction task (Of-Cloze test). Our formulation produces optimistic results without any finetuning, which indicates that pre-trained language models substantially capture bridging inference. Our further investigation shows that the distance between anaphor-antecedent and the context provided to language models play an important role in the inference.

show abstract

Probing for Bridging Inference in Transformer Language Models

Pandit¹,

Hou²

2021

Preprint

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Onkar Pandit

Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks

CNN for Text-Based Multiple Choice Question Answering

Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content

Probing for Bridging Inference in Transformer Language Models

Probing for Bridging Inference in Transformer Language Models

Contact Info

Product

Resources

About