Rakesh Gosangi scite author profile

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings. We evaluate the proposed architecture using both contextualized and fixed word embedding models on three different benchmark datasets (Inspec, SemEval 2010, SemEval 2017), and compare with existing popular unsupervised and supervised techniques. Our results quantify the benefits of: (a) using contextualized embeddings (e.g. BERT) over fixed word embeddings (e.g. Glove); (b) using a BiLSTM-CRF architecture with contextualized word embeddings over fine-tuning the contextualized word embedding model directly; and (c) using genre-specific contextualized embeddings (SciBERT). Through error analysis, we also provide some insights into why particular models work better than the others. Lastly, we present a case study where we analyze different self-attention layers of the two best models (BERT and SciBERT) to better understand the predictions made by each for the task of keyphrase extraction.

show abstract

Active temperature modulation of metal-oxide sensors for quantitative analysis of gas mixtures

Gosangi

Gutierrez‐Osuna

2013

Sensors and Actuators B: Chemical

View full text Add to dashboard Cite

Active Temperature Programming for Metal-Oxide Chemoresistors

Gosangi

Gutierrez‐Osuna

2010

IEEE Sensors J.

View full text Add to dashboard Cite

A Preliminary Exploration of GANs for Keyphrase Generation

Swaminathan¹,

Zhang²,

Mahata³

et al. 2020

View full text Add to dashboard Cite

We introduce a new keyphrase generation approach using Generative Adversarial Networks (GANs). For a given document, the generator produces a sequence of keyphrases, and the discriminator distinguishes between human-curated and machinegenerated keyphrases. We evaluated this approach on standard benchmark datasets. We observed that our model achieves state-of-theart performance in the generation of abstractive keyphrases and is comparable to the best performing extractive techniques. Although we achieve promising results using GANs, they are not significantly better than the stateof-the-art generative models. To our knowledge, this is one of the first works that use GANs for keyphrase generation. We present a detailed analysis of our observations and expect that these findings would help other researchers to further study the use of GANs for the task of keyphrase generation.

show abstract

#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement

Gautam

Mathur

Gosangi

et al. 2020

ICWSM

View full text Add to dashboard Cite

In this paper, we present a dataset containing 9,973 tweets related to the MeToo movement that were manually annotated for five different linguistic aspects: relevance, stance, hate speech, sarcasm, and dialogue acts. We present a detailed account of the data collection and annotation processes. The annotations have a very high inter-annotator agreement (0.79 to 0.93 k-alpha) due to the domain expertise of the annotators and clear annotation instructions. We analyze the data in terms of geographical distribution, label correlations, and keywords. Lastly, we present some potential use cases of this dataset. We expect this dataset would be of great interest to psycholinguists, socio-linguists, and computational linguists to study the discursive space of digitally mobilized social movements on sensitive issues like sexual harassment.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Rakesh Gosangi

Keyphrase Extraction as Sequence Labeling Using Contextualized Embeddings

Active temperature modulation of metal-oxide sensors for quantitative analysis of gas mixtures

Active Temperature Programming for Metal-Oxide Chemoresistors

A Preliminary Exploration of GANs for Keyphrase Generation

#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement

Contact Info

Product

Resources

About