Khushboo Thaker scite author profile

Different texts shall by nature correspond to different number of keyphrases. This desideratum is largely missing from existing neural keyphrase generation models. In this study, we address this problem from both modeling and evaluation perspectives.We first propose a recurrent generative model that generates multiple keyphrases as delimiter-separated sequences. Generation diversity is further enhanced with two novel techniques by manipulating decoder hidden states. In contrast to previous approaches, our model is capable of generating diverse keyphrases and controlling number of outputs.We further propose two evaluation metrics tailored towards the variable-number generation. We also introduce a new dataset (ST A C KEX) that expands beyond the only existing genre (i.e., academic writing) in keyphrase generation tasks. With both previous and new evaluation metrics, our model outperforms strong baselines on all datasets.

show abstract

One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

Yuan

Wang

Meng

et al. 2018

Preprint

View full text Add to dashboard Cite

Different texts shall by nature correspond to different number of keyphrases. This desideratum is largely missing from existing neural keyphrase generation models. In this study, we address this problem from both modeling and evaluation perspectives.We first propose a recurrent-generative model that generates multiple keyphrases as delimiter-separated sequences. Generation diversity is further enhanced with two novel techniques by manipulating decoder hidden states. In contrast to previous approaches, our model is capable of generating variable number of diverse keyphrases.We further propose two evaluation metrics tailored towards variable-number generation. We also introduce a new dataset (ST A C KEX) that expand beyond the only existing genre (i.e., academic writing) in keyphrase generation tasks. With both previous and new evaluation metrics, our model outperforms strong baselines on all datasets.

show abstract

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Meng¹,

Thaker²,

Zhang³

et al. 2021

View full text Add to dashboard Cite

Faceted summarization provides briefings of a document from different perspectives. Readers can quickly comprehend the main points of a long document with the help of a structured outline. However, little research has been conducted on this subject, partially due to the lack of large-scale faceted summarization datasets. In this study, we present FacetSum, a faceted summarization benchmark built on Emerald journal articles, covering a diverse range of domains. Different from traditional documentsummary pairs, FacetSum provides multiple summaries, each targeted at specific sections of a long document, including the purpose, method, findings, and value. Analyses and empirical results on our dataset reveal the importance of bringing structure into summaries. We believe FacetSum will spur further advances in summarization research and foster the development of NLP systems that can leverage the structured information in both long texts and summaries.

show abstract

Automatic Concept Extraction for Domain and Student Modeling in Adaptive Textbooks

Chau

Labutov

Thaker

et al. 2020

Int J Artif Intell Educ

View full text Add to dashboard Cite

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Meng

Thaker

Zhang³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Khushboo Thaker

One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Automatic Concept Extraction for Domain and Student Modeling in Adaptive Textbooks

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Contact Info

Product

Resources

About