Namit Katariya scite author profile

Namit Katariya

5Publications

125Citation Statements Received

65Citation Statements Given

How they've been cited

170

123

How they cite others

Affiliations

Indian Institute of Technology Bombay

Publications

Order By: Most citations

Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures.

Joshi¹,

Katariya²,

Amatriain³

et al. 2020

View full text Add to dashboard Cite

Understanding a medical conversation between a patient and a physician poses unique natural language understanding challenge since it combines elements of standard open-ended conversation with very domainspecific elements that require expertise and medical knowledge. Summarization of medical conversations is a particularly important aspect of medical conversation understanding since it addresses a very real need in medical practice: capturing the most important aspects of a medical encounter so that they can be used for medical decision making and subsequent follow ups.In this paper we present a novel approach to medical conversation summarization that leverages the unique and independent local structures created when gathering a patient's medical history. Our approach is a variation of the pointer generator network where we introduce a penalty on the generator distribution, and we explicitly model negations. The model also captures important properties of medical conversations such as medical knowledge coming from standardized medical ontologies better than when those concepts are introduced explicitly. Through evaluation by doctors, we show that our approach is preferred on twice the number of summaries to the baseline pointer generator model and captures most or all of the information in 80% of the conversations making it a realistic alternative to costly manual summarization by medical experts.

show abstract

Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization

Chintagunta¹,

Katariya²,

Amatriain³

et al. 2021

View full text Add to dashboard Cite

In medical dialogue summarization, summaries must be coherent and must capture all the medically relevant information in the dialogue. However, learning effective models for summarization require large amounts of labeled data which is especially hard to obtain. We present an algorithm to create synthetic training data with an explicit focus on capturing medically relevant information. We utilize GPT-3 as the backbone of our algorithm and scale 210 human labeled examples to yield results comparable to using 6400 human labeled examples (∼30x) leveraging low-shot learning and an ensemble method. In detailed experiments, we show that this approach produces high quality training data that can further be combined with human labeled data to get summaries that are strongly preferable to those produced by models trained on human data alone both in terms of medical accuracy and coherency.

show abstract

Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures

Joshi¹,

Katariya²,

Amatriain³

et al. 2020

Preprint

View full text Add to dashboard Cite

Active Evaluation of Classifiers on Large Datasets

Katariya

Iyer

Sarawagi

2012

View full text Add to dashboard Cite

Effective Transfer Learning for Identifying Similar Questions

McCreery

Katariya²,

Kannan³

et al. 2020

View full text Add to dashboard Cite

People increasingly search online for answers to their medical questions but the rate at which medical questions are asked online significantly exceeds the capacity of qualified people to answer them. This leaves many questions unanswered or inadequately answered. Many of these questions are not unique, and reliable identification of similar questions would enable more efficient and effective question answering schema. COVID-19 has only exacerbated this problem. Almost every government agency and healthcare organization has tried to meet the informational need of users by building online FAQs, but there is no way for people to ask their question and know if it is answered on one of these pages. While many research efforts have focused on the problem of general question similarity, these approaches do not generalize well to domains that require expert knowledge to determine semantic similarity, such as the medical domain. In this paper, we show how a double fine-tuning approach of pretraining a neural network on medical question-answer pairs followed by fine-tuning on medical question-question pairs is a particularly useful intermediate task for the ultimate goal of determining medical question similarity. While other pretraining tasks yield an accuracy below 78.7% on this task, our model achieves an accuracy of 82.6% with the same number of training examples, an accuracy of 80.0% with a much smaller training set, and an accuracy of 84.5% when the full corpus of medical question-answer data is used. We also describe a currently live system that uses the trained model to match user questions to COVID-related FAQs. CCS Concepts • Applied computing → Consumer health; • Computing methodologies → Semi-supervised learning settings; Neural networks.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Namit Katariya

Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures.

Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization

Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures

Active Evaluation of Classifiers on Large Datasets

Effective Transfer Learning for Identifying Similar Questions

Contact Info

Product

Resources

About