Ruchi Deshpande scite author profile

Ruchi Deshpande

5Publications

48Citation Statements Received

111Citation Statements Given

How they've been cited

How they cite others

125

111

Affiliations

Adobe Systems (United States), University of Southern California

Publications

Order By: Most citations

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models

Du¹,

Manjunatha²,

Jain³

et al. 2021

View full text Add to dashboard Cite

Recent studies indicate that NLU models are prone to rely on shortcut features for prediction, without achieving true language understanding. As a result, these models fail to generalize to real-world out-of-distribution data. In this work, we show that the words in the NLU training set can be modeled as a longtailed distribution. There are two findings: 1) NLU models have strong preference for features located at the head of the long-tailed distribution, and 2) Shortcut features are picked up during very early few iterations of the model training. These two observations are further employed to formulate a measurement which can quantify the shortcut degree of each training sample. Based on this shortcut measurement, we propose a shortcut mitigation framework LTGR, to suppress the model from making overconfident predictions for samples with large shortcut degree. Experimental results on three NLU benchmarks demonstrate that our long-tailed distribution explanation accurately reflects the shortcut learning behavior of NLU models. Experimental analysis further indicates that LTGR can improve the generalization accuracy on OOD data, while preserving the accuracy on in-distribution data. Input x Teacher model Softmax Softmax Ground truth y Smoothed Softmax Distill loss Student loss Overconfident prediction Long-tailed distribution Example of model paying high attention to features on the head (a) long-tailed observation (b) Mitigation framework Shortcut degree Student model Head Long tail Shortcut degree Data statistics Model behavior

show abstract

An Observer Study for a Computer-Aided Reading Protocol (CARP) in the Screening Environment for Digital Mammography

et al. 2011

View full text Add to dashboard Cite

Knowledge-driven decision support for assessing dose distributions in radiation therapy of head and neck cancer

et al. 2016

View full text Add to dashboard Cite

show abstract

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU Models

Manjunatha

Jain

et al. 2021

Preprint

View full text Add to dashboard Cite

Recent studies indicate that NLU models are prone to rely on shortcut features for prediction, without achieving true language understanding. As a result, these models fail to generalize to real-world out-of-distribution data. In this work, we show that the words in the NLU training set can be modeled as a longtailed distribution. There are two findings: 1) NLU models have strong preference for features located at the head of the long-tailed distribution, and 2) Shortcut features are picked up during very early few iterations of the model training. These two observations are further employed to formulate a measurement which can quantify the shortcut degree of each training sample. Based on this shortcut measurement, we propose a shortcut mitigation framework LGTR, to suppress the model from making overconfident predictions for samples with large shortcut degree. Experimental results on three NLU benchmarks demonstrate that our long-tailed distribution explanation accurately reflects the shortcut learning behavior of NLU models. Experimental analysis further indicates that LGTR can improve the generalization accuracy on OOD data, while preserving the accuracy on in-distribution data.

show abstract

A collaborative framework for contributing DICOM RT PHI (Protected Health Information) to augment data mining in clinical decision support

Deshpande

Thuptimdang

DeMarco

et al. 2014

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ruchi Deshpande

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models

An Observer Study for a Computer-Aided Reading Protocol (CARP) in the Screening Environment for Digital Mammography

Knowledge-driven decision support for assessing dose distributions in radiation therapy of head and neck cancer

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU Models

A collaborative framework for contributing DICOM RT PHI (Protected Health Information) to augment data mining in clinical decision support

Contact Info

Product

Resources

About