Ryan Benkert scite author profile

Ryan Benkert

5Publications

9Citation Statements Received

84Citation Statements Given

How they've been cited

How they cite others

137

Affiliations

Georgia Institute of Technology

Publications

Order By: Most citations

Explaining Deep Models Through Forgettable Learning Dynamics

Benkert

Aribido

AlRegib

2021

View full text Add to dashboard Cite

Even though deep neural networks have shown tremendous success in countless applications, explaining model behaviour or predictions is an open research problem. In this paper, we address this issue by employing a simple yet effective method by analysing the learning dynamics of deep neural networks in semantic segmentation tasks. Specifically, we visualize the learning behaviour during training by tracking how often samples are learned and forgotten in subsequent training epochs. This further allows us to derive important information about the proximity to the class decision boundary and identify regions that pose a particular challenge to the model. Inspired by this phenomenon, we present a novel segmentation method that actively uses this information to alter the data representation within the model by increasing the variety of difficult regions. Finally, we show that our method consistently reduces the amount of regions that are forgotten frequently. We further evaluate our method in light of the segmentation performance.

show abstract

Explainable seismic neural networks using learning statistics

Benkert

Aribido

AlRegib

2021

View full text Add to dashboard Cite

Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data

Benkert

Prabhushankar

AlRegib

2022

View full text Add to dashboard Cite

This paper considers deep out-of-distribution active learning. In practice, fully trained neural networks interact randomly with out-of-distribution (OOD) inputs and map aberrant samples randomly within the model representation space. Since data representations are direct manifestations of the training distribution, the data selection process plays a crucial role in outlier robustness. For paradigms such as active learning, this is especially challenging since protocols must not only improve performance on the training distribution most effectively but further render a robust representation space. However, existing strategies directly base the data selection on the data representation of the unlabeled data which is random for OOD samples by definition. For this purpose, we introduce forgetful active learning with switch events (FALSE)a novel active learning protocol for out-of-distribution active learning. Instead of defining sample importance on the data representation directly, we formulate "informativeness" with learning difficulty during training. Specifically, we approximate how often the network "forgets" unlabeled samples and query the most "forgotten" samples for annotation. We report up to 4.5% accuracy improvements in over 270 experiments, including four commonly used protocols, two OOD benchmarks, one in-distribution benchmark, and three different architectures.

show abstract

Example Forgetting: A Novel Approach to Explain and Interpret Deep Neural Networks in Seismic Interpretation

Benkert¹,

Aribido²,

AlRegib³

2022

IEEE Trans. Geosci. Remote Sensing

View full text Add to dashboard Cite

Gaussian Switch Sampling: A Second Order Approach to Active Learning

Benkert¹,

Prabhushankar²,

AlRegib³

et al. 2023

Preprint

View full text Add to dashboard Cite

In active learning, acquisition functions define informativeness directly on the representation position within the model manifold. However, for most machine learning models (in particular neural networks) this representation is not fixed due to the training pool fluctuations in between active learning rounds. Therefore, several popular strategies are sensitive to experiment parameters (e.g. architecture) and do not consider model robustness to out-of-distribution settings. To alleviate this issue, we propose a grounded second-order definition of information content and sample importance within the context of active learning. Specifically, we define importance by how often a neural network "forgets" a sample during training -artifacts of second order representation shifts. We show that our definition produces highly accurate importance scores even when the model representations are constrained by the lack of training data. Motivated by our analysis, we develop Gaussian Switch Sampling (GauSS). We show that GauSS is setup agnostic and robust to anomalous distributions with exhaustive experiments on three indistribution benchmarks, three out-of-distribution benchmarks, and three different architectures. We report an improvement of up to 5% when compared against four popular query strategies. Our code is available at https://github.com/olivesgatech/gauss. Impact Statement-With the ever increasing demand for deep learning products in safety-critical applications, the acquisition of suitable training data has significantly increased in complexity. In several instances, labeling large quantities of data is associated with insurmountable costs (e.g. medical applications) while other instances require data diversity at scale with numerous edge cases (e.g. autonomous vehicles). Active learning offers a promising solution to both of these problems by selecting data to both improve annotation efficiency and data quality. For practical deployment, these algorithms must select robust datasets and further function in a wide variety of training setups in order to guarantee design requirements. However, existing algorithms base their acquisition function on the representation approaches which results in high performance fluctuations over training setups and robustness metrics. For instance, an algorithm might perform exceedingly well with one neural network architecture but underperform with another. Our work introduces a secondorder active learning approach for robustness and portability. We see our work as the first step of many in bringing theoretical active learning algorithms to real world deployment.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ryan Benkert

Explaining Deep Models Through Forgettable Learning Dynamics

Explainable seismic neural networks using learning statistics

Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data

Example Forgetting: A Novel Approach to Explain and Interpret Deep Neural Networks in Seismic Interpretation

Gaussian Switch Sampling: A Second Order Approach to Active Learning

Contact Info

Product

Resources

About