Stevie Bergman scite author profile

Stevie Bergman

4Publications

26Citation Statements Received

79Citation Statements Given

How they've been cited

How they cite others

101

Affiliations

Publications

Order By: Most citations

Fairness On The Ground: Applying Algorithmic Fairness Approaches to Production Systems

Bakalar¹,

Barreto²,

Bergman³

et al. 2021

Preprint

View full text Add to dashboard Cite

Many technical approaches have been proposed for ensuring that decisions made by machine learning systems are fair, but few of these proposals have been stress-tested in real-world systems. This paper presents an example of one team's approach to the challenge of applying algorithmic fairness approaches to complex production systems within the context of a large technology company. We discuss how we disentangle normative questions of product and policy design (like, "how should the system trade off between different stakeholders' interests and needs?") from empirical questions of system implementation (like, "is the system achieving the desired tradeoff in practice?"). We also present an approach for answering questions of the latter sort, which allows us to measure how machine learning systems and human labelers are making these tradeoffs across different relevant groups. We hope our experience integrating fairness tools and approaches into large-scale and complex production systems will be useful to other practitioners facing similar challenges, and illuminating to academics and researchers looking to better address the needs of practitioners.

show abstract

Adaptive Sampling Strategies to Construct Equitable Training Datasets

Cai¹,

Encarnacion²,

Chern³

et al. 2022

View full text Add to dashboard Cite

Adaptive Sampling Strategies to Construct Equitable Training Datasets

Cai¹,

Encarnacion²,

Chern³

et al. 2022

Preprint

View full text Add to dashboard Cite

In domains ranging from computer vision to natural language processing, machine learning models have been shown to exhibit stark disparities, often performing worse for members of traditionally underserved groups. One factor contributing to these performance gaps is a lack of representation in the data the models are trained on. It is often unclear, however, how to operationalize representativeness in specific applications. Here we formalize the problem of creating equitable training datasets, and propose a statistical framework for addressing this problem. We consider a setting where a model builder must decide how to allocate a fixed data collection budget to gather training data from different subgroups. We then frame dataset creation as a constrained optimization problem, in which one maximizes a function of group-specific performance metrics based on (estimated) group-specific learning rates and costs per sample. This flexible approach incorporates preferences of model-builders and other stakeholders, as well as the statistical properties of the learning task. When data collection decisions are made sequentially, we show that under certain conditions this optimization problem can be efficiently solved even without prior knowledge of the learning rates. To illustrate our approach, we conduct a simulation study of polygenic risk scores on synthetic genomic data-an application domain that often suffers from non-representative data collection.We find that our adaptive sampling strategy outperforms several common data collection heuristics, including equal and proportional sampling, demonstrating the value of strategic dataset design for building equitable models. CCS Concepts: • Computing methodologies → Machine learning; Artificial intelligence; • Theory of computation → Design and analysis of algorithms.

show abstract

Statistical Methods for Pharmaceutical Research Planning

Bergman¹

2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Stevie Bergman

Fairness On The Ground: Applying Algorithmic Fairness Approaches to Production Systems

Adaptive Sampling Strategies to Construct Equitable Training Datasets

Adaptive Sampling Strategies to Construct Equitable Training Datasets

Statistical Methods for Pharmaceutical Research Planning

Contact Info

Product

Resources

About