2023
DOI: 10.48550/arxiv.2302.04732
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Zeno: An Interactive Framework for Behavioral Evaluation of Machine Learning

Ángel Alexander Cabrera,
Erica Fu,
Donald Bertucci
et al.

Abstract: Figure 1: zeno is a framework for behavioral evaluation of machine learning (ML) models. It has two components, a Python API and an interactive UI. The API is used to generate information such as model outputs and metrics. Users then interact with the UI to see metrics, create slices, and write unit tests. In this toy example, a user is evaluating a cat and dog classifier. They see that the model has lower accuracy for dogs with pointy ears, and create a test expecting the slice accuracy to be higher than 70%.

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 49 publications
(64 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?