Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages &Amp; Applications 2014
DOI: 10.1145/2660193.2660207
|View full text |Cite
|
Sign up to set email alerts
|

CheckCell

Abstract: Testing and static analysis can help root out bugs in programs, but not in data. This paper introduces data debugging, an approach that combines program analysis and statistical analysis to automatically find potential data errors. Since it is impossible to know a priori whether data are erroneous, data debugging instead locates data that has a disproportionate impact on the computation. Such data is either very important or wrong. Data debugging is especially useful in the context of data-intensive programmin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(3 citation statements)
references
References 44 publications
0
3
0
Order By: Relevance
“…ExceLint extracts reference vectors by first gathering data dependencies for every formula in the given spreadsheet. It obtains dependence information by parsing a sheet's formulas and building the program's dataflow graph [Barowy et al 2014;Cooper and Torczon 2005]. ExceLint can analyze all Excel functions.…”
Section: Reference Vectorsmentioning
confidence: 99%
See 2 more Smart Citations
“…ExceLint extracts reference vectors by first gathering data dependencies for every formula in the given spreadsheet. It obtains dependence information by parsing a sheet's formulas and building the program's dataflow graph [Barowy et al 2014;Cooper and Torczon 2005]. ExceLint can analyze all Excel functions.…”
Section: Reference Vectorsmentioning
confidence: 99%
“…The cutoff threshold is a user-defined parameter that represents the proportion of the worksheet that a user is willing to inspect. The default value, 5%, is based on the observed frequency of spreadsheet errors in the wild [Barowy et al 2014;Panko 2015]. Users may adjust this threshold to inspect more or fewer cells, depending on their preference.…”
Section: Entropy-based Errormentioning
confidence: 99%
See 1 more Smart Citation