2019 IEEE Conference on Visual Analytics Science and Technology (VAST) 2019
DOI: 10.1109/vast47406.2019.8986943
|View full text |Cite
|
Sign up to set email alerts
|

Interactive Correction of Mislabeled Training Data

Abstract: Figure 1: DataDebugger: (a) the navigation stack records the explored hierarchical levels; (b) the tSNE-based visualization shows the item distribution and the confusion between different classes; (c)(d) the selected item view and the trusted item view display the images of selected items and trusted items, respectively; (e) The action trail records the correction history. ABSTRACTIn this paper, we develop a visual analysis method for interactively improving the quality of labeled data, which is essential to t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 65 publications
(43 citation statements)
references
References 47 publications
0
43
0
Order By: Relevance
“…To handle data quality problems not restricted to crowdsourcing settings, Xiang et al [18] developed a scalable data correction algorithm to propagate the labels of trusted items to other unverified items. A hierarchical visualization was proposed to facilitate the exploration and identification of trusted items.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…To handle data quality problems not restricted to crowdsourcing settings, Xiang et al [18] developed a scalable data correction algorithm to propagate the labels of trusted items to other unverified items. A hierarchical visualization was proposed to facilitate the exploration and identification of trusted items.…”
Section: Related Workmentioning
confidence: 99%
“…To allow efficient exploration of a large number of samples, we organize the samples in a hierarchy. An uncertainty-biased sampling method [18], [59] is used to build the hierarchy. This sampling method takes both region density and classification uncertainty into consideration and increases the sampling ratio from dense regions with low uncertainty to sparse regions with high uncertainty.…”
Section: Samples and Their Relationshipsmentioning
confidence: 99%
“…Methods. We compare our method with seven sampling methods: random sampling (RS), blue noise sampling (BNS) [11], density-based sampling (DBS) [32], non-uniform sampling (NUS) [5], outlier biased density-based sampling (OBDBS) [46], multi-view z-order sampling (MVZS) [24], and KD-tree-based sampling (KBS) [10]. Though KBS and MVZS are designed for multi-class sampling, we use them on single-class data without multi-class constraints.…”
Section: Comparative Evaluation Of Static Samplingmentioning
confidence: 99%
“…For example, the ImageNet dataset [277] only contains the labels cleaned by automatic noise removal methods. To tackle these datasets, Xiang et al [43] developed DataDebugger to interactively improve data quality by utilizing user-selected trusted items. Hierarchical visualization combined with an incremental projection method and an outlier biased sampling method facilitates the exploration and identification of trusted items.…”
Section: Label-level Improvementmentioning
confidence: 99%