2021
DOI: 10.1007/s10462-021-10021-3
|View full text |Cite
|
Sign up to set email alerts
|

A review and experimental analysis of active learning over crowdsourced data

Abstract: Training data creation is increasingly a key bottleneck for developing machine learning, especially for deep learning systems. Active learning provides a cost-effective means for creating training data by selecting the most informative instances for labeling. Labels in real applications are often collected from crowdsourcing, which engages online crowds for data labeling at scale. Despite the importance of using crowdsourced data in the active learning process, an analysis of how the existing active learning a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(15 citation statements)
references
References 48 publications
(49 reference statements)
0
15
0
Order By: Relevance
“…This process can be further improved by combining active learning with the crowd sourcing data and labelling, produced by citizen science activities as discussed in Støttrup et al (2018) andDiBattista et al (2021). In this case, several categories spanning from students to professional fisherman or amatorial divers can efficiently contribute to the labelling of the acquired images and combine this effort with active learning techniques to select the most relevant images for training/updating the classifier, as discussed in Sayin et al (2021).…”
Section: Discussionmentioning
confidence: 99%
“…This process can be further improved by combining active learning with the crowd sourcing data and labelling, produced by citizen science activities as discussed in Støttrup et al (2018) andDiBattista et al (2021). In this case, several categories spanning from students to professional fisherman or amatorial divers can efficiently contribute to the labelling of the acquired images and combine this effort with active learning techniques to select the most relevant images for training/updating the classifier, as discussed in Sayin et al (2021).…”
Section: Discussionmentioning
confidence: 99%
“…Such an approach could be feasible as components of the catenary arch have strong geometric shapes. An alternative approach is to use an active learning paradigm [ 50 ] for reducing the labelling cost. It is possible to leverage the trained model in this feat for the human-in-loop approach.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, modern machine learning models are extremely complex, and it is appropriate to optimize models by iteration. Active learning tries to solve the labeling bottleneck of samples, and actively selects the most valuable unlabeled samples for labeling first, so as to achieve excellent model accuracy with as few labeled samples as possible (Faulon and Faure 2021;Sayin et al 2021). Active learning could overcome the shortcomings of traditional machine learning to some extent.…”
Section: Statistical Analysis and General Guidance For Biocharmentioning
confidence: 99%