2019
DOI: 10.33851/jmis.2019.6.4.225
|View full text |Cite
|
Sign up to set email alerts
|

Training Data Sets Construction from Large Data Set for PCB Character Recognition

Abstract: Deep learning has become increasingly popular in both academic and industrial areas nowadays. Various domains including pattern recognition, Computer vision have witnessed the great power of deep neural networks. However, current studies on deep learning mainly focus on quality data sets with balanced class labels, while training on bad and imbalanced data set have been providing great challenges for classification tasks. We propose in this paper a method of data analysis-based data reduction techniques for se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(12 citation statements)
references
References 20 publications
0
12
0
Order By: Relevance
“…We have identified several points in the previous study [3,4], and the learning data in the previous study has been subdivided by case, and the test data is as shown in Table 1. -Amount of data from the 1st factory, 2nd factory: 10% of the total amount -Amount of other factories data: 90% of the total amount -Collect at a different period…”
Section: Process Of the Proposed Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…We have identified several points in the previous study [3,4], and the learning data in the previous study has been subdivided by case, and the test data is as shown in Table 1. -Amount of data from the 1st factory, 2nd factory: 10% of the total amount -Amount of other factories data: 90% of the total amount -Collect at a different period…”
Section: Process Of the Proposed Methodsmentioning
confidence: 99%
“…Moreover, when labeling separated character images, there was an error of specifying a different label. Figure 3 shows a problem that cannot be properly divided because of the short distance between characters [3][4][5]. All data were automatically labeled using a pattern matching software that was previously used for PCB inspection during the collection process.…”
Section: Data Collectionmentioning
confidence: 99%
See 2 more Smart Citations
“…Although the HMMs in the network were designed, they were first trained together in a network and learned the component boundaries simultaneously. With a small training set, this, however, didn't turn out well [7]. So the model has been primed by prior samples, about 10% of the training set selected at random from the training set.…”
Section: Experimental Set Up and Datasetmentioning
confidence: 99%