2022
DOI: 10.1016/j.cmpb.2021.106504
|View full text |Cite
|
Sign up to set email alerts
|

Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 90 publications
(27 citation statements)
references
References 16 publications
1
18
0
Order By: Relevance
“…The table shows steady improvement for most models with a larger set of training data over all metrics, except for the small model SqueezeNet. Generally, deep learning models, unlike traditional machine learning, benefit from larger datasets [44], which may be the reason for improved performance. The sample confusion matrix for DarkNet-53 in Figure 6 shows considerably better performance in terms of entries with one or fewer false misclassifications.…”
Section: Resultsmentioning
confidence: 99%
“…The table shows steady improvement for most models with a larger set of training data over all metrics, except for the small model SqueezeNet. Generally, deep learning models, unlike traditional machine learning, benefit from larger datasets [44], which may be the reason for improved performance. The sample confusion matrix for DarkNet-53 in Figure 6 shows considerably better performance in terms of entries with one or fewer false misclassifications.…”
Section: Resultsmentioning
confidence: 99%
“…The other tested models with more parameters performed worse, as they seemed overparameterized and likely learned aberrant features, thus overfitting to the training data. Not many studies explore this phenomenon in detail, but a similar phenomenon was noted in the results of a recent study of Bailly et al 64 studying the effects of dataset size, dataset complexity, and model complexity on performance.…”
Section: Deep-learning Model Architecturementioning
confidence: 62%
“…The output of logistic regression is always between (0 and 1), which is suitable for the binary classification task. The higher the value, the higher the probability that the current sample will be classified as class 1 and vice versa (Bailly et al., 2022; Ma et al., 2023; van den Goorbergh et al., 2022; Zabor et al., 2022).…”
Section: Methodsmentioning
confidence: 99%