2023
DOI: 10.1038/s41598-023-32559-8
|View full text |Cite
|
Sign up to set email alerts
|

Efficient shallow learning as an alternative to deep learning

Abstract: The realization of complex classification tasks requires training of deep learning (DL) architectures consisting of tens or even hundreds of convolutional and fully connected hidden layers, which is far from the reality of the human brain. According to the DL rationale, the first convolutional layer reveals localized patterns in the input and large-scale patterns in the following layers, until it reliably characterizes a class of inputs. Here, we demonstrate that with a fixed ratio between the depths of the fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…For the purpose of digit recognition, a modified version of the convolutional neural network LENET5 [18] is utilized [19]. The original neural network model exhibits issues of significant bias and variance, and suitable optimizations have been implemented to address these challenges effectively [20][21][22].…”
Section: Short Answermentioning
confidence: 99%
“…For the purpose of digit recognition, a modified version of the convolutional neural network LENET5 [18] is utilized [19]. The original neural network model exhibits issues of significant bias and variance, and suitable optimizations have been implemented to address these challenges effectively [20][21][22].…”
Section: Short Answermentioning
confidence: 99%
“…AI models are broadly categorized into shallow or deep learning models based on the number of linear or non-linear transformations the input data undergo before yielding an output ( Ahmad et al, 2018 ). Shallow models typically convert inputs once or twice before transmitting outputs, while deep models, derived from conventional neural networks, commonly convert inputs multiple times ( Meir et al, 2023 ). As a result, deep models can learn more complex patterns, thereby facilitating end-to-end learning without the need for manual feature engineering and exhibit robust performance in CV and sequential data analysis tasks.…”
Section: Artificial Intelligencementioning
confidence: 99%