2020
DOI: 10.1016/j.neucom.2020.07.053
|View full text |Cite
|
Sign up to set email alerts
|

Survey on Deep Neural Networks in Speech and Vision Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
66
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 184 publications
(67 citation statements)
references
References 153 publications
0
66
0
1
Order By: Relevance
“…The following aspects describe the differences between end-to-end systems and traditional systems, although they may not necessarily be considered as fundamental differences. Some end-to-end considered neural networks, as the one in Section 3.5, which is introduced in [13], are capable of processing raw audio signal, the feature extraction being integrated into the network [14], but also some hand-crafted features which have been already extracted in a previous step. From the acoustic model point of view, while the traditional systems are modeling phonemes, the end-to-end systems are trained on graphemes (characters) [15], word-pieces [16] or even entire words [17].…”
Section: End-to-end Asrmentioning
confidence: 99%
“…The following aspects describe the differences between end-to-end systems and traditional systems, although they may not necessarily be considered as fundamental differences. Some end-to-end considered neural networks, as the one in Section 3.5, which is introduced in [13], are capable of processing raw audio signal, the feature extraction being integrated into the network [14], but also some hand-crafted features which have been already extracted in a previous step. From the acoustic model point of view, while the traditional systems are modeling phonemes, the end-to-end systems are trained on graphemes (characters) [15], word-pieces [16] or even entire words [17].…”
Section: End-to-end Asrmentioning
confidence: 99%
“…Nosúltimos anos, esforços significativos de pesquisa foram feitos em técnicas de deep neural network [Mallmann et al 2020], que atualmente fornece os resultados mais promissores em campos como classificação de imagens e detecção de objetos [Alam et al 2020]. Uma dessas arquiteturas de deep neural são os deep autoencoders, que normalmente são usados para redução de dimensionalidade [Li et al 2020], e eliminação de ruído de imagem [Gondara 2016], através de um processo de duas etapas, o codificador e o decodificador.…”
Section: Deep Autoencodersunclassified
“…Deep learning is a class of ML that allows multiple hidden layers for data processing [ 19 ]. A deep neural network (DNN) combines an artificial neural network with deep learning and is capable of providing a better solution to problems in cognitive learning such as speech and image recognition [ 20 ]. So far, DNN models have been successfully applied to learn and predict a range of properties of diverse types of materials, including metals, ceramics, and macromolecular materials [ 21 , 22 , 23 ].…”
Section: Introductionmentioning
confidence: 99%