ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683043
|View full text |Cite
|
Sign up to set email alerts
|

How Transferable Are Features in Convolutional Neural Network Acoustic Models across Languages?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
1
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 5 publications
0
14
0
Order By: Relevance
“…The freeze-trained transfer networks from Thompson et al (2019), which were initialized with parameters from a network previously trained on one language and then freeze-trained on another, outperformed all other freeze-trained networks (no transfer) and other transfer networks (no freeze training). Here, we compare the activations of the English standard, Dutch standard and Dutch-to-English freeze-trained networks from Thompson et al (2019). We predict with high confidence that the early layers of the Dutchto-English freeze-trained network will be more similar to the Dutch than the English standard model since they were initialized with the parameters from the Dutch standard network and received relatively little training afterwards.…”
Section: Freeze Trainingmentioning
confidence: 97%
“…The freeze-trained transfer networks from Thompson et al (2019), which were initialized with parameters from a network previously trained on one language and then freeze-trained on another, outperformed all other freeze-trained networks (no transfer) and other transfer networks (no freeze training). Here, we compare the activations of the English standard, Dutch standard and Dutch-to-English freeze-trained networks from Thompson et al (2019). We predict with high confidence that the early layers of the Dutchto-English freeze-trained network will be more similar to the Dutch than the English standard model since they were initialized with the parameters from the Dutch standard network and received relatively little training afterwards.…”
Section: Freeze Trainingmentioning
confidence: 97%
“…Freeze training refers to the procedure by which layers are gradually removed from the set of trainable variables over the course of training and in order of depth. Previous work has shown that freeze training can speed up training (Raghu et al, 2017) and facilitate transfer across related tasks (Thompson et al, 2019a).…”
Section: Convolutional Neural Network Activationsmentioning
confidence: 99%
“…This quilting procedure, described below, allowed us to focus our comparison on representational transformations only up to the sub-word level in both the convnets and the human auditory system. The audio corpora from which the stimuli were constructed were the same datasets that were used in (Thompson et al, 2019a) and (Thompson et al, 2019b), which are owned by Nuance Communications. Each of the three datasets, one for English, Dutch and German, contained 64-83 hours of spoken text read by several native speakers in a quiet room.…”
Section: Experimental Stimulimentioning
confidence: 99%
See 1 more Smart Citation
“…As an example of other machine learning models applied to EBSD, a nearest neighbor machine learning model has been previously explored by Goulden et al (2017) to address the challenge of separating ferrite and martensite by machine learning aided pattern quality analysis, but pattern quality is too rigid a metric for general use, and the method was reliant on human analysis and confirmation over multiple rounds of indexing a single map. Another advantage is the flexibility of these CNNs, allowing for the transfer of knowledge learned from discriminating images in other contexts (Gonzalez et al, 2019;Thompson et al, 2019), the development of models suitable for application in a highly specific materials space, or deployment in an application where the phases present are completely unknown. For example, during the initial analysis of a material, a pre-trained CNN could be utilized for the determination of which Bravais lattices or space groups are present (Kaufmann et al, 2020).…”
Section: Introductionmentioning
confidence: 99%