2021
DOI: 10.1162/tacl_a_00421
|View full text |Cite
|
Sign up to set email alerts
|

Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication

Abstract: This paper models unsupervised learning of an identity-based pattern (or copying) in speech called reduplication from raw continuous data with deep convolutional neural networks. We use the ciwGAN architecture (Beguš, 2021a) in which learning of meaningful representations in speech emerges from a requirement that the CNNs generate informative data. We propose a technique to wug-test CNNs trained on speech and, based on four generative tests, argue that the network learns to represent an identity-based pattern … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(28 citation statements)
references
References 31 publications
0
28
0
Order By: Relevance
“…F1, perhaps also F2; Krishnan 2002), “acoustic onsets” such as burst, and “frequency transitions”(Abrams and Kraus, 2015). Similarly, it has been shown that the same acoustic properties are encoded in the second to last convolutional layer: periodicity and F0 together with F0 transitions, low frequency formant structure (F1 and to lesser degree F2), burst, and timing of individual segments (Begušs and Zhou, 2021b,a). Figures 1a,b,c,d; 3a,b,c,d; and 4a,b,c,d,e illustrate how the signal from Conv1/4 and cABR encode the same acoustic properties.…”
Section: Discussionmentioning
confidence: 89%
See 4 more Smart Citations
“…F1, perhaps also F2; Krishnan 2002), “acoustic onsets” such as burst, and “frequency transitions”(Abrams and Kraus, 2015). Similarly, it has been shown that the same acoustic properties are encoded in the second to last convolutional layer: periodicity and F0 together with F0 transitions, low frequency formant structure (F1 and to lesser degree F2), burst, and timing of individual segments (Begušs and Zhou, 2021b,a). Figures 1a,b,c,d; 3a,b,c,d; and 4a,b,c,d,e illustrate how the signal from Conv1/4 and cABR encode the same acoustic properties.…”
Section: Discussionmentioning
confidence: 89%
“…Figures 1a,b,c,d; 3a,b,c,d; and 4a,b,c,d,e illustrate how the signal from Conv1/4 and cABR encode the same acoustic properties. Higher-level convolutional layers do not encode all these acoustic properties (Begušs and Zhou, 2021b,a). In sum, both the earlier intermediate layers and cABR signal feature encoding of the same acoustic properties (F0, burst, timing, and low frequency formant structure).…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations