2019
DOI: 10.1101/870311
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Latent space visualization, characterization, and generation of diverse vocal communication signals

Abstract: Animals produce vocalizations that range in complexity from a single repeated call to hundreds 1 of unique vocal elements patterned in sequences unfolding over hours. Characterizing complex 2 vocalizations can require considerable effort and a deep intuition about each species' vocal behavior. 3 Even with a great deal of experience, human characterizations of animal communication can be 4 affected by human perceptual biases. We present here a set of computational methods that center 5 around projecting anim… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
45
0
1

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(47 citation statements)
references
References 84 publications
0
45
0
1
Order By: Relevance
“…In that scenario, as our results suggested, each successive dimension represents an orthogonal transformation capturing the maximum possible variance between the data [68,70]. Vocalizations can also be decomposed into characteristics using linear discriminant analysis [5] where characteristics are determined by their ability to explain variance in a specific dimension of data, such as individual identity [68,71]. However, the dimensionality reduction may also be non-linear, representing a greater number of data relationships (for example, the similarity between notes of birdsong).…”
Section: Discussionmentioning
confidence: 95%
See 2 more Smart Citations
“…In that scenario, as our results suggested, each successive dimension represents an orthogonal transformation capturing the maximum possible variance between the data [68,70]. Vocalizations can also be decomposed into characteristics using linear discriminant analysis [5] where characteristics are determined by their ability to explain variance in a specific dimension of data, such as individual identity [68,71]. However, the dimensionality reduction may also be non-linear, representing a greater number of data relationships (for example, the similarity between notes of birdsong).…”
Section: Discussionmentioning
confidence: 95%
“…For instance, species that due to their morphological characteristics as plumage are difficult to separate taxonomically. However, it is important to apply and define this technique in organisms with more complex vocalizations, taking into account that animals emit acoustic signals that vary in complexity, which represents from a single repeated call (note), as in the case of our species model, to hundreds of different vocal elements [68].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Aspects of other deep networks applied to animal motor control may improve TweetyNet. Examples include object detection architectures [47,48] applied to mouse ultrasonic vocalizations and animal motion tracking, and generative architectures applied to birdsong and other vocalizations [49][50][51]. Lastly we note that in principle TweetyNet and vak library can be applied to any other annotated vocalization, including calls of bats, mouse ultrasonic vocalizations, and dolphin communication.…”
Section: Discussionmentioning
confidence: 99%
“…The application of dimension reduction algorithms or latent space visualization of communication signals is also getting much attention in the wide field of bioacoustics. Sainburg proposed a package for this purpose, called animal vocalization generative network (AVGN) (Sainburg et al 2019). They showed the various examples of the use of the latent space such as discrete latent projections of animal vocalizations and temporally continuous latent trajectories.…”
Section: Discussionmentioning
confidence: 99%