Marvin Thielk scite author profile

Animals produce vocalizations that range in complexity from a single repeated call to hundreds of unique vocal elements patterned in sequences unfolding over hours. Characterizing complex vocalizations can require considerable effort and a deep intuition about each species’ vocal behavior. Even with a great deal of experience, human characterizations of animal communication can be affected by human perceptual biases. We present a set of computational methods for projecting animal vocalizations into low dimensional latent representational spaces that are directly learned from the spectrograms of vocal signals. We apply these methods to diverse datasets from over 20 species, including humans, bats, songbirds, mice, cetaceans, and nonhuman primates. Latent projections uncover complex features of data in visually intuitive and quantifiable ways, enabling high-powered comparative analyses of vocal acoustics. We introduce methods for analyzing vocalizations as both discrete sequences and as continuous latent variables. Each method can be used to disentangle complex spectro-temporal structure and observe long-timescale organization in communication.

show abstract

Parallels in the sequential organization of birdsong and human speech

Sainburg

et al. 2019

View full text Add to dashboard Cite

Human speech possesses a rich hierarchical structure that allows for meaning to be altered by words spaced far apart in time. Conversely, the sequential structure of nonhuman communication is thought to follow non-hierarchical Markovian dynamics operating over only short distances. Here, we show that human speech and birdsong share a similar sequential structure indicative of both hierarchical and Markovian organization. We analyze the sequential dynamics of song from multiple songbird species and speech from multiple languages by modeling the information content of signals as a function of the sequential distance between vocal elements. Across short sequence-distances, an exponential decay dominates the information in speech and birdsong, consistent with underlying Markovian processes. At longer sequence-distances, the decay in information follows a power law, consistent with underlying hierarchical processes. Thus, the sequential organization of acoustic elements in two learned vocal communication signals (speech and birdsong) shows functionally equivalent dynamics, governed by similar processes.

show abstract

Latent space visualization, characterization, and generation of diverse vocal communication signals

Sainburg

Thielk

Gentner

2019

Preprint

View full text Add to dashboard Cite

Animals produce vocalizations that range in complexity from a single repeated call to hundreds 1 of unique vocal elements patterned in sequences unfolding over hours. Characterizing complex 2 vocalizations can require considerable effort and a deep intuition about each species' vocal behavior. 3 Even with a great deal of experience, human characterizations of animal communication can be 4 affected by human perceptual biases. We present here a set of computational methods that center 5 around projecting animal vocalizations into low dimensional latent representational spaces that 6 are directly learned from data. We apply these methods to diverse datasets from over 20 species, 7 including humans, bats, songbirds, mice, cetaceans, and nonhuman primates, enabling high-powered 8 comparative analyses of unbiased acoustic features in the communicative repertoires across species. 9 Latent projections uncover complex features of data in visually intuitive and quantifiable ways. We 10 introduce methods for analyzing vocalizations as both discrete sequences and as continuous latent 11 variables. Each method can be used to disentangle complex spectro-temporal structure and observe 12 long-timescale organization in communication. Finally, we show how systematic sampling from 13 latent representational spaces of vocalizations enables comprehensive investigations of perceptual 14 and neural representations of complex and ecologically relevant acoustic feature spaces. 15Of the thousands of species that communicate vocally, the repertoires of only a tiny minority have been characterized or 18 studied in detail. This is due, in large part, to traditional analysis methods that require a high level of expertise that is 19 hard to develop and often species-specific. Here, we present a set of novel methods to project animal vocalizations 20 into latent feature spaces to quantitatively compare and develop visual intuitions about animal vocalizations, and to 21 systematically synthesize novel species-typical vocalizations from learned feature sets. We demonstrate these methods 22 across a series of analyses over 19 datasets of animal vocalizations from 29 different species, including songbirds, mice, 23 monkeys, humans, and whales. We show how learned latent feature spaces untangle complex spectro-temporal structure, 24 enable unbiased comparisons, and uncover high-level features such as individual identity and population dialects. We 25 generate smoothly varying morphs between vocalizations from a songbird species with a spectro-temporally complex 26 vocal repertoire, European starlings, and show how these methods enable a new degree of control over ecologically 27 relevant signals that can be broadly applied across behavioral and physiological experimental settings. 28 2 Introduction 29Vocal communication is a social behavior common to much of the animal kingdom in which acoustic signals are 30 transmitted from sender to receiver to convey various forms of information such as identity, individual fitness, or the 31 presence ...

show abstract

In Vivo Dopamine Detection and Single Unit Recordings Using Intracortical Glassy Carbon Microelectrode Arrays

et al. 2018

View full text Add to dashboard Cite

In this study, we present a 4-channel intracortical glassy carbon (GC) microelectrode array on a flexible substrate for the simultaneous in vivo neural activity recording and dopamine (DA) concentration measurement at four different brain locations (220μm vertical spacing). The ability of GC microelectrodes to detect DA was firstly assessed in vitro in phosphate-buffered saline solution and then validated in vivo measuring spontaneous DA concentration in the Striatum of European Starling songbird through fast scan cyclic voltammetry (FSCV). The capability of GC microelectrode arrays and commercial penetrating metal microelectrode arrays to record neural activity from the Caudomedial Neostriatum of European starling songbird was compared. Preliminary results demonstrated the ability of GC microelectrodes in detecting neurotransmitters release and recording neural activity in vivo. GC microelectrodes array may, therefore, offer a new opportunity to understand the intimate relations linking electrophysiological parameters with neurotransmitters release.

show abstract

Learned context dependent categorical perception in a songbird

Sainburg

Thielk

Gentner

2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Marvin Thielk

Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoires

Parallels in the sequential organization of birdsong and human speech

Latent space visualization, characterization, and generation of diverse vocal communication signals

In Vivo Dopamine Detection and Single Unit Recordings Using Intracortical Glassy Carbon Microelectrode Arrays

Learned context dependent categorical perception in a songbird

Contact Info

Product

Resources

About