2020
DOI: 10.1101/2020.08.28.272088
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

TweetyNet: A neural network that enables high-throughput, automated annotation of birdsong

Abstract: Songbirds provide an excellent model system for understanding sensorimotor learning. Many analyses of learning require annotating song, but songbirds produce more songs than can be annotated by hand. Existing methods for automating annotation are challenged by variable song, like that of Bengalese finches. For particularly complex song like that of canaries, no methods exist, limiting the questions researchers can investigate. We developed an artificial neural network, TweetyNet, that automates annotation. Fir… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 12 publications
(50 citation statements)
references
References 55 publications
0
50
0
Order By: Relevance
“…MFCC computations where performed using the Librosa [13] Python library. Song spectrograms are first extracted using Short Time Fourier Transform every 11ms (often called frame stride) and computed on overlapping windows of 23ms (often called window width) 4 , using a Hanning window to reduce edge effects. Then, we set the frequency range of a 128 filters Mel filterbank to [500Hz; 8kHz], as canaries vocal patterns occur below 8kHz and as the [0Hz; 500Hz] bandwidth represents mostly noise.…”
Section: Data Preprocessingmentioning
confidence: 99%
See 4 more Smart Citations
“…MFCC computations where performed using the Librosa [13] Python library. Song spectrograms are first extracted using Short Time Fourier Transform every 11ms (often called frame stride) and computed on overlapping windows of 23ms (often called window width) 4 , using a Hanning window to reduce edge effects. Then, we set the frequency range of a 128 filters Mel filterbank to [500Hz; 8kHz], as canaries vocal patterns occur below 8kHz and as the [0Hz; 500Hz] bandwidth represents mostly noise.…”
Section: Data Preprocessingmentioning
confidence: 99%
“…Comparison with [4] can not be done fairly, as our method operate at phrase level and not at syllable level, and as we did not use the same dataset. We discuss the possibility of extending this work in Discussion.…”
Section: Performance Of Transductionmentioning
confidence: 99%
See 3 more Smart Citations