2022
DOI: 10.48550/arxiv.2203.00756
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Real time spectrogram inversion on mobile phone

Abstract: With the growth of computing power on mobile phones and privacy concerns over user's data, on-device real time speech processing has become an important research topic. In this paper, we focus on methods for real time spectrogram inversion, where an algorithm receives a portion of the input signal (e.g., one frame) and processes it incrementally, i.e., operating in streaming mode. We present a real time Griffin Lim(GL) algorithm using a sliding window approach in STFT domain. The proposed algorithm is 2.4x fas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 22 publications
0
1
0
Order By: Relevance
“…Besides, the potential of the high-resolution spectrogram, e.g., with a one-millisecond hop size, is still unclear. Some popular choices of hop size including 10 ms (Böck et al 2012;Kong et al 2020;Gong, Chung, and Glass 2021a) and 12.5 ms (Rybakov et al 2022). Previous studies (Kong et al 2020;Ferraro et al 2021) show classification performance can be steadily improved with the increase of resolution.…”
Section: Introductionmentioning
confidence: 99%
“…Besides, the potential of the high-resolution spectrogram, e.g., with a one-millisecond hop size, is still unclear. Some popular choices of hop size including 10 ms (Böck et al 2012;Kong et al 2020;Gong, Chung, and Glass 2021a) and 12.5 ms (Rybakov et al 2022). Previous studies (Kong et al 2020;Ferraro et al 2021) show classification performance can be steadily improved with the increase of resolution.…”
Section: Introductionmentioning
confidence: 99%