ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054194
|View full text |Cite
|
Sign up to set email alerts
|

Bandwidth Extension of Musical Audio Signals With No Side Information Using Dilated Convolutional Neural Networks

Abstract: Bandwidth extension has a long history in audio processing. While speech processing tools do not rely on side information, production-ready bandwidth extension tools of general audio signals rely on side information that has to be transmitted alongside the bitstream of the low frequency part, mostly because polyphonic music has a more complex and less predictable spectral structure than speech.This paper studies the benefit of considering a dilated fully convolutional neural network to perform the bandwidth ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 13 publications
0
8
0
Order By: Relevance
“…processing [8], [9], [10], whereas most of these studies focus on processing speech [11], [12], [13], [14], [15]. Although music and speech share the same domain of acoustic signals, the two are fundamentally different.…”
Section: Bandwidthextendedmentioning
confidence: 99%
“…processing [8], [9], [10], whereas most of these studies focus on processing speech [11], [12], [13], [14], [15]. Although music and speech share the same domain of acoustic signals, the two are fundamentally different.…”
Section: Bandwidthextendedmentioning
confidence: 99%
“…BWE is alternatively referred to as audio re-sampling or sample-rate conversion in the field of Digital Signal Processing (DSP), or as audio super-resolution in the Machine Learning (ML) literature. Methods for BWE have been extensively studied in areas like audio streaming and restoration, mainly for legacy speech telephony communication systems [13,16,17,27] or, less commonly, for degraded musical material [19,20].…”
Section: Bandwidth Extensionmentioning
confidence: 99%
“…For example, Convolutional Neural Networks (CNNs) [12], WaveNet-like architectures [8,13], and UNets [14,15]. However, most of the works in this line of research tackle the enhancement of speech signals [7][8][9][10][12][13][14][15][16][17][18], and only a few publications exist for musical audio restoration [11,[19][20][21]. This focus on speech is understandable, given the wide range of speech enhancement techniques in telephony, automatic speech recognition, and hearing aids.…”
Section: Introductionmentioning
confidence: 99%
“…Bandwidth Extension (BE) is the task of reconstructing a high-bandwidth signal from its lowbandwidth version, and is usually demonstrated on speech [29,6,26,19,57] and music [30,54]. To perform BE using CAW, we first train it on a high-bandwidth short audio example of a specific speaker.…”
Section: Bandwidth Extensionmentioning
confidence: 99%