Interspeech 2022 2022
DOI: 10.21437/interspeech.2022-10829
|View full text |Cite
|
Sign up to set email alerts
|

INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge

Abstract: Audio packet loss concealment is the hiding of gaps in VoIP audio streams caused by network packet loss. With the ICASSP 2024 Audio Deep Packet Loss Concealment Grand Challenge, we build on the success of the previous Audio PLC Challenge held at INTERSPEECH 2022. We evaluate models on an overall harder dataset, and use the new ITU-T P.804 evaluation procedure to more closely evaluate the performance of systems specifically on the PLC task.We evaluate a total of 9 systems, 8 of which satisfy the strict real-tim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(6 citation statements)
references
References 1 publication
0
6
0
Order By: Relevance
“…While this challenge focuses on improving speech quality by reducing background noise (improving BAK) and reverberation which improves OVRL, other related challenges target echo cancellation [41]- [44], packet loss concealment [45], and general speech signal improvements [46]. The ICASSP 2023 Speech Signal Improvement challenge [46] provides test sets with various types of SIG regressions such as poor-quality microphones and speech enhancement.…”
Section: Related Workmentioning
confidence: 99%
“…While this challenge focuses on improving speech quality by reducing background noise (improving BAK) and reverberation which improves OVRL, other related challenges target echo cancellation [41]- [44], packet loss concealment [45], and general speech signal improvements [46]. The ICASSP 2023 Speech Signal Improvement challenge [46] provides test sets with various types of SIG regressions such as poor-quality microphones and speech enhancement.…”
Section: Related Workmentioning
confidence: 99%
“…Noisiness Deep Noise Suppression [21], [22], [3], [2], [4] Coloration None Discontinuity Packet Loss Concealment [23] Loudness None Reverberation REVERB [24] Echo AEC Challenge [25], [26], [27], [28] • A overlap-save processing has a buffering latency corresponding to the frame size. • A time-domain convolution with stride 1 introduces a buffering latency of 1 sample.…”
Section: Area Related Challengementioning
confidence: 99%
“…The dataset for the ICASSP 2024 challenge is built upon the same framework as before, leveraging real-world packet loss patterns combined with data that is either in the public domain (conversational speech, sourced from the LibriVox Community Podcast) 1 or was collected by us explicitly for use in challenges (read speech), allowing us to have a realistic dataset while avoiding the potential for privacy issues. Audio segments were selected by filtering using DNSMOS [3] and manual inspection to avoid very noisy base audio clips, and were cut to 10 to 15 seconds of length using the WebRTC Voice Activity Detection to avoid cutting off parts of words.…”
Section: Dataset Constructionmentioning
confidence: 99%
“…• 0-120 ms burst: 20 per loss bracket, 100 total Additionally, we include a total of 200 traces also used in the 2022 PLC Challenge to allow for limited comparability. Further details about the dataset creation procedure and data sourcing can be found in our previous work [1]. On Oct. 11, 2023, we first released a validation set constructed in this way, followed by a blind set with no references on Dec. 1, 2023.…”
Section: Dataset Constructionmentioning
confidence: 99%
See 1 more Smart Citation