ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053356
|View full text |Cite
|
Sign up to set email alerts
|

Towards Blind Quality Assessment of Concert Audio Recordings Using Deep Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…Other NR tools produce estimates of objective values including FR speech quality values [23], [30], [32], [38], [44], [51], [54], [56], [57], FR speech intelligibility values [30], [32], [38], [44], [52], [54], [56], [57], speech transmission index [22], codec bit-rate [46], and detection of specific impairments, artifacts, or noise types [34], [39], [41], [52]. Some of these tools perform a single task and others perform multiple tasks.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Other NR tools produce estimates of objective values including FR speech quality values [23], [30], [32], [38], [44], [51], [54], [56], [57], FR speech intelligibility values [30], [32], [38], [44], [52], [54], [56], [57], speech transmission index [22], codec bit-rate [46], and detection of specific impairments, artifacts, or noise types [34], [39], [41], [52]. Some of these tools perform a single task and others perform multiple tasks.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%
“…They address application areas that include room acoustics (noise and reverberation), speech enhancers, telecommunications systems, and hearing aids. Each addresses one or more of narrowband (NB) (nominally 300 Hz-3.5 kHz), wideband (WB) (nominally 50 Hz-7 kHz), super-wideband (SWB) (nominally 50 Hz-16 kHz), or fullband (FB) (nominally 20 Hz-20 kHz) speech, except for [34], [46] which address music. Recent work shows that NR-tools can measure the speech quality at a system input in spite of the fact that such tools can only access the system output.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%
“…The non-intrusive speech quality assessment model called NISQA [50] produces estimates of subjective speech quality as well as four constituent dimensions: noisiness, coloration, discontinuity, and loudness. Other NR tools produce estimates of objective values including FR speech quality values [22], [29], [31], [42], [48], FR speech intelligibility values [29], [31], [42], [49], speech transmission index [21], codec bit-rate [43], and detection of specific impairments, artifacts, or noise types [33], [37], [39], [49]. Some of these tools perform a single task and others perform multiple tasks.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%
“…They address application areas that include room acoustics (noise and reverberation), speech enhancers, telecommunications systems, and hearing aids. Each addresses one or more of narrowband (NB) (nominally 300 Hz-3.5 kHz), wideband (WB) (nominally 50 Hz-7 kHz), super-wideband (SWB) (nominally 50 Hz-16 kHz), or fullband (FB) (nominally 20 Hz-20 kHz) speech, except for [33], [43] which address music. Recent work shows that NR-tools can measure the speech quality at a system input in spite of the fact that such tools can only access the system output.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%