In this paper, the results of automated subjective assessment of Ukrainian speech intelligibility are presented. Speech monosyllables of the consonant-vowel-consonant (CVC) type were listened in two modes: through headphones and through acoustic monitors. The assessment was carried out with the help of specially developed software that allowed automating of articulation tests. Speech listening was done for four situations: pure language; speech distorted by noise; speech distorted by reverberation; speech distorted by the combined effect of noise and reverberation. In the first case, speech monosyllables of 3 articulation tables were listened, each of which contained 50 monosyllables. In the second case, speech distorted by the additive noise with the signal-to-noise ratios (SNR) varied in the range-15…+10 dB was listened. In this case, models of white, pink and brown noises were used, the masking properties of which are rather well-studied. In the third case, the reverberant speech for reverberation times in the range 0.3…2.7 s was modeled by convolution of pure speech signals with room impulse responces (RIRs) of various rooms, and in the fourth case the joint action of pink noise and reverberation was considered. It turned out that the masking ability of white noise exceeds one for brown noise for SNR less than minus 5 dB, which is not entirely consistent with preliminary predictive estimates. In addition, it turned out that listening to speech distorted by noise through acoustic monitors could lead to a significant increase in the speech intelligibility, compared to the case of listening through headphones. The analysis of possible causes of abnormal increase in speech intelligibility has been carried out. Early reflections, presence of two loudspeakers, binaural listening, psychophysical features of listeners, as well as peculiarities of software and articulatory testing organization were considered as possible reasons of the phenomenon. After correction of the software and some features of articulation tests it turned out that the results of the speech intelligibility estimation almost coincide when listening to the signals through the headphones and through acoustic monitors, if the distance between the listener and acoustic monitors does not exceed 0.6-0.8 meters. At the same time, these corrections did not differ in the behavior of the dependencies of speech intelligibility on the SNR for small (less minus 5 dB) SNR values The general conclusion may be that listening to speech signals distorted by noise and reverberation interferences, performed with the application of the proposed automated system of articulation tests, indicates the performance and high quality of the developed system. Ref. 13, fig. 7.
Національний технічний університет України «Київський політехнічний інститут імені Ігоря Сікорського», kpi.ua Київ, Україна Реферат-В даній роботі представлено результати суб'єктивного оцінювання, здійснюваного шляхом артикуляційних випробувань, розбірливості односкладових звукосполучень на тлі шуму та реверберації. Оцінювання здійснювалося за допомогою спеціально розробленого програмного забезпечення, що дозволило автоматизувати й таким чином суттєво полегшити та пришвидшити процедуру артикуляційних випробувань. За результатами випробувань маскувальна здатність білого шуму виявилася кращою за таку для коричневого шуму при відношеннях сигнал-шум, менших за мінус 5 дБ, що не повністю узгоджується із попередніми прогнозними оцінками. Крім того, виявилося, що слухання мови, спотвореної шумом, через акустичні монітори може призводити до суттєвого підвищення (до 0,85-0,93) оцінок розбірливості мови, порівняно із випадком слухання через навушники (0,1-0,3). Аналогічні результати одержано для ревербераційної завади: для часу реверберації 2,7 с розбірливість збільшилася із 0,65 до 0,94. Даний феномен можна в значній мірі пояснити дією ранніх відбить звуку в приміщеннях, наявністю двох джерел випромінювання та бінауральним прослуховуванням. Додатковими причинами можуть бути особливості психофізичного стану слухачів та розробленої автоматизованої системи артикуляційних випробувань. Бібл. 13, рис. 5. Ключові словарозбірливість мови; суб'єктивне оцінювання; артикуляційні випробування; шум; реверберація; односкладові звукосполучення.
Correcting the public address (PA) system during a concert event is one of the crucial tasks in ensuring acoustic comfort. However, the existing approaches to such correction do not allow for real-time adaptation to changes in the acoustic properties of the venue that occur during the event. To address this limitation, this article proposes the use of a multiband compressor. It is shown that a zero-latency VST plugin can serve as a multiband compressor. Pink noise can be used as a test signal for system calibration. The results of testing the proposed algorithm, conducted through model and real-world experiments, demonstrate the feasibility and effectiveness of the proposed approach.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.