Radek Safarik scite author profile

Radek Safarik

5Publications

29Citation Statements Received

41Citation Statements Given

How they've been cited

How they cite others

Affiliations

Technical University of Liberec

Publications

Order By: Most citations

Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal

Mateju

Červa

Žďánský

et al. 2018

View full text Add to dashboard Cite

This paper investigates the use of deep neural networks (DNNs) for the task of spoken language identification. Various feed-forward fully connected, convolutional and recurrent DNN architectures are adopted and compared against a baseline i-vector based system. Moreover, DNNs are also utilized for extraction of bottleneck features from the input signal. The dataset used for experimental evaluation contains utterances belonging to languages that are all related to each other and sometimes hard to distinguish even for human listeners: it is compiled from recordings of the 11 most widespread Slavic languages. We also released this Slavic dataset to the general public, because a similar collection is not publicly available through any other source. The best results were yielded by a bidirectional recurrent DNN with gated recurrent units that was fed by bottleneck features. In this case, the baseline ER was reduced from 4.2% to 1.2% and Cavg from 2.3% to 0.6%.

show abstract

ASR for South Slavic Languages Developed in Almost Automated Way

Nouza¹,

Safarik²,

Červa³

2016

View full text Add to dashboard Cite

Slavic languages pose several specific challenges that need to be addressed in an ASR system design. Since we have already built an engine suited for highly-inflected languages, we focus on adopting it for new languages, now. In this case, we present an efficient way to adapt the system to all (seven) South Slavic languages, using methods and tools that benefit from language similarities, easily adjustable G2P rules or common phonetic subsets. We show that it is possible to build accurate language and acoustic models in an almost automated way, entirely from resources found on the web. The AMs are trained via cross-lingual bootstrapping followed by lightly supervised retraining from public data, like broadcast and parliament archives. Tests done on a set of main broadcast news in each language show WER values in range 16.8 to 21.5 %, which includes also errors caused by OOL (out-of-language) utterances often occurring in this type of spoken programs.

show abstract

Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources

Nouza

Červa

Safarik

2018

View full text Add to dashboard Cite

Unified Approach to Development of ASR Systems for East Slavic Languages

Safarik

Nouza

2017

View full text Add to dashboard Cite

Impact of phonetic annotation precision on automatic speech recognition systems

Safarik

Mateju

2016

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Radek Safarik

Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal

ASR for South Slavic Languages Developed in Almost Automated Way

Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources

Unified Approach to Development of ASR Systems for East Slavic Languages

Impact of phonetic annotation precision on automatic speech recognition systems

Contact Info

Product

Resources

About