Olena Iosifova scite author profile

Olena Iosifova

4Publications

0Citation Statements Received

19Citation Statements Given

How they've been cited

How they cite others

Affiliations

National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”

Publications

Order By: Most citations

Automated Pipeline for Training Dataset Creation from Unlabeled Audios for Automatic Speech Recognition

Romanovskyi¹,

Iosifov²,

Iosifova³

et al. 2021

View full text Add to dashboard Cite

In the paper, we present a software pipeline for speech recognition to automate the creation of training datasets, based on desired unlabeled audios, for low resource languages and domain-specific area. Considering the commoditizing of speech recognition, more teams build domain-specific models as well as models for local languages. At the same time, lack of training datasets for low to middle resource languages significantly decreases possibilities to exploit last achievements and frameworks in the Speech Recognition area and limits the wide range of software engineers to work on speech recognition problems. This problem is even more critical for domain-specific datasets. The pipeline was tested for building Ukrainian language recognition and confirmed that the created design is adaptable to different data source formats and expandable to integrate with existing frameworks.

show abstract

Transferability Evaluation of Speech Emotion Recognition Between Different Languages

Iosifov

Iosifova²,

Romanovskyi³

et al. 2022

View full text Add to dashboard Cite

Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches

Iosifov¹,

Iosifova²,

Sokolov

2020

View full text Add to dashboard Cite

show abstract

Методи Та Компоненти Обробки Природної Мови

Iosifova

Iosifov

Rolik

2020

«Адаптивні Системи Автоматичного Управління Міжвідомчий науково

View full text Add to dashboard Cite

A dramatic change in the abilities of language models to provide state of the art accuracy in a number of Natural Language Processing tasks is currently observed. These improvements open a lot of possibilities in solving NLP downstream tasks. Such tasks include machine translation, speech recognition, information retrieval, sentiment analysis, summarization, question answering, multilingual dialogue systems development and many more. Language models are one of the most important components in solving each of the mentioned tasks. This paper is devoted to research and analysis of the most adopted techniques and designs for building and training language models that show a state of the art results. Techniques and components applied in creation of language models and its parts are observed in this paper, paying attention to neural networks, embedding mechanisms, bidirectionality, encoder and decoder architecture, attention and self-attention, as well as parallelization through using Transformer. Results: the most promising techniques imply pretraining and fine-tuning of a language model, attention-based neural network as a part of model design, and a complex ensemble of multidimensional embeddings to build deep context understanding. The latest offered architectures based on these approaches require a lot of computational power for training language model and it is a direction of further improvement.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Olena Iosifova

Automated Pipeline for Training Dataset Creation from Unlabeled Audios for Automatic Speech Recognition

Transferability Evaluation of Speech Emotion Recognition Between Different Languages

Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches

Методи Та Компоненти Обробки Природної Мови

Contact Info

Product

Resources

About