Yuri Matveev scite author profile

Yuri Matveev

5Publications

47Citation Statements Received

105Citation Statements Given

How they've been cited

How they cite others

152

103

Affiliations

Federal Penitentiary Service, ITMO University, Tver State Technical University

Publications

Order By: Most citations

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

Gusev¹,

Volokhov²,

Andzhukaev³

et al. 2020

View full text Add to dashboard Cite

Speaker recognition systems based on deep speaker embeddings have achieved significant performance in controlled conditions according to the results obtained for early NIST SRE (Speaker Recognition Evaluation) datasets. From the practical point of view, taking into account the increased interest in virtual assistants (such as Amazon Alexa, Google Home, Apple Siri, etc.), speaker verification on short utterances in uncontrolled noisy environment conditions is one of the most challenging and highly demanded tasks. This paper presents approaches aimed to achieve two goals: a) improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and b) reduce the system quality degradation for short utterances. For these purposes, we considered deep neural network architectures based on TDNN (Time Delay Neural Network) and ResNet (Residual Neural Network) blocks. We experimented with state-of-the-art embedding extractors and their training procedures. Obtained results confirm that ResNet architectures outperform the standard x-vector approach in terms of speaker verification quality for both longduration and short-duration utterances. We also investigate the impact of speech activity detector, different scoring models, adaptation and score normalization techniques. The experimental results are presented for publicly available data and verification protocols for the VoxCeleb1, VoxCeleb2, and VOiCES datasets.

show abstract

A Highly Efficient Neural Network Solution for Automated Detection of Pointer Meters with Different Analog Scales Operating in Different Conditions

Alexeev¹,

Kukharev

Matveev

et al. 2020

Mathematics

View full text Add to dashboard Cite

We investigate a neural network–based solution for the Automatic Meter Reading detection problem, applied to analog dial gauges. We employ a convolutional neural network with a non-linear Network in Network kernel. Presently, there is a significant interest in systems for automatic detection of analog dial gauges, particularly in the energy and household sectors, but the problem is not yet sufficiently addressed in research. Our method is a universal three-level model that takes an image as an input and outputs circular bounding areas, object classes, grids of reference points for all symbols on the front panel of the device and positions of display pointers. Since all analog pointer meters have a common nature, this multi-cascade model can serve various types of devices if its capacity is sufficient. The model is using global regression for locations of symbols, which provides resilient results even for low image quality and overlapping symbols. In this work, we do not focus on the pointer location detection since it heavily depends on the shape of the pointer. We prepare training data and benchmark the algorithm with our own framework a3net, not relying on third-party neural network solutions. The experimental results demonstrate the versatility of the proposed methods, high accuracy, and resilience of reference points detection.

show abstract

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

Gusev¹,

Volokhov²,

Andzhukaev³

et al. 2020

Preprint

View full text Add to dashboard Cite

Facial Expression

Lyakso

Frolova

Matveev

2021

View full text Add to dashboard Cite

The description of the results of five psychophysiological studies using automatic coding facial expression in adults and children (from 4 to 16 years) in the FaceReader software version 8.0 is presented. The model situations of reading the emotional text and pronouncing emotional phrases and words, natural interaction in mother-child dyads, child and adult (experimenter), and interaction of children with each other were analyzed. The difficulties of applying the program to analyze the behavior of children in natural conditions, to analyze the emotional facial expressions of the children with autism spectrum disorders and children with Down syndrome are described. The ways to solve them are outlined.

show abstract

Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development

Matveev¹,

Matveev²,

Frolova

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuri Matveev

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

A Highly Efficient Neural Network Solution for Automated Detection of Pointer Meters with Different Analog Scales Operating in Different Conditions

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

Facial Expression

Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development

Contact Info

Product

Resources

About