2021
DOI: 10.3390/electronics10192420
|View full text |Cite
|
Sign up to set email alerts
|

Edge Container for Speech Recognition

Abstract: Containerization has been mainly used in pure software solutions, but it is gradually finding its way into the industrial systems. This paper introduces the edge container with artificial intelligence for speech recognition, which performs the voice control function of the actuator as a part of the Human Machine Interface (HMI). This work proposes a procedure for creating voice-controlled applications with modern hardware and software resources. The created architecture integrates well-known digital technologi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…Be ňo et al [79] describe an implementation of Microsoft Cognitive Speech Service on the edge utilizing Microsoft Azure services. The solution is made up of two containers.…”
Section: Virtual Assistantsmentioning
confidence: 99%
See 1 more Smart Citation
“…Be ňo et al [79] describe an implementation of Microsoft Cognitive Speech Service on the edge utilizing Microsoft Azure services. The solution is made up of two containers.…”
Section: Virtual Assistantsmentioning
confidence: 99%
“…Year Application Area Description EI Level [75] 2023 Federated learning Dynamic FL deployment and learning scheme 4 [76] 2021 Robotics Design methodology for ROS-based applications 4 [77] 2021 Healthcare Readmission prediction system for healthcare facilities 3 [78] 2022 Healthcare Electronic health records decomposing the patient's body into containers - [79] 2021 Virtual assistant Voice control for human-machine interaction 3 [81] 2022 Composite AI Ontology model for development of multi-agent AI systems - [82] 2018 Healthcare Human activity recognition 3 [83] 2021 Security Re-identification of people across multiple cameras 2 [84] 2022 Wildfire modelling A federation architecture to enable a composable infrastructure - [85] 2019 Computer vision Architecture for image processing 3…”
Section: Referencementioning
confidence: 99%
“…A powerful computer with graphics processing units (GPUs) is mainly used to train the ASR model to achieve a word error rate (WER) of less than a few percent using hundreds of millions of weights. Most of the high-accuracy ASR models are fullcontext models, which wait to hear the complete utterance before generating output [3][4][5][6][7][8]. On the contrary, streaming ASR models try to generate output as fast as possible without waiting for the completion of utterance [9,10].…”
Section: Introductionmentioning
confidence: 99%