2022
DOI: 10.3389/frai.2022.856232
|View full text |Cite
|
Sign up to set email alerts
|

DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data

Abstract: Deep neural speech and audio processing systems have a large number of trainable parameters, a relatively complex architecture, and require a vast amount of training data and computational power. These constraints make it more challenging to integrate such systems into embedded devices and utilize them for real-time, real-world applications. We tackle these limitations by introducing DeepSpectrumLite, an open-source, lightweight transfer learning framework for on-device speech and audio recognition using pre-t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 22 publications
(7 citation statements)
references
References 46 publications
0
7
0
Order By: Relevance
“…There have consequently been increasing efforts to develop AutoML approaches that optimise a large network until it is executable on a low-resource device ( 77 , 78 ). Many of these approaches focus on reducing the memory footprint and the computational complexity of a network while preserving its accuracy.…”
Section: Earliermentioning
confidence: 99%
“…There have consequently been increasing efforts to develop AutoML approaches that optimise a large network until it is executable on a low-resource device ( 77 , 78 ). Many of these approaches focus on reducing the memory footprint and the computational complexity of a network while preserving its accuracy.…”
Section: Earliermentioning
confidence: 99%
“…It has been used in previous challenges [31,32] and is described in [3]. A lightweight version of DeepSpectrum for audio signal processing on-device can be found in [5] 5 . auDeep: 6 This feature set is obtained through unsupervised representation learning with recurrent sequence-to-sequence autoencoders [2,11]; it has as well been employed in previous challenges [31,32].…”
Section: Approachesmentioning
confidence: 99%
“…Mobile cough devices are portable, low-cost, real-time, and patient-friendly. Thus, in recent years, there has been an increased focus on the advanced techniques based on mobile technologies and wearable devices in the acquisition and automatic processing of cough sounds, especially after the outbreak of coronavirus disease 2019 (COVID-19), more studies have attempted to rapidly identify COVID-19 by this technique (158)(159)(160)(161)(162)(163)(164)(165)(166). Hoyos-Barceló et al proposed an efficient and power-saving smartphone-based cough detection system, which could classify cough in a noisy environment with 88.94% sensitivity and 98.64% specificity (167).…”
Section: Mobile Device Technologiesmentioning
confidence: 99%