Acceleration of HMM-based speech recognition system by parallel FPGA Gaussian calculation

Veitch, Richard; Aubert, Louis-Marie; Woods, Roger; Fischaber, Scott

doi:10.1109/spl.2010.5483010

Cited by 5 publications

(7 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Performance figures are given for two implementations. The first case, which was originally presented in [12], has been implemented in C++ compiled with Microsoft compilers via Visual Studio 2008. This implementation achieves speeds of 114 frames per second on average which is better than the real-time figure of 100 frames per second.…”

Section: Software Performancementioning

confidence: 99%

“…For this reason, the approach taken in designing the Gaussian core was to first build a single, efficient pipeline with minimal control and then to build a parallel architecture containing multiple pipelines which could be configured to achieve specific design goals. In this section, we will first present the single core implementation which was first proposed in the previous paper [12] and then describe a number of new multi-core architectures that have been implemented since the original publication, in order to provide a solution tailored to the required performance of a range of speech recognition systems.…”

Section: Fpga Implementationmentioning

confidence: 99%

“…The first is based on a buffered input system and is new for this paper. The second was first proposed in [12] and uses parallel acoustic models on a single Acoustic Observation Vector.…”

Section: 2mentioning

confidence: 99%

“…This work, originally presented at the 6th Southern Programmable Logic Conference [12], outlined the Gaussian core as a hardware peripheral capable of real-time operation; dual cores were implemented in order to achieve this. Improvements to that paper that have been included here are given below.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Veitch

Aubert

Woods

et al. 2011

International Journal of Reconfigurable Computing

Self Cite

View full text Add to dashboard Cite

A scalable large vocabulary, speaker independent speech recognition system is being developed using Hidden Markov Models (HMMs) for acoustic modeling and a Weighted Finite State Transducer (WFST) to compile sentence, word, and phoneme models. The system comprises a software backend search and an FPGA-based Gaussian calculation which are covered here. In this paper, we present an efficient pipelined design implemented both as an embedded peripheral and as a scalable, parallel hardware accelerator. Both architectures have been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 9.03 ms which coupled with a backend search of 5000 words has provided an accuracy of over 80%. Parallel implementations have been designed with up to 32 cores and have been successfully implemented with a clock frequency of 133 MHz.

show abstract

Section: Software Performancementioning

confidence: 99%

Section: Fpga Implementationmentioning

confidence: 99%

“…The first is based on a buffered input system and is new for this paper. The second was first proposed in [12] and uses parallel acoustic models on a single Acoustic Observation Vector.…”

Section: 2mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Veitch

Aubert

Woods

et al. 2011

International Journal of Reconfigurable Computing

Self Cite

View full text Add to dashboard Cite

show abstract

“…The recognition of large vocabulary and continuous speech requires complicated algorithms with huge amounts of calculations, large quantities of memory [3], [4]. This can result in enlarged power consumption, longer recognition time and higher recognition error rate.…”

Section: Introductionmentioning

confidence: 99%

Upgrading FPGA Implementation of Isolated Word Recognition System for a Real-Time Operation

Sledevič

Tamulevičius

Navakauskas

2013

ElAEE

View full text Add to dashboard Cite

The article reports on the upgrading of the FPGA based isolated word recognition system for real-time tasks. All recognition system components (except some feature calculation steps) were implemented using VHDL. Some high precision calculations were implemented on soft core processor. The employed Dynamic time warping algorithm was speeded-up 2.8 times by restricting the calculated error matrix size. This enabled us to reduce the average word recognition time to 12.81 ms. Linear predictive coding, linear predictive coding cepstral and linear frequency cepstral coefficients feature analyses were investigated for 100 Lithuanian word recognition. In speaker dependent experiments linear predictive coding cepstral analysis gave the highest average recognition rate of 95 % and the highest robustness to white noise in speech. 15 dB noise level lowered average recognition rate to 86.2 %. Index Terms-Cepstral analysis, dynamic time warping, field programmable gate array, intellectual property core, isolated word recognition, linear predictive coefficients.

show abstract

Improved parameterized efficient FPGA implementations of parallel 1-D filtering algorithms using Xilinx System Generator

Hasan

Boussakta

Yakovlev

2010

The 10th IEEE International Symposium on Signal Processing and Information Technology

View full text Add to dashboard Cite

Acceleration of HMM-based speech recognition system by parallel FPGA Gaussian calculation

Cited by 5 publications

References 8 publications

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Upgrading FPGA Implementation of Isolated Word Recognition System for a Real-Time Operation

Improved parameterized efficient FPGA implementations of parallel 1-D filtering algorithms using Xilinx System Generator

Contact Info

Product

Resources

About