Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2015
DOI: 10.3115/v1/n15-1038
|View full text |Cite
|
Sign up to set email alerts
|

Lexicon-Free Conversational Speech Recognition with Neural Networks

Abstract: We present an approach to speech recognition that uses only a neural network to map acoustic input to characters, a character-level language model, and a beam search decoding procedure. This approach eliminates much of the complex infrastructure of modern speech recognition systems, making it possible to directly train a speech recognizer using errors generated by spoken language understanding tasks. The system naturally handles out of vocabulary words and spoken word fragments. We demonstrate our approach usi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
85
0
1

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 119 publications
(87 citation statements)
references
References 13 publications
1
85
0
1
Order By: Relevance
“…In contrast to this, we follow recent work [22,18,23,19] where a neural network learns context-dependence implicitly.…”
Section: Relation To Prior Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In contrast to this, we follow recent work [22,18,23,19] where a neural network learns context-dependence implicitly.…”
Section: Relation To Prior Workmentioning
confidence: 99%
“…Our approach is most similar to the CTC methods of [20,28,19,18,17]. In contrast to [20,28,17], we use a ReLU-RNN rather than an LSTM, and find it to be effective and much faster.…”
Section: Relation To Prior Workmentioning
confidence: 99%
See 1 more Smart Citation
“…There have been interesting developments in the creation of 'end-to-end' speech recognition systems [5,20] that create joint audio and language models, often working with the character as basic unit. In contrast to these systems, we explore the use of characters in a 'conventional' state-of-the-art LVCSR system.…”
Section: Subwords To the Extreme: Charactersmentioning
confidence: 99%
“…domain-dependent) manner. [9][10][11][12][13] To reduce the limitations of the statistical approach, we employ a representational approach based on the focus tree model of attentional information in human-machine interaction. Various adaptations of this model were successfully applied in several prototypical conversational agents for the purposes of natural language understanding and dialogue management.…”
Section: The Methodological Aspectmentioning
confidence: 99%