An algorithm for fast composition of weighted finite-state transducers

McDonough, John; Stoimenov, Emilian; Klakow, Dietrich

doi:10.1109/asru.2007.4430156

Cited by 12 publications

(12 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Component n-gram probabilities and interpolation weights will be applied for each context on request during decoding. Similar approaches have been previously shown to be effective for the composition between one single back-off n-gram LM and a lexicon transducer (Caseiro and Trancoso, 2006;Cheng et al, 2007;McDonough et al, 2007;Oonishi et al, 2009).…”

Section: Decoding With Context Dependent Lm Interpolationmentioning

confidence: 79%

Use of contexts in language model interpolation and adaptation

Liu

Gales

Woodland

2013

Computer Speech & Language

View full text Add to dashboard Cite

Section: Decoding With Context Dependent Lm Interpolationmentioning

confidence: 79%

Use of contexts in language model interpolation and adaptation

Liu

Gales

Woodland

2013

Computer Speech & Language

View full text Add to dashboard Cite

“…Finally, M 1 is composed on-the-fly with a hidden Markov model H along with the cross-word computation. This was initially described by Hori et al [22], [23] and further improved by McDonough et al [24] and Allaucen et al [25], [26]. The probability for a phoneme sequence given a sequence of speech features can be computed [27] using a token passing time synchronous Viterbi beam search [28].…”

Section: Recognition On the Servermentioning

confidence: 99%

Accurate client-server based speech recognition keeping personal data on the client

Georges¹,

Kanthak²,

Klakow

2014

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

In this paper, a novel technique is proposed that recognizes speech on a server but all private knowledge is processed on the client. Private knowledge could be address book entries, calendar entries or medical patient data.The technique combines the advantage of a powerful server with almost unlimited memory and the advantage using locally available user dependent knowledge. A dynamic language model is used to recognize speech with the help of content dependent acoustic fillers on a server. The result is then recognized including user dependent knowledge on a client, e.g., a smart phone. We achieved a word error rate reduction of 17% on the Wall Street Journal Corpus.

show abstract

“…If the new filter state is not the blocking state ⊥ and a new transition is created from the filter-rewritten transitions (e ′ 1 , e ′ 2 ) (line 14). If the destination state (n[e ′ 1 ], n[e ′ 2 ], q ′ 3 ) has not been found previously, it is added to Q and inserted in S (lines [11][12][13]. The composition algorithm presented here is available in the OpenFst library [3].…”

Section: Compositionmentioning

confidence: 99%

“…For some problems, it is possible to find equivalent inputs that will compose more efficiently, but it is not always possible or desirable to do so. This has been especially an issue in natural language processing applications and led to special-purpose composition algorithms for use in speech recognition [6,7,11,15] and speech synthesis [2].…”

Section: Introductionmentioning

confidence: 99%

A Filter-Based Algorithm for Efficient Composition of Finite-State Transducers

Allauzen

Riley

Schalkwyk

2011

Int. J. Found. Comput. Sci.

View full text Add to dashboard Cite

This paper describes a weighted finite-state transducer composition algorithm that generalizes the concept of the composition filter and presents various filters that process epsilon transitions, look-ahead along paths, and push forward labels along epsilon paths. These filters, either individually or in combination, make it possible to compose some transducers much more efficiently in time and space than otherwise possible. We present examples of this drawn, in part, from demanding speech-processing applications. The generalized composition algorithm and many of these filters have been included in Open-Fst, an open-source weighted transducer library.June 14, 2011 10:44 WSPC/INSTRUCTION FILE ijfcs11 2 C. Allauzen, M. Riley and J.Schalkwyk sition filter, applied at each composition state during the construction, that decides if composition is to continue. If we set out to create a general composition filter that blocks every non-coaccessible composition state for any input transducers, then we have only delegated the job of doing a full composition to the filter. Instead, we take the view that there are certain specific filters, tailored to particular but common cases, that are efficient to use, involving only a limited degree of look-ahead along paths. Composition itself is then parameterized to take one or more of these filters that are selected by the user to fit his problem.Section 2 presents the generalized composition algorithm and defines several composition filters. Section 3 provides examples of these composition filters applied to practical problems. Section 4 briefly describes how these filters are used in OpenFst [3], an open-source weighted transducer library. Composition Algorithm PreliminariesA semiring (K, ⊕, ⊗, 0, 1) is a ring that may lack negation. A semiring (K, ⊕, ⊗, 0, 1) is specified by a set of values K, two binary operations ⊕ and ⊗, and two designated values 0 and 1. The operation ⊕ is associative, commutative, and has 0 as identity. The operation ⊗ is associative, has identity 1, distributes with respect to ⊕, and has 0 as annihilator: for all a ∈ K, a ⊗ 0 = 0 ⊗ a = 0. If ⊗ is also commutative, we say that the semiring is commutative.The probability semiring (R + , +, ×, 0, 1) is used when the weights represent probabilities. The log semiring (R ∪ {∞} , ⊕ log , +, ∞, 0), isomorphic to the probability semiring via the negative-log mapping, is often used in practice for numerical stability. The tropical semiring (R ∪ {∞} , min, +, ∞, 0), derived from the log semiring using the Viterbi approximation, is often used in shortest-path applications.A weighted finite-state transducer T = (A, B, Q, I, F, E, λ, ρ) over a semiring K is specified by a finite input alphabet A, a finite output alphabet B, a finite set of states Q, a set of initial states I ⊆ Q, a set of final states F ⊆ Q, a finite set ofan initial state weight assignment λ : I → K, and a final state weight assignment ρ : F → K. E[q] denotes the set of transitions leaving state q ∈ Q. Given a transition e ∈ E, p[e] denotes its origin o...

show abstract

An algorithm for fast composition of weighted finite-state transducers

Cited by 12 publications

References 20 publications

Use of contexts in language model interpolation and adaptation

Use of contexts in language model interpolation and adaptation

Accurate client-server based speech recognition keeping personal data on the client

A Filter-Based Algorithm for Efficient Composition of Finite-State Transducers

Contact Info

Product

Resources

About