A Discriminative Model of Stochastic Edit Distance in the Form of a Conditional Transducer

Marc, Bernard; Janodet, Jean-Christophe; Sebban, Marc

doi:10.1007/11872436_20

Cited by 6 publications

(9 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the context of probabilistic machines, the maximization of the likelihood is often used. In this paper, we follow the same idea that explains why we are interested in learning string edit similarities in a probabilistic context rather than learning a true edit metric 1 . In our approach, we aim to learn a conditional (or discriminative) model that takes into account information about the input string X.…”

Section: Definitions and Notationsmentioning

confidence: 99%

“…To deal with such situations, non-memoryless approaches have been proposed in the literature in the form of probabilistic state machines that are able to take into account the string context. They are mainly based on pair-Hidden Markov Models (pair-HMM) [2,6], probabilistic deterministic automata [1], or stochastic transducers [7]. The string context is described in each state by a statistical distribution over the edit operations.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Learning Constrained Edit State Machines

Boyer

Gandrillon

Habrard

et al. 2009

2009 21st IEEE International Conference on Tools With Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

Section: Definitions and Notationsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Learning Constrained Edit State Machines

Boyer

Gandrillon

Habrard

et al. 2009

2009 21st IEEE International Conference on Tools With Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

“…Most training algorithms for learning FSTs rely on gradient-based or EM optimizations which can be computationally expensive and suffer from local optima issues [8,10]. There are also methods that are based on grammar induction techniques [5,3], which have the advantage of inferring both the structure of the model and the parameters.…”

Section: Introductionmentioning

confidence: 99%

“…3 . Table 3 shows the time it takes to complete an iteration under EM, together with the number of iterations it takes to reach the best error rates at tests.…”

mentioning

confidence: 99%

A Spectral Learning Algorithm for Finite State Transducers

Balle

Quattoni

Carreras

2011

Machine Learning and Knowledge Discovery in Databases

View full text Add to dashboard Cite

Abstract. Finite-State Transducers (FSTs) are a popular tool for modeling paired input-output sequences, and have numerous applications in real-world problems. Most training algorithms for learning FSTs rely on gradient-based or EM optimizations which can be computationally expensive and suffer from local optima issues. Recently, Hsu et al. [13] proposed a spectral method for learning Hidden Markov Models (HMMs) which is based on an Observable Operator Model (OOM) view of HMMs. Following this line of work we present a spectral algorithm to learn FSTs with strong PAC-style guarantees. To the best of our knowledge, ours is the first result of this type for FST learning. At its core, the algorithm is simple, and scalable to large data sets. We present experiments that validate the effectiveness of the algorithm on synthetic and real data.

show abstract

“…This approach has shown its efficiency in handwritten digit recognition [16] and has been recently extended to tree-structured data [10,14]. Note that non memoryless models have been proposed during the past few years [11,13,15]. While these approaches allow us to model more complex phenomena, their understandability is more complicated because the knowledge is captured in several matrices of finite state machines.…”

Section: Introductionmentioning

confidence: 99%

Melody Recognition with Learned Edit Distances

Habrard

Iñesta

Rizo

et al. 2008

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. In a music recognition task, the classification of a new melody is often achieved by looking for the closest piece in a set of already known prototypes. The definition of a relevant similarity measure becomes then a crucial point. So far, the edit distance approach with a-priori fixed operation costs has been one of the most used to accomplish the task. In this paper, the application of a probabilistic learning model to both string and tree edit distances is proposed and is compared to a genetic algorithm cost fitting approach. The results show that both learning models outperform fixed-costs systems, and that the probabilistic approach is able to describe consistently the underlying melodic similarity model.

show abstract

A Discriminative Model of Stochastic Edit Distance in the Form of a Conditional Transducer

Cited by 6 publications

References 10 publications

Learning Constrained Edit State Machines

Learning Constrained Edit State Machines

A Spectral Learning Algorithm for Finite State Transducers

Melody Recognition with Learned Edit Distances

Contact Info

Product

Resources

About