Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies Short Pa 2008
DOI: 10.3115/1557690.1557736
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised learning of acoustic sub-word units

Abstract: Accurate unsupervised learning of phonemes of a language directly from speech is demonstrated via an algorithm for joint unsupervised learning of the topology and parameters of a hidden Markov model (HMM); states and short state-sequences through this HMM correspond to the learnt sub-word units. The algorithm, originally proposed for unsupervised learning of allophonic variations within a given phoneme set, has been adapted to learn without any knowledge of the phonemes. An evaluation methodology is also propo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
50
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 70 publications
(50 citation statements)
references
References 4 publications
0
50
0
Order By: Relevance
“…Speaker independence remains a major stumbling block [1] and improving it can be tackled in any of these three components. Given limited success of core recognition architectures in the zero resource setting, several alternative acoustic frontends and unsupervised acoustic models have been proposed in recent years [2,3,4,5,1,6,7,8,9,10], though there has been limited effort to evaluate these methods in a systematic way. Lexical discovery is the process of automatically identifying meaningful word-sized units from speech.…”
Section: Introductionmentioning
confidence: 99%
“…Speaker independence remains a major stumbling block [1] and improving it can be tackled in any of these three components. Given limited success of core recognition architectures in the zero resource setting, several alternative acoustic frontends and unsupervised acoustic models have been proposed in recent years [2,3,4,5,1,6,7,8,9,10], though there has been limited effort to evaluate these methods in a systematic way. Lexical discovery is the process of automatically identifying meaningful word-sized units from speech.…”
Section: Introductionmentioning
confidence: 99%
“…In these approaches, the transcription into labeled phones, syllables or words assumes a prior definition of these categories-even if the "semi-supervision" is only used to initialize a model that is then refined in an unsupervised fashion (cf. Ljolje et al 1997;Toledano et al 2003;Varadarajan et al 2008). …”
Section: Semi-supervised Approachesmentioning
confidence: 99%
“…We apply one such algorithm by Varadarajan et al [11], called the modified successive state splitting (SSS) algorithm, to our problem. We begin with a single-state HMM for each surgeme, and iteratively estimate the HMM parameters and increment the number of HMM states via SSS .…”
Section: Data-derived Hmm Topologiesmentioning
confidence: 99%