Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines

Juneja, Amit; Espy‐Wilson, Carol Y.

doi:10.1109/ijcnn.2003.1223445

Cited by 30 publications

(15 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Many works reported cases regarding distinctive feature-based speech recognition framework, ranging from studying and proposing measurable acoustic parameters (APs) to proposing and evaluating complete distinctive feature-based speech recognition tasks in limited domains [17][18][19][20], which yielded satisfactory results. Juneja and Espy-Wilson [21][22][23] also proposed acoustic parameters (APs) for classifying speech signals into defined manner classes. They also proposed a segmentation algorithm, and complete event-based speech recognition on a limited domain, respectively.…”

Section: Distinctive Features (Phonetic Approach)mentioning

confidence: 99%

“…"Speech", "Sonorant", "Syllabic" and "Continuant", which were adopted from the researches of Juneja and Espy-Wilson [21,22] [-Continuant] indicates that there is a narrow constriction blocking the air stream in the oral cavity while uttering the sound. We can combine manners of articulation into a hierarchical structure to classify phones into "broad classes" such as silence, vowels, sonorant consonants, fricatives, and stop consonants, as shown in Fig.…”

Section: Distinctive Features (Phonetic Approach)mentioning

confidence: 99%

“…5. A hierarchical structure of speech manners [21,22]. Each label at the branch represents the binary value of the parent manner and its associated probability.…”

Section: Distinctive Features (Phonetic Approach)mentioning

confidence: 99%

See 2 more Smart Citations

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Likitsupin

Punyabukkana

Wutiwiwatchai

et al. 2016

View full text Add to dashboard Cite

Abstract. Segment-based speech recognition has shown to be a competitive alternative to the state-of-theart HMM-based techniques. Its accuracies rely heavily on the quality of the segment graph from which the recognizer searches for the most likely recognition hypotheses. In order to increase the inclusion rate of actual segments in the graph, it is important to recover possible missing segments generated by segmentbased segmentation algorithm. An aspect of this research focuses on determining the missing segments due to missed detection of segment boundaries. The acoustic discontinuities, together with manner-distinctive features are utilized to recover the missing segments. Another aspect of improvement to our segment-based framework tackles the restriction of having limited amount of training speech data which prevents the usage of more complex covariance matrices for the acoustic models. Feature dimensional reduction in the form of the Principal Component Analysis (PCA) is applied to enable the training of full covariance matrices and it results in improved segment-based phoneme recognition. Furthermore, to benefit from the fact that segment-based approach allows the integration of phonetic knowledge, we incorporate the probability of each segment being one type of sound unit of a certain specific common manner of articulation into the scoring of the segment graphs. Our experiment shows that, with the proposed improvements, our segment-based framework approximately increases the phoneme recognition accuracy by approximately 25% of the one obtained from the baseline segment-based speech recognition.

show abstract

Section: Distinctive Features (Phonetic Approach)mentioning

confidence: 99%

Section: Distinctive Features (Phonetic Approach)mentioning

confidence: 99%

See 1 more Smart Citation

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Likitsupin

Punyabukkana

Wutiwiwatchai

et al. 2016

View full text Add to dashboard Cite

show abstract

“…Discovering particular section of a speech as a meaningful unit presumes recognition of that unit; on the contrary, the recognition of the unit is possible only after segmentation. While the former approach is called top-down segmentation [5], the latter approach is called the bottom-up segmentation [6]. Generally, top-down and bottom-up approaches are integrated to harness the strengths of both approaches [7], thereby increasing the performance of the system.…”

Section: Introductionmentioning

confidence: 99%

“…Thus, it employs the information from both segmentation and classification; and eventually generates appropriate segments and labels of the continuous speech signal [8]. Alternatively, sequential segmentation and recognition approach, generates segments from the acoustic cues independent of the labels, which are then fed to the classifier to identify the labels ( [6], [9]). …”

Section: Introductionmentioning

confidence: 99%

Automatic Speech Segmentation and Recognition using Class-Specific Features

Rekha¹,

Chatrapati²,

Babu³

2015

IJCA

View full text Add to dashboard Cite

The class-specific automatic speech recognition systems construct an individual classifier for each class based on its own feature set, wherein the feature set for each class is selected such that it distinguishes that class from the other classes most accurately. Consequently, different feature set sequences must be fed into each of the classifiers, and the output of each of the classifiers must be combined to predict the actual class of the observation sequences. However, speech is continuous, and to be able to apply class-specific features, speech should be segmented and fed to the classifiers, which requires the identification of segmentation cues. This paper proposes a framework that jointly segments, and combines the output of the class-specific classifiers in the absence of any segmentation cues using a recursive formulation.

show abstract

A New Hierarchical Decision Structure Using Wavelet Packet and SVM for Brazilian Phonemes Recognition

Bresolin

Neto

Alsina

2006

Neural Information Processing

View full text Add to dashboard Cite

Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines

Cited by 30 publications

References 9 publications

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Automatic Speech Segmentation and Recognition using Class-Specific Features

A New Hierarchical Decision Structure Using Wavelet Packet and SVM for Brazilian Phonemes Recognition

Contact Info

Product

Resources

About