2006
DOI: 10.1007/11846406_79
|View full text |Cite
|
Sign up to set email alerts
|

Using Prosody for Automatic Sentence Segmentation of Multi-party Meetings

Abstract: Abstract. We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2007
2007
2016
2016

Publication Types

Select...
4
2
1

Relationship

4
3

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 16 publications
0
15
0
Order By: Relevance
“…BoosTexter was initially designed for the task of text categorization, employment of this method for tasks related to DA segmentation was firstly presented in [9,19]. The method combines weak classifiers having a basic form of one-level decision trees (stumps) using confidence-rated predictions.…”
Section: Boosting-based Models (Boostexter)mentioning
confidence: 99%
“…BoosTexter was initially designed for the task of text categorization, employment of this method for tasks related to DA segmentation was firstly presented in [9,19]. The method combines weak classifiers having a basic form of one-level decision trees (stumps) using confidence-rated predictions.…”
Section: Boosting-based Models (Boostexter)mentioning
confidence: 99%
“…Several different approaches utilizing both textual (lexical or syntactic) and acoustic (prosodic) information have been proposed. The proposed techniques include hidden Markov models (HMMs) (Shriberg et al, 2000;Kim and Woodland, 2003), multilayer perceptrons (Warnke et al, 1997;Srivastava and Kubala, 2003), maximum entropy (Huang and Zweig, 2002;Liu et al, 2004), conditional random fields (Liu et al, 2005;Zimmermann, 2009), support vector machines (Akita et al, 2006;Magimai-Doss et al, 2007), and adaptive boosting (Zimmermann et al, 2006;Kolář et al, 2006b). Syntactic information has been used in (Roark et al, 2006;Favre et al, 2008).…”
Section: Related Workmentioning
confidence: 99%
“…In general, the information sources we use for Czech and English are similar. See [7] for a more detailed description of the features, along with an analysis of their contribution to sentence segmentation of Czech, and [8] for more information about English.…”
Section: Knowledge Sourcesmentioning
confidence: 99%