4th European Conference on Speech Communication and Technology (Eurospeech 1995) 1995
DOI: 10.21437/eurospeech.1995-188
|View full text |Cite
|
Sign up to set email alerts
|

New telephone speech corpora at CSLU

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

1996
1996
2006
2006

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 89 publications
(15 citation statements)
references
References 0 publications
0
15
0
Order By: Relevance
“…A one-hour subset of Switchboard has also been labeled with respect to stress-accent by two individuals not involved in the phonetic annotation. These individuals also labeled two and a half hours of stress-accent material from a separate (phonetically annotated) corpus, "OGI Stories" [6], containing hundreds of telephone monologues (of ca. 60-seconds each).…”
Section: Into the Wilds (Of Spontaneous Speech)mentioning
confidence: 99%
“…A one-hour subset of Switchboard has also been labeled with respect to stress-accent by two individuals not involved in the phonetic annotation. These individuals also labeled two and a half hours of stress-accent material from a separate (phonetically annotated) corpus, "OGI Stories" [6], containing hundreds of telephone monologues (of ca. 60-seconds each).…”
Section: Into the Wilds (Of Spontaneous Speech)mentioning
confidence: 99%
“…The ALPS transcription system was evaluated using spontaneous speech material from the Numbers95 corpus [1], collected and phonetically annotated (i.e., labeled and segmented) at the Oregon Graduate Institute. This corpus contains the numerical portion (mostly street addresses and phone numbers) of thousands of telephone dialogues and possesses a lexicon of 37 words and an inventory of 29 phonetic segments.…”
Section: Corpus Materialsmentioning
confidence: 99%
“…The architecture of the TFM networks used for classification of the articulatory acoustic features was developed using a threedimensional representation of the log-power-spectrum distributed across frequency and time that incorporates both the mean and variance of the energy distribution associated with multiple (typically, hundreds or thousands of) instances of a specific phonetic feature or segment derived from the phonetically annotated, OGI Stories-TS corpus [1]. Each phonetic-segment class was mapped to an array of articulatory phonetic features, and this map used to construct the spectrotemporal profile (STeP) for a given feature class.…”
Section: Spectro-temporal Profilesmentioning
confidence: 99%
“…It is more meaningful to test the three phone types on a realworld recognition task. Four thousand phonetically transcribed names are selected from the OGI Names Corpus [2] with balanced genders. One hundred test sets of perplexity 40 are constructed by randomly choosing ten male speaking names and ten female speaking names 100 times without replacement.…”
Section: Isolated Word Recognitionmentioning
confidence: 99%
“…A. Ljolje [8] used more detailed contextual eects to derive a set of 19 left-context classes and 18 right-context classes. (2) The data-driven approach e v aluates all contexts in the training data, and uses some distance measure with a clustering algorithm to split or merge the contexts to a specied number of generalized contexts. This usually uses an information-theoretic distance measure commonly employed with Hidden Markov models.…”
Section: Introductionmentioning
confidence: 99%