Edward W. D. Whittaker scite author profile

Edward W. D. Whittaker

5Publications

121Citation Statements Received

13Citation Statements Given

How they've been cited

159

121

How they cite others

Affiliations

Tokyo Institute of Technology, University of Cambridge

Publications

Order By: Most citations

The 1998 HTK system for transcription of conversational telephone speech

Hain

Woodland

Niesler

et al. 1999

View full text Add to dashboard Cite

This paper describes the 1998 HTK large vocabulary speech recognition system for conversational telephone speech as used in the NIST 1998 HubSE evaluation. Front-end and language modelling experiments conducted using various training and test sets from both the Switchboard and Callhome English corpora are presented. Our complete system includes reduced bandwidth analysis, sidebased cepstral feature normalisation, vocal tract length normalisation (VTLN), triphone and quinphone hidden Markov models (HMMs) built using speaker adaptive training (SAT), maximum likelihood linear regression (MLLR) speaker adaptation and a confidence score based system combination. A detailed description of the complete system together with experimental results for each stage of our multi-pass decoding scheme is presented. The word error rate obtained is almost 20% better than our 1997 system on the development set.

show abstract

Comparison of part-of-speech and automatically derived category-based language models for speech recognition

Niesler

Whittaker

Woodland

View full text Add to dashboard Cite

Automatic Sentence Segmentation of Speech for Automatic Summarization

Mrozinski

Whittaker

Chatain

et al.

View full text Add to dashboard Cite

This paper presents an automatic sentence segmentation method for an automatic speech summarization system. The segmentation method is based on combining word-and class-based statistical language models to predict sentence and non-sentence boundaries. We study both the performance of the sentence segmentation system itself and the effect of the segmentation on the summarization accuracy. The sentence segmentation is done by modelling the probability of a sentence boundary given a certain word history with language models trained on transcriptions and texts from several sources. The resulting segmented data is used as the input to an existing automatic summarization system to determine the effect it has on the summarization process. We conduct all our experiments with two types of evaluation data: broadcast news and lecture transcriptions. The automatic summarizations are created with different sentence segmentations and different summarization ratios (30% and 40%) and evaluated by comparing them to human-made summaries. We show that a proper sentence segmentation is essential to achieve good performance with an automatic summarization system.

show abstract

Language modelling for Russian and English using words and classes

Whittaker

Woodland

2003

Computer Speech & Language

View full text Add to dashboard Cite

CLEF2006 Question Answering Experiments at Tokyo Institute of Technology

Whittaker

Novak

Chatain

et al. 2007

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.