A practical methodology for the evaluation of spoken language systems

Boisen, Sean; Bates, Madeleine

doi:10.3115/974499.974529

Cited by 5 publications

(7 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They will be manageable if interruption points can be discerned in the real-time using acoustic information (Nakatani and Hirschberg, 1994 Parsing Utterances Including Self-Repairs It is relatively easy to evaluate technologies such as morphological analysis and information retrieval in objecive and empirical terms, because unique solutions can be defined for such tasks. Such an evaluation will be almost necessarily a blackbox evaluation, such as in ATIS (Boisen and Bates, 1992), TREC (Harman, 1995), MUC (MUC, 1991), and so forth. In order to advance researches on dialogue systems, there should be some empirical method for evaluating them.…”

Section: Discussionmentioning

confidence: 99%

Machine Conversations

Wilks¹

1999

View full text Add to dashboard Cite

show abstract

Section: Discussionmentioning

confidence: 99%

Machine Conversations

Wilks¹

1999

View full text Add to dashboard Cite

show abstract

“…Boisen and Bates developed a methodology based on the collective experiences of BBN's participation in the DARPA projects [15]. Their methodology analyzed many domain specific evaluation methods to create a general framework to characterize the evaluation of dialogue systems.…”

Section: Evaluating Dialogue Systemsmentioning

confidence: 99%

“…The recognition work, combined with the work in this chapter, are ultimately designed to create a complete running CopyCat system for deployment and testing with live children. There has been a great amount of research on evaluating live system testing for dialogue systems that have been designed using Wizard of Oz systems [15,66,72,114,148]. There is a tension between domain specific criteria, intermediary evaluation and metrics, human judgement, and the input / output mapping of the final system.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Improving the efficacy of automated sign language practice tools

Brashear

2007

SIGACCESS Access. Comput.

View full text Add to dashboard Cite

CopyCat is an America Sign Language (ASL) game, which uses gesture recognition technology to help young Deaf children practice ASL skills. Our database of signing samples was collected from user studies of Deaf children playing a Wizard of Oz version of the game at the Atlanta Area School for the Deaf. We have created an automatic sign language recognition system for the game. We believe that we can improve the accuracy of this system by characterizing and modeling disfluencies found in the children's signing.

show abstract

“…Other evaluations in the same tradition include the ATIS [2] and TREC [10] evaluations, the first in the domain of database query, emphasizing a spoken language component, the second in the domain of text retrieval. A third set of evaluations-the MUC evaluations of fact extraction systems-is reported by Cowie and Lehnert in this issue and is reported in detail in [18].The test material for these other evaluations in the ARPA tradition is, however, critically different.…”

Section: Some Past Evaluationsmentioning

confidence: 99%

Evaluating natural language processing systems

King

1996

Commun. ACM

View full text Add to dashboard Cite

A practical methodology for the evaluation of spoken language systems

Cited by 5 publications

References 14 publications

Machine Conversations

Machine Conversations

Improving the efficacy of automated sign language practice tools

Evaluating natural language processing systems

Contact Info

Product

Resources

About