Thomas Friedrichs scite author profile

Thomas Friedrichs

5Publications

94Citation Statements Received

107Citation Statements Given

How they've been cited

110

How they cite others

107

Affiliations

Robert Bosch (United States), Oldenburger Institut für Informatik, National University of Singapore

Publications

Order By: Most citations

DynaEval: Unifying Turn and Dialogue Level Evaluation

Zhang¹,

Chen²,

D’Haro³

et al. 2021

View full text Add to dashboard Cite

A dialogue is essentially a multi-turn interaction among interlocutors. Effective evaluation metrics should reflect the dynamics of such interaction. Existing automatic metrics are focused very much on the turn-level quality, while ignoring such dynamics. To this end, we propose DynaEval 1 , a unified automatic evaluation framework which is not only capable of performing turn-level evaluation, but also holistically considers the quality of the entire dialogue. In DynaEval, the graph convolutional network (GCN) is adopted to model a dialogue in totality, where the graph nodes denote each individual utterance and the edges represent the dependency between pairs of utterances. A contrastive loss is then applied to distinguish well-formed dialogues from carefully constructed negative samples. Experiments show that DynaEval significantly outperforms the state-of-the-art dialogue coherence model, and correlates strongly with human judgements across multiple dialogue evaluation aspects at both turn and dialogue level.

show abstract

Deep AM-FM: Toolkit for Automatic Dialogue Evaluation

Zhang

D’Haro

Banchs

et al. 2020

View full text Add to dashboard Cite

COMPANION -- Towards Co-operative Platoon Management of Heavy-Duty Vehicles

Eilers¹,

Mårtensson²,

Pettersson³

et al. 2015

View full text Add to dashboard Cite

MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation

Zhang

D’Haro

Friedrichs³

et al. 2022

AAAI

View full text Add to dashboard Cite

Chatbots are designed to carry out human-like conversations across different domains, such as general chit-chat, knowledge exchange, and persona-grounded conversations. To measure the quality of such conversational agents, a dialogue evaluator is expected to conduct assessment across domains as well. However, most of the state-of-the-art automatic dialogue evaluation metrics (ADMs) are not designed for multi-domain evaluation. We are motivated to design a general and robust framework, MDD-Eval, to address the problem. Specifically, we first train a teacher evaluator with human-annotated data to acquire a rating skill to tell good dialogue responses from bad ones in a particular domain and then, adopt a self-training strategy to train a new evaluator with teacher-annotated multi-domain data, that helps the new evaluator to generalize across multiple domains. MDD-Eval is extensively assessed on six dialogue evaluation benchmarks. Empirical results show that the MDD-Eval framework achieves a strong performance with an absolute improvement of 7% over the state-of-the-art ADMs in terms of mean Spearman correlation scores across all the evaluation benchmarks.

show abstract

MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation

Zhang¹,

D’Haro²,

Friedrichs³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.