Reina Akama scite author profile

Reina Akama

5Publications

67Citation Statements Received

142Citation Statements Given

How they've been cited

How they cite others

122

142

Affiliations

Tohoku University

Publications

Order By: Most citations

Filtering Noisy Dialogue Corpora by Connectivity and Content Relatedness

Akama¹,

Yokoi

Suzuki

et al. 2020

View full text Add to dashboard Cite

Large-scale dialogue datasets have recently become available for training neural dialogue agents. However, these datasets have been reported to contain a non-negligible number of unacceptable utterance pairs. In this paper, we propose a method for scoring the quality of utterance pairs in terms of their connectivity and relatedness. The proposed scoring method is designed based on findings widely shared in the dialogue and linguistics research communities. We demonstrate that it has a relatively good correlation with the human judgment of dialogue quality. Furthermore, the method is applied to filter out potentially unacceptable utterance pairs from a large-scale noisy dialogue corpus to ensure its quality. We experimentally confirm that training data filtered by the proposed method improves the quality of neural dialogue agents in response generation. 1

show abstract

Word Rotator’s Distance

Yokoi¹,

Takahashi²,

Akama³

et al. 2020

View full text Add to dashboard Cite

One key principle for assessing textual similarity is measuring the degree of semantic overlap between two texts by considering the word alignment. Such alignment-based approaches are both intuitive and interpretable; however, they are empirically inferior to the simple cosine similarity between general-purpose sentence vectors. We focus on the fact that the norm of word vectors is a good proxy for word importance, and the angle of them is a good proxy for word similarity. Alignment-based approaches do not distinguish the norm and direction, whereas sentence-vector approaches automatically use the norm as the word importance. Accordingly, we propose decoupling word vectors into their norm and direction then computing the alignment-based similarity with the help of earth mover's distance (optimal transport), which we refer to as word rotator's distance. Furthermore, we demonstrate how to "grow" the norm and direction of word vectors (vector converter); this is a new systematic approach derived from the sentence-vector estimation methods, which can significantly improve the performance of the proposed method. On several STS benchmarks, the proposed methods outperform not only alignment-based approaches but also strong baselines. 1

show abstract

Unsupervised Learning of Style-sensitive Word Vectors

Akama¹,

Watanabe

Yokoi

et al. 2018

View full text Add to dashboard Cite

This paper presents the first study aimed at capturing stylistic similarity between words in an unsupervised manner. We propose extending the continuous bag of words (CBOW) model (Mikolov et al., 2013a) to learn style-sensitive word vectors using a wider context window under the assumption that the style of all the words in an utterance is consistent. In addition, we introduce a novel task to predict lexical stylistic similarity and to create a benchmark dataset for this task. Our experiment with this dataset supports our assumption and demonstrates that the proposed extensions contribute to the acquisition of stylesensitive word embeddings. Style-sensitive Word VectorThe key idea is to extend the continuous bag of words (CBOW) (Mikolov et al., 2013a) by distin-arXiv:1805.05581v1 [cs.CL]

show abstract

Evaluating Dialogue Generation Systems via Response Selection

Sato¹,

Akama²,

Ouchi³

et al. 2020

View full text Add to dashboard Cite

Existing automatic evaluation metrics for open-domain dialogue response generation systems correlate poorly with human evaluation. We focus on evaluating response generation systems via response selection. To evaluate systems properly via response selection, we propose a method to construct response selection test sets with well-chosen false candidates. Specifically, we propose to construct test sets filtering out some types of false candidates: (i) those unrelated to the ground-truth response and (ii) those acceptable as appropriate responses. Through experiments, we demonstrate that evaluating systems via response selection with the test set developed by our method correlates more strongly with human evaluation, compared with widely used automatic evaluation metrics such as BLEU.

show abstract

Dialogue System Live Competition: Identifying Problems with Dialogue Systems Through Live Event

Higashinaka

Funakoshi

Inaba

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Reina Akama

Filtering Noisy Dialogue Corpora by Connectivity and Content Relatedness

Word Rotator’s Distance

Unsupervised Learning of Style-sensitive Word Vectors

Evaluating Dialogue Generation Systems via Response Selection

Dialogue System Live Competition: Identifying Problems with Dialogue Systems Through Live Event

Contact Info

Product

Resources

About