Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-long.96
|View full text |Cite
|
Sign up to set email alerts
|

Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children’s mindreading ability

Abstract: In this paper we implement and compare 7 different data augmentation strategies for the task of automatic scoring of children's ability to understand others' thoughts, feelings, and desires (or "mindreading").We recruit in-domain experts to re-annotate augmented samples and determine to what extent each strategy preserves the original rating. We also carry out multiple experiments to measure how much each augmentation strategy improves the performance of automatic scoring systems. To determine the capabilities… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 24 publications
0
6
0
Order By: Relevance
“…This augmentation adds noise in the form of colloquial filler phrases. 23 di erent phrases are chosen across 3 di erent categories: general filler words and phrases ("uhm", "err", "actually", "like", "you know"...), phrases emphasizing speaker opinion/mental state ("I think/believe/mean", "I would say"...) & phrases indicating uncertainty ("maybe", "perhaps", "probably", "possibly", "most likely").The la er two categories had shown promising results Kovatchev et al (2021) when they were concatenated at the beginning of the sentence unlike this implementation which perform insertions at any random positions. Filler words are based on the work of Laserna et al (2014) but have not been explored in the context of data augmentation.…”
Section: A40 Filler Word Augmentationmentioning
confidence: 99%
“…This augmentation adds noise in the form of colloquial filler phrases. 23 di erent phrases are chosen across 3 di erent categories: general filler words and phrases ("uhm", "err", "actually", "like", "you know"...), phrases emphasizing speaker opinion/mental state ("I think/believe/mean", "I would say"...) & phrases indicating uncertainty ("maybe", "perhaps", "probably", "possibly", "most likely").The la er two categories had shown promising results Kovatchev et al (2021) when they were concatenated at the beginning of the sentence unlike this implementation which perform insertions at any random positions. Filler words are based on the work of Laserna et al (2014) but have not been explored in the context of data augmentation.…”
Section: A40 Filler Word Augmentationmentioning
confidence: 99%
“…Data Generation Recent years have seen an increasing interest in the use of data generation and data augmentation for various NLP tasks (Liu, Swayamdipta, Smith and Choi, 2022;Hartvigsen, Gabriel, Palangi, Sap, Ray and Kamar, 2022;Dhole, Gangal, Gehrmann, Gupta, Li, Mahamood, Mahendiran, Mille, Srivastava, Tan et al, 2021;Kovatchev, Smith, Lee and Devine, 2021). The use of synthetic data has not been extensively explored in the context of factchecking.…”
Section: Challengesmentioning
confidence: 99%
“…The in-domain dictionary contains all keywords identified by Chapparo et al and maps each keyword to its substitutes, e.g., affect → {break, block, close, ...}. Domain knowledge guided operators have been recently shown to lead to better performance compared to more advanced but general approaches (e.g., embeddings) [25].…”
Section: Natural Language Da Operatorsmentioning
confidence: 99%