Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications 2014
DOI: 10.3115/v1/w14-1801
|View full text |Cite
|
Sign up to set email alerts
|

Automated Measures of Specific Vocabulary Knowledge from Constructed Responses ('Use These Words to Write a Sentence Based on this Picture')

Abstract: We describe a system for automatically scoring a vocabulary item type that asks test-takers to use two specific words in writing a sentence based on a picture. The system consists of a rule-based component and a machine learned statistical model which uses a variety of construct-relevant features. Specifically, in constructing the statistical model, we investigate if grammar, usage, and mechanics features developed for scoring essays can be applied to short answers, as in our task. We also explore new features… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
13
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(15 citation statements)
references
References 19 publications
(20 reference statements)
2
13
0
Order By: Relevance
“…In research closer to our own image-based work, Somasundaran and Chodorow (2014) analyze learner responses to a PDT where the responses were constrained by requiring the use of specific words. The pictures were annotated by experts, and the relevance of responses was calculated through the overlap of the response and annotation contents.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In research closer to our own image-based work, Somasundaran and Chodorow (2014) analyze learner responses to a PDT where the responses were constrained by requiring the use of specific words. The pictures were annotated by experts, and the relevance of responses was calculated through the overlap of the response and annotation contents.…”
Section: Related Workmentioning
confidence: 99%
“…Although much current work on analyzing learner language focuses on grammatical error detection and correction (e.g., Leacock et al, 2014), there is a growing body of work covering varying kinds of semantic analysis (e.g., Meurers et al, 2011;Bailey and Meurers, 2008;Dickinson, 2014, 2013;Petersen, 2010), including assessment-driven work (e.g., Somasundaran et al, 2015;Somasundaran and Chodorow, 2014). One goal of such work is to facilitate intelligent language tutors (ILTs) and language assessment tools that maximize communicative interaction, as suggested by research in second language instruction (cf.…”
Section: Introductionmentioning
confidence: 99%
“…In order to test if a given response tells a story that is relevant to the pictures in the prompt, we calculate the overlap of the content of the response and the content of the pictures similar to (Somasundaran and Chodorow, 2014). To facilitate this, each prompt is associated with a reference corpus containing a detailed description of each picture, and also an overall narrative that ties together the events in the pictures.…”
Section: Relevancementioning
confidence: 99%
“…Our work explores the other (complementary) dimensions of the test such as language use, content relevance and story development. Somasundaran and Chodorow (2014) construct features for awkward word usage and content relevance for a written vocabulary test which we adapt for our task. Discourse organization features have been employed for essay scoring of written essays in the expository and argumentative genre (Attali and Burstein, 2006).…”
Section: Introductionmentioning
confidence: 99%
“…In our own previous work Dickinson, 2016, 2013), we annotated a small set of PDT responses as correct or incorrect, with incorrect responses further labeled as errors of form or meaning. Somasundaran and Chodorow (2014) presented work on PDT re-sponses in which respondents used provided vocabulary words. Responses were manually annotated on a holistic four point scale, and a set of five features (relating to meaning, relevance and language use) were calculated based on statistical assumptions.…”
Section: Introductionmentioning
confidence: 99%