2012
DOI: 10.1002/j.2333-8504.2012.tb02300.x
|View full text |Cite
|
Sign up to set email alerts
|

Examining Linguistic Characteristics of Paraphrase in Test‐taker Summaries

Abstract: Since its 1947 founding, ETS has conducted and disseminated scientific research to support its products and services, and to advance the measurement and education fields. In keeping with these goals, ETS is committed to making its research freely available to the professional community and to the general public. Published accounts of ETS research, including papers in the ETS Research Report series, undergo a formal peer-review process by ETS staff to ensure that they meet established scientific and professiona… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 14 publications
0
2
0
Order By: Relevance
“…The estimation of use-in-a-corpus is more accurate for smaller n, because longer ngrams are more susceptible to paraphrasing, which leads to under-estimation of use. Assuming that paraphrasing behavior of good and bad writers is not the same -in fact, there is corpus evidence that better writers paraphrase more (Burstein et al, 2012) -the resulting inaccuracies might impact the estimation of differential use in a systematic manner, making the n > 1 models less effective than the unigrams. Given that (a) the GoodVsBad bigram model is the second best overall in spite of the shortcomings of the estimation process, and (b) that the bigram models worked better than unigram models for all the other content importance models, the GoodVsBad bigram model could probably be improved significantly by using a more flexible information realization mechanism.…”
Section: Discussionmentioning
confidence: 99%
“…The estimation of use-in-a-corpus is more accurate for smaller n, because longer ngrams are more susceptible to paraphrasing, which leads to under-estimation of use. Assuming that paraphrasing behavior of good and bad writers is not the same -in fact, there is corpus evidence that better writers paraphrase more (Burstein et al, 2012) -the resulting inaccuracies might impact the estimation of differential use in a systematic manner, making the n > 1 models less effective than the unigrams. Given that (a) the GoodVsBad bigram model is the second best overall in spite of the shortcomings of the estimation process, and (b) that the bigram models worked better than unigram models for all the other content importance models, the GoodVsBad bigram model could probably be improved significantly by using a more flexible information realization mechanism.…”
Section: Discussionmentioning
confidence: 99%
“…In a study about paraphrasing in TOEFL iBT integrated writing, Burnstein et al () delved more deeply into the nature of the integration of source material using computational tools to explore paraphrasing. They discovered a difference in paraphrases from the listening source materials and from reading.…”
Section: What?mentioning
confidence: 99%