2021
DOI: 10.21031/epod.817396
|View full text |Cite
|
Sign up to set email alerts
|

How Reliable Is It to Automatically Score Open-Ended Items? An Application in the Turkish Language

Abstract: The use of open-ended items, especially in large-scale tests, created difficulties in scoring open-ended items. However, this problem can be overcome with an approach based on automated scoring of open-ended items. The aim of this study was to examine the reliability of the data obtained by scoring open-ended items automatically. One of the objectives was to compare different algorithms based on machine learning in automated scoring (support vector machines, logistic regression, multinominal Naive Bayes, long-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 33 publications
0
1
0
Order By: Relevance
“…The QWK coefficient is generally used as an evaluation index in the automatic grading of related essays and articles. It www.ijacsa.thesai.org can well reflect the consistency between the predicted quality index (mostly in the form of score) and the actual quality index of the essay and articles [25] and improves the validity of the evaluation results. The QWK coefficient introduces a penalty mechanism based on the KAPPA coefficient.…”
Section: B Evaluation Methodsmentioning
confidence: 90%
“…The QWK coefficient is generally used as an evaluation index in the automatic grading of related essays and articles. It www.ijacsa.thesai.org can well reflect the consistency between the predicted quality index (mostly in the form of score) and the actual quality index of the essay and articles [25] and improves the validity of the evaluation results. The QWK coefficient introduces a penalty mechanism based on the KAPPA coefficient.…”
Section: B Evaluation Methodsmentioning
confidence: 90%