2020
DOI: 10.1002/wics.1543
|View full text |Cite
|
Sign up to set email alerts
|

Item response theory and its applications in educational measurement Part II: Theory and practices of test equating in item response theory

Abstract: Item response theory (IRT) is a class of latent variable models, which are used to develop educational and psychological tests (e.g., standardized tests, personality tests, tests for licensure and certification). We offer readers with comprehensive overviews of the theory and applications of IRT through two articles. While Part 1 of the review discusses topics such as foundations of educational measurement, IRT models, item parameter estimation, and applications of IRT with R, this Part 2 reviews areas of test… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 41 publications
0
1
0
Order By: Relevance
“…The results of the goodness of fit analysis Table 1 shows that the item analysis based on the Item Response Theory fits the 2-PL model for both packages. Then, based on the logistic parameter two (2PL), the model adds completion parameters to the difficulty level [63], [64]. Further analysis carried out to see the good or poor characteristics of the questions with the 2-PL model show the following results: package A had 27 good items and 13 poor items, while package B had 24 good item and 16 poor items.…”
Section: Results Question Analysismentioning
confidence: 99%
“…The results of the goodness of fit analysis Table 1 shows that the item analysis based on the Item Response Theory fits the 2-PL model for both packages. Then, based on the logistic parameter two (2PL), the model adds completion parameters to the difficulty level [63], [64]. Further analysis carried out to see the good or poor characteristics of the questions with the 2-PL model show the following results: package A had 27 good items and 13 poor items, while package B had 24 good item and 16 poor items.…”
Section: Results Question Analysismentioning
confidence: 99%
“…Test equating -LID can impact test equating, which is the process of linking scores from several forms of a test. LID may affect the comparability of scores across different test versions (Hori et al, 2022). It is typically seen for instruments composed of items or groups of items that measure various facets of the latent variable or different domains of an underlying construct.…”
Section: Introductionmentioning
confidence: 99%