Homogeneity Within Item Forms in Domain Referenced Testing

Macready, George B.; Merwin, Jack C.

doi:10.1177/001316447303300215

Cited by 18 publications

(8 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When items in a domain are expected to be relatively homogeneous, and there are many times when this is not a reasonable assumption (Macready & Merwin, 1973), it has become a fairly common practice for a test developer to compare estimates of item difficulty parameters, or item discrimination parameters, or both. Since one would expect items measuring an objec tive equally well to have similar item parameters, estimates of the parameters are compared to detect items that deviate from the norm.…”

Section: Review Of Educational Researchmentioning

confidence: 99%

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments

Hambleton

Swaminathan

Algina

et al. 1978

Review of Educational Research

192

View full text Add to dashboard Cite

Glaser (1963) and Popham and Husek (1969) were the first to introduce and to popularize the field of criterion-referenced testing. Their motive was to provide the kind of test score information needed to make a variety of individual and programmatic decisions arising in objectivesbased instructional programs. Norm-referenced tests were seen as less than ideal for providing the desired kind of test score information.At present students at all levels of education are taking criterion-

show abstract

Section: Review Of Educational Researchmentioning

confidence: 99%

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments

Hambleton

Swaminathan

Algina

et al. 1978

Review of Educational Research

192

View full text Add to dashboard Cite

show abstract

“…In the preceding paragraphs it was assumed that a single (mastery) continuum underlies the items. Although occasionally the opinion can be met that heterogeneous item domains are usable as well (Millman, 1974), homogeneous item domains are generally considered as a requisite for a criterion-referenced interpretation of test scores (see, e.g., Hambleton & Gorth, 1971;Macready & Merwin, 1973;Popham & Husek, 1967). We denote this underlying mastery continuum by 0 and the cut-off score on it that is used for mastery decisions by the value 0c.…”

Section: Latent Trait Analysismentioning

confidence: 99%

A Latent Trait Look at Pretest-Posttest Validation of Criterion-Referenced Test Items

Linden

1981

Review of Educational Research

View full text Add to dashboard Cite

Since Cox and Vargas (1966) introduced their pretest-posttest validity index for criterion-referenced test items, a great number of additions and modifications havefollowed. All are based on the idea of gain scoring; that is, they are computedfrom the differences between proportions ofpretest andposttest item responses. Although the method is simple and generally considered as the prototype of criterion-referenced item analysis, it has many and serious disadvantages. Some of these go back to thefact that it leads to indices based on a dual test administration-and population-dependent item p values. Others have to do with the global information about the discriminating power that these indices provide, the implicit weighting they suppose, and the meaningless maximization of posttest scores they lead to. Analyzing the pretest-posttest methodfrom a latent trait point of view, it is proposed to replace indices like Cox and Vargas' Dpp by an evaluation of the item informationfunctionfor the mastery score. An empirical study was conducted to compare the differences in item selection between both methods. As in any other area of educational and psychological measurement, more attention has been paid to reliability than to validity aspects of criterion-referenced measurement. Several test parameters have been proposed and compared with their normreferenced counterparts, assessment methods have been introduced and examined using both real and simulated data, and the criterion-referenced reliability problem seems on its way to a great diversity of solutions (

show abstract

“…The tasks were too easy. Consequently, it was necessary to select a new set of algebra tasks and psychometrically validate these before proceding to the studies for Task Macready and Merwin (1973) have called a homogeneous item domain.…”

Section: Second Year Researchmentioning

confidence: 99%