Current research in computerised adaptive testing (CAT) focuses on applications, in small and large scale, that address self assessment, training, employment, teacher professional development for schools, industry, military, assessment of non-cognitive skills, etc. Dynamic item generation tools and automated scoring of complex, constructed response examinations are coming into use. Therefore it is important to extend CAT's functionality to include more variables in its student model that define the examinee as an individual beyond the mastery level, for improved performance and more efficient test delivery. This paper examines variables that can prompt adaptation and discusses their potential use in a hypothetical student model for CAT. The objective of this effort is to provide researchers, designers, and developers of CAT a perspective for exploiting research outcomes from the area of personalised hypermedia applications.
IntroductionDue to the advances in communication and information technology, the popularity of computer based testing has increased in recent years. Computer delivery of tests has become feasible for processes such as licensure, certification and admission. Moreover, computers can be used to increase the statistical accuracy of test scores using computerised adaptive testing (CAT). As an alternative to giving each examinee the same fixed test, CAT item selection adapts to the ability level of individual examinees, and after each response the ability estimate is updated and the next item is selected to have optimal properties at the new estimate (van der Linden & Glas, 2003). The computer continuously re-evaluates the ability of the examinee until the accuracy of the estimate reaches a statistically acceptable level or when some limit is reached, such as a maximum number of test items presented. The score is determined from the level of
Triantafillou, Georgiadou and Economides
351the difficulty, and as a result, while all examinees may answer the same percentage of questions correctly, high ability examinees will attain a better score as they answer correctly more difficult items. The vast majority of CAT systems rely on Item Response Theory as the underlying model (Lord, 1980;Wainer, 1990). However, Decision Theory provides an alternative underlying model for sequential testing (Rudner, 2002), and Knowledge Space Theory (Doignon & Falmagne, 1985) is a third basis for small scale construction of adaptive tests.Regardless of some disadvantages reported in the literature, for example, high cost of development, item calibration, item exposure control (Eggen, 2001;Boyd, 2003), effect of a flawed item (Abdullah, 2003), or the use of CAT for summative assessment (Lilley & Barker, 2002, CAT has several advantages. Testing on demand can be facilitated, so an examinee can take the test whenever and wherever he or she is ready. Multiple media can be used to create innovative item formats and more realistic testing environments. Other possible advantages are flexibility of test management, immediate avai...