Psychometric modeling of decision making via game play

Regan, Kenneth W.; Biswas, Tamal

doi:10.1109/cig.2013.6633653

Cited by 6 publications

(4 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This makes it possible to analyze the moves of strong human players, in a large-scale fashion, comparing their choices to those of an engine. This has been pursued very effectively in the last several years by Biswas and Regan [2,3,23]; they have used the approach to derive interesting insights including proposals for how to estimate the depth at which human players are analyzing a position.…”

Section: Introductionmentioning

confidence: 99%

Assessing Human Error Against a Benchmark of Perfection

Anderson

Kleinberg

Mullainathan

2016

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

An increasing number of domains are providing us with detailed trace data on human decisions in settings where we can evaluate the quality of these decisions via an algorithm. Motivated by this development, an emerging line of work has begun to consider whether we can characterize and predict the kinds of decisions where people are likely to make errors.To investigate what a general framework for human error prediction might look like, we focus on a model system with a rich history in the behavioral sciences: the decisions made by chess players as they select moves in a game. We carry out our analysis at a large scale, employing datasets with several million recorded games, and using chess tablebases to acquire a form of ground truth for a subset of chess positions that have been completely solved by computers but remain challenging even for the best players in the world.We organize our analysis around three categories of features that we argue are present in most settings where the analysis of human error is applicable: the skill of the decision-maker, the time available to make the decision, and the inherent difficulty of the decision. We identify rich structure in all three of these categories of features, and find strong evidence that in our domain, features describing the inherent difficulty of an instance are significantly more powerful than features based on skill or time.

show abstract

Section: Introductionmentioning

confidence: 99%

Assessing Human Error Against a Benchmark of Perfection

Anderson

Kleinberg

Mullainathan

2016

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

show abstract

“…They began by showing that the skill levels of individual players can be estimated accurately by examining how their individual move choices correlate with those of top chess engines: Stronger players more often play one of the computer's preferred moves, which is taken to mean that the stronger players are playing better moves (Regan & Haworth, ; Regan & Biswas, ). Interestingly, the correspondence between individual move quality and rating has been consistent over time, meaning that the top players of the current era (who are rated in the 2750–2850 range on the Elo scale) are playing better chess than the top players of earlier eras (who were rated 2600–2750).…”

Section: Use (Near)‐optimal Play To Understand Human Play and Playersmentioning

confidence: 99%

Six Suggestions for Research on Games in Cognitive Science

Chabris

2017

Topics in Cognitive Science

View full text Add to dashboard Cite

Games are more varied and occupy more of daily life than ever before. At the same time, the tools available to study game play and players are more powerful than ever, especially massive data sets from online platforms and computational engines that can accurately evaluate human decisions. This essay offers six suggestions for future cognitive science research on games: (1) Don't forget about chess, (2) Look beyond action games and chess, (3) Use (near)-optimal play to understand human play and players, (4) Investigate social phenomena, (5) Raise the standards for studies of games as treatments, (6) Talk to real experts.

show abstract

“…Table 1 highlights some games analysed with STOCKFISH 3.0. [14,22,23] identifies the BP i which best fits the observed play: it is essentially frequentist. The probability of BP(c i ) playing moves m 1 -m k is p(c i )  q j,i and c i is found to maximize p(c i ).…”

Section: Figmentioning

confidence: 99%

A Comparative Review of Skill Assessment: Performance, Prediction and Profiling

Haworth

Biswas

Regan

2015

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. The assessment of chess players is both an increasingly attractive opportunity and an unfortunate necessity. The chess community needs to limit potential reputational damage by inhibiting cheating and unjustified accusations of cheating: there has been a recent rise in both. A number of counter-intuitive discoveries have been made by benchmarking the intrinsic merit of players' moves: these call for further investigation. Is Capablanca actually, objectively the most accurate World Champion? Has ELO rating inflation not taken place? Stimulated by FIDE/ACP, we revisit the fundamentals of the subject to advance a framework suitable for improved standards of computational experiment and more precise results. Other games and domains look to chess as demonstrator of good practice, including the rating of professionals making high-value decisions under pressure, personnel evaluation by Multichoice Assessment and the organization of crowd-sourcing in citizen science projects. The '3P' themes of performance, prediction and profiling pervade all these domains.

show abstract

Psychometric modeling of decision making via game play

Cited by 6 publications

References 21 publications

Assessing Human Error Against a Benchmark of Perfection

Assessing Human Error Against a Benchmark of Perfection

Six Suggestions for Research on Games in Cognitive Science

A Comparative Review of Skill Assessment: Performance, Prediction and Profiling

Contact Info

Product

Resources

About