“…It is thought by the author of this paper that there are some reasons behind it. To elaborate, as mentioned earlier most popular rater monitoring systems currently available are based on Many Facets Rasch Measurement (MFRM; Wang et al, 2020;Myford & Wolfe, 2009;Wigglesworth, 1993;Davis, 2016), Bayesian approach (Cao, et al, 2010), Rasch Partial Credit model (Wang, et al, 2017), Hierarchical rater model (DeCarlo, et al, 2011), and automated scoring engines (Shin, et al, 2019). These monitoring systems provide the administrators with highly detailed and robust systems which can be attained by using complex mathematical models, and methods like the Bayesian method (Cao, et al, 2010), Maximum likelihood estimation (Shin, et al, 2019), log-ratio test (Wang, et al, 2017), time facet model (Myford & Wolfe, 2009), Signal detection rater model (DeCarlo, et al, 2011), generalized partial credit model (DeCarlo, et al, 2011), mixedeffects ordinal probit model (Shin, et al, 2019) and some specialized software like Facets (Linacre, 2014), Winsteps (Linacre, 2018), Stata (Stata Corp., 2013, latent gold (Vermunt & Magidson, 2005), glam (Rabe-Hesketh, et al, 2004), and WinBUGS (Lunn, et al, 2009).…”