“…In software engineering, there are many works on visual artifacts including recording, tutorials, bug reports [27,29,32,38,61,78,82] with some of them specifically for usability and accessibility testing [30,49,76,81]. In detail, Bao et al [20] focused on the extraction of user interactions to facilitate behavioral analysis of developers during programming tasks, such as code editing, text selection, and window scrolling. Krieter et al [44] analyzed every single frame of a video to generate log files that describe what events are happening at the app level (e.g., the "k" key is pressed).…”