“…Each video was evaluated by three different viewers through the Amazon Mechanical Turk [Mason and Suri 2011] crowd-sourcing platform. To mitigate the impact of low inter-rater agreement (measured by intraclass correlation coefficient (ICC) [Bartko 1966]), we applied the root mean square (RMS) to the ratings provided by the three annotators as the final scoring method, similar to the approach followed in [Dinkar et al 2020a] (for more details, refer to the paper [Biancardi et al 2022]). In order to analyse the speech transcripts, we processed the data set using a speech transcription library 3 .…”