“…Few papers use the Cohen's Kappa score [89,233], the Kappa statistic [53,88,231], the Spearman correlation [197,292], the precision-recall curve with the precision-recall breakeven point [110,160,255], and the Hamming loss [229] as an evaluation metric. Others use error calculation-based metrics such as the mean squared error [53,81,171,180], the root mean square forecasting error, and the mean absolute percent error [210].…”