“…Therefore, F1-score is applied, which incorporates both precision and recall and thus is more reliable. It can be observed that ARSH-SV has the highest F1-score in datasets, Kyoto7 (91%), Kasteren (91%) and Kasteren 10 (80%) compared to the existing approaches in [6,37,38,45,48]. The results show that in ARSH-SV, the activities are correctly recognized, while incorrect labels are correctly identified through confidence measure that remain useful in reducing the false positives effectively.…”