“…In total, 26 distinct combinations were identified using data of six different modalities. [188], [262], [265], [273], [274], [277], [294], [300], [376], [379], [393], Video & Audio & Sensor [252], [296], [409], Video & Audio [153], [171], [229], [253], [255], [266], [275], [281], [287], [292], [295], [298], [315], Video & Text [199], Video & Sensor [271], Video & Signal [250], [283], Image & Audio & Text [111], [175], [204], [213], [216], [263], [264], [288], [340], [396], [406], Image & Audio & Sensor & Signal [366], Image & Audio & Sensor [249], [334], [335], Image & Audio …”