“…Sensors include ECG and EMG [73,83] to record physiological signals, microphones [83,90,199,200,202,203,205] to record voice intonations, and 2D cameras to capture facial information [90,199,200,202,205], and body language information [203]. These sensors have been integrated together in order to extract many features, including heart rate [73,83], voice pitch [83,90,199,200,202,203,205], gait features [203] and facial features [90,199,200,202,205]. Future research should continue to investigate a wide range of features for all modes in order to determine which combinations of features result in the highest recognition rates during real-world interactions.…”