“…A typical problem setting involving first-person vision is to recognize activities of camera wearers. Recently, some work has focused on activity recognition [7,22,23,28], activity forecasting [6,9,26,31], person identification [11], gaze anticipation [45] and grasp recognition [3,4,21,35]. Similar to our setting, other work has also tried to recognize behaviors of other people observed in first-person videos, e.g., group discovery [2], eye contact detection [42] and activity recognition [33,34,44].…”