“…Whereas, in [1], the common scene observed by the wearer and a surveillance camera has been used Poster Session F1: Deep Learning for Multimedia MM '20, October 12-16, 2020, Seattle, WA, USA to identify the wearer. Other works compute the location of the wearer directly [7,15] or indirectly (using gaze, social interactions, etc.) [16,17], which is then used to identify the wearer.…”