The current article describes an exploratory study that focussed on joint attention behaviour—the basis of interaction predicting productive collaboration—to better understand collaborative problem solving, particularly its social aspects during remote dyadic interaction. The study considered joint attention behaviour as a socio-linguistic phenomenon and relied on detailed qualitative interaction analysis on event-related measures of multiple observational data (i.e. log files, eye-tracking data). The aim was to illustrate and exemplify how the diverse attentional levels of joint attention behaviour (i.e. monitoring, common, mutual and shared attention) delineated by Siposova and Carpenter (Cognition 89:260–274, 2019) were achieved in remote collaborative problem solving in dyads, including the underlying basis of joint attention behaviour (i.e. individual attention experience). The results made visible the complex functioning of the social aspects of remote collaborative problem solving and provided preliminary insights into how the hierarchical and nested levels of ‘jointness’ and common knowledge were achieved in this context. The analysis reproduced all the theorised attentional levels as both isolated and parallel individualistic attention experiences whilst acknowledging the restrictions of the remote interaction environment and the specific task structures.