Aims: This study provides a multimodal conversation analytic account of directive sequences used in the presence of a child aged 1;8-2;4 growing up in an English-dominant environment and acquiring Polish as a heritage language. Design: The video recorded data are drawn from naturally occurring interactions, in which the child is present when one caregiver produces a directive turn (request, proposal, or suggestion) and another one carries it out in an embodied fashion. Analysis: The analysis focuses on the sequential unfolding of the triadic interactions, the two adults’ and child’s verbal turns, gaze direction, manual actions, and handling of objects. Findings: Although the child is not always verbally active in the directive sequences, she observes them and sometimes takes part, either through bodily actions or verbal utterances. The multimodal analysis also shows that even if the child’s verbal activity might indicate understanding a prior turn and responding to it, the child may not actually be orienting to the conversation. Observing adults carrying out heritage language directive sequences is only possible when the child is interacting with two speakers of her heritage language, or a speaker of that language and a person who has some passive knowledge of it. Seeing adults’ mutual social actions offers the child a social environment, in which she can get a rich linguistic model and also observe the benefits of using the heritage language in everyday interactional situations. Originality: This is the first study to offer a multimodal conversation analysis of directive sequences in which one adult produces the directive and another adult responds to it in the presence of a child acquiring a heritage language.