“…From the perspective of SLA, Long (2020) has argued that bi-modal and tri-modal input may improve enhanced incidental learning, an internal process in the mind of the learner that increases unconscious detection, without necessarily raising learning to the level of conscious awareness at all. Indeed, research on learning from audiovisual input has consistently shown that watching audiovisual material, especially with captions (subtitles in the L2), is beneficial for comprehension (e.g., Gass et al, 2019), vocabulary items and formulaic sequences (e.g., Peters & Webb, 2018;Pujadas & Muñoz, 2019), grammar learning (e.g., Lee & Révész, 2020;Pattemore & Muñoz, 2020), and pronunciation (Wisnieska & Mora, 2020). On the other hand, watching non-captioned video has been shown to especially benefit aural recognition of word forms (Sydorenko, 2010).…”