“…Web Content Accessibility Guidelines (WCAG 2.0) recommend that to make videos accessible, authors can create a summary of the content or add audio descriptions that synchronously narrate the visual content while avoiding overlapping with important audio content [72]. To support authors creating audio descriptions, prior work has proposed manual [36,40], collaborative [51], and (semi-)automated [15,17,24,46,52,60,73,76] approaches to create descriptions for a range of videos including long-form traditional films and TV shows [17,24], user-generated videos [36,46,51,52,60,73], livestreams [38,40], and 360-degree videos [18,23,37]. Short-form videos often include continual audio such that clear gaps do not exist for adding audio descriptions.…”