“…They produce statistics of events within a game by either analyzing camera shots or semantic information. Human activity localization in sports videos is studied in [192], [193], [194], [195], salient game actions are identified in [196], [197], automatic game highlights identification and summarization are performed in [198], [199], [200], [201], [202]. Moreover, action spotting, which is the task of temporal localization of human-induced events, has been popular in soccer game broadcasts [3], [203] and some methods aimed to automatically detect goals, penalties, corner kicks, and card events [204].…”