“…The majority of existing methods for video grounding can be categorized into two families: 1) proposal-based methods [2,5,13,14,17,25,26,33,43,45,48,50,54,56,57,58], which All codes and models will be made available shortly. generate a bunch of proposals in advance and select the best match with target spans, and 2) proposal-free methods [6,7,8,15,29,31,36,42,45,52,53,55], which estimate start and end timestamps aligned to the given description directly. The proposal-based approaches generally show strong performance at the expense of prohibitive cost of proposal generation.…”