“…This suggests that the sketching step helps generate a more fluent summary even with lower unigram matching. Furthermore, recognizing the limitation of ROUGE scores in their ability to fully capture the resemblance between the generated summary and the reference, in Table 2, we follow (Fabbri et al, 2020) rics, including ROUGE-Word Embedding (Ng and Abrecht, 2015), BERTScore (Zhang et al, 2019b), MoverScore (Zhao et al, 2019), Sentence Mover's Similarity (SMS) (Clark et al, 2019), BLEU (Papineni et al, 2002), and CIDEr (Vedantam et al, 2015). As shown in Table 2, CODS consistently outperforms PEGASUS and BART.…”