“…Most COVID-19 related datasets are constructed from two types of sources. The first one is scientific publications, including the datasets CORD-19 (Wang et al, 2020) and LitCovid (Chen et al, 2020), that help facilitate many types of research works, such as building search engines to retrieve relevant information from scholarly articles (Esteva et al, 2020;Zhang et al, 2020;Verspoor et al, 2020), question answering and summarization (Lee et al, 2020;Su et al, 2020). Recently, Colic et al (2020) fine-tune a BERT-based NER model on the CRAFT corpus (Verspoor et al, 2012) to recognize and then normalize biomedical ontology and terminology entities in LitCovid.…”