2020
DOI: 10.1007/978-3-030-58219-7_18
|View full text |Cite
|
Sign up to set email alerts
|

Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
35
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 27 publications
(35 citation statements)
references
References 11 publications
0
35
0
Order By: Relevance
“…The ChEMU 2020 benchmark dataset 1 [He et al, 2021] contains snippets sampled from 170 English patents from the European Patent Office and the United States Patent and Trademark Office [He et al, 2020a[He et al, ,b, 2021. As shown in Fig.…”
Section: Benchmark Data For Chemical Entity Recognition -Chemu 2020 Dmentioning
confidence: 99%
See 1 more Smart Citation
“…The ChEMU 2020 benchmark dataset 1 [He et al, 2021] contains snippets sampled from 170 English patents from the European Patent Office and the United States Patent and Trademark Office [He et al, 2020a[He et al, ,b, 2021. As shown in Fig.…”
Section: Benchmark Data For Chemical Entity Recognition -Chemu 2020 Dmentioning
confidence: 99%
“…Particularly, we leverage different pre-trained models available in the literature to create ensembles of named entity recognizers. We evaluate our models in chemistry, clinical and wet lab corpora provided in the context of the ChEMU (Cheminformatics Elsevier Melbourne University) [He et al, 2020a], DEFT (Défi Fouille de Textes) [Grabar et al, 2018] and WNUT (Workshop on Noisy User-generated Text) [Tabassum et al, 2020] challenges, respectively. Our results show that ensembles of named entity recognizers based on masked language models can achieve effective NER performances in these different domains and languages.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we provide a detailed overview of the activities within ChEMU2020 lab, including the new ChEMU corpus, the tasks, the evaluation framework, the evaluation results, and a summary of participants' approaches. This paper is an extension of our previous overview papers (He et al, 2020a , b ) and thereby the task descriptions (section 4) and core evaluation results (section 5) are repeated here from those papers. Our focus is to provide additional detail about the preparation of the corpus we developed (section 3) and to provide more comprehensive analysis of the evaluation results.…”
Section: Introductionmentioning
confidence: 97%
“…The ChEMU (Cheminformatics Elsevier Melbourne University) lab is an initiative to encourage research on methods for automated information extraction from chemical patents. As a first running of ChEMU, ChEMU2020 lab focused on extraction of chemical reactions from patents (He et al, 2020a ; Nguyen et al, 2020 ). We prepared two fundamental information extraction tasks.…”
Section: Introductionmentioning
confidence: 99%
“…To reduce the time and effort needed for information extraction from chemical literature and patents, datasets and text mining approaches have been developed on a wide range of information extraction tasks, including named entity recognition and relation extraction [7,8,9,10,11,12]. Most of these methods focus on processing plain text by leveraging state of the art Natural Language Processing (NLP) approaches, and tabular data is usually ignored or discarded, which causes significant loss in the amount of compound-related information that can be extracted from patents.…”
Section: Introductionmentioning
confidence: 99%