Christian Druckenbrodt scite author profile

Chemical patents are an important resource for chemical information. However, few chemical Named Entity Recognition (NER) systems have been evaluated on patent documents, due in part to their structural and linguistic complexity. In this paper, we explore the NER performance of a BiLSTM-CRF model utilising pre-trained word embeddings, characterlevel word representations and contextualized ELMo word representations for chemical patents. We compare word embeddings pre-trained on biomedical and chemical patent corpora. The effect of tokenizers optimized for the chemical domain on NER performance in chemical patents is also explored. The results on two patent corpora show that contextualized word representations generated from ELMo substantially improve chemical NER performance w.r.t. the current state-of-the-art. We also show that domain-specific resources such as word embeddings trained on chemical patents and chemical-specific tokenizers have a positive impact on NER performance.

show abstract

Dimere Dialkylphosphanylgermylene: Ylidische Diphosphadigermetane

Druckenbrodt

Mont

Ruthe

et al. 1998

Z. anorg. allg. Chem.

View full text Add to dashboard Cite

Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents

Nguyen

Akhondi

et al. 2020

View full text Add to dashboard Cite

The Bromination of Bulky Trialkylphosphane Selenides R₂R′PSe (R, R′ = iPr or tBu) Studied by Physical and Computational Methods

et al. 2005

View full text Add to dashboard Cite

show abstract

The first trialkylphosphane telluride complexes of Ag(i): molecular, ionic and supramolecular structural alternatives

et al. 2007

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.