Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1648
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Task Learning for Chemical Named Entity Recognition with Chemical Compound Paraphrasing

Abstract: We propose a method to improve named entity recognition (NER) for chemical compounds using multi-task learning by jointly training a chemical NER model and a chemical compound paraphrase model. Our method enables the long short-term memory (LSTM) of the NER model to capture chemical compound paraphrases by sharing the parameters of the LSTM and character embeddings between the two models. The experimental results on the BioCreative IV's CHEMDNER task show that our method improves chemical NER and achieves stat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 20 publications
0
11
0
Order By: Relevance
“…[137] transfers knowledge from a large opendomain corpus to the data-scarce medical domain for Chinese word segmentation by developing a parallel MTL architecture. HanPaNE [130] improves NER for chemical compounds by jointly training a chemical compound paraphrase model. [134] enhances Chinese semantic role labeling by adding a dependency parsing model and uses the output of dependency parsing as additional features.…”
Section: Auxiliary Mtlmentioning
confidence: 99%
See 1 more Smart Citation
“…[137] transfers knowledge from a large opendomain corpus to the data-scarce medical domain for Chinese word segmentation by developing a parallel MTL architecture. HanPaNE [130] improves NER for chemical compounds by jointly training a chemical compound paraphrase model. [134] enhances Chinese semantic role labeling by adding a dependency parsing model and uses the output of dependency parsing as additional features.…”
Section: Auxiliary Mtlmentioning
confidence: 99%
“…[116] uses hashtags to represent genres of tweet posts. [130] generates sentence pairs by replacing chemical named entities with their paraphrases in the PubChemDic database. Unicoder [49] uses translated text from the source language to fine-tune on the target language.…”
Section: Multi-label Datasetsmentioning
confidence: 99%
“…The performance of BioNER models has been further improved with the introduction of multi-task learning on multiple biomedical corpora (Crichton et al, 2017;Yoon et al, 2019). Several works demonstrated the effectiveness of jointly learning the BioNER task and other biomedical NLP tasks Watanabe et al, 2019;Zhao et al, 2019;Peng et al, 2020b).…”
Section: Related Workmentioning
confidence: 99%
“…The question of retrieving multiword named entities that may not have been lexicalised yet or that may have a different form compared to the lexicalised ones, is addressed in Nayel et al (2019) for the biomedical domain and in Watanabe et al (2019) for the chemical domain. Our study here also aims at showing how to detect INE that may not have been lexicalised and that may not be detected by traditional NER tools, but without focusing on specific domains such as the biomedical or chemical ones.…”
Section: Nermentioning
confidence: 99%