Proceedings of the 4th Workshop on Research in Computational Linguistic Typology and Multilingual NLP 2022
DOI: 10.18653/v1/2022.sigtyp-1.15
|View full text |Cite
|
Sign up to set email alerts
|

ParaNames: A Massively Multilingual Entity Name Corpus

Abstract: We present ParaNames, a Wikidata-derived multilingual parallel name resource consisting of over 118 million names for 13.7 million entities, spanning over 400 languages. ParaNames is useful for multilingual language processing, both for defining name translation tasks and as supplementary data for other tasks. We demonstrate an application of ParaNames by training a multilingual model for canonical name translation to and from English.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 12 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?