2007
DOI: 10.1002/asi.20671
|View full text |Cite
|
Sign up to set email alerts
|

Approximate personal name‐matching through finite‐state graphs

Abstract: This article shows how finite-state methods can be employed in a new and different task: the conflation of personal name variants in standard forms. In bibliographic databases and citation index systems, variant forms create problems of inaccuracy that affect information retrieval, the quality of information from databases, and the citation statistics used for the evaluation of scientists' work. A number of approximate string matching techniques have been developed to validate variant forms, based on similarit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0
1

Year Published

2008
2008
2015
2015

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(18 citation statements)
references
References 78 publications
0
17
0
1
Order By: Relevance
“…The Analysis and recognition of the variants is very high through slightly hampered by a problem of over analysis owing to the fact that some strings contain errors. An inherent limitation of such string matching [6] approaches is that they cannot identify aliases. D. Bollegala, Y. Matsuo, and M. Ishizuka [7] the techniques involved in measuring similarities between words are pattern extraction, page count and word cooccurrence.…”
Section: Literature Surveymentioning
confidence: 99%
“…The Analysis and recognition of the variants is very high through slightly hampered by a problem of over analysis owing to the fact that some strings contain errors. An inherent limitation of such string matching [6] approaches is that they cannot identify aliases. D. Bollegala, Y. Matsuo, and M. Ishizuka [7] the techniques involved in measuring similarities between words are pattern extraction, page count and word cooccurrence.…”
Section: Literature Surveymentioning
confidence: 99%
“…These studies did not aim to determine spelling mistakes for organization names (Galvez & Moya-Anegón 2006a;Galvez & Moya-Anegón 2007a). On the other hand, the study on standardizing author names was designed to find different versions of an author name (Galvez & Moya-Anegón 2007b).…”
Section: Previous Studies About Data Accuracy In Citation Indexesmentioning
confidence: 99%
“…Estas iniciativas suelen centrarse fundamentalmente en tres áreas: las relacionadas con el diseño de esquemas para la descripción de registros de autoridades (desde las ISAAR CPF hasta los más recientes esquemas de metadatos como MADS o microformatos como VCard), las relacionadas con la creación de identifi cadores únicos, como el ISAN (Snyman, 2000) o DAI (Spanje, 2007) y fi nalmente las relacionadas con procesos de desambiguación de nombres en bases de datos (Torvik, 2005;Wooding, 2006;Gálvez, 2007). Asimismo, destaca el reciente trabajo aportado por el centro de investigación Ingenio (Pinar, 2007).…”
Section: La Variabilidad En La Forma De Los Nombresunclassified