2011
DOI: 10.1145/1891879.1891883
|View full text |Cite
|
Sign up to set email alerts
|

On Graph-Based Name Disambiguation

Abstract: Name ambiguity stems from the fact that many people or objects share identical names in the real world. Such name ambiguity decreases the performance of document retrieval, Web search, information integration, and may cause confusion in other applications. Due to the same name spellings and lack of information, it is a nontrivial task to distinguish them accurately. In this article, we focus on investigating the problem in digital libraries to distinguish publications written by authors with identical names. W… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
129
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 111 publications
(134 citation statements)
references
References 28 publications
0
129
0
Order By: Relevance
“…For example, the institutional repository of the University of Trento does not provide any mechanism to disambiguate locations (of conferences and of editors). Examples of useful heuristics, though restricted to author names only, can be found in [12].…”
Section: E N T I T Y C L a S S : C O N C E P T N A M E : N A M E [ ]mentioning
confidence: 99%
“…For example, the institutional repository of the University of Trento does not provide any mechanism to disambiguate locations (of conferences and of editors). Examples of useful heuristics, though restricted to author names only, can be found in [12].…”
Section: E N T I T Y C L a S S : C O N C E P T N A M E : N A M E [ ]mentioning
confidence: 99%
“…In [2] the authors propose a general object distinction methodology called DISTINCT. DISTINCT assumes a relational structure between neighbor tuples then computes the linkage strengths using two measures -set resemblance and random walks.Fan,et al,proposed another kind of network based name disambiguation method called GHOST [4]. By using only one type of the publication attributes (coautorship) GHOST is claimed to get better performance than DISTINCT.…”
Section: Related Workmentioning
confidence: 99%
“…As an example, we take the one proposed in [4] which is described as follows (the title of each publication is replaced with its venue label like "SIGMOD'12" to save space).…”
Section: Name Disambiguationmentioning
confidence: 99%
See 1 more Smart Citation
“…The data are represented as vector space model where the distance between vectors represents the similarity. Such algorithms include the Cosine Similarity (CS) with TF-IDF, Jaccard Similarity, Jaro Winkler, and Levenshtein algorithms [7,9,[12][13][14].…”
Section: Related Workmentioning
confidence: 99%