Proceedings of the 10th Annual Joint Conference on Digital Libraries 2010
DOI: 10.1145/1816123.1816130
|View full text |Cite
|
Sign up to set email alerts
|

Effective self-training author name disambiguation in scholarly digital libraries

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
77
3
12

Year Published

2012
2012
2017
2017

Publication Types

Select...
8
1

Relationship

3
6

Authors

Journals

citations
Cited by 63 publications
(93 citation statements)
references
References 29 publications
1
77
3
12
Order By: Relevance
“…They use a mix of techniques. While some use similarity functions [2,7,12,18,21,27,30], others use learning techniques [1,14,16,28,32,35], heuristics [17,19,20,24], classifiers [9,10,34] and clustering methods [11,31].…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…They use a mix of techniques. While some use similarity functions [2,7,12,18,21,27,30], others use learning techniques [1,14,16,28,32,35], heuristics [17,19,20,24], classifiers [9,10,34] and clustering methods [11,31].…”
Section: Background and Related Workmentioning
confidence: 99%
“…In several cases, we cannot locate a given publication in the coauthor's CV due to differences in the title spelling (lines 1-3 of Algorithm 2). In this case, our heuristic attempts to retrieve it by using a set of attributes: the Id of the coauthor's CV, year of publication, volume, and number of first and last pages (lines [5][6][7][8][9][10]. If the publication is not located, we use the most stable attributes, which are the id of the coauthor's CV and the publication year (lines 12-14).…”
Section: Heuristic Matching Algorithmmentioning
confidence: 99%
“…A group of techniques use machine learning algorithms (Veloso et al 2012;D'Angelo et al 2011;Cota et al 2010;Treeratpituk and Giles 2009;Kang 2008). Levin et al (2012Levin et al ( , 1031 (Ferreira et al 2010;Dai and Storkey 2009;Kang et al 2009b;Masada et al 2007), ontology-based method using properties (Kim et al 2011;Kim and Park 2009), and author profiling (Ferreira et al 2012b). Ferreira et al (2012a, 18-19) 2.1 Author disambiguation using unsupervised algorithm As a result, for a group of 5,332 authors with same names, they found 9,133 'real' individual authors.…”
Section: Review Of Author Name Disambiguation Techniquesmentioning
confidence: 99%
“…HHC disambiguates a set of citation records by successively fusing clusters of citation records with similar author names based on a real-world heuristic applied to their citation attributes. Then, we present SAND -Self-training Associative Name Disambiguator [9,8]. SAND is a three-step selftraining method for author name disambiguation that requires no manual labeling and no parameterization (in real world scenarios).…”
Section: Introductionmentioning
confidence: 99%