2018
DOI: 10.18495/comengapp.v7i1.223
|View full text |Cite
|
Sign up to set email alerts
|

Improving Data Integrity of Individual-based Bibliographic Repository Using Clustering Techniques

Abstract: This paper presents a method to improve data integrity of individual-based bibliographic repository. Integrity improvement is done by comparing individual-based publication raw data with individual-based clustered publication data. Hierarchical Agglomerative Clustering is used to cluster the publication data with similar author names. Clustering is done by two steps of clustering. The first clustering is based on the co-author relationship and the second is by title similarity and year difference. The two-step… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…DL content and service quality are strongly influenced by the ambiguity problem of the author's name in the citation and are considered as one of the most difficult problems faced by digital library researchers [3]. AND becomes a problem when a set of publication notes contains the name of the author which gives rise to more than one interpretation, ie the same author can appear with a different name [4] [5]. This becomes a point that reduces the quality of information and also reduces the reliability of the information because it impacts the information on the author, organization and other things that are displayed as part of the publication's notes [6].…”
Section: Introductionmentioning
confidence: 99%
“…DL content and service quality are strongly influenced by the ambiguity problem of the author's name in the citation and are considered as one of the most difficult problems faced by digital library researchers [3]. AND becomes a problem when a set of publication notes contains the name of the author which gives rise to more than one interpretation, ie the same author can appear with a different name [4] [5]. This becomes a point that reduces the quality of information and also reduces the reliability of the information because it impacts the information on the author, organization and other things that are displayed as part of the publication's notes [6].…”
Section: Introductionmentioning
confidence: 99%