2016
DOI: 10.1587/transinf.2015edp7392
|View full text |Cite
|
Sign up to set email alerts
|

Linked Data Entity Resolution System Enhanced by Configuration Learning Algorithm

Abstract: SUMMARY Linked data entity resolution is the detection of instances that reside in different repositories but co-describe the same topic. The quality of the resolution result depends on the appropriateness of the configuration, including the selected matching properties and the similarity measures. Because such configuration details are currently set differently across domains and repositories, a general resolution approach for every repository is necessary. In this paper, we present cLink, a system that can p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…All the candidates pairs belonging to S are labeled positive (ie, ( s , t )∈ S ). The other candidates pairs that include the first element ( s ) of an existing positive labeled pair are labeled negative (ie, ( s , z ) where z ≠ t is labeled negative) . Finally, the candidates pairs are divided into training and validation sets and they are provided to the fourth configuration module.…”
Section: Context‐free Approachesmentioning
confidence: 99%
See 2 more Smart Citations
“…All the candidates pairs belonging to S are labeled positive (ie, ( s , t )∈ S ). The other candidates pairs that include the first element ( s ) of an existing positive labeled pair are labeled negative (ie, ( s , z ) where z ≠ t is labeled negative) . Finally, the candidates pairs are divided into training and validation sets and they are provided to the fourth configuration module.…”
Section: Context‐free Approachesmentioning
confidence: 99%
“…cLink is a supervised approach, which is implemented as a part of ScSLINT. It uses a heuristic search method permitting to learn an optimal matching configuration (ie, combination of similarity functions as well as other settings of the IM process) defined in ScSLINT.…”
Section: Context‐free Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…ActiveAtlas,TAILOR,MARLIN,andGenLinkarelinkingsystemsbasedonsupervised learningmodels.InActiveAtlas (Tejada,Knoblock,&Minton,2001)andTAILOR (Elfeky& Verykios,2002)authorshavedesignedadecisiontreebasedclassifiersystemtoidentifymatches andnon-matchestolinkentities.ActiveAtlasusesaC4.5algorithmtobuildadecisiontreewhile TAILORusestheID3algorithm.MARLIN(MultiplyAdaptiveRecordLinkagewithInduction) (Bilenko&Mooney,2003)learnsstringsimilaritymeasuretocalculatethesimilaritybetween entitypropertiesandusesasupportvectormachine (Cortes&Vapnik,1995)tocombinelearned similaritymeasurestodesignaclassifiermodel.Insupervisedlearning,basedapproachesmost closedtoourworkareGenLink (RIsele&Bizer,2011),EAGLE(Ngomoetal.,2012,cLink and ScLink (Nguyen & Ichise, 2016a, 2016b. The GenLink and EAGLE systems use GP to learnLRsfromtheexistingsetoflinks.IncLinkandScLinkauthorshaveusedheuristicsto optimallylearnconfigurationstoperformlinking.…”
Section: Literature Reviewmentioning
confidence: 99%