2021 the 4th International Conference on Software Engineering and Information Management 2021
DOI: 10.1145/3451471.3451489
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of Clustering Algorithms to Clean and Normalize Early Modern European Book Titles

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 9 publications
0
2
0
1
Order By: Relevance
“…The first data set will be one preferable for our new algorithm, and is the same data set we used in our last paper to compare the different clustering algorithms (Bryer et al, 2021). The data set is made up of approximately 1,000 idioms and proverbs taken from English, French, German and Latin.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The first data set will be one preferable for our new algorithm, and is the same data set we used in our last paper to compare the different clustering algorithms (Bryer et al, 2021). The data set is made up of approximately 1,000 idioms and proverbs taken from English, French, German and Latin.…”
Section: Methodsmentioning
confidence: 99%
“…This dataset has been created and maintained by the Online Computer Library Center (OCLC). A more detailed history of this organization as well as our relationship with them can be found in our previous paper (Bryer et al, 2021). The data contained within the MARC records (in MARC 21 format) stores the metadata of a book, all of the bibliographic and publishing information without containing any of the actual data from the book.…”
Section: Introductionmentioning
confidence: 99%
“…Evan Bryer wraz z zespołem (Bryer et al, 2021) opracowali metodę deduplikacji rekordów metadanych w oparciu o strategię oczyszczania i analizę skupień z wykorzystaniem metod i technologii uczenia maszynowego. Przedmiotem badań była baza ponad 5 mln rekordów w formacie MARC 21 dla wydawnictw zwartych opublikowanych między XVI a XIX w. i dostępnych za pośrednictwem bazy WorldCat.…”
Section: Wymiar Ontologiczny Bdsunclassified