2016
DOI: 10.1016/j.joi.2016.07.003
|View full text |Cite
|
Sign up to set email alerts
|

Empirical analysis and classification of database errors in Scopus and Web of Science

Abstract: In the last decade, a growing number of studies focused on the qualitative/quantitative analysis of bibliometric-database errors. Most of these studies relied on the identification and (manual) examination of relatively limited samples of errors. Using an automated procedure, we collected a large corpus of more than 10,000 errors in the two multidisciplinary databases Scopus and Web of Science (WoS), mainly including articles in the Engineering-Manufacturing field. Based on the manual examination of a portion … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
92
0
6

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 159 publications
(111 citation statements)
references
References 25 publications
3
92
0
6
Order By: Relevance
“…A better practice is to search all articles in all SLA related journals indexed in the WoS database, as done in a recent bibliometric study of applied linguistics [41]. Second, as found in a number of studies, automated search results are not error-free [104][105][106]. Although this study has refined the search results to "linguistics or education educational research or language linguistics" categories, some erroneous entries may still exist within the dataset.…”
Section: Discussionmentioning
confidence: 99%
“…A better practice is to search all articles in all SLA related journals indexed in the WoS database, as done in a recent bibliometric study of applied linguistics [41]. Second, as found in a number of studies, automated search results are not error-free [104][105][106]. Although this study has refined the search results to "linguistics or education educational research or language linguistics" categories, some erroneous entries may still exist within the dataset.…”
Section: Discussionmentioning
confidence: 99%
“…Using the query string ‘ISSN(2071‐1050) AND (LIMIT‐TO(PUBYEAR,2015))’, the abstracting and indexing database provides 843 results. It deserves mentioning that at least one of the results is a duplicated item, probably due to matching issues within the indexing engine (Franceschini, Maisano, & Mastrogiacomo, , ; Meester, Colledge, & Dyas, ), and one is a correction. Therefore, the number of articles analysed here is equal to 841.…”
Section: Methodsmentioning
confidence: 99%
“…During the data verification process, some inconsistencies were detected. Nevertheless, this is a common problem of large databases given that they contain a huge amount of information from a variety of sources [35,36].…”
Section: Data Parsing and Text Refiningmentioning
confidence: 99%