2018
DOI: 10.2139/ssrn.3287149
|View full text |Cite
|
Sign up to set email alerts
|

Characterising Dataset Search – An Analysis of Search Logs and Data Requests

Abstract: Large amounts of data are becoming increasingly available online. In order to benefit from it we need tools to retrieve the most relevant datasets that match ones data needs. Several vocabularies have been developed to describe datasets in order to increase their discoverability, but for data publishers is costly to cumbersome to annotate them using all, leading to the question of what properties are more important. In this work we contribute with a systematic study of the patterns and specific attributes that… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
16
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
(18 citation statements)
references
References 48 publications
1
16
1
Order By: Relevance
“…The study of user data accessing behaviors is still in its early stage. Kacprzak et al (2019) mentioned that there are no systematic studies that investigate what properties of data users' needs are crucial for them to effectively search and discover datasets. Also, understanding users' data accessing behaviors is essential to the findability of datasets, yet “a user‐focused, cross‐disciplinary analysis of data retrieval practices is lacking” (Gregory et al, 2019; Ohno‐Machado et al, 2017).…”
Section: Literature Reviewmentioning
confidence: 99%
See 2 more Smart Citations
“…The study of user data accessing behaviors is still in its early stage. Kacprzak et al (2019) mentioned that there are no systematic studies that investigate what properties of data users' needs are crucial for them to effectively search and discover datasets. Also, understanding users' data accessing behaviors is essential to the findability of datasets, yet “a user‐focused, cross‐disciplinary analysis of data retrieval practices is lacking” (Gregory et al, 2019; Ohno‐Machado et al, 2017).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Both studies used qualitative research methods, which is helpful to obtain insights on OGD users, but the accuracy and comprehensiveness of their results could be limited by the low numbers of participants. The third study on OGD users' search behaviors employed the search log analysis method (Kacprzak et al, 2019). It examined the patterns and specific attributes of search queries and compared the OGD search with a general web search.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…Real Queries. We used crowdsourced natural language queries 9 that were originally submitted to data.gov.uk for datasets [20]. They were transformed into keyword queries by removing stop words using Apache Lucene.…”
Section: Empirical Evaluationmentioning
confidence: 99%
“…Some recent studies have explored the issues of research data curation and discovery services from the perspectives of researchers and data curation and discovery service providers (Faniel, Frank, & Yakel, 2019; Kacprzak et al, 2019; Walsh et al, 2019). Yet a relatively small part of the research data has been shared outside the research community; the roles played by the different stakeholders and how to facilitate the practices of data sharing are still emerging (Cox, Kennan, Lyon, Pinfield, & Sbaffi, 2019; Khalsa, Cotroneo, & Wu, 2018; Polona, 2019; Wu, Psomopoulos, Khalsa, & de Waard, 2019).…”
Section: Introductionmentioning
confidence: 99%