2021
DOI: 10.1007/s10606-021-09407-2
|View full text |Cite
|
Sign up to set email alerts
|

Encoding Collective Knowledge, Instructing Data Reusers: The Collaborative Fixation of a Digital Scientific Data Set

Abstract: This article provides a novel perspective on the use and reuse of scientific data by providing a chronological ethnographic account and analysis of how a team of researchers prepared an astronomical catalogue (a table of measured properties of galaxies) for public release. Whereas much existing work on data reuse has focused on information about data (such as metadata), whose form or lack has been described as a hurdle for reusing data successfully, I describe how data makers tried to instruct users through th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 109 publications
(79 reference statements)
0
7
0
Order By: Relevance
“…All the literature uploaded into GeoDeepShovel is all automatically parsed with Grobid (GRO, 2008-2021 and F I G U R E 8 GeoDeepShovel System overview. GeoDeepShovel consists a PDF prasing module, a backend server including some artificial intelligence models and a interactive graphical user interface.…”
Section: Data Extraction Processmentioning
confidence: 99%
See 2 more Smart Citations
“…All the literature uploaded into GeoDeepShovel is all automatically parsed with Grobid (GRO, 2008-2021 and F I G U R E 8 GeoDeepShovel System overview. GeoDeepShovel consists a PDF prasing module, a backend server including some artificial intelligence models and a interactive graphical user interface.…”
Section: Data Extraction Processmentioning
confidence: 99%
“…For each uploaded document, GeoDeepShovel uses multiple parsing tools (e.g. Grobid (GRO, 2008-2021, Science Parse and PdfFigures 2.0 (Clark & Divvala, 2016b)) to independently extract its meta-information and mix all the information with a voting mechanism. The meta-information of papers (e.g.…”
Section: Metadata Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…Some researchers noticed and analyzed the problems in team collaboration that existing platforms are facing. Hoeppe [22] mentions that the large scientific databases often require large teams to extract and clean the data, and it is a complex task that needs to be done through CSCW. Schmidt [45] demonstrates that task distribution, allocation, and interrelating of 'distributed individual activities' are some important issues in team collaboration.…”
Section: Team Collaboration In Building a Scientific Databasementioning
confidence: 99%
“…Scientific research is currently advancing faster than ever before, with tens of thousands of scientific studies published every day in various forms such as papers, preprints, and datasets. Scientific knowledge bases [16,23], a collection of structured and verified research results that consists of various numeric, word-oriented, or image-organized data, emerge in this context and bring entirely new approaches and opportunities to scientific research. Researchers in many disciplines uses AI techniques and the scientific knowledge bases, often constructed from the published literature, to drive scientific discoveries [38,45,46], such as Geoscience [10,64], Medicine [9], Biology [3], Chemistry [50].…”
Section: Introductionmentioning
confidence: 99%