2010
DOI: 10.1145/1743546.1743568
|View full text |Cite
|
Sign up to set email alerts
|

Managing scientific data

Abstract: 68 communications of th e ac m | j u n e 2 0 1 0 | vo l. 5 3 | n o. 6 contributed articles DATA-orienTeD sC i e nT i f iC P ro C es se s depend on fast, accurate analysis of experimental data generated through empirical observation and simulation. However, scientists are increasingly overwhelmed by the volume of data produced by their own experiments. With improving instrument precision and the complexity of the simulated models, data overload promises to only get worse. The inefficiency of existing database m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
55
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 79 publications
(55 citation statements)
references
References 18 publications
0
55
0
Order By: Relevance
“…Using vocabularies in 69 vocabulary governance groups to request new concepts, rather than attempting to add new 75 concepts to existing vocabularies at the end of a project, when timescales for completion of 76 work are often compressed, and may be insufficient to allow for requests to external 77 agencies to be processed. Once the desired vocabularies have been selected, development 78 of automated methods for tagging datasets provides the advantages of minimising the time 79 required for tagging and increasing the accuracy with which it is carried out, since it reduces 80 human error (Ailamaki et al 2010). Data can also be tagged at point of source i.e.…”
Section: Sector 59 60mentioning
confidence: 99%
“…Using vocabularies in 69 vocabulary governance groups to request new concepts, rather than attempting to add new 75 concepts to existing vocabularies at the end of a project, when timescales for completion of 76 work are often compressed, and may be insufficient to allow for requests to external 77 agencies to be processed. Once the desired vocabularies have been selected, development 78 of automated methods for tagging datasets provides the advantages of minimising the time 79 required for tagging and increasing the accuracy with which it is carried out, since it reduces 80 human error (Ailamaki et al 2010). Data can also be tagged at point of source i.e.…”
Section: Sector 59 60mentioning
confidence: 99%
“…Editing and updating of data also generates data. Produced data are schema-less, semi or fully structured persisting in different repositories [5]. According to some sources [2], the data volumes are approximately doubling each year.…”
Section: Introductionmentioning
confidence: 99%
“…The major software companies, such as Google, Amazon, Microsoft or Facebook have adapted their architectures in order to support the enormous quantity of information that they have to manage. Scientific applications are also struggling with those kinds of scenarios and significant research efforts are directed to deal with it [4]. An example of these applications is the management of astronomical catalogs; for instance those generated by the Dark Energy Survey (DES) [1] project with which we are collaborating.…”
Section: Introductionmentioning
confidence: 99%