2003
DOI: 10.1016/s0743-7315(03)00036-4
|View full text |Cite
|
Sign up to set email alerts
|

High-performance scientific data management system

Abstract: Many scientific applications have large I/O requirements, in terms of both the size of data and the number of files or data sets. Management, storage, efficient access, and analysis of this data present an extremely challenging task. Traditionally, two different solutions have been used for this task: file I/O or databases. File I/O can provide high performance but is tedious to use with large numbers of files and large and complex data sets. Databases can be convenient, flexible, and powerful but do not perfo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2004
2004
2006
2006

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…The system must have capabilities for managing metadata generated and derived from scientific data, a query and analysis capability on these metadata, and the ability to manage a large number of files with associated query functions. The metadata management component entails use of a "lightweight" database system (e.g., MDMS [Chou00,No03]). Such a system is responsible for storing and managing metadata and the results of queries.…”
Section: Ii-32 Query Processing Over Filesmentioning
confidence: 99%
“…The system must have capabilities for managing metadata generated and derived from scientific data, a query and analysis capability on these metadata, and the ability to manage a large number of files with associated query functions. The metadata management component entails use of a "lightweight" database system (e.g., MDMS [Chou00,No03]). Such a system is responsible for storing and managing metadata and the results of queries.…”
Section: Ii-32 Query Processing Over Filesmentioning
confidence: 99%
“…We integrated those replication techniques to GEDAS (Grid Environment-based Data Management System ) [6,7] that is a grid toolkit providing a high-level, user-friendly interface to share the remotely produced data among the grid communities.…”
Section: Introductionmentioning
confidence: 99%
“…Although both parallelization and mapping strategies of parallel jobs have improved system utilization and response time to certain extent, obtaining maximum performance of the parallel programs have remained elusive 131. This is because many scientific and engineering applications have large I/O requirements, in terms of both the size of data and the number of files or data sets [20]. Moreover, regardless of the fact that disk technology has been advancing rapidly especially with respect to the storage density, capacity and bandwidth, I/O subsystem continues to be a major bottleneck in many parallel applications [22] 1241 [dl.…”
Section: Introductionmentioning
confidence: 99%