Proceedings of the 28th International Conference on Software Engineering 2006
DOI: 10.1145/1134285.1134400
|View full text |Cite
|
Sign up to set email alerts
|

A software architecture-based framework for highly distributed and data intensive scientific applications

Abstract: Modern scientific research is increasingly conducted by virtual communities of scientists distributed around the world. The data volumes created by these communities are extremely large, and growing rapidly. The management of the resulting highly distributed, virtual data systems is a complex task, characterized by a number of formidable technical challenges, many of which are of a software engineering nature. In this paper we describe our experience over the past seven years in constructing and deploying OODT… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
59
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
6
3

Relationship

6
3

Authors

Journals

citations
Cited by 78 publications
(59 citation statements)
references
References 19 publications
0
59
0
Order By: Relevance
“…We assess the accuracy of these techniques on eight architectures from six different open-source systems: ArchStudio [16], Bash [13], Hadoop [4], Linux [12], Mozilla [2], and OODT [28]. In the case of two of the systems-Linux and Mozilla-we use a pair of architectures each, at different levels of detail.…”
Section: Introductionmentioning
confidence: 99%
“…We assess the accuracy of these techniques on eight architectures from six different open-source systems: ArchStudio [16], Bash [13], Hadoop [4], Linux [12], Mozilla [2], and OODT [28]. In the case of two of the systems-Linux and Mozilla-we use a pair of architectures each, at different levels of detail.…”
Section: Introductionmentioning
confidence: 99%
“…DRAT is a MapReduce (M/R) style [7] RAT workflow that runs on top of Apache Object Oriented Data Technology (OODT) [8], a scientific data processing, acquisition, and dissemination system. M/R, a methodology for processing a large amount of unstructured data with a parallel, distributed algorithm, consists of two components: "mapper" and "reducer".…”
Section: Drat Architecturementioning
confidence: 99%
“…While we have met with success in applying software architectures to other software built at JPL, such as the Planetary Data System [5], the project described in this paper is the first that incorporates JPL's more recently developed software services built in a componentized fashion with well-defined interfaces, as well as multiple scientific processing codes known as PGEs. These PGEs include a number of legacy radar packages (imaging and interferometry) and trajectory packages.…”
Section: Motivationmentioning
confidence: 99%