2007 8th IEEE/ACM International Conference on Grid Computing 2007
DOI: 10.1109/grid.2007.4354139
|View full text |Cite
|
Sign up to set email alerts
|

Efficient access to many small files in a filesystem for grid computing

Abstract: Abstract-Many potential users of grid computing systems have a need to manage large numbers of small files. However, computing and storage grids are generally optimized for the management of large files. As a result, users with small files achieve performance several orders of magnitude worse than possible. Archival tools and custom storage structures can be used to improve small-file performance, but this requires the end user to change the behavior of the application, which is not always practical. To addres… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(16 citation statements)
references
References 21 publications
0
16
0
Order By: Relevance
“…Once acknowledgement for transfer completion is received, another file transfer can be requested. This adds at least three additional round-trip-times over the control channel [8,23]. The data channel stays idle while waiting for the next transfer command to be issued.…”
Section: Climate Data Distribution Over 100gbpsmentioning
confidence: 99%
See 1 more Smart Citation
“…Once acknowledgement for transfer completion is received, another file transfer can be requested. This adds at least three additional round-trip-times over the control channel [8,23]. The data channel stays idle while waiting for the next transfer command to be issued.…”
Section: Climate Data Distribution Over 100gbpsmentioning
confidence: 99%
“…This puts extra burden on filesystem access and network transfer protocols. An important challenge in dealing with climate data movement is the lots-of-small-files problem [15,23,8]. Most of the end-to-end data transfer tools are designed for moving large data files.…”
Section: Motivationmentioning
confidence: 99%
“…3, the client tools use iRODS API to call relevant iRODS operations at the lowest level to handle I/O requests of an application. Petashell uses an existing open-source software, Parrot [19], to catch system I/O calls of an application and to match them with the respective iRODS I/O calls. On the other hand, petafs pretends as a filesystem to handle system I/O calls and maps filesystem calls to respective the iRODS calls through a special interface called FUSE.…”
Section: Client Toolsmentioning
confidence: 99%
“…In this paper, we provide a broad overview of all aspects of the Chirp distributed filesystem, building on several previous publications [44,27,43,32,47] that introduced different technical elements independently. Section 2 discusses the unique properties needed for a grid filesystem.…”
Section: Introductionmentioning
confidence: 99%