Proceedings of the Seventh Workshop on Hot Topics in Operating Systems
DOI: 10.1109/hotos.1999.798371
|View full text |Cite
|
Sign up to set email alerts
|

The case for efficient file access pattern modeling

Abstract: Most modern I/O systems treat each file access independently. However, events in a computer system are driven by programs. Thus, accesses to files occur in consistent patterns and are by no means independent. The result is that modern I/O systems ignore useful information.Using traces of file system activity we show that file accesses are strongly correlated with preceding accesses. In fact, a simple last-successor model (one that predicts each file access will be followed by the same file that followed the la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0
1

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 59 publications
(64 citation statements)
references
References 9 publications
0
63
0
1
Order By: Relevance
“…Kroeger and Long [30] study several spatial access pattern modelings techniques, some of which are inspired by text compression algorithms such as variants of PPM (prediction by partial matching). The contexts (or symbols) used in these models are parameters of system-level I/O calls (i.e., file name, offset, size, etc.).…”
Section: ) Spatial Predictionsmentioning
confidence: 99%
“…Kroeger and Long [30] study several spatial access pattern modelings techniques, some of which are inspired by text compression algorithms such as variants of PPM (prediction by partial matching). The contexts (or symbols) used in these models are parameters of system-level I/O calls (i.e., file name, offset, size, etc.).…”
Section: ) Spatial Predictionsmentioning
confidence: 99%
“…Second, a dataset collected directly from the visualization software, VisIt, integrated with the ADIOS provenance module, ADIOS-P. The DFSTrace consists of 4 different datasets collected from the different machines, barber, ives, dvorak, and mozart, each of which has unique file access characteristics [19]; barber has the highest rate of system calls per second, dvorak has the highest percentage of write activity, ives the largest number of users, and mozart a typical desktop work-station.…”
Section: Resultsmentioning
confidence: 99%
“…In the file system ares, a series of Partitioned Context Modeling (PCM) based schemes [19][20][21] have been studied for sequence file prediction for IO prefetching. AMP [22] and TaP [23] have been proposed for sequence prediction.…”
Section: B Probabilistic Latent Semantic Analysis (Plsa)mentioning
confidence: 99%
“…Based on the previous research, Long and coworkers developed a serial of successor-based predictive prefetching algorithms in their efforts to advance the prefetching accuracy while maintaining a reasonable performance gain [24], [25], [26]. The features of these predictors are summarized below as they are state of the art and are most relevant to our design.…”
Section: Related Workmentioning
confidence: 99%