Abstract:Today computers have several levels of memory hierarchy. To obtain good performance on these processors it is necessary to design algorithms that minimize I/O traffic to slower memories in the hierarchy. In this paper, we study the computation of option pricing using the binomial and trinomial models on processors with a multilevel memory hierarchy. We derive lower bounds on memory traffic between different levels of hierarchy for these two models. We also develop algorithms for the binomial and trinomial mode… Show more
“…We then provide an algorithm to pebble − using pebbles that requires roughly half the I/O needed by previously described algorithms [8]. We also provide a lower bound that is twice the previous best known lower bound for the same problem [8]. With these improvements, one can prove that the pebbling scheme presented here does no more than twice the I/O required by an optimal pebbling scheme.…”
Section: Introductionmentioning
confidence: 88%
“…In [8] the authors derived lower bounds for memory traffic at different levels of memory hierarchy for ( ) and ( ) . The technique used in the paper is based on the concept of a -span of the DAG [3].…”
Section: Introductionmentioning
confidence: 99%
“…( ) and ( ) are sub families of this family. We then provide an algorithm to pebble − using pebbles that requires roughly half the I/O needed by previously described algorithms [8]. We also provide a lower bound that is twice the previous best known lower bound for the same problem [8].…”
Abstract. Modern computers have several levels of memory hierarchy. To obtain good performance on these processors it is necessary to design algorithms that minimize I/O traffic to slower memories in the hierarchy. In this paper, we present I/O efficient algorithms to pebble -pyramids and derive lower bounds on the number of I/O steps to do so. Thepyramid graph models financial applications which are of practical interest and where minimizing memory traffic can have a significant impact on cost saving.
“…We then provide an algorithm to pebble − using pebbles that requires roughly half the I/O needed by previously described algorithms [8]. We also provide a lower bound that is twice the previous best known lower bound for the same problem [8]. With these improvements, one can prove that the pebbling scheme presented here does no more than twice the I/O required by an optimal pebbling scheme.…”
Section: Introductionmentioning
confidence: 88%
“…In [8] the authors derived lower bounds for memory traffic at different levels of memory hierarchy for ( ) and ( ) . The technique used in the paper is based on the concept of a -span of the DAG [3].…”
Section: Introductionmentioning
confidence: 99%
“…( ) and ( ) are sub families of this family. We then provide an algorithm to pebble − using pebbles that requires roughly half the I/O needed by previously described algorithms [8]. We also provide a lower bound that is twice the previous best known lower bound for the same problem [8].…”
Abstract. Modern computers have several levels of memory hierarchy. To obtain good performance on these processors it is necessary to design algorithms that minimize I/O traffic to slower memories in the hierarchy. In this paper, we present I/O efficient algorithms to pebble -pyramids and derive lower bounds on the number of I/O steps to do so. Thepyramid graph models financial applications which are of practical interest and where minimizing memory traffic can have a significant impact on cost saving.
“…Several works followed Hong & Kung's work on I/O complexity in deriving lower bounds on data accesses [2,1,18,6,5,23,24,19,20,29,13,3,4,8,28,26]. Aggarwal et al provided several lower bounds for sorting algorithms [2].…”
Section: Related Workmentioning
confidence: 99%
“…Aggarwal et al provided several lower bounds for sorting algorithms [2]. Savage [23,24] developed the notion of S-span to derive Hong-Kung style lower bounds and that model has been used in several works [19,20,26]. Irony et al [18] provided a new proof of the Hong-Kung result on I/O complexity of matrix multiplication and developed lower bounds on communication for sequential and parallel matrix multiplication.…”
Abstract:Technology trends are making the cost of data movement increasingly dominant, both in terms of energy and time, over the cost of performing arithmetic operations in computer systems. The fundamental ratio of aggregate data movement bandwidth to the total computational power (also referred to the machine balance parameter ) in parallel computer systems is decreasing. It is therefore of considerable importance to characterize the inherent data movement requirements of parallel algorithms, so that the minimal architectural balance parameters required to support it on future systems can be well understood. In this paper, we develop an extension of the well-known red-blue pebble game to develop lower bounds on the data movement complexity for the parallel execution of computational directed acyclic graphs (CDAGs) on parallel systems. We model multi-node multi-core parallel systems, with the total physical memory distributed across the nodes (that are connected through some interconnection network) and in a multi-level shared cache hierarchy for processors within a node. We also develop new techniques for lower bound characterization of non-homogeneous CDAGs. We demonstrate the use of the methodology by analyzing the CDAGs of several numerical algorithms, to develop lower bounds on data movement for their parallel execution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.