Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
As part of the HORIZON project currently underway at the Supercomputing Research Center, a set of application programs are being written and their performance is being evaluated. This paper discusses two of these applications: the fast Fourier transform and sparse matrix computations. For both problems, we develop efficient implementations that take advantage of HORIZON'S many unique features, including fined-grained synchronization, multiple instruction streams, and a horizontal instruction set. Our results indicate that HORIZON will offer very high performance, sometimes in excess of a sustained rate of one floating point operation per tick. IntroductionThe HORIZON supercomputer is a 256 processor shared memory MIMD machine that is currently being designed and evaluated at the Supercomputing Research Center. As part of the evaluation of HORIZON, a set of application programs are being written and simulated to determine their potential performance. These application programs have been chosen to represent a variety of computationally intensive tasks that normally demand supercomputer performance.In this paper, we give a flavor of these application studies by discussing work that has been done to date on the fast Fourier transform and sparse matrix computations. Our results indicate that HORIZON will indeed be a high performance machine capable of outperforming existing supercomputers by at least two, possibly even three orders of magnitude.For both problems, we show how to map efficient algorithms effectively onto HORIZON in order to obtain high performance. A key part of this mapping process is the creation of multiple Istreams for each HORIZON processor, which allows the delay in accessing memory to be hidden in the algorithms. We also discuss how to take advantage of some of the novel aspects of the HOR-IZON instruction set in the code for our algorithms.For the fast Fourier transform, we have implemented an algorithm based on the radix4 Stockham variant that promises excellent performance. At each stage of the algorithm, multiple Istreams perform independent radix4 computations. Each I-stream has enough registers to support a radix-4 computation without spilling to memory in the middle, which increases performance
As part of the HORIZON project currently underway at the Supercomputing Research Center, a set of application programs are being written and their performance is being evaluated. This paper discusses two of these applications: the fast Fourier transform and sparse matrix computations. For both problems, we develop efficient implementations that take advantage of HORIZON'S many unique features, including fined-grained synchronization, multiple instruction streams, and a horizontal instruction set. Our results indicate that HORIZON will offer very high performance, sometimes in excess of a sustained rate of one floating point operation per tick. IntroductionThe HORIZON supercomputer is a 256 processor shared memory MIMD machine that is currently being designed and evaluated at the Supercomputing Research Center. As part of the evaluation of HORIZON, a set of application programs are being written and simulated to determine their potential performance. These application programs have been chosen to represent a variety of computationally intensive tasks that normally demand supercomputer performance.In this paper, we give a flavor of these application studies by discussing work that has been done to date on the fast Fourier transform and sparse matrix computations. Our results indicate that HORIZON will indeed be a high performance machine capable of outperforming existing supercomputers by at least two, possibly even three orders of magnitude.For both problems, we show how to map efficient algorithms effectively onto HORIZON in order to obtain high performance. A key part of this mapping process is the creation of multiple Istreams for each HORIZON processor, which allows the delay in accessing memory to be hidden in the algorithms. We also discuss how to take advantage of some of the novel aspects of the HOR-IZON instruction set in the code for our algorithms.For the fast Fourier transform, we have implemented an algorithm based on the radix4 Stockham variant that promises excellent performance. At each stage of the algorithm, multiple Istreams perform independent radix4 computations. Each I-stream has enough registers to support a radix-4 computation without spilling to memory in the middle, which increases performance
ICASE INTERIM REPORTS ICASE has introduced a new report series to be called ICASE lntcrim Reports. The series will complement the more familiar blue ICASE reports that have becn distributed for many years. The blue reports are intended as preprints of research that has been submitted for publication in either rcferccd journals or conference proceedings. In general, the green Interim Report will not be submitted for publication, at least not in its printed form. It will be used for research that has reached a certain level of maturity but needs additional refinement, for technical reviews or position statements, for bibliographies, and for computer software. The Interim Reports will receive the same distribution as the ICASE Reports. They will be available upon request in the future, and they may be referenced in other publications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.