The fast Fourier transform and sparse matrix computations: a study of two applications on the Horizon supercomputer

Carlson, D.A.; Conroy, J.M.

doi:10.1109/superc.1988.44637

Proceedings. SUPERCOMPUTING '88

DOI: 10.1109/superc.1988.44637

|View full text |Cite

The fast Fourier transform and sparse matrix computations: a study of two applications on the Horizon supercomputer

D.A. Carlson¹,

J.M. Conroy²

Abstract: As part of the HORIZON project currently underway at the Supercomputing Research Center, a set of application programs are being written and their performance is being evaluated. This paper discusses two of these applications: the fast Fourier transform and sparse matrix computations. For both problems, we develop efficient implementations that take advantage of HORIZON'S many unique features, including fined-grained synchronization, multiple instruction streams, and a horizontal instruction set. Our results i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

The Horizon supercomputing system: architecture and software

Kuehn¹,

Smith²

Proceedings. SUPERCOMPUTING '88

View full text Add to dashboard Cite

Horizon is the name currently being used to refer to a sharedmemory Multiple Instruction stream -Multiple Data stream (MIMD) computer architecture under study by independent groups at the Supercomputing Research Center and at Tera Computer Company. Its performance target is a sustained rate of 100 giga (10") Floating Point Operations Per Second (FLOPS). Horizon achieves this speed with a few hundred identical scalar processors. Each processor has a horizontal instruction set that allows the production of one or more floating point results per cycle without resorting to vector operations. Memory latency is hidden, assuming enough parallelism is available, by allowing processors to switch context on each machine cycle,In this overview, the Horizon architecture is introduced and its performance is estimated. The processor instruction set and a simple programming example are given. Additional details on the processor architecture, interconnection network design, performance analyses, machine simulator, compiler development, and application studies can be found in companion papers. I. Design PhilosophyShared-memory MLMD computers that can be utilized effectively are difficult to implement The principal difficulty is the latency associated with memory access and its consequences for processor performance. If many processors are sharing memory, propagation delays in the interconnection network through either logic circuitry or wiring will limit the minimum latency attainable. There are at least two ways of lowering the effective latency, each of which should be employed to approach a minimum: latency reduction and latency hiding. Latency reduction is accomplished by arranging a processor's memory accesses so that most of them are to locations that are both spatially and temporally nearby. Caches, such as those used in Cedar and the IBM RP3 11, 21, are a very popular and effective device for latency reduction. Larency hiding is brought about by introducing virtual processors and additional parallelism. While a virtual processor waits for a memory request, the physical processor switches to another task and continues to compute, as in HEP [3]. Memory requests en route from a processor to a memory or vice-versa in a shared-memory system will be. referred to as messages.The time needed to switch between tasks is important to the programmer because it determines the maximum message rate that the system will support. If a decomposition of a problem into parallel parts results in too few instructions executed per message sent, then the performance of each physical processor in the system will be limited by the peak message rate. In such cases, the system overhead of a virtual processor implementation is too great for the proposed problem decomposition, and another decomposition +Work described here was performed should be sought that requires less virtual processor switching. In other words, systems with lightweight (easily switched) virtual processors are more generally applicablc than are systems with heavyweight (slow to s...

show abstract

The Horizon supercomputing system: architecture and software

Kuehn¹,

Smith²

Proceedings. SUPERCOMPUTING '88

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

The fast Fourier transform and sparse matrix computations: a study of two applications on the Horizon supercomputer

Cited by 1 publication

References 11 publications

The Horizon supercomputing system: architecture and software

The Horizon supercomputing system: architecture and software

Contact Info

Product

Resources

About