[1992] Proceedings the 25th Annual International Symposium on Microarchitecture MICRO 25
DOI: 10.1109/micro.1992.697011
|View full text |Cite
|
Sign up to set email alerts
|

Interlock Collapsing ALU For Increased Instruction-level Parallelism

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…An important design technique in increasing the performance of general-purpose codes is the use of compound functional units such as the cascaded half-cycle integer ALUs (as done in the triple-issue TI SuperSPARC [2]) and fused 3-operand functional units [9]. (Special handling of dependent integer instructions is also an important part of the MIPS R4000 superpipeline design, where an ALU result can be produced every internal cycle [8].)…”
Section: Extended Fill Unit Designsmentioning
confidence: 99%
“…An important design technique in increasing the performance of general-purpose codes is the use of compound functional units such as the cascaded half-cycle integer ALUs (as done in the triple-issue TI SuperSPARC [2]) and fused 3-operand functional units [9]. (Special handling of dependent integer instructions is also an important part of the MIPS R4000 superpipeline design, where an ALU result can be produced every internal cycle [8].)…”
Section: Extended Fill Unit Designsmentioning
confidence: 99%
“…Many DSPs have specialized hardware for common computations in signal and image processing, such as dot product, sum of absolute differences, and compare-select. A number of generalized accelerator designs have also been proposed, such as 3-1 ALUs [22,25], closed-loop ALUs [27], or ALU pipelines [5]. Larger accelerators can support bigger subgraphs and thus enhance the performance advantages.…”
Section: Introductionmentioning
confidence: 99%
“…A mini-graph processor does not rely on latency reducing ALUs [16,21,22,27], but can exploit them. Support for such ALUs depends on the precise manner in which latency reduction is achieved.…”
Section: Alu Pipelinementioning
confidence: 99%