1992
DOI: 10.1145/144965.145794
|View full text |Cite
|
Sign up to set email alerts
|

Interlock collapsing ALU for increased instruction-level parallelism

Abstract: Simulation results are presented for machine implementations using a novel integer ALL7 design that allows parallel execution, in a single cycle, of two interlocked instructions, which because of true data dependency must normally be executed sequentially. This parallel execution is achieved by collapsing the execution interlocks between integer ALU operations as well as between address generation operations, but without increasing the cycle time of the base implementation. Results demonstrate that in integer … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2000
2000
2012
2012

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…"Fused" ALUs that perform two or more dependent operations at a time increase the amount of computation relative to operand shuffling, such as the common fused multiply-accumulate unit and interlock collapsing ALUs [55,56]. Other proposals cluster instructions together to reduce the communication of operand values to instructions outside the group [57,58].…”
Section: Alus and Bypassingmentioning
confidence: 99%
“…"Fused" ALUs that perform two or more dependent operations at a time increase the amount of computation relative to operand shuffling, such as the common fused multiply-accumulate unit and interlock collapsing ALUs [55,56]. Other proposals cluster instructions together to reduce the communication of operand values to instructions outside the group [57,58].…”
Section: Alus and Bypassingmentioning
confidence: 99%
“…In this dissertation, the leading non-speculative data dependency collapsing technique we focus on is instruction fusion. Instruction fusion is a technique used to execute two data dependent instructions within the same time interval through the use of a specialized execution unit which can execute two dependent instructions in a single time interval [30]. Instruction fusion in micro-architecture is well utilized.…”
Section: Non-speculative Collapsing Of Data Dependenciesmentioning
confidence: 99%
“…Initial experiments with the baseline ll unit design indicated that shadow cache lines were nalized too quickly on general-purpose codes due to RAW dependencies among integer instructions. An important design technique in increasing the performance of general-purpose codes is the use of compound functional units such as the cascaded half-cycle integer ALUs (as done in the triple-issue TI SuperSPARC [2]) and fused 3-operand functional units [9]. (Special handling of dependent integer instructions is also an important part of the MIPS R4000 superpipeline design, where an ALU result can be produced every internal cycle [8].)…”
Section: Extended Fill Unit Designsmentioning
confidence: 99%