2009
DOI: 10.1007/s11265-009-0355-2
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-Shared Register File Structure for VLIW Processors

Abstract: The available instruction level parallelism allowed by current register file organizations is not always fully exploited by media processors when running a multimedia application. This paper introduces a novel register file organization, called multi-shared register file, that eliminates this superfluous instruction scheduling flexibility by reducing the number of read and write ports and partitioning the register file in a special ring structure. A parameterized generic VLIW architecture is used to explore di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 38 publications
(51 reference statements)
0
2
0
Order By: Relevance
“…The generic VLIW archi tecture allows to implement different kinds of monolithic and partitioned RF configurations in order to study the architecture trade-offs between performance and hard ware cost. In general, partitioned RF configurations are desirable in VLIW architectures with a high number of issue-slots [14]. VLIW architectures specially designed for processing video applications are optimized to take advantage of the high 459 A mechanism, called X4 operation mode, can be used to concurrently use four identical FUs without increasing the number of instructions to decode.…”
Section: Vliw Architecture Optimizationsmentioning
confidence: 99%
“…The generic VLIW archi tecture allows to implement different kinds of monolithic and partitioned RF configurations in order to study the architecture trade-offs between performance and hard ware cost. In general, partitioned RF configurations are desirable in VLIW architectures with a high number of issue-slots [14]. VLIW architectures specially designed for processing video applications are optimized to take advantage of the high 459 A mechanism, called X4 operation mode, can be used to concurrently use four identical FUs without increasing the number of instructions to decode.…”
Section: Vliw Architecture Optimizationsmentioning
confidence: 99%
“…The benefit of this concept is that the FU utilization increases because all data register can be swapped in a cycle. Otherwise, a respective VLIW approach would require several of read and write ports on a state-of-the-art register file or sophisticated optimization affecting also the code translation process [15]. However, the interconnects between the FPE registers must also be taken into account at the code translation process.…”
Section: B Data Path Organizationmentioning
confidence: 99%