2008
DOI: 10.1002/cpe.1334
|View full text |Cite
|
Sign up to set email alerts
|

LC‐GRFA: global register file assignment with local consciousness for VLIW DSP processors with non‐uniform register files

Abstract: SUMMARYEmbedded processors developed within the past few years have employed novel hardware designs to reduce the ever-growing complexity, power dissipation, and die area. Although using a distributed register file architecture is considered to have less read/write ports than using traditional unified register file structures, it presents challenges in compilation techniques to generate efficient codes for such architectures. This paper presents a novel scheme for register allocation that includes global and l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2010
2010
2014
2014

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 99 publications
(113 reference statements)
0
10
0
Order By: Relevance
“…Because the PAC DSP has a clustered organization with distributed register files, the data movement between register files in the PAC DSP is classified into intra-and intercluster communications, which means that the common cluster-assignment problem for VLIW processors [24][25][26] cannot be applied directly to the architecture. Also, the complicated communication makes it desirable to use register file assignment in handling communications, including localconscious global register file assignment [11,12], ping-pong-aware local favorable local register file assignment [6,7], and simulated-annealing local register file assignment [13]. In addition to register file assignment, the copy propagation in the original Open64 compiler has also been improved [29].…”
Section: Platform Infrastructurementioning
confidence: 99%
See 3 more Smart Citations
“…Because the PAC DSP has a clustered organization with distributed register files, the data movement between register files in the PAC DSP is classified into intra-and intercluster communications, which means that the common cluster-assignment problem for VLIW processors [24][25][26] cannot be applied directly to the architecture. Also, the complicated communication makes it desirable to use register file assignment in handling communications, including localconscious global register file assignment [11,12], ping-pong-aware local favorable local register file assignment [6,7], and simulated-annealing local register file assignment [13]. In addition to register file assignment, the copy propagation in the original Open64 compiler has also been improved [29].…”
Section: Platform Infrastructurementioning
confidence: 99%
“…In register file assignment, we have proposed local-conscious global register file assignment [11,12], ping-pong-aware local favorable local register file assignment [6,7], and simulated-annealing local register file assignment [13] to handle the issues.…”
Section: Previous Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The ORC frontend helps to generate the intermediate representation, WHIRL, with five representation levels from "very high" to "very low", where various targetindependent optimizations are performed, such as control flow optimization, extended basic block (peephole) optimization, integrated global/local scheduling, and loop transformation at the "very low" level. We have developed specific optimization techniques in the backend for PACDSP, including copy propagation for irregular register files [17], optimal local register file assignment based on simulation annealing (SA-LRFA) [18], ping-pong aware & local favorable register file assignment (PALF-LRFA) [19], and local-conscious & global register file assignment (LC-GRFA) [20], etc. LC-GRFA is the most important optimization, which minimizes data communication costs between various registers.…”
Section: Software Development Toolsmentioning
confidence: 99%