Proceedings of the 38th Conference on Design Automation - DAC '01 2001
DOI: 10.1145/378239.379051
|View full text |Cite
|
Sign up to set email alerts
|

High-quality operation binding for clustered VLIW datapaths

Abstract: Clustering is an effective method to increase the available parallelism in VLIW datapaths without incurring severe penalties associated with large number of register file ports. Efficient utilization of a clustered datapath requires careful binding of operations to clusters. The paper proposes a binding algorithm that effectively explores tradeoffs between in-cluster operation serialization and delays associated with data transfers between clusters. Extensive experimental evidence is provided showing that the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2002
2002
2012
2012

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…While not a heavily researched area, there has been some work on estimate-based approaches for clustering. Lapinskii et al [17] base their estimate off three major factors: the data transfer penalty, FU serialization penalty and bus serialization penalty. They use a local approach like BUG to minimize the data transfer penalties.…”
Section: Related Workmentioning
confidence: 99%
“…While not a heavily researched area, there has been some work on estimate-based approaches for clustering. Lapinskii et al [17] base their estimate off three major factors: the data transfer penalty, FU serialization penalty and bus serialization penalty. They use a local approach like BUG to minimize the data transfer penalties.…”
Section: Related Workmentioning
confidence: 99%
“…While it seems to work well for architectures which communicate only in one direction, the algorithm is not very effective for architectures which can both read from as well as write to the adjacent clusters. Partially motivated by this and partially by Lapinskii et al [2001], we thus bring in an additional iterative improvement phase, by performing a local search around this initial binding (steps 6 to 13).…”
Section: Bindingmentioning
confidence: 99%
“…We have separate schedulers for RF-to-RF type architectures and direct intercluster communication architectures. Although a few scheduling algorithms for clustered architectures have been reported in the literature, they are all specifically targeted toward RF-to-RF type of architectures [Desoli 1998;Lapinskii et al 2001;Sanchez and Gonzalez 2000]. Our scheduling algorithm again is a list scheduling algorithm with the distance of the node from the sink as the heuristic.…”
Section: Final Schedulingmentioning
confidence: 99%
“…In traditional clustered VLIW processors, the notion of clustering is applied to minimize the complexity of the register files (datapath clusters) [36], [37]. In recent years, this notion has also been applied to minimize the complexity of the front end (instruction fetch) in the form of multi-VLIWs [38], [39].…”
Section: Related Workmentioning
confidence: 99%