Proceedings 2000 International Conference on Computer Design
DOI: 10.1109/iccd.2000.878264
|View full text |Cite
|
Sign up to set email alerts
|

Fast subword permutation instructions using omega and flip network stages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
39
0

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 39 publications
(39 citation statements)
references
References 5 publications
0
39
0
Order By: Relevance
“…This is a significant result since previously arbitrary n-bit bit permutations took O(n) cycles. Even with our recent proposals of permutation instructions [5][9] [10][11] [12], this took at least O(log(n)) cycles. We show how a different 64-bit dynamically-specified permutation can be achieved every cycle by a 4-way superscalar processor with datarich MOMR execution.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…This is a significant result since previously arbitrary n-bit bit permutations took O(n) cycles. Even with our recent proposals of permutation instructions [5][9] [10][11] [12], this took at least O(log(n)) cycles. We show how a different 64-bit dynamically-specified permutation can be achieved every cycle by a 4-way superscalar processor with datarich MOMR execution.…”
Section: Discussionmentioning
confidence: 99%
“…Several approaches were proposed. The CROSS [9] and OMFLIP [10] permutation instructions each performs the equivalent function of two stages of a "virtual" interconnection network. A sequence of log(n) CROSS or OMFLIP instructions can build a 2log(n)-stage virtual network that can achieve any one of the n!…”
Section: Past Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In one method, individual bits of the source datum are selected and shifted to their destination locations using a series of logical AND, logical OR, and shift instructions [18]. For an arbitrary permutation of the bits in an n-bit word, this procedure requires as many as 4n instructions.…”
Section: Past Workmentioning
confidence: 99%
“…Lookup tables can also be employed to perform permutations with repetitions in software [18]. First, the n-bit source datum is divided into x groups of bits; each group is used to index a unique lookup table.…”
Section: Past Workmentioning
confidence: 99%