Proceedings of the 2nd Conference on Computing Frontiers 2005
DOI: 10.1145/1062261.1062291
|View full text |Cite
|
Sign up to set email alerts
|

Matrix register file and extended subwords

Abstract: In this paper we employ two techniques suitable for embedded media processors. The first technique, extended subwords, uses four extra bits for every byte in a media register. This allows many SIMD operations to be performed without overflow and avoids packing/unpacking conversion overhead because of mismatch between storage and computational formats. The second technique, the Matrix Register File (MRF), allows flexible row-wise as well as column-wise access to the register file. It is useful for many block-ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2006
2006
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…The architecture in [ 26 ] is difficult to run at very high frequency due to the need to connect modules at a long distance, the high fan-out degree, and the complex wiring. The architecture in [ 27 , 28 ] does not really support multiple inputs and outputs at the same time.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The architecture in [ 26 ] is difficult to run at very high frequency due to the need to connect modules at a long distance, the high fan-out degree, and the complex wiring. The architecture in [ 27 , 28 ] does not really support multiple inputs and outputs at the same time.…”
Section: Methodsmentioning
confidence: 99%
“…Each computing unit adds the accumulated value of its own through the number of 1s obtained from multiple inputs, so as to calculate the accumulated histogram. The second type of multi-input and -output architecture, such as the multi-BRAM port architecture [ 27 , 28 ], can read or write in parallel, but not simultaneously. In the case of parallel writing, statistics are performed through multiple BRAM ports, and each port inputs a part of the image; then, the generated histograms are added together to generate a complete histogram.…”
Section: Related Workmentioning
confidence: 99%