2006 International Workshop on Computer Architecture for Machine Perception and Sensing 2006
DOI: 10.1109/camp.2007.4350352
|View full text |Cite
|
Sign up to set email alerts
|

Customizing CPU Instructions for Embedded Vision Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
4
4

Relationship

3
5

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…All the components are implemented in VHDL and without DSP to increase clock working frequency. Half and single precision floating point numeric formats are used in the same way as in [19] and all the units are built with the help of the FloPoCo library [20].…”
Section: Results Of Implementationmentioning
confidence: 99%
“…All the components are implemented in VHDL and without DSP to increase clock working frequency. Half and single precision floating point numeric formats are used in the same way as in [19] and all the units are built with the help of the FloPoCo library [20].…”
Section: Results Of Implementationmentioning
confidence: 99%
“…global and shared_fusion are declined in both 32-bit (F 32 ) and 16-bit (F 16 ) floating point versions. As shown in [28] the use of F 16 is sufficient for optical flow. Furthermore, the original implementation in [10] already uses F 16 for parts of the algorithm on GPU.…”
Section: Cuda Gpu Optimisationsmentioning
confidence: 99%
“…Adding such an instruction has been studied into [29]. [37] for specific domain application [38] but also new dedicated blocks. With a compiler like C2H for Altera FPGA or DIME-C for Xilinx, a complete C function can be compiled into a VHDL block and be directly called inside a C code.…”
Section: Swar Enhancementmentioning
confidence: 99%
“…Such hardware implementation can be much more faster than the sequential execution of the instructions that compose it, as no more ''register to register'' stage is required at each cycle like it is the case for pipeline execution. One of the best example of processor customization (not softcore but ASIP) is the Tensilica Xtensa architecture [37].…”
Section: Swar Enhancementmentioning
confidence: 99%