A Synchronization Method for Register Traces of Pipelined Processors

Dreesen, Ralf; Jungeblut, Thorsten; Thies, Michael; Porrmann, Mario; Kastens, Uwe; Rückert, Ulrich

doi:10.1007/978-3-642-04284-3_19

Cited by 3 publications

(1 citation statement)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the evaluation of the power consumption, switching activities are captured using gate-level simulation. For the validation of the reference specification (instruction set architecture) and of the RTL description (microarchitecture) the validation by simulation approach from Dreesen et al [2009] is used. For the functional verification of the hardware architecture an FPGA-based approach using our rapid prototyping environment RAPTOR [Porrmann et al 2010] is used.…”

Section: The Coreva Architecturementioning

confidence: 99%

A systematic approach for optimized bypass configurations for application-specific embedded processors

Jungeblut

Hübener

Porrmann

et al. 2013

ACM Trans. Embed. Comput. Syst.

Self Cite

View full text Add to dashboard Cite

The diversity of today's mobile applications requires embedded processor cores with a high resource efficiency, that means, the devices should provide a high performance at low area requirements and power consumption. The fine-grained parallelism supported by multiple functional units of VLIW architectures offers a high throughput at reasonable low clock frequencies compared to single-core RISC processors. To efficiently utilize the processor pipeline, common system architectures have to cope with data hazards due to data dependencies between consecutive operations. On the one hand, such hazards can be resolved by complex forwarding circuits (i.e., a pipeline bypass) which forward intermediate results to a subsequent instruction. On the other hand, the pipeline bypass can strongly affect or even dominate the total resource requirements and degrade the maximum clock frequency. In this work the CoreVA VLIW architecture is used for the development and the analysis of application-specific bypass configurations. It is shown that many paths of a comprehensive bypass system are rarely used and may not be required for certain applications. For this reason, several strategies have been implemented to enhance the efficiency of the total system by introducing applicationspecific bypass configurations. The configuration can be carried out statically by only implementing required paths or at runtime by dynamically reconfiguring the hardware. An algorithm is proposed which derives an optimized configuration by iteratively disabling single bypass paths. The adaptation of these applicationspecific bypass configurations allows for a reduction of the critical path by 26%. As a result, the execution time and energy requirements could be reduced by up to 21.5%. Using Dynamic Frequency Scaling (DFS) and dynamic deactivation/reactivation of bypass paths allows for a runtime reconfiguration of the bypass system. This ensures the highest efficiency while processing varying applications. ACM Reference Format:Jungeblut, T., Hübener, B., Porrmann, M., and Rückert, U. 2013. A systematic approach for optimized bypass configurations for application-specific embedded processors.

show abstract

Section: The Coreva Architecturementioning

confidence: 99%