XPP-VC: A C Compiler with Temporal Partitioning for the PACT-XPP Architecture

Cardoso, João M. P.; Weinhardt, Markus

doi:10.1007/3-540-46117-5_89

Cited by 30 publications

(38 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Another benefit of using a coarse-grained architecture is the reduced configuration data size and hence reduced reconfiguration latency which can allow faster context switching. Because of this apparent advantage, researchers have explored a number of ASIC implementations of coarse-grained reconfigurable architectures (CGRAs) [17], [18], [19], [20], [21], [22], [23], [24], [25]. Some key features that enabled these architectures to address signal processing and high performance computing problems more efficiently include: energy efficiency, ease of programming, fast compilation and reconfiguration.…”

Section: Coarse-grained Reconfigurable Devicesmentioning

confidence: 99%

Are Coarse-Grained Overlays Ready for General Purpose Application Acceleration on FPGAs?

Jain

Maskell

Fahmy

2016

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing,

View full text Add to dashboard Cite

Section: Coarse-grained Reconfigurable Devicesmentioning

confidence: 99%

Are Coarse-Grained Overlays Ready for General Purpose Application Acceleration on FPGAs?

Jain

Maskell

Fahmy

2016

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing,

View full text Add to dashboard Cite

“…An alternative approach is loop disserving, described in [32] as applied to the PACT-XPP CGRA, in which the underlying hardware is reconfigured inside loop bodies as many times at each kernel iteration. Loop disserving does not need temporary arrays to store intermediate data, but presents a much higher configuration overhead.…”

Section: Partitioning Of Computational Kernelsmentioning

confidence: 99%

Integrated Kernel Partitioning and Scheduling for Coarse-Grained Reconfigurable Arrays

Ansaloni

Tanimura

Pozzi

et al. 2012

IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst.

View full text Add to dashboard Cite

Abstract-Coarse-grained reconfigurable arrays (CGRAs) are a promising class of architectures conjugating flexibility and efficiency. Devising effective methodologies to map applications onto CGRAs is a challenging task, due to their parallel execution paradigm and constrained hardware resources. In order to handle complex applications, it is important to devise efficient strategies to partition a kernel into pieces that obey resource constraint and methodologies to schedule them on the underlying hardware. In this paper, we tackle these problems by proposing algorithms to address partitioning based on recursive searches over abstract trees. A novel scheduling strategy is also described that, leveraging differences in delays of various operations, is able to efficiently map operations on CGRA architectures. Experimental evidence on kernels derived from a diverse set of data flow graphs and EEMBC benchmarks demonstrate the efficacy of the described methods, which, when combined, achieve a higher runtime performance on a given mesh size than stateof-the-art approaches (as much as 38% for the benchmark applications considered).

show abstract

“…Typically the applications which belong to the application domain of CGRAs are characterized by high data transfer rate between the processor and the memory. Only a few approaches ( [4], [6], [7], [8], and [9]) have been followed to tackle the problem of the limited memory bandwidth in CGRAs for exploiting the hardware parallelism.…”

Section: Related Workmentioning

confidence: 99%

“…A series of vertical and horizontal buses establish communication among the PEs while for storing the intermediate data values shared memory banks exist on the left and the right side of each array's row. To reduce the number of memory accesses, the compiler [7] only reads one element per iteration and generates shift registers to store the data reuse values when array references inside loops read subsequent element positions.…”

Section: Related Workmentioning

confidence: 99%

Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigurable architectures

Dimitroulakos

Galanis

Goutis

2006

Proceedings 20th IEEE International Parallel &Amp; Distributed Processing Symposium

View full text Add to dashboard Cite

XPP-VC: A C Compiler with Temporal Partitioning for the PACT-XPP Architecture

Cited by 30 publications

References 3 publications

Are Coarse-Grained Overlays Ready for General Purpose Application Acceleration on FPGAs?

Are Coarse-Grained Overlays Ready for General Purpose Application Acceleration on FPGAs?

Integrated Kernel Partitioning and Scheduling for Coarse-Grained Reconfigurable Arrays

Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigurable architectures

Contact Info

Product

Resources

About