Memory-centric VDF graph transformations for practical FPGA implementation

Milford, Matthew; McAllister, John

doi:10.1109/estimedia.2012.6507023

2012 IEEE 10th Symposium on Embedded Systems for Real-Time Multimedia 2012

DOI: 10.1109/estimedia.2012.6507023

|View full text |Cite

Memory-centric VDF graph transformations for practical FPGA implementation

Matthew Milford

John McAllister

Abstract: Realising memory intensive applications such as image and video processing on FPGA requires creation of complex, multi-level memory hierarchies to achieve real-time performance; however commerical High Level Synthesis tools are unable to automatically derive such structures and hence are unable to meet the demanding bandwidth and capacity constraints of these applications. Current approaches to solving this problem can only derive either single-level memory structures or very deep, highly inefficient hierarchi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2016

Publication Types

Select...

Article1

Relationship

Self Cite1

Independent0

Authors

Journals

Cited by 1 publication

(2 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This paper presents an approach to overcoming this limitation. By extending foundation work in [5][6] a constructive [7] approach is presented which derives FPGA RTL accelerators, including all requisite on-chip and off-chip memory storage, from a C kernel specification to meet a user-defined throughput requirement. To the best of the authors' knowledge this is the first reported approach to achieve this capability.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Milford

McAllister

2016

IEEE Trans. Signal Process.

Self Cite

View full text Add to dashboard Cite

Field programmable gate array are ideal hosts to custom accelerators for signal, image and data processing but demand manual register transfer level design if high performance and low cost are desired. High level synthesis reduces this design burden but requires manual design of complex on-chip and off-chip memory architectures, a major limitation in applications such as video processing. This paper presents an approach to resolve this shortcoming. A constructive process is described which can derive such accelerators, including on and off-chip memory storage from a C description such that a user-defined throughput constraint it met. By employing a novel statement-oriented approach, dataflow intermediate models are derived and used to support simple approaches for on/off-chip buffer partitioning, derivation of custom on-chip memory hierarchies and architecture transformation to ensure user-defined throughput constraints are met with minimum cost. When applied to accelerators for full search motion estimation, matrix multiplication, sobel edge detection and fast fourier transform it is shown how real-time performance up to an order of magnitude in advance of existing commercial HLS tools is enabled whilst including all requisite memory infrastructure. Further, optimisations are presented which reduce the on-chip buffer capacity and physical resource cost by up to 96% and 75% respectively, whilst maintaining real-time performance.

show abstract

mentioning

confidence: 99%

“…Employing streaming interfaces5 Note that, for consistency, no manual manipulation or annotation of input source, nor manual direction, is given to any of the tools compared in this sectionApril 26, 2016 DRAFT…”

mentioning

confidence: 99%

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Milford

McAllister

2016

IEEE Trans. Signal Process.

Self Cite

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Memory-centric VDF graph transformations for practical FPGA implementation

Cited by 1 publication

References 18 publications

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Contact Info

Product

Resources

About