You Only Traverse Twice: A YOTT Placement, Routing, and Timing Approach for CGRAs

Canesche, Michael; Carvalho, Westerley; Reis, Lucas; Oliveira, M. M. F.; Magalhães, Salles V. G.; Jamieson, Peter; Nacif, José Augusto M.; Ferreira, Ricardo

doi:10.1145/3477038

Cited by 3 publications

(3 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…A ferramenta VPR [Murray and et al 2020] gerou uma soluc ¸ão pior do que o Z1000 e a implementac ¸ão GPU. A Tabela 2 compara a acelerac ¸ão no tempo de execuc ¸ão e a qualidade da soluc ¸ão dos algoritmos múltipla travessia (YOTO) [Canesche et al 2020] e YOLT [Canesche et al 2021] com o VPR [Murray and et al 2020]. Os resultados estão normalizados em relac ¸ão ao VPR Bound Box.…”

Section: Resultsunclassified

“…A quinta contribuic ¸ão foi a poda no espac ¸o de busca da das escolhas no CGRA com a verificac ¸ão do grau dos vértices e uma pré-validac ¸ão do posicionamento na proximidade de uma reconvergência, chamada de lookahead. O algoritmo de dupla travessia foi publicado no período ACM Transactions on Embedded Computing Systems [Canesche et al 2021]. A sexta contribuic ¸ão foram a implementac ¸ão do roteamento e de escalonamento.…”

Section: Contribuic ¸õEsunclassified

See 1 more Smart Citation

Algoritmos de Posicionamento e Roteamento baseados em Travessia de Grafo para Arquiteturas Reconfiguráveis de Grão Grosso (CGRA)

Canesche¹,

Ferreira²

2022

Anais Do XXXV Concurso De Teses E Dissertações (CTD 2022)

Self Cite

View full text Add to dashboard Cite

O hardware reconfigurável é flexível, eficiente energeticamente e oferece alto desempenho para vários domínios de aplicação. Os FPGAs são a tecnologia reconfigurável mais utilizada, entretanto o mapeamento da aplicação é NP-Completo. Os CGRAs simplificam o desenvolvimento de aplicações para hardware reconfigurável e alteram a granularidade das operações do nível de bit para o nível de palavras (8-32 bits). Este trabalho apresenta avanços no estado da arte para o problema de mapeamento em CGRA propondo novos algoritmos de travessia de grafos. As soluções alcançadas são ordens de grandeza mais rápidas sem comprometer a qualidade. Os algoritmos propostos foram 91x mais rápidos com uma redução de 1.7x nos recursos de balanceamento em comparação com a abordagem com Simulated Annealing.

show abstract

Section: Resultsunclassified

Section: Contribuic ¸õEsunclassified

Algoritmos de Posicionamento e Roteamento baseados em Travessia de Grafo para Arquiteturas Reconfiguráveis de Grão Grosso (CGRA)

Canesche¹,

Ferreira²

2022

Anais Do XXXV Concurso De Teses E Dissertações (CTD 2022)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Heuristic approaches provide approximate solutions and scalability without the guarantee of optimality. By employing techniques such as local search based on graph level (Ferreira et al, 2005), genetic algorithms (Silva et al, 2006), graph traversal approaches (Canesche et al, 2020(Canesche et al, , 2021, or simulated annealing (Luu et al, 2011;Oliveira et al, 2020a), heuristics can efficiently explore large solution spaces and find good-quality solutions within reasonable time frames.…”

Section: Placement and Routing Heuristic Approachesmentioning

confidence: 99%

Design exploration of machine learning data-flows onto heterogeneous reconﬁgurable hardware

Oliveira

View full text Add to dashboard Cite

This work explores the placement and routing of Machine Learning applications data- ﬂow graphs on different heterogeneous Coarse-Grained Reconﬁgurable Architectures (CGRA). We analyze three different types of processing element (PE) heterogeneity, the ﬁrst concerning the interconnection pattern, the second being on the kind of ope- rations a single PE can execute, and the last concerning the PE buffer resources. This analysis aims to propose a fair reduction to the overall cost in comparison to the ho- mogeneous CGRA architecture. We compare our results with the homogeneous case and one of the state-of-the-art tools for placement and routing (P&R). Our algorithm executed, on average, 52% faster than VPR 8.1 (Versatile Place and Route), which is an open-source academic tool designed for the FPGA placement and routing pha- ses, reaching better mapping in 66% of cases and achieving the same results in 26% of cases. Furthermore, a heterogeneous architecture reduces the cost without losing performance in 76% of the cases considering multiplier heterogeneity. We propose a novel heterogeneous buffer architecture that minimizes the buffer resources by 56.3% for K-means dataﬂow patterns. We also show that a heterogeneous border chess archi- tecture outperforms a homogeneous one. In addition, our mapping reaches optimal instances of single tree dataﬂows compared to classical Lee/Choi and H-Trees. Keywords: Reconﬁgurable architecture. CGRAs. Placement. Routing.

show abstract

GCN-RA: A graph convolutional network-based resource allocator for reconfigurable systems

Mohtavipour,

Shahhoseini

2023

Journal of Computational Science

View full text Add to dashboard Cite

You Only Traverse Twice: A YOTT Placement, Routing, and Timing Approach for CGRAs

Cited by 3 publications

References 34 publications

Algoritmos de Posicionamento e Roteamento baseados em Travessia de Grafo para Arquiteturas Reconfiguráveis de Grão Grosso (CGRA)

Algoritmos de Posicionamento e Roteamento baseados em Travessia de Grafo para Arquiteturas Reconfiguráveis de Grão Grosso (CGRA)

Design exploration of machine learning data-flows onto heterogeneous reconﬁgurable hardware

GCN-RA: A graph convolutional network-based resource allocator for reconfigurable systems

Contact Info

Product

Resources

About