Dual-pipeline heterogeneous ASIP design

Radhakrishnan, S.; Guo, Hui; Parameswaran, Sri

doi:10.1145/1016720.1016727

Cited by 3 publications

(4 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An effective parallel instruction scheduling algorithm is used to determine the number of pipelines and the instruction sets to be implemented by each of the pipelines such that the high performance improvement can be achieved with small area overhead. The performance improvement is achieved by specific instructions (the related work and approach can be found in [15]), improved pipeline (with forwarding logics) structure, and parallel instructions executing on the multiple pipelines. The small area overhead is retained by utilizing a distributed controller, minimized instruction set overlap between pipelines, and appropriate number of pipelines.…”

Section: Discussionmentioning

confidence: 99%

“…Using a similar architecture, in [15], the authors presented a design approach for dual-pipeline customized processor.…”

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

“…We enhance the pipeline structure presented in [15] by utilizing a forwarding scheme so that the data hazards in the pipelines can be reduced. Also, unlike in [15], where a single clock cycle penalty for memory accesses is used, we consider different wait cycles for memory access so that the effect of memory access penalty on performance can be observed. Moreover, instead of using small and non-standard benchmarks as in [15], we target the applications from Mibench benchmark suites (popular in embedded system design) in our study.…”

Section: Related Workmentioning

confidence: 99%

See 3 more Smart Citations

Customization of application specific heterogeneous multi-pipeline processors

Radhakrishnan

Guo

Parameswaran

2006

Proceedings of the Design Automation &Amp; Test in Europe Conference

View full text Add to dashboard Cite

show abstract

Section: Discussionmentioning

confidence: 99%

“…Using a similar architecture, in [15], the authors presented a design approach for dual-pipeline customized processor.…”

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Customization of application specific heterogeneous multi-pipeline processors

Radhakrishnan

Guo

Parameswaran

2006

Proceedings of the Design Automation &Amp; Test in Europe Conference

View full text Add to dashboard Cite

show abstract

Generating ASIPs with Reduced Number of Connections to the Register-File

Asher

Lipov²,

Tartakovsky

et al. 2017

Int J Parallel Prog

View full text Add to dashboard Cite

Generating ASIPs with reduced number of connections to the register-file

Asher

Lipov

Tartakovsky

et al. 2015

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

View full text Add to dashboard Cite

We propose automatic synthesis of application specific instruction set processors (ASIPs). We use pipeline execution of multi-op machine-instructions, e.g., * (reg1 * reg2) = ( * reg3)+ ( * reg4) (C-syntax) an instruction with three memory pipeline stages and two arithmetic stages. The problem is, for a given set of loops, to find a pipeline configuration and a multi-op ISA that maximizes the IPC (instructions per cycle) while minimizing the resource usage and the cost of interconnections to the registerfile of the resulting CPU. The algorithm is based on finding an efficient cover of a large graph by a small set of convex subgraphs gis that are consistent with a given set of pipeline units. Unlike previous works, gis are not synthesized to circuits that are executed in a co-processor mode but rather both gis and the rest of the program are executed by the same set of multiop pipeline units. In this way we eliminate the overhead associated with the co-processor mode of regular ASIPs but maintain high values of IPC of these ASIPs. Once the pipeline configuration and the cover g1 ∪ . . . ∪ gn = G has been computed the Verilog RTL of the corresponding CPU (extended with branch instructions) is generated and synthesized to FPGA. The results show that, for a set of selected kernels, the resulting ASIP (called Ocpu) obtains higher IPC values compare to an equivalent compilation to an ARM cpu while obtaining similar clock frequencies.

show abstract

Dual-pipeline heterogeneous ASIP design

Cited by 3 publications

References 25 publications

Customization of application specific heterogeneous multi-pipeline processors

Customization of application specific heterogeneous multi-pipeline processors

Generating ASIPs with Reduced Number of Connections to the Register-File

Generating ASIPs with reduced number of connections to the register-file

Contact Info

Product

Resources

About