A distributed timing synchronization technique for parallel multi-core instruction-set simulation

Wu, Meng-Huan; Fu, Cheng-Yang; Wang, Peng-Chih; Tsay, Ren-Song

doi:10.1145/2435227.2435250

Cited by 5 publications

(1 citation statement)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Wu et al proposed a distributed timing synchronization technique [24] based on predicting future synchronization points. This technique addresses the errors caused by shared data access.…”

Section: Related Workmentioning

confidence: 99%

Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction

Lee

2016

IEEE Trans. Comput.

View full text Add to dashboard Cite

The performance analysis and study of large-scale many-core processor architectures require fast and highly accurate simulation techniques in order to reduce time consumption. State-of-the-art graphics processing units (GPUs), which are used extensively as coprocessors in the high-performance-computing area, also require fast simulation techniques because they have massively complex microarchitectures with thousands of processing elements. At present, however, GPU simulators do not have sufficient simulation speed for advanced software and architecture studies. In this study, we propose a new parallel simulation framework and a new parallel simulation technique for improving the simulation speed of GPUs. The proposed framework facilitates multithreaded simulation by exploiting the architectural-level parallelism and execution model parallelism of GPUs. In addition, an error predictive synchronization scheme based on a timing error prediction mechanism is used to minimize the cycle errors and simulator slowdown during parallel simulations. The experimental results obtained using a simulator with the proposed framework showed that the proposed technique provided a speedup of up to 8.9 times compared with an existing single-thread-based GPU simulator on a 16-core machine.

show abstract

“…Wu et al proposed a distributed timing synchronization technique [24] based on predicting future synchronization points. This technique addresses the errors caused by shared data access.…”

Section: Related Workmentioning

confidence: 99%

Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction

Lee

2016

IEEE Trans. Comput.

View full text Add to dashboard Cite

show abstract

On a maximally permissive deadlock prevention policy for automated manufacturing systems by using resource-oriented Petri nets

Chen

et al. 2019

ISA Transactions

View full text Add to dashboard Cite

Highly Efficient and Effective Approach for Synchronization-Function-Level Parallel Multicore Instruction-Set Simulations

Chang

Pai

et al. 2015

IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst.

View full text Add to dashboard Cite

A distributed timing synchronization technique for parallel multi-core instruction-set simulation

Cited by 5 publications

References 38 publications

Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction

Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction

On a maximally permissive deadlock prevention policy for automated manufacturing systems by using resource-oriented Petri nets

Highly Efficient and Effective Approach for Synchronization-Function-Level Parallel Multicore Instruction-Set Simulations

Contact Info

Product

Resources

About