Fast and Correct Load-Link/Store-Conditional Instruction Handling in DBT Systems

Kristien, Martin; Spink, Tom; Campbell, B. K.; Sarkar, Susmit; Stark, Ian; Franke, Björn; Böhm, Igor; Topham, Nigel

doi:10.1109/tcad.2020.3013048

Cited by 3 publications

(1 citation statement)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The approach they propose is conceptually intuitive: it consists of emulating the target CPU's (also called vCPU) atomic instructions using the host CPU's atomic instructions. Said so, this seems relatively straightforward, however slight semantic differences between the target and the host memory models lead to generating complex code snippets that are also difficult to ensure correct [14]. But this is only the tip of the iceberg, and many complicated technical details have to be solved, among which parallel target code generation and caching, choice of the next emulated processor to execute, interprocessor communication, etc.…”

Section: Background and Related Workmentioning

confidence: 99%

To Pin or Not to Pin: Asserting the Scalability of QEMU Parallel Implementation

Badaroux

Miroddi²,

Pétrot

2021

2021 24th Euromicro Conference on Digital System Design (DSD)

View full text Add to dashboard Cite

Due to its speed in cross-executing sequential code, dynamic binary translation is the unchallenged technology for full system-level simulation. Among the translators, QEMU has become the de facto solution. It introduced parallel host execution of the target cores a few years ago for the ARM instruction set architecture and this support is now also available, among others, for RISC-V. Given the popularity of these instruction sets in multi and many-core systems, assessing the scalability of their parallel implementation makes sense. In this paper, we use a subset of the PARSEC benchmark to measure the execution time of QEMU's parallel implementation, to which we added the ability to pin a target processor to a host core or hardware thread. We report the results of a wealth of experiments we performed on a 16-core/32-thread x86-64 SMP machine. They show that the support of parallelism in QEMU scales well, and that, somewhat counter intuitively, pinning does not improve performance.

show abstract

Section: Background and Related Workmentioning

confidence: 99%

To Pin or Not to Pin: Asserting the Scalability of QEMU Parallel Implementation

Badaroux

Miroddi²,

Pétrot

2021

2021 24th Euromicro Conference on Digital System Design (DSD)

View full text Add to dashboard Cite

show abstract

A System-Level Dynamic Binary Translator Using Automatically-Learned Translation Rules

Jiang,

Liang,

Dong

et al. 2024

2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

View full text Add to dashboard Cite

Design of Intelligent Decision System via Computer Aided Translation Software Evaluation under the Background of Internet of Things

Xiang

Zeng

Sun

et al. 2022

ACM Trans. Asian Low-Resour. Lang. Inf. Process.

View full text Add to dashboard Cite

The identification and classification of professional terms of machine translation are studied in this work, to improve the accuracy and professionalism of computer aided translation (CAT) software. Firstly, the current situation and related fields of machine translation are analyzed to summarize the difficulties and shortcomings in machine translation. Secondly, the concept of term is introduced to conduct targeted research on the imbalance problem of terminology classification and recognition in machine translation. Thirdly, a term recognition model based on integrated recognition method is proposed. Finally, the classification accuracy and recall rate of the model are verified using the method of confusion matrix in experiments. The results demonstrate that in comparison of the recall rate, classification accuracy, and f value in different fields, the classification accuracy of network terms by the hybrid method combining the over-sampling method and under-sampling method is the highest of 77%, that of sports terms is the lowest of 71%, and that of economic terms is 74%. Among the recall rate, accuracy rate and f value, the recall rate is the highest, reaching more than 80%, especially for economic terms of 91%. The combination of over-sampling and under-sampling performs better than the under-sampling with playback and under-sampling without playback in terms of term recognition and classification in different fields. Through the classification results before and after integration, it is obvious that the integration of each base classifier not only effectively improves the classification accuracy of terms, but also greatly improves the recall rate. This term recognition model can help CAT software in improving the recognition accuracy of term translation, which has certain practical effects and provides reference for research in related fields.

show abstract

Fast and Correct Load-Link/Store-Conditional Instruction Handling in DBT Systems

Cited by 3 publications

References 18 publications

To Pin or Not to Pin: Asserting the Scalability of QEMU Parallel Implementation

To Pin or Not to Pin: Asserting the Scalability of QEMU Parallel Implementation

A System-Level Dynamic Binary Translator Using Automatically-Learned Translation Rules

Design of Intelligent Decision System via Computer Aided Translation Software Evaluation under the Background of Internet of Things

Contact Info

Product

Resources

About