2008
DOI: 10.1016/j.sysarc.2008.01.008
|View full text |Cite
|
Sign up to set email alerts
|

A predecoding technique for ILP exploitation in Java processors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0
1

Year Published

2009
2009
2020
2020

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 8 publications
0
4
0
1
Order By: Relevance
“…More precisely, its delay accounted for about 33.8% of the critical path delay, approximately 49.4% of which was consumed in the logic to identify the source register operands. Also, we found that we could increase the operating frequency by up to 16.6% if we remove the dependence logic from the critical path.…”
Section: Introductionmentioning
confidence: 89%
See 1 more Smart Citation
“…More precisely, its delay accounted for about 33.8% of the critical path delay, approximately 49.4% of which was consumed in the logic to identify the source register operands. Also, we found that we could increase the operating frequency by up to 16.6% if we remove the dependence logic from the critical path.…”
Section: Introductionmentioning
confidence: 89%
“…Another use of the pre-decoding is for avoiding the repeated decoding the same instruction. The authors of Reference [16] proposed a hardware folding technique that dynamically transforms Java bytecodes groups into RISC instructions, storing them in a cache to enable reuse.…”
Section: Related Workmentioning
confidence: 99%
“…The peripheral performs static translation per superblock, using a modification [5] of the OPEX algorithm [3]. The core folding algorithm maintains a queue of recently fetched Java bytecodes and it tries to find folding groups, even if they are nested.…”
Section: Translation Techniquementioning
confidence: 99%
“…The hardware peripheral performs stack folding of Java bytecode sequences based on OPEX [3], a recursive stack folding algorithm, which translates bytecodes to RISC instructions. The acceleration hardware is placed in the processor bus and it is managed as a peripheral, in contrast with other solutions which put the hardware between the instruction cache and the processor [5]. This placement is more flexible and up to a certain point independent of the specific microarchitecture.…”
Section: Introductionmentioning
confidence: 99%
“…Η ομαδοποίηση αυτή είναι βασισμένη στον αλγόριθμο OPEX [52]. Το περιφερειακό επιτάχυνσης τοποθετείται στο διάδρομο του επεξεργαστή και ελέγχεται απ´ αυτόν, εν αντιθέσει με άλλες λύσεις όπου το υλικό μετάφρασης παρεμβάλλεται μεταξύ κρυφής μνήμης εντολών και επεξεργαστή [88,96]. Η τοποθέτηση αυτή είναι πιο ευέλικτή και μέχρι ενός βαθμού ανεξάρτητη της συγκεκριμένης μικροαρχιτεκτονικής.…”
Section: εισαγωγήunclassified