Proceedings of the Joint Conference on Languages, Compilers and Tools for Embedded Systems: Software and Compilers for Embedded 2002
DOI: 10.1145/513829.513840
|View full text |Cite
|
Sign up to set email alerts
|

Profile guided selection of ARM and thumb instructions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2003
2003
2021
2021

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 51 publications
(13 citation statements)
references
References 1 publication
0
13
0
Order By: Relevance
“…The Thumb version of an application is on average 30% smaller than its 32-bit ARM counterpart [4]. It should be noted that using Thumb code, with every 32 bits fetched, the processor fetches two Thumb instructions.…”
Section: Mixed-widthmentioning
confidence: 99%
See 1 more Smart Citation
“…The Thumb version of an application is on average 30% smaller than its 32-bit ARM counterpart [4]. It should be noted that using Thumb code, with every 32 bits fetched, the processor fetches two Thumb instructions.…”
Section: Mixed-widthmentioning
confidence: 99%
“…In some cases it is even slightly smaller due to the improved instruction cache behavior of Mixed Code. A more detailed analysis, including a comparison with three other heuristics, is available in [4].…”
Section: Profile-guided Generation Of Mixed Codementioning
confidence: 99%
“…Most THUMB instructions can only access 8 registers although all 16 registers are physically present. The compact THUMB instruction set demonstrates better I-cache power saving but lower performance partially due to the small number of architected regis-ters available [8] 1 . Study in [8] shows that compiling the same program with 16-bit THUMB mode instead of the 32-bit ARM mode can save up to 19% I-cache energy.…”
Section: Introductionmentioning
confidence: 99%
“…Although the hardware can support more registers such as on ARM and MIPS, the ISA bottleneck prevents them from being addressed and utilized. [8] mentions that with a compact ISA (THUMB or MIPS-16), instruction count can increase by 9% to 41%, which not only slows down the execution but eliminates some of the benefits brought about by a compact ISA.…”
Section: Introductionmentioning
confidence: 99%
“…While the AX extensions described in this paper are for the ARM architecture, the idea of Instruction Coalescing and Augmenting instructions can be applied to other dual width processors. In previous work [6] we showed how one could achieve good code size, low energy and high performance using profile guided heuristics at compile time. The techniques described here are orthogonal to the previous techniques and more scalable.…”
Section: Introductionmentioning
confidence: 99%