2020
DOI: 10.48550/arxiv.2008.12243
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A transprecision floating-point cluster for efficient near-sensor data analytics

Abstract: Recent applications in the domain of near-sensor computing require the adoption of floating-point arithmetic to reconcile high precision results with a wide dynamic range. In this paper, we propose a multi-core computing cluster that leverages the fined-grained tunable principles of transprecision computing to provide support to near-sensor applications at a minimum power budget. Our design -based on the open-source RISC-V architecture -combines parallelization and sub-word vectorization with near-threshold op… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 37 publications
0
2
0
Order By: Relevance
“…We performed an experimental evaluation using a cycleaccurate PULP emulator implemented on a Xilinx UltraScale+ VCU118 FPGA board 1 . We used a configuration with 8 cores and 4 FPUs, which is the most energy-efficient for near-sensor applications [17]. We measured execution cycles, instructions, and stalls for each code region employing hardware performance counters.…”
Section: Discussionmentioning
confidence: 99%
“…We performed an experimental evaluation using a cycleaccurate PULP emulator implemented on a Xilinx UltraScale+ VCU118 FPGA board 1 . We used a configuration with 8 cores and 4 FPUs, which is the most energy-efficient for near-sensor applications [17]. We measured execution cycles, instructions, and stalls for each code region employing hardware performance counters.…”
Section: Discussionmentioning
confidence: 99%
“…The FPU architecture is pipelined with a single stage. This architecture, introduced in [10] as 8c4f1p (8 cores, 4 floating-point units, 1 pipeline stage), is the most energy-efficient configuration of PULP; experimental results show that this solution outperforms its main competitors in the domain of embedded processing systems.…”
Section: Introductionmentioning
confidence: 99%