2022 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) 2022
DOI: 10.1109/isvlsi54635.2022.00089
|View full text |Cite
|
Sign up to set email alerts
|

On-Demand Redundancy Grouping: Selectable Soft-Error Tolerance for a Multicore Cluster

Abstract: With the shrinking of technology nodes and the use of parallel processor clusters in hostile and critical environments, such as space, run-time faults caused by radiation are a serious cross-cutting concern, also impacting architectural design. This paper introduces an architectural approach to run-time configurable soft-error tolerance at the core level, augmenting a six-core open-source RISC-V cluster with a novel On-Demand Redundancy Grouping (ODRG) scheme. ODRG allows the cluster to operate either as two f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…While the tight coupling and integration with the memory system significantly improve the processing performance of parallelizable tasks, these cores can also be used for the same calculation, offering a redundancy copy to detect errors. Extending the hardware, on-demand redundancy grouping (ODRG) [46] combines three cores in the PULP cluster into a triple-core lockstep (TCLS) group. In an 8-core cluster, this allows for two TCLS groups, and 2 remaining individual cores.…”
Section: On-demand Redundancy Grouping (Odrg)mentioning
confidence: 99%
“…While the tight coupling and integration with the memory system significantly improve the processing performance of parallelizable tasks, these cores can also be used for the same calculation, offering a redundancy copy to detect errors. Extending the hardware, on-demand redundancy grouping (ODRG) [46] combines three cores in the PULP cluster into a triple-core lockstep (TCLS) group. In an 8-core cluster, this allows for two TCLS groups, and 2 remaining individual cores.…”
Section: On-demand Redundancy Grouping (Odrg)mentioning
confidence: 99%
“…With a 2-stage pipeline and independent instruction and data memory interfaces, the lightweight core is designed for high area efficiency, while providing reasonable performance for microcontrollers. For reliability, the operation of the three cores is locked together with ODRG [7] into a TCLS configuration. In this default soft-error tolerant mode, the three cores all receive identical inputs, and their outputs are majority-voted, thereby emulating a single, reliable core to the programmer and the system.…”
Section: Soc Architecturementioning
confidence: 99%
“…It includes three Ibex [6] processing cores, by default, operating together in a triple-core lockstep (TCLS) mode. Using on-demand redundancy grouping (ODRG) [7], these cores can be unlocked to increase performance when reliability is not required. Furthermore, the memory is protected with an efficient error correction code (ECC), and scrubbers are added to correct latent errors.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we introduce a space-ready multi-core RISC-V-based computing system featuring a Hybrid Modular Redundancy (HMR) approach. We leverage the independent cores available in a multi-core RISC-V cluster for redundant execution in a dynamically runtime configurable manner and introduce Dual-Core Lockstep (DCLS) and Triple-Core Lockstep (TCLS) modes, extending the On-Demand Redundancy Grouping with TCLS configurable under reset presented in [37]. Our design allows each application to configure its reliability setting according to its requirements, possibly decided at runtime.…”
Section: Introductionmentioning
confidence: 99%