2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No
DOI: 10.1109/iscas.2000.856418
|View full text |Cite
|
Sign up to set email alerts
|

Development of an evaluation model for the design of fault-tolerant solid state mass memory

Abstract: The use of Commercial Off The Shelf (COTS) devices for space applications is growing due the higher cost and lower performance of corresponding dedicated devices. The needed reliability is achieved at the architectural level. This paper faces this problem in the design of a Solid State Mass Memory (SSMM). High-energy particles induce soft and hard errors in DRAMs devices. In our approach, these errors are mitigated at system level rather than at device level. In particular our SSMM is based on a suitable Error… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…Systemmemory cooperative solutions that DRAM consumers can implement to improve memory reliability identify and/or address memory errors before they impact the system at large. Hardware solutions include rank-level ECC [51,53,54,62,63,71,72,81,87,93,101,105,108], rank-level ECC scrubbing [94,221,279,280,315,[315][316][317][318][319], and bit repair techniques [79,86,[320][321][322][323][324][325][326][327]. Software-based approaches include retiring known-bad memory pages [49,60,83,84,88,103], and predicting failures [328][329][330][331][332][333].…”
Section: Study 4: Improving Memory Reliabilitymentioning
confidence: 99%
See 1 more Smart Citation
“…Systemmemory cooperative solutions that DRAM consumers can implement to improve memory reliability identify and/or address memory errors before they impact the system at large. Hardware solutions include rank-level ECC [51,53,54,62,63,71,72,81,87,93,101,105,108], rank-level ECC scrubbing [94,221,279,280,315,[315][316][317][318][319], and bit repair techniques [79,86,[320][321][322][323][324][325][326][327]. Software-based approaches include retiring known-bad memory pages [49,60,83,84,88,103], and predicting failures [328][329][330][331][332][333].…”
Section: Study 4: Improving Memory Reliabilitymentioning
confidence: 99%
“…ror models 4 ⃝ based on a previous understanding of DRAM errors (e.g., from past experiments or scientific studies). Examples of such error models include: analytical models based on understanding DRAM failure modes (e.g., sources of runtime faults [51,60,149,[371][372][373]), parametric statistical models that provide useful summary statistics (e.g., lognormal distribution of cell data-retention times [276,277,[374][375][376][377][378][379][380], exponential distribution of the time-in-state of cells susceptible to variable-retention time (VRT) [65,94,150,166,367,[381][382][383][384][385][386][387][388][389]), physics-based simulation models (e.g., TCAD [232,374,[390][391][392] and SPICE models [14,59,78,106,109,283,[393][394][395]), and empirically-determined curves that predict observations well (e.g., single-bit error rates…”
Section: Testing Modelingmentioning
confidence: 99%
“…Second, the system designer may make predictions from analytical or empirical error models 4 based on a previous understanding of DRAM errors (e.g., from past experiments or scienti c studies). Examples of such error models include: analytical models based on understanding DRAM failure modes (e.g., sources of runtime faults [21,49,123,[271][272][273]), parametric statistical models that provide useful summary statistics (e.g., lognormal distribution of cell data-retention times [190,191,[274][275][276][277][278][279][280]…”
Section: Testing Modelingmentioning
confidence: 99%
“…To address this disparity, system designers have long since developed techniques for adapting unmodi ed commodity DRAM chips to varying system requirements. Examples include: (1) actively identifying and/or mitigating errors to improve reliability [48][49][50][51][52][53][54][55][56][57][58][59][60][61][62][63][64][65]; (2) exploiting available timing [39,[66][67][68][69][70][71][72] and voltage [73][74][75] margins to reduce memory access latency, power consumption, decrease refresh overheads [22,[76][77][78][78][79][80][81][82]; and (3) mitigating unwanted DRAM data persistence [83][84][85] and read-disturb problems [86][87][88][89][90]. Section 2.1 discusses these proposals in greate...…”
Section: Introductionmentioning
confidence: 99%
“…For a simple series of systems, in which the 's represent the probability of survival at End Of Life (EOL), we get The above equation has an infinite number of solutions and a procedure that yields a unique or limited number of solutions must be used. For this purpose we use a proprietary optimization tool [23] based on the minimization of an Effort Function, as described in [24]. For the reliability evaluation of the SSMM the reliability model shown in Fig.…”
Section: A Solid State Mass Memory Reliability Evaluationmentioning
confidence: 99%