The New DRAM Interfaces: SDRAM, RDRAM and Variants

Davis, Brian; Jacob, Bruce; Mudge, Trevor

doi:10.1007/3-540-39999-2_3

Cited by 10 publications

(6 citation statements)

References 1 publication

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DRAM vendors have recently announced numerous core variations that improve access time. For example, Enhanced Memory System's ESDRAM improves performance over regular SDRAM by adding an SRAM cache for the full row buffer, thereby allowing precharge to begin immediately after an access and DRAM writes to go directly to the core without destroying read locality [5,7,6]. Fujitsu's FCRAM subdivides each internal bank by activating only a portion of each word line, thereby reducing capacitance on the word access and improving access time over that of standard SDRAM to roughly 30ns [9,10].…”

Section: Introductionmentioning

confidence: 99%

“…Several vendors have placed large amounts of SRAM onto the DRAM die, in addition to the row buffers, in an attempt to reduce latency. For example, NEC'S VCDRAM places a set-associative SRAM buffer on the die that holds an implementation-defined number of sub-page (typically 10-100), where a sub-page is a subset of the bits activated by a column access and is on the order of 16-32 bytes [6,10].…”

Section: Introductionmentioning

confidence: 99%

“…The operation of the traditional DRAM was defined several decades ago, and as this has become a performance bottleneck, a number of evolutionary and revolutionary changes have been made [26]. The performance of many of these DRAMs in the context of a fixed bus architecture has been studied recently [5,7,6], and more detail on the nature of DRAM operation can be found there and elsewhere [26,18].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

International Symposium on Computer Architecture (ISCA 2004)

Karl¹

2004

It - Information Technology

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

International Symposium on Computer Architecture (ISCA 2004)

Karl¹

2004

It - Information Technology

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

Concurrency, latency, or system overhead

Cuppu

Jacob

2001

Proceedings of the 28th Annual International Symposium on Computer Architecture - ISCA '01

Self Cite

View full text Add to dashboard Cite

Given aJbced CPU architecture and a fixed DRAM timing specification, there is still a large design space for a DRAM system organization. Parameters" include the number of memory channels, the bandwidth of each channel, burst sizes, queue sizes and organizations, turnaround overhead, memory-controller page protocol, algorithms .for assigning request priorities and scheduling requests dynamically, etc. In this design space, we see a wide variation in application execution times;for example, execution times for SPEC CPU 2000 integer suite on a 2-way ganged Direct Rambus organization (32 data bits) with 64-byte bursts are 10-20% lower than execution times on an otherwise identical configuration that uses 32byte bursts. This represents two system configurations that are relatively close to each other in the design space; performance differences become even more pronounced for designs further apart. This paper characterizes the sources of overhead in high-performance DRAM systems and investigates the most effective ways to reduce a system's exposure to performance loss. In particular, we look at mechanisms to increase a system's support for concurrent transactions, mechanisms to reduce request latency, and mechanisms to reduce the "system overhead"--the portion of the primary memory system's overhead that is not due to DRAM latency but rather to things like turnaround time, request queueing, inefficiencies due to read~write request interleaving, etc. Our simulator models a 2GHz, highly aggressive out-of-order uniprocessor. The interJhce to the memory system is fully non-blocking, supporting up to 32 outstanding misses at both the level-I and level-2 caches and split-transaction busses to all DRAM banks.

show abstract

“…The various types of DRAM differ primarily in their interfaces at the chip and bus level [26]- [28], but the idea of banking is always there. Experimental evidence [28] indicates that on average PC133 SDRAM works at 60% efficiency and DDR266 SDRAM works at 37% efficiency, where 80%-85% of the lost efficiency is due to the bank conflicts.…”

Section: A Dram Banksmentioning

confidence: 99%

High-Bandwidth Network Memory System Through Virtual Pipelines

Agrawal

Sherwood

2009

IEEE/ACM Trans. Networking

View full text Add to dashboard Cite

Abstract-As network bandwidth increases, designing an effective memory system for network processors becomes a significant challenge. The size of the routing tables, the complexity of the packet classification rules, and the amount of packet buffering required all continue to grow at a staggering rate. Simply relying on large, fast SRAMs alone is not likely to be scalable or cost-effective. Instead, trends point to the use of low-cost commodity DRAM devices as a means to deliver the worst-case memory performance that network data-plane algorithms demand. While DRAMs can deliver a great deal of throughput, the problem is that memory banking significantly complicates the worst-case analysis, and specialized algorithms are needed to ensure that specific types of access patterns are conflict-free.We introduce virtually pipelined memory, an architectural technique that efficiently supports high bandwidth, uniform latency memory accesses, and high-confidence throughput even under adversarial conditions. Virtual pipelining provides a simple-to-analyze programming model of a deep pipeline (deterministic latencies) with a completely different physical implementation (a memory system with banks and probabilistic mapping). This allows designers to effectively decouple the analysis of their algorithms and data structures from the analysis of the memory buses and banks. Unlike specialized hardware customized for a specific data-plane algorithm, our system makes no assumption about the memory access patterns. We present a mathematical argument for our system's ability to provably provide bandwidth with high confidence and demonstrate its functionality and area overhead through a synthesizable design. We further show that, even though our scheme is general purpose to support new applications such as packet reassembly, it outperforms the state-of-the-art in specialized packet buffering architectures.

show abstract

The New DRAM Interfaces: SDRAM, RDRAM and Variants

Cited by 10 publications

References 1 publication

International Symposium on Computer Architecture (ISCA 2004)

International Symposium on Computer Architecture (ISCA 2004)

Concurrency, latency, or system overhead

High-Bandwidth Network Memory System Through Virtual Pipelines

Contact Info

Product

Resources

About