Performance and power of 1.6 to 9.6Gbps server, desktop, and mobile I/O links in a 1.2V 90nm CMOS test chip implementing equalized voltage-mode and current-mode drivers, TX and RX equalizers, self-biased ring oscillator and LC PLLs, and different RX clocking schemes are compared. The novel combination of voltage-mode driver (equalized or unequalized) and RX equalizer delivers the lowest power (12.1mW/Gbps at 7.2Gbps), offering a low-power option for short-distance links. (Keywords: low power, I/O link, equalizer, voltage-mode driver)
IntroductionThe 3 key metrics in Gigabit I/O designs are performance (data rate), power, and channel length. Different tradeoffs are made depending on system requirements. In server interconnections, the major challenge is to overcome the harsh channel conditions. Sophisticated equalizers are implemented to increase the data rate and distance at the expense of increased power. On the other hand, I/Os for desktop and mobile systems need cheaper cooling solutions, making power optimization the most important design metric. This is especially true for mobile I/Os, where the power is further limited by battery capacities.Transceiver System A high-speed transceiver test chip containing I/O links for server, desktop, and mobile applications was implemented in a 1.2V, 90nm digital CMOS process. The chip contains 2 I/O ports on the 2 sides of a 5mm x 5mm die, with shared circuitry and digital blocks situated in the middle, as shown in Figure 1. Two packages, a land-grid array and a ball-grid array, allow the packaged chip to be placed in a socket or soldered directly on a test board. Figure 2 shows the architecture of each port, which contains 6 differential data lanes and 1 differential clock lane. A halfrate TxClk is generated by a TxPLL, which can be an LC-PLL or a self-biased ring oscillator PLL (SBRO-PLL) shared between the ports, and distributed to all I/Os using CMOS buffers. TX data can be a (2 32 -1) bit PRBS or an externally loaded 140-bit pattern which subsequently passes through a bypass-able 8B/10B encoder. Two TX designs were implemented: the low-power TX with 2-tap equalized voltagemode (V-mode) driver [1], and the high-performance TX with 2-way interleaved 2 to 6-tap programmable MAC (multiplyaccumulate) based FIR TX equalizers and a 6-bit DAC currentmode (I-mode) driver [2]. Both support swings up to 600mV.At RX, a DLL generates 8 global clock phases at 45 o spacings from a half-rate forwarded clock or a SBRO-PLL (labeled RxPLL) locked to a reference (local) clock. The clock phases are distributed to all I/Os where they are phase multiplexed and interpolated to generate the skewed RxClks.