29th VLSI Test Symposium 2011
DOI: 10.1109/vts.2011.5783724
|View full text |Cite
|
Sign up to set email alerts
|

Design and implementation of a time-division multiplexing scan architecture using serializer and deserializer in GPU chips

Abstract: We present the design and implementation details of a time-division demultiplexing/multiplexing based scan architecture using serializer/deserializer. This is one of the key DFT features implemented on NVIDIA's Fermi family GPU (Graphic Processing Unit) chips. We provide a comprehensive description on the architecture and specifications. We also depict a compact serializer/deserializer module design, test timing consideration, design rule and test pattern verification. Finally, we show silicon data collected f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(8 citation statements)
references
References 12 publications
0
8
0
Order By: Relevance
“…More recently [10] describes the practical aspects of implementing a time-division multiplexed TAM for GPUs. They also write a wrapper script to serialize the patterns and their key contribution is in their detailed descriptions of the DFT insertion, rule checking and verification.…”
Section: Previous Workmentioning
confidence: 99%
“…More recently [10] describes the practical aspects of implementing a time-division multiplexed TAM for GPUs. They also write a wrapper script to serialize the patterns and their key contribution is in their detailed descriptions of the DFT insertion, rule checking and verification.…”
Section: Previous Workmentioning
confidence: 99%
“…The scan frequency is limited by thermal and design constraints; the scan chain insertion is not optimized for performance, and therefore the scan frequency is usually limited to a few tens of MHz. Most of the conventional TAM design methods, therefore, rely on either increasing the number of test channels, which is limited by the chip pins or increasing the pin efficiency by techniques such as TDMA and SerDes [15] [16][17] [18].…”
Section: Background and Motivationmentioning
confidence: 99%
“…Time de-multiplexers at the receiving end perform the serial to parallel conversion, and the data is shifted into scan-chains at the scan frequency. Another approach to allow optimal utilization of tester resources was presented in [17] and [18]. In [17], the authors introduced the concept of virtual TAMs to efficiently utilize tester resources.…”
Section: Prior Workmentioning
confidence: 99%
“…Time-division multiplexing is the standard method of maximizing ATE bandwidth utilization when the ATE shift speed is higher than the internal scan chain shift speed and has been successfully applied to combinational scan compression methods [10], [12]. However, for large designs with many scan chains the number of combinational compressor inputs and outputs (I/O) can exceed the number of virtual pins available after time-division; consequently, shift speed must be reduced or compressors must be shared -neither of Details of these advantages are explained in the next section.…”
Section: Chip Level Time-division Multiplexing: Serdes and Pipelinesmentioning
confidence: 99%
“…Even low-cost ATEs can drive test channels around 200MHz whereas internal scan chains operate at a much lower frequency of 10MHZ to 50MHz [10]. Test compression, particularly combinational methods that add long compressor logic paths at either end of scan chains can further widen the gap by requiring even lower internal shift speed.…”
Section: Introductionmentioning
confidence: 99%