. , '9
’Q ~ 9 J."
r .1“ war:-

it
i; ‘ 3,1 . -
Jafﬁj’f:
, ’Ji’i
(-3»:

.n

.213) .,

~ 0
.’3'; a, 1}

‘v a.» '..'., :1: .

v find-ha? ’

~¢
p.

{it (3).":

WWW“

3",. m4

- J94:
«ham; 3!; a

:1- ?” W“
, ,

, “a“? a". V
7‘ W 3'1"“
'4 l
:’ 7;

ll! ‘
”A
A":

I
{13“

. - . . K“ "
. ~,~ “no.“

a

.13; I v»

u ,1, H- ."
(c510 ' 3"" 1.02:“ All
1/
ill/,1”.

“3’s.
«'4

~5-
x.
,-

'_" c} o‘ I.

3'" .';I"':"V',' trm ‘

1"."4’51.;.r":

' I; r 930'}?
L_ 1‘

34;,
W" 91"
7";{ﬁfq/jl
, )gijuJﬁAﬁ
' «bit/ﬁe: ,
(fut? ; I‘

f?
I 'vv’

"'15
I!) 'Mr
V

‘1:
n_u‘

ﬁlial."
“hf. .

 

33. 6.23.
_ 1.,“ A_

‘l
x »

ﬁ}.n1f7¢: “"lqtvlm‘
n16 "Tm ’* ->*‘

yr. a'i' ﬁﬂfﬁ‘géﬁg}
f“

9 ‘ ‘1
. y . . 7h ’ f)
1134);???" ’ ”‘63:; "I
' n ‘ "I § — W71" 5‘
1:"? .‘i-‘,§’:4~i§§:..xw‘? ’ .43? r
#4“ W‘ ’36. g V.

~~—. ._

Jag-5&3:
yﬁm~”"

Jﬁrkﬁti 432W;

4 .,‘§€,g€&::u:: -
éP‘ﬁé‘Mg. 31%?
5d,." 1': :- A: :1“.

43";U§;;‘; 'ﬁ: (at;

rag 51%;“;
4.3;. 3% ‘5'
m igiils

.15“

J31

..

M ‘ ‘

3:2: ”17.;Cﬁs‘
ﬁﬁ~ «wig

a “M":

“Lui‘

‘I?<';U

6:33“ «(If 1*“ 21l-
(:96, 1': «13.5”; #11; v

a?
{563,

‘u \‘g’ﬁh’u'
‘H’ _. .v'hifu“; ;!
_.r )r‘x‘ IL ,
it 3:352}? - "36:

{a
(1.35; I" v!
\.:""n'lx.~:;t:-; :3 ‘
"M‘ . . ,
\Kﬂn‘ﬂf ;.§’é‘.‘1,l‘.“

' ,- .fv ' ’
my).
1

1‘ ' K.
a: A < '
‘ “A"; t;

a.
.r
V} 3

h

I}:
1-.“

1a,-

2 lg " wide!

- h :.-4-
‘f" x
if;

‘93; ‘
‘Siuk

. 'Gvg’ ' ’4'": J
,J an

7 :3.
'ifg‘f’ﬁz 55%;”

4‘43“? 5'} f‘u

M'uﬁ', J52!A

5‘“4:f;-i? W r J ’2: ig"
2"“ " .4; gasﬁw

. {55.
cups

‘m":’;~ﬂ\
::ra.: gr ‘ ‘W
_. 3-3" ._ p- ”3!;
’ 339”“ at? . 325W:

0
\

"HEQN

cm a)

 

 

 

 

 

 

 

 

 

 

 

ll°l°\\\\m\\“mum lllfjlilu

DESIGN OF FAULT-TOLERANT
PROGRAMMABLE LOGIC ARRAYS
FOR YIELD ENHANCEMENT

presented by

Tsin-Yuan Chang

has been accepted towards fulﬁllment
of the requirements for

Ph.D. degree in Electrical Engineering

 

[Zia/77752 ”/57

Major/ofessor

Date Ni’V. é/ /397
7

MS U is ab Afﬁrmative Action/Equal Opportunity Institution 0-12771

 

LIBRARY
Mlchlgan State
Unlveralty

 

 

 

PLACE ll RETURN BOX to roman this checkout from your record.

TO AVOID FINES Mum on or Moro duo duo.

DATE DUE DATE DUE DATE DUE

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

MSU I. An Afﬁrmwvo Action/Equal Opponunuy Instituuon
Wanna

7%,

 

DESIGN OF FAULT-TOLERANT
PROGRAMMABLE LOGIC ARRAYS
FOR YIELD ENHANCEMENT

By

Tsin-Yuan Chang

A DISSERTATION

Submitted to

Michigan State University

in partial fulﬁllment of the requirements
for the degree of

DOCTOR OF PHILOSOPHY
Department of Electrical Engineering

1989

DESIGN OF FAULT-TOLERANT
PROGRAMMABLE LOGIC ARRAYS
FOR YIELD ENHANCEMENT

BY

Tsin-Yuan Chang

Department of Electrical Engineering
Michigan State University

ABSTRACT

The yield (expected percentage of good chips out of a wafer) of integrated
circuits (ICs) has always been crucial to the commercial success of their

manufactm‘e. The technology of ICs evolved from LSI, VLSI, to ULSI in the past two

decades. Multiple layers and scaling techniques make it possible for more than 106
transistors to be put into a single chip. However, as the complexity of digital devices
increases and geometry shrinks, the probability of having faulty components also

increases, thereby lowering the chip yield.

One solution to the low yield problem is to improve manufacturing and testing
processes, but it is very costly and quite difﬁcult to implement within a short time.
Another practical way is the use of fault-tolerant structures, which has been
demonstrated in practice for high density memory chips. The result of fault-tolerant
memory design is a reduction in the capital-required level of shippable product, and
also that redundancy typically improves yields by 1.5 to 5 times.

Programmable Logic Arrays (PLAs) have the advantages of regular structure,

design simplicity, and fast turnaround time. The use of PLAs becomes increasingly

popular for implementing Boolean logic functions and control blocks in the design of
integrated circuit. Due to the fact that complex chips (and in particular
microprocessors) can be efﬁciently implemented using PLAs, a trend towards

manufacturing larger programmable chips is expected.

In this dissertation, a fault-tolerant design for large PLAs is proposed. The
fault-tolerant design achieves a full diagnosability of single and multiple stuck-at
faults, bridging faults, and crosspoint faults. During the manufacturing process, faults
in the PLA can be detected, located, and repaired with the spare lines. When the
PLA is used in ﬁeld, the structure still possesses the easily testable capability. An
automatic layout generator, MRPLA, has also been developed and implemented in
Sun 3/160 for generating the physical layout of the proposed fault-tolerant PLA. In
addition, some important issues such as die size, speed, and yield enhancement are
also addressed in this study. The results of this study show that the yield can be
enhanced signiﬁcantly. A simple, yet efﬁcient optimization method has been

presented to determine the optimal redundancy of various sizes of PLAs.

This study also introduces a PLA structure based on memory cells. The RAM-
Based PLA (RBPLA) allows designers to reprogram the PLA as many times as
needed. A fault-tolerant RBPLA is also presented to electrically repair faults in the

manufacturing process and also in ﬁeld use.

ACKNOWLEDGMENTS

The author wishes to express his sincere appreciation to his major advisor,
Dr. Chin-Long Wey, for the guidance and encouragement given in the course of this
graduate study. He also wishes to thank the dissertation committee members, Dr.
Donnie Reinhard, Dr. Michael Shanblatt, and Dr. Byron Drachman for their valuable
suggestions and comments in his dissertation research. The author also gives thanks

to Dr. Harriett Rigas, and regrets her passing.

The author is especially grateful for the ﬁnancial support from Dr. Wey and the
National Science Foundation under grant No. MIP—8700880. Without these supports,

this research effort would not have been possible.

He would like to acknowledge all the faculty and staff members, and the
students who gave him help and assistance during his studying in the Electrical
Engineering Department at Michigan State University, and many friends, especially

from the Church in Lansing, who showed their support and concern.

Finally, he is very grateful to his family for years of concern, encouragement

and support.

iv

TABLE OF CONTENTS

List of Tables ........................................................................................................... viii
List of Figures .......................................................................................................... ix
Chapter 1 Introduction ........................................................................................ 1
1.1 Problem Statement ................................................................................ 2

1.2 Objectives ............................................................................................. 4

1.3 Physical Failures in VLSI Circuits ....................................................... 5

1.4 Redundancy Architectures .................................................................... 6

1.5 Thesis Organization .............................................................................. 9
Chapter 2 Fault-Tolerant Semiconductor Memories ......................................... 11
2.1 On-Chip Redundancy ............................................................................ 12

2.2 Repair Techniques ................................................................................ 13

2.3 Fault Analysis ....................................................................................... 16

2.4 Fault-Tolerant RAM Design Examples ................................................ 19

2.4.1 A Fault-Tolerant Dynamic RAM ......................................... 19

2.4.2 A Fault-Tolerant Static RAM .............................................. 22

2.5 Discussion and Summary ...................................................................... 25
Chapter 3 Fault-Tolerant Programmable Logic Arrays ................................... 26
3.1 Programmable Logic Arrays ................................................................ 27

3.1.1 PLA Structure and Notation ................................................... 28

3.1.2 Fault Models .......................................................................... 28

3.2 Design of the Repairable PLA ............................................................. 33

3.2.1 Repair Rules ........................................................................... 33

3.2.2 Repairable PLA ...................................................................... 38
3.2.2.1 SISC and Spare Input Bit Lines .................................... 38

3.2.2.2 SOSC and Spare Output Lines ...................................... 41

3.2.2.3 Spare Product Lines ...................................................... 43

3.2.3 Automatic Layout Generator ................................................. 43

3.2.4 Performance ........................................................................... 46

3.2.4.1 Chip Area ...................................................................... 48

3.2.4.2 Propagation Delay Time ............................................... 50

3.3 Design of the Diagnosable PLA ........................................................... 52
3.3.1 Augmented Circuits ............................................................... 52

3.3.1.1 Product Lines’ Shift Register (PSR) ....................... 52

3.3.1.2 Input Lines’ Shift Register (ISR) ............................ 55

3.3.1.3 Extra Power Line Vddl .......................................... 57

3.3.2 Design Evaluation .................................................................. 57

3.4 Summary ................................................................................................ 59
Chapter 4 Fault Diagnosis and Repair Process .................................................. 61
4.1 Locate and Repair Faults in Manufacturing Process ............................ 61
4.1.1 Detect Faults in Augmented Circuits ..................................... 62

4.1.2 Identify and Repair Faults in the AND plane ....................... 64

4.1.3. Identify and Repair Faults in the OR plane .......................... 68

4.1.4 Repair Crosspoint Faults ........................................................ 72

4.2 Fault Diagnosis and Repair Algorithm ................................................. 73
4.2.1 Example 1 .............................................................................. 75

4.2.2 Example 2 .............................................................................. 78

4.2.3 Discussion .............................................................................. 80

4.3 Test Chip in Field Use ........................................................................... 80
4.4 Summary ................................................................................................ 81

vi

Chapter 5 Yield Analysis ...................................................................................... 82

5.1 Yield Model .......................................................................................... 82
5.1.1 Correctable Random Effect Yield, YCRD .............................. 83
5.1.2 Uncorrectable Random Effect Yield, YURD .......................... 86
5.2 Yield Simulation ................................................................................... 86
53 Optimal Redundancy ............................................................................ 91
5.4 Summary ............................................................................................... 94
Chapter 6 Fault-Tolerant RAM-Based PLAs .................................................... 95
6.1 Basic Structure of an RBPLA ............................................................... 96
6.1.1 A DRBPLA Structure ............................................................ 97
6.1.2 An SRBPLA Structure ........................................................... 98
6.2 A Fault-Tolerant SRBPLA Design ....................................................... 102
6.3 Fault Diagnosis and Repair Process ...................................................... 105
6.3.1 Fault Models ........................................................................... 105
6.3.2 Fault Diagnosis and Repair Algorithm ................................... 106
6.4 Summary ............................................................................................... 107
Chapter 7 Conclusions .......................................................................................... 108
7.1 Summary of Major Contributions ......................................................... 108
7.2 Directions for Future Research ............................................................. 109
7.2.1 Fault-Tolerant Design of Folded PLAs ................................ 110
7.2.2 Fault-Tolerant Design of
VLSI/ULSI/WSI Array Structures ....................................... 111
7.2.3 New Yet Low-Yield Technologies ...................................... 112
Appendices .............................................................................................................. 1 13
Bibliography ........................................................................................................... 123

vii

Table 2.1
Table 2.2
Table 3.1
Table 3.2
Table 3.3
Table 3.4
Table 3.5
Table 3.6
Table 3.7
Table 5.1
Table 5.2
Table 5.3
Table 7.1

LIST OF TABLES

Memories Built with Redundancy ...................................................... 14
The Comparison Between Laser and Electrical Programming ........... 15
Cubic Notation .................................................................................... 29
Repair Rules ........................................................................................ 35
Area Overhead in RPIAs ................................................................... 49
Delay Time Penalty of the RPLAs ...................................................... 51
Operations of the PSR ......................................................................... 55
Operations of the ISR .......................................................................... 56
Area Overhead of FTPLAs ................................................................. 57
Yield Simulation for (50,190,67)-PLA ............................................... 92
The Effective Yields for (50,190,67)-PLA ......................................... 93
Yield Simulation for (100,400,100)-PLA ........................................... 94
Yields of LSI GaAs Circuits ............................................................... 112

viii

Figure 1.1
Figure 1.2
Figure 1.3
Figure 1.4
Figure 2.1
Figure 2.2
Figure 2.3
Figure 2.4

Figure 2.5
Figure 2.6
Figure 2.7
Figure 3.1
Figure 3.2
Figure 3.3
Figure 3.4
Figure 3.5
Figure 3.6
Figure 3.7
Figure 3.8
Figure 3.9

Figure 3.10
Figure 3.11
Figure 3.12

LIST OF FIGURES

Learning Curves .................................................................................. 3
Implant Mask Defects ......................................................................... 7
Signiﬁcant and Insigniﬁcant Defects .................................................. 7
Reconﬁguration Architectures ............................................................ 8
Spare Allocation of Redundant Elements ........................................... 17
Fault-Tolerant 64K DRAM ................................................................. 20
Standard and Spare Row Decoders ..................................................... 21
Block Diagram of Major Hardware Components of

Laser Programming System ................................................................ 21
Block Diagram of the 8K x 8 Bit Static RAM .................................... 23
Block Diagram of the Redundancy Conuol Circuit ........................... 23
Laser Diffusion Programmable Devices ............................................. 23
Programmable Logic Array ................................................................ 29
Crosspoint Faults ................................................................................ 31
Stuck-at Faults .................................................................................... 32
Schematic Diagram of a Repairable PLA ........................................... 34
The Repair of S-fault with a Spare Product Line ................................ 37
The Repair of G-fault with a Spare Product Line ............................... 37
Spare Input Selector Circuit (SISC) .................................................... 39
The Programming Procedure of SISC and Spare Input Lines ............ 40
The Programming Procedure of SOSC and Spare Output Lines ........ 42
The Programming Procedure of Spare Product Lines ........................ 44
A Sample RPLA Template ................................................................. 45
MRPLA ................................................................................................ 47

Figure 3.13 Floor Plan of an (sn,sp,sm)-RPLA ...................................................... 48
Figme 3.14 Floor Plan of the PLA ......................................................................... 49
Figure 3.15 Different Allocation Scheme of Spare Lines ...................................... 51
Figure 3.16 A Schematic Diagram of a Fault-Diagnosable PLA ........................... 53
Figure 3.17 A Producr Lines’ Shift Register (PSR) Cell ....................................... 53
Figure 3.18 The Function of Shift Register Cell in Testable PLA Design ............ 54
Figure 3.19 An Input Lines’ Shift Register (ISR) Cell .......................................... 56
Figure 3.20 Fault-Diagnosable PLA ...................................................................... 58
Figure 3.21 An Easily Testable PLA Modiﬁed from the FDPLA ......................... 60
Figure 4.1 A Simpliﬁed Diagram for Fault Diagnosable PLA ............................ 63
Figure 4.2 Identify Input Line Stuck-at Faults and Bridging Faults .................... 66
Figure 4.3 Identify Faults at Product Lines as well as G- and S- Faults .............. 66
Figure 4.4 Identify s-a-l Faults at Output Lines .................................................. 69
Figure 4.5 Identify Bridging Faults; Outputs s-a-l Faults; and

A- and D- faults ................................................................................... 69
Figure 4.6 Examples in the Fault-Diagnosable PLA Design ............................... 74
Figure 5.1 Yields for (50,190,67)-PLA with Redundancy (snsp,sm)=(3,4.2) ............ 88
Figure 5.2 Yield Analysis for (50,190,67)-PLA with (2:2 ................................... 90
Figure 6.1 The Structure of a DRBPLA ............................................................... 97
Figure 6.2 SRBPLA Structure ............................................................................. 99
Figure 6.3 Control Circuit in SRBPLA. ............................................................... 101
Figure 6.4 Fault-Tolerant SRBPLA Scheme ........................................................ 102
Figure 6.5 SI-Cell ................................................................................................. 103
Figure 6.6 P-Cell .................................................................................................. 104
Figure 6.7 Control Signals of the Fault-Tolerant SRBPLA ................................. 104

CHAPTER 1

Introduction

 

As the complexity of digital devices increases and the geometry shrinks, the
probability of having faulty components also increases. The yield of integrated
circuits (expected percentage of good chips from a wafer) has always been crucial to

the commercial success of their manufacture.

One solution to solve low yield problems is to improve manufacturing and
testing processes, but it is very costly and quite difﬁcult to implement within a short
time [57]. Another practical way is the use of fault-tolerant structures [39], which
has been demonstrated in practice for high density memory chips. The only integrated
circuits so far to have exploited fault-tolerant techniques commercially have been
memory chips. This is because memory chips are particularly densely packed and
therefore increasingly vulnerable to defects, and also because a regular memory array
lends itself to a variety of efﬁcient fault-tolerant designs. The result of fault-tolerant
memory design is a reduction in the capital-required level of shippable product, and
also that redundancy typically improves yields by 1.5 to 5 times [56].

During the past few years, Programmable Logic Arrays (PLAs) have become
increasingly common for implementing Boolean logic functions in Very Large Scale
Integration (VLSI) chips. The advantages of regular structure, design simplicity, and
fast turnaround time have played signiﬁcant roles in the manufacturing of large
density PLAs. Due to the fact that complex chips (and in particular microprocessors)

can be efﬁciently implemented using PLAs, a trend towards manufacturing larger

programmable chips is expected. Therefore, the probability of having faulty PLA chips
also increases. The same scenario happens in PLA chips as in memory chips. Low
yield, then, is a potential problem in manufacturing of PLA chips.

In recent years, research has extensively dealt with the fault detection and
test generation of PLAs [6,9,25,33,54,59]. In particular, redundancy techniques have
been successfully applied to PLA testing. While extra logic has been added to PLAs
to implement function-independent tests [9.59] so that the complexity of PLA test
generation can be reduced, other approaches have implemented the added redundancy
with coding techniques, such as m-out-of-n codes and Berger codes [33], to design
totally self-checking (TSC) PLAs. Little emphasis, however, has been devoted to
use redundancy for repairing PLAs.

1.1 Problem Statement

New devices traditionally push against the current technological limits and
often have a very low yield. This situation is only sensible when continuing advances
in processing techniques are likely to ensure a proﬁtable yield level by the time the

device is in volume production.

When a new generation fabrication process is being developed, the rate of
climbing the learning curve is relatively slow, and the initial yield values are typically
quite low (Curve 1 in Figure 1.1) [57]. The slow learning is due to the following
facts: (1) the immature process design and the technology development vehicle are
not centered with respect to process variation; and (2) signiﬁcant yield drOps can be
experienced even in mature processes, because it may take a long time before the
causes are diagnosed and the problems are corrected. The time necessary to bring

the yield above the economically acceptable yield (Yaw) can be on the order of several

months, resulting in loss in revenue and competitive edge.
The learning curve may be improved by (1) increasing the initial yield; (2)

minimizing process and circuit sensitivity to process variation; and (3) characterizing

the most likely failure modes to spwd-up diagnosis. As a result, the yield Yaw as

the desired learning curve shown in Curve 2 of Figure 1.1 [57], can be obtained in a
shorter period of time. In other words, the yield can be enhanced signiﬁcantly if the
manufacturing process and testing process are improved. However, this improvement
requires better factory equipment and better knowledge of design and testing of the
chips, which are very costly and quite diﬁcult to implement. Recently, fault-tolerant
techniques have been widely applied to the newly developed fabrication processes.

 

were [

Yacc /_

 

 

 

 

L

"

 

 

1.2 1.1. Time

Figure 1.1 learning Curves [5‘7].

 

It should be noted that the entire manufacturing process may consist of three
major yield steps that affect the total number of functional integrated circuit products
that are realized [47]. The major steps are: wafer processing yield, probe yield, and
ﬁnal test yield. Wafer processing yield is deﬁned as the percentage of good wafers
that survive the manufacturing process. Probe yield is deﬁned as the percentage of
good chips out of a wafer. Final test yield is the percentage of devices that pass a
ﬁnaltestprogramwhichoccursafterthediehavebeenwirebondedtoaleadframe
and placed inside a package.

Fault-tolerant techniques have been used extensively by semiconductor
manufacttu'ers [39]. The use of redundancy for yield enhancement is not new; the ﬁrst
scheme for redundancy implementation on core memories was published in 1964 [44]

and the ﬁrst practical application in redundant memory design was proposed in 1979
[7]. Currently, more than 15 semiconductor memory manufacturers have
commercially produced various redundant memory chips [27,39]. As the technique of
the fault-tolerant memory design matures, the next logical step is to apply this
technique to PLAs.

1 .2 Objectives

The motivation for incorporating fault-tolerance into PLAs is twofold: yield
enhancement in the manufacturing phase and fault-tolerance in ﬁeld. Both are

achieved by restructuring the links so as to isolate the faulty lines.

This work is centered on the study of the design of fault-tolerant PLAs. The
issues include design-for-repairability, design-for-diagnosability, and design-for-
manufacturability/yield.

Design-for—repairability is the design of a repairable PLA that implements a
reconﬁguration scheme to replace faulty lines by spare lines. Reconﬁguration is
deﬁned as an operation for replacing faulty components with spares while maintaining

the original interconnection structure.

Before the partially defective PLA chips can be repaired, the types and
locations of faults must be precisely identiﬁed, so that the repair process can be
properly and efﬁciently performed. The need for locating and identifying faults led to
the design-for-diagnosability.

Designfor-manrq'acturability/yield is aimed at achieving manufacturable, high-
yield chips. To enhance chip yield of PLAs, spare lines and reconﬁguration circuitry
are built into the chip so that partially defective chips can be repaired. Since spare
lines and reconﬁguration circuitry are also susceptible to defects, too much
redundancy may have a "diminishing return" effect on the chip. Therefore, the amount
of additional redundancy to the PLA is best kept as low as possible. However, if the

amount of redundancy is insufficient, high yield cannot be reached.

Two aspects of fault-tolerance can be identiﬁed: (1) techniques to tolerate
manufacturing defects; and (2) techniques to tolerate failures in ﬁeld. In this work, the
fault-tolerant PLA designs that fulﬁll the above design aspects are investigated. Fault-
tolerant PLA design using laser programming techniques is implemented to tolerate the
manufacturing faults, while fault-tolerant RAM-based PLA design is implemented using
electrical programming techniques to tolerate manufacturing defects and to tolerate
failures in ﬁeld. ’

1.3 Physical Failures in VLSI Circuits

VLSI systems have the following two classes of failures [60]: manufacturing
failures and long-term failures. Manufacturing failures are caused by defects which
depend on the processes and materials; while long-term failures are caused by wear-out
in ﬁeld. Long-term failure mechanisms include break-in lines, shorts between lines, and

degradation or breakdown of active devices.

The manufacturing defects can be divided into two groups: those that affect a
relatively large (global) area of the wafer and those that affect a relatively small (local)
area [48]. Examples of global defects include cracks or scratches in the material,
photolithographic mask misalignment, line dislocations, and major fabrication process
control errors. These defects usually have global and prominent effects on the circuit
behavior and can be detected easily early in the manufacturing phase. Furthermore, for a
ﬁnely tuned and mature fabrication line, major processes control errors, and hence global
defects, can be detected easily and minimized. For the above reasons, localized spot or
point defects are the primary targets for fabrication testing.

Point defects can be classiﬁed into three categories: silicon substrate
inhomogeneities, local surface contarrrinations, and photolithography-related point
defects. The origin of defects from each of these categories involves distinct, usually
complicated and frequently uncontrollable processes. Complete and accurate physical

modeling of point defects inherent in the fabrication process is difﬁcult [48].

Depending on the location, size, and type, a defect may or may not have any
effect on the circuit. Only those signiﬁcant defects which cause faults are considered
in causing faults. For example [48], Figure 1.2 shows that a small point defect in the
implant window of a depletion mode MOS u'ansistor may or may not have any
signiﬁcant effect at the location. Figure 1.3 illustrates how a missing element of a
polysilicon path may or may not have any signiﬁcant effect at the circuit level.

Point defects will cause extra or missing spots of metal, polysilicon, or
diffusion layouts. Extra spots may cause shorts between two layers (metal,
polysilicon, or diffusion), degradation of elements, or extra devices. On the other
hand, missing spots may cause break of a line (metal, polysilicon, or diffusion line),
degradation of elements, or missing devices.

Fault models are extracted from signiﬁcant physical failures, and serve two
purposes: test generation and fault coverage evaluation. A good fault model is one

that is simple to analyze and yet closely represents the behavior of physical faults.

1 .4 Redundancy Architectures

The important criteria for evaluating a reconﬁguration scheme are hardware
overhead, reconﬁguration effectiveness (the probability that an array with a given
number of faulty cells is reconﬁgurable), wiring length after reconﬁguration, time

required for the reconﬁguration procedure, and overall yield and reliability [61].
The redundant designs of VLSI array structures can be classiﬁed by the

following four reconﬁguration schemes [61]: (1) the whole row (and/or column)
bypass (WRB/W CB); (2) single-cell bypass (SCB); (3) interstitial scheme; and (4)
duplicated cell scheme.

The ﬁrst scheme allows for a faulty cell to cause the whole row or column to be
bypassed as shown in Figure 1.4 (a). The control circuitry in the WRB/WCB scheme
is simpler than those in others. However, the utilization of spare cells is inefﬁcient.

To choose the minimum number of spare rows and/or columns that cover all the faulty

 

 

Figure 1.2 Implant Mask Defects [48]:
(a) Non-signiﬁcant Defect; (b) Signiﬁcant Effect.

 

l

 

’ P
.1:

I.
n

 

 

 

 

 

i
D

P
-

. ‘9‘

(II)

 

 

 

 

 

 

Figure 1.3 Signiﬁcant and Insigniﬁcant Defects [48]:
(a) Defect-free Poly Path; (b) Insigniﬁcant Missing Poly;

(c) Insigniﬁcant Missing Poly; (d) Signiﬁcant Missing Poly.

cells is an NP-complete problem [30]. This leads to various heuristic reconﬁguration
algorithms that have been proposed [5,13,20,58,63].

 

   

Figm'e 1.4 Reconﬁguration Architectures [61]:
(a) WCB/WRB Scheme; (b) SCB Scheme;
(0) Interstitial Scheme; ((1) Duplicated Cell Scheme.

 

To increase the utilization rate of spare cells. the single-cell bypass (SCB)
scheme, as shown in Figure 1.4 (b), allows the faulty cells to be passed However,
the utilization rate is dependent on the complexity of the control circuits that include
switches and interconnecting wires. As a result, long interconnection wires after
reconﬁguration are possible if higher utilization is attained.

The interstitial scheme, as shown in Figure 1.4 (c), has spare cells uniformly
distributed into the array and a faulty cell that can only be replaced by its neighboring
spare cells. Since spare cells are adjacent to regular cells, the length of connecting
wires is limited, which thus results in a low time overhead. However, the drawback

9

is that an array may fail due to the lack of spare cells in one area, while there are

unused spares in other area.

The last scheme, indicated in Figure 1.4 (d), is the duplicated cell scheme in
which each regular cell has its own spare cell. It requires a simple reconﬁguration

algorithm and low time overhead, but the area overhead is large.

Since each cell of the array in both memories and PLAs takes a very small
portion of the entire array, the use of scheme (2)-(4) that requires either high
complexity of control circuit, or high percentage of area overhead, is not practical. In
this study, the WRB/W CB scheme is implemented in the design of fault-tolerant
PLA. That is, the faulty lines are repaired and replaced by the spare lines.

1.5 Thesis Organization

This dissertation is organized as follows. Chapter 2 reviews the design of
fault-tolerant semiconductor memories. Two commercial memory chips that
implement the fault-tolerant design are presented. The laser programming
techniques developed in these two examples can be applied to the proposed fault-
tolerant PLA design.

In Chapter 3, a fault-tolerant design of PLAs is proposed. The fault-tolerant
design achieves a full diagnosability of single and multiple stuck-at faults, bridging
faults, and crosspoint faults. During the manufacturing process, faults in the PLA can
be detected, located, and repaired with the spare lines. When the PLAs are used in
ﬁeld, the structure still possesses the easily testable capability. In addition to the
fault-tolerant structure, an automatic layout generator, called MRPLA, is presented
to generate the physical layout of the proposed fault-tolerant design. Some important
issues in a redundant design, such as chip area and propagation delay time, are also
addressed.

Chapter 4 describes the fault diagnosis and repair process for the proposed
fault-tolerant PLA. Two examples will be given to demonstrate that the proposed

10

fault-diagnosable PLA achieves a full diagnosability. In addition, a simple test
process is presented for detecting faults after the chip is packaged and used in ﬁeld.

Chapter 5 analyzes the effects of adding redundancy to the design of fault-
tolerant PLAs. A yield model for this design is presented and simulated. Based on
the yield model a simple, yet efﬁcient optimization method is pr0posed to determine
the optimal redundancy of various sizes of PLAs.

Chapter 6 illustrates a RAM-based PLA (RBPLA) structure that allows the
designers to change the design as many times as needed. In addition, a fault-tolerant
design of the RBPLA is also presented. Faults occurred in either the manufacturing
process or in ﬁeld use can be detected, located, and repaired.

Finally, the last chapter summarizes the work of this dissertation research

and presents suggestions for related future research.

CHAPTER 2

Fault-Tolerant Semiconductor Memories

 

Semiconductor memory has made tremendous contributions to the
revolutionary growth of digital electronics. The cost and space effectiveness of MOS
DRAMs (Dynamic Random Access Memories) has permitted their use in today’s
computers, for example, more than 100M bytes for mainframe and even 1M bytes for
personal computers. MOS SRAM (Static RAM), with low stand-by power, has been
used in small, portable, battery-backed systems [4]. Nonvolatile memories such as
EPROMs (Electrically Programmable Read-Only Memories) and EEPROMs
(Electrically Erasable PROMs) have opened up new areas of applications such as
ﬁeld-programmable microcomputers. Various needs from different systems
applications constitute the driving force toward improved performance/cost and

enhanced functions of semiconductor memories.

Throughout the short history of semiconductor memories, the number of
memory cells in a device has quadrupled approximately every four years [4]. The
device density has been increased from 64K, 256K, to 1M, and will soon to 4M and
16M in market. As device density has increased, improved design and fabrication
methods have been introduced to maintain an adequate yield of good devices per

wafer.

On-chip redundancy techniques have been commercially used to eliminate the
large number of chip failures due to the local defects, and offered yield improvement in
the manufacturing of the commercial memory chips [39]. The result is a reduction in
capital required for wafer fabrication to achieve a desired level of shippable product.
Instead of 1% or 2% of good dice per wafer in early chip yields, the right combination

11

12

of spare bits per die can suddenly make half the wafer good [42]. Basically, on-chip
redundancy techniques take an memory cells as spares. Each device is tested at
wafer probe, and if non-functional cells are found, the device is repaired by replacing
the non-functional cells with the spares. One of the biggest controversies
surrounding on-chip redundancy is whether to make the replacements by blowing
fuses electrically or by laser techniques. The argument will be discussed in Section
2.2.

Before the defective memory cells can be repaired, techniques for diagnosing
the location of the defective cells and efﬁcient spare allocation strategies are needed.
The repair of the on-chip redundancy is generally divided into two phases: diagnosis
to detect and locate all faulty cells, and repair to allocate spares for all faulty cells. In
Section 2.3, existing fault analysis and repair algorithms are reviewed.

Finally, two commercial memory products with on-chip redundancy are
illustrated in Section 2.4. They are: the fault-tolerant 64K DRAM developed by Bell
Laboratories [7], and the 8K x 8 high-performance CMOS SRAM developed by
Hitachi Ltd, Japan [38].

2.1 On-Chip Redundancy

The only integrated circuits so far to have exploited on-chip redundancy

techniques commercially have been memory chips.

At the level of 64K devices, devices are beginning to appear with on-chip
redundancy to increase yields and maintain reasonable manufacturing costs. As
memory density increases and geometries shrink (via device scaling and circuit
innovations) the die size must remain constant for producibility and yield
considerations. As such, defect density becomes a much more important factor than
with lower density devices since a single defect can wipe out a major section of
memory. Process cleanliness becomes more stringent as well [27]. To offset this,

on-chip redundancy allows the defective memory cells to be replaced by the spare

13

cells. Redundancy will become more important and probably mandatory at 256K
DRAM level and even higher level of device density. Table 2.1 summarizes the
memories which utilize on-chip redundant circuitry [27]. The on-chip redundancy has
been commercially implemented to 64K and 256K DRAMs, 16K, 32K, and 64K
SRAMs, and some others. The table also shows that the number of spares is
relatively small comparing with the device size.

On a memory with redundancy, incoming addresses are compared with the
locations of faulty bits; when a match is found, spare bits take over. Substitutions
can be made for individual bits, small clusters or large blocks, or rows or columns.
Spare rows and columns have become the most popular approach because they
represent a reasonable trade-off between yield enhancement and the number of
required elements and associated circuitry.

On-chip redundancy techniques are not free of penalties: spare elements
increase the chip area, and result in performance degradation and productivity loss.
The important attributes for on-chip redundant circuit design are: how much
redundancy to employ; how to apply it; and how much it will affect performance, die
size, and yield. The number of spare rows or columns is subject to several
considerations since spare cells in any form inﬂate die size and reduce the number of
chips per wafer. Furthermore, each spare element demands extra support circuitry
which cannot be repaired. Consequently, too much redundancy reduces overall repair
efﬁciency. The yield improvement factor, the ratio of the yield with redundancy to that
without redundancy, can be plotted as a function .of the yield without redundancy for
different number of spare elements. As more spares are added, the curve is
eventually increased and reaches a point of diminishing returns. Further increases in

the number of spare elements will start reducing the yield improvement factor.

2.2 Repair Techniques

As mentioned previously, the number of programming elements required is an

important consideration in the ﬁnal choice of optimal number of spare elements. On-

14

Table 2.1 Memories Built with Redundancy [27]

 

MANUFACTURER

OKI ELECTRIC
SIEMENS AG

NTT NUSASNIBO
BELL LABS

HITACHI
IBN

IBN
BELL LABS
IBN
INNOS
INTEL
ROSTER
HITACHI
TOSHIBA
TOSHIBA
TOSNIBA
TOSHIBA
INTEL
INTEL
INTEL

INHOS
lNNOS

HOSTER
INTEL

BOSTEN

SEEU

INTEL
MOTOROLA

NTT NUSASNINO

TYPE OF HEHORY

256K ORAN
256K ORAN

256K ORAN
256K ORAN

256K DRAM

288K ORAN
(azxxsl

72x BIPOLAR OPAP

(9Kx9)
64K DNA"

64: ORAN
an: cam
64! ORAN
sax ORAN
sax saan
“KSMN
64x seam
64x SRANICHOSI
54x snwums)
32x SRAM
16! sure

ISK SRAN

ISR SRAM
16V. SRAN

ISR SRAM
128K EEPROH

BAX EPRON

16K EEPRON
32R BIPOLAR PROM
ISR BIPOLA PRON
"BYTE Rm

m

NSN37256
NCA

"CA
NCA

NCA

RCA

NCA

NCA

NCA
IHSZBOO
IZISA
RKAI6A
NCA

NCA

NCA
TCSSSAP.
TCSSGSP
NCA

NCA
l2l67

INSIAOO
INSIAZOIIAZI

"[4167
NCA

NKZTSA

5213
3632
NCNTBISI
NCA

TYPE OF RECUNOANCY
6K CELLS

'SPARE ROUS S COLUMNS

SPARE rows 5 COLUMNS
B SPARE ROWS & 3
SPARE COLUMNS

l SPARE RON S I SPARE
COLUHR

iiSZ BITS WITH a WORD
LINES PER can

IOTH BIT

8 SPARE RUNS S 3
COIUHNS

8 SPARE ROWS 8 8
COLUMNS

a span: nous
4 SPARE COLUMNS
8 SPARE COLUHNS

2 SPARE COLUMNS

Z SPARE ROWS

I SPARE ROW. 2 SPARE
COLUMNS

2 SPARE ROHS S l
SPARE COLUMN

2 SPARE COLUNNS

6 SPARE ROWS. A SPARE
COLUMNS

Z SPARE ROHS. A SPARE
COLUMNS

3 SPARE ROWS

2 SPARE COLUMNS
8 SPARE COLUMNS

A SPARE COLUHNS

A SPARE ROUSaIZS
BYTES

25‘ REDUNOANT MEMORY
HATRIX- 2 COLUMNS

6 SPARE ROWS

A SPARE ROWS

ROWS ANO COLUHNS

COMPLETE RON REOUNOANCY

FOUR IHb MODULES

PROGRAMMING TECHNIQUES NOTE

HIGH VOLTAGE °ULSES AT HAFER
TEST POLY FUSE

”COS REGISTER

POLYSILICON LASER FUSE

HIGH VOLTAGE. POLYSILICON

FUSE

UNKNOWN

UNKNOWN

POLYSILICON LASER FUSES REQUIRES CRITICAL
MECHANICAL
POSITIONING

REQUIRES MAIN OE-
INHIBITING-EXTRA
GATE DELAY

(SAME AS ABOVE)

HIGH VOLTAGEIIZV) PULSES
AT uartn soar, POLY FUSE

HIGH VOLTAGE PULSES AT
HATER SORT. POLY FUSE
LASER PULSE AT HAFEP SORT
POLY FUSE -

LASER ZAP. POLY EUS

LASER ZAP. POLY FUSE
LASER ZAP. POLY FUSE

LASEP PULSE AT HAFER SORT,
POLY FUSE

LASER PULSE AT HAFER SORT,
POLY FUSE

LASER PULSE AT HAFER SORT,
POLY FUSE

LASER PULSE AT HATER SORT.
POLY FUSE

HIGH VOLTAGE PULSES AT
uartn SORT, POLY FUSE

HIGH VOLTAGE PULSES AT
HAFER SORT, POLY FUSES
HIGH VOLTAGE PULSES AT
HAFEP SORT. POLY FUSES
LASER PULSE AT HAFER SORT
HIGH VOLTAGE PULSE AT
HAFER SORT, POLY FUSES
HIGH VOLTAGEIZSV) PULSES
AT HAFER SORT. POLY FUSE
EPROH FUSES

UNKNOWN

UNKNOWN

 

(I) It is said that redundancy increases the

line.

yield by a factor of S to 30 depending on the maturity of the Fab

(2) Host Magnetic Bubble Memories also use redundancy to increase manufacturing yields by means of a "boot loop".

(3) See “A AND Full Hater RON“ by N.Y. rttano at al. 1980 lEEE ISSCC Digest of Technical Papers. Feb. 13-15, 1980.

NCA - NOT COHNERCIALLY AVAILABLE

15

chip storage of the information that identiﬁes the defective cell locations is a key
issue in redundancy. The programming elements used for this purpose fall into two

categories [53]: the laser programmable and the electrical programmable. Table 2.2
lists a comparison between laser programming and electrically fusible links.

Table 2.2 The Comparison between Laser and Electrical Programming [53]

 

Feature ' Laser Approach Electrical Fuses

 

Circuit Layout Links are placed anywhere Links must be accessible to external drives via bonding
pads or additional on-chip circuitry

 

 

Performance Access time of programmed and nonprogrammed Speed is generally adversely affected. particularly if
devices are indistinguishable both row and column redundancy are used.
Reliability Since exploded links ac covered with ﬁnal nitride High reliability requires guard rings around link regions

passivstion layer. reliability is extremer high

 

Area Penalty Ares increase for redundancy is slight -- increase will Area increase is also slight, but may not scale down as
scale down with finer design rules in future devices easily because of layout and reliability concerns

 

Flexibility Performance margins are easily tailored with Layout is not adaptable to unforeseen circuit nwds
”quick fixes“

 

Equipment costs Software development requirements and hence costs initial costs are lower due to relaxed software demands
are lures

 

 

 

 

 

The major advantages of electrical fuse blowing are that the redundancy can be
implemented with minimal initial capital expenditures and that existing test
equipment may be used [1]. Also, electrical fuses offer the simplicity of using an
unmodiﬁed wafer sort machine, but at the cost of requiring each fuse to be connected
to a bulky driver transistor. This extra transistor costs area and limits the number of
fuses that can be used, thereby complicating the circuit design. Electrical fuses could
conceivably be blown inside the memory’s package, opening up the possibility of ﬁeld
' repair. Laser programming presents the obvious advantage of conserving valuable
silicon real estate by eliminating the circuitry associated with blowing electrical fuses,
but it requires the addition of a costly laser to the sort equipment, and precise

alignment as well. However, lasers allow a wider choice of potential fuse materials.

16

Trade-offs between ease of design implementation, up-front capital investment, and
ﬁnal product cost determine whether laser or electrical programming is best for a
particular memory product [1]. As it is shown in Table 2.1, most of redundant
memory designs implement the laser programming techniques that perform "cut" and
"patch" operations in polysilicon links or fuses. Recently, Sandia National
Laboratories [3] have also devised a speedy method of on-chip repair that uses low

power lasers to cut and patch the metal lines.

2.3 Fault Analysis

Before the repair process is performed, fault analysis algorithm is ﬁrst called
upon to determine if there are any catastrophic problems on the chip or more defects
than the spare elements. If not, faults are further analyzed, and the sites of faulty
cells are logged into a fault map. According to the fault map, an efﬁcient spare

allocation of redundant rows and columns is applied to provide the repair solution.
The problem of spare allocation of redundant elements can be speciﬁed as

follows. Consider a rectangular array that consists of M x N cells, as shown in
Figure 2.1, where the dots in the array represent the faulty cells, and 2 spare rows
(Sr=2) and 3 spare columns (80:3) are assigned.

A partially defective chip is said to be repairable if the spare elements can
completely cover all faulty cells; otherwise, it is unrepairable. Therefore, the
objective of the fault analysis algorithm is either to quickly check the unrepairability,
or to provide repair solutions for the repairable devices. More speciﬁcally, if the
unrepairability of a device can be quickly determined, then the costly repair process
can be terminated early. On the other hand, if spare elements can be efﬁciently

utilized to cover the faults, then more devices can be claimed as good ones.

l7

 

N
1234567
0

 

#ri—i
O

 

 

 

 

 

 

 

s = Sc=3

 

Figure 2.1 Spare Allocation of Redundant Elements.

 

The problem of optimal spare allocation has been shown as an NP-complete
problem [30]. As a result, several heuristic algorithms have been proposed and they
are summarized in [20.58.63]. These heuristic algorithms can be classiﬁed into two
categories: row/column selection and unrepairability checking.

The following heuristics are used to select rows or columns for repair:
Broadside [58], Repair-most [58], and fault—driven [13].

The broadside approach employs a crude technique to locate each faulty bit
and to immediately repair it. No optimization is used. Spares are allocated in a very
inefﬁcient fashion, since no overall distribution of faulty bits is considered. This

results in failure to identify a potentially repairable device.

A limited usage of optimization techniques can be found in repair-most [58].
In this technique, row and column fault counts are employed to determine spare
allocations. Repair-most is implemented in a two-stage algorithm: must-repair and
ﬁnal repair. Must-repair determines either a row or a column that must be replaced
by a fault-free spare to repair the maximum number of faulty bits. This process is
iteratively repeated until no more faulty bits are left uncovered in memory by using
spares. This corresponds to a maximization criterion in fault selection; a minimization
in allocation of spares can be accomplished by an initial covering of faulty bits. This

information is supplied to ﬁnal-repair to ﬁnd a balanced time allocation for the desired

18

repair solution. This is accomplished by considering processing time, laser repair
time, and spare utilization [58]. Although this approach gives better results than the
broadside approach, undesirable features, such as an inability to provide repair
solutions for certain devices and no provision for user-deﬁned preferences, are still
left. '

Fault-driven [13] partially avoids the drawbacks of the repair-most approach.
In fault-driven, repair solutions are generated according to user-deﬁned preferences.
Repair is implemented using a two stage analysis: forced-repair and sparse-repair.
Fault-counters are still employed. F creed-repair determines speciﬁc rows or
columns that must be replaced by redundant copies; sparse-repair determines repair
solutions for all remaining faulty bits at completion of forced-repair.

The following hemistics are used to check the unrepairability: Diagonal-test
[5], Maximum-matching [30], Total-faults [20], Fault-count aﬁer Must-repair [58],
and Leading-element-test [63].

The diagonal-test approach is a fast test performed on the bits along the major
diagonal line of the memory. Since all the faulty bits on a diagonal line of a memory
cannot be repaired by the same row or column, if the number of faulty bits on a
diagonal line is greater than the total number of spare rows and columns, the memory
is unrepairable.

Maximum-matching approach uses the aid of graph theory. If the size of the
solution found in the graph is greater than the total number of spare rows and
columns, the memory is unrepairable.

Total-faults approach exploits the fact that the maximum number of faulty bits
that can be repaired by Sr spare rows and Se spare columns is MxSc«r-N><Sr-Schr
If the number of faulty bits in the memory is greater than that maximum number, it is
unrepairable.

Fault-count after Must-repair approach indicates that the total number of

unrepaired faults which can be recovered after Must-repair is complete is 2xSchr,

19

since there can be no more than Sc faults on a row, and no more than Sr faults on a

column.

Leading-element-test approach ﬁnds the ﬁrst faulty bit in the row of the
memory as the Leading-element. Let ISI be the total number of leading elements, and
d be the number of leading elements in which the other non-leading-element faulty
bits appeared at both the same row and column. If ISI or lSl+d is greater than the

total number of spare rows and columns, the memory is unrepairable.

Recently, several other fault-analysis algorithms [8,22,23,24] have also been
proposed to efﬁciently determine the repair solutions.

2.4 Fault-Tolerant RAM Design Examples

This section describes two fault-tolerant memory designs that have been

commercially available.

2.4.1 A Fault-Tolerant Dynamic RAM

Figure 2.2 shows a fault-tolerant 64K DRAM deveIOped in 1979 by Bell
Laboratories [7]. The design employs a total of 16 spare elements, 8 spare rows and
8 spare columns. Two spare rows, complete with decoder and driver circuitry, are
associated with each 16K quadrant, organized as 64 rows by 256 columns. Either one
of these spare rows may replace any one of the 64 main rows in the adjacent quadrant
or may replace each other, if necessary. Four spare columns including decoder and
sense ampliﬁer circuits, are associated with each pair of 16K memory quadrants. Any
one of the spare columns may be used to replace any of the 256 columns in the

adjacent quadrant or any other previously encoded spare column in the same group.

Replacement of a defective memory clement, whether row or column, may be

understood by referring to Figure 2.3, which shows a standard and a spare row

20

 

 

(a)

 

I ouan a 54 news it 256 cous ]

 

 

 

[ t . ]
saa::< smut:s
[ 2
\\< J

I 3 l

 

(b)

Figure 2.2 Fault—Tolerant 64K DRAM [7.52]!
(a) Physical Layout; and (b) Schematic Diagram.

 

21

 

STANDARD DECOOER

‘°,‘° A‘,“ ‘z.‘1 V00 V00 CW

 

 

' ""1 use:
7 ® Pnoonaunaug
LINK
(a)
SPARE DECODER
‘o It; ‘I T ‘a Van

 

 

L'LL'LL'LL'L LLLLL

Figure 2.3 Standard and Spare Row Decoders [52]:
(a) Standard Row Decoder; and (b) Spare Row Decoder.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

STEP a
REPEAT TABLE

I“ ------ "l
Low cosr . I
MEMORY Tesrtn l Lassa PROGRAMMER l
' I
uanowaa: IE 7‘1 couraor. I
canon nun-1n L; A COMPUTER |
' I
I
J l
r— ——————— y I
‘ 1.06pm :
74'. I as: no LASER a '
LENS MOVEMENT I
conrnor. I
l
I
I
l
I

 

 

 

l"
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
l
L.

Figure 2.4 Block Diagram of Majbr Hardware Components of Laser
Programming System [52].

 

22

decoder schematic. The essential difference between the standard and spare decoder
is that the former has half the number of decode transistors of the latter. The identity
of the standard decoder is deﬁned by unique connections of address and address
complement to appropriate decode gates. By contrast, the spare decoder has both
address and complement address tied to the gates of decode transistor pairs.

Disconnection of a faulty memory row is accomplished by exploding the
programmable link between the standard row driver and the row line, using a single
laser pulse.

An "open" link is deﬁned as one having an impedance of greater than 10 M Q,
and this requirement is easily accomplished in practice. The links are 3-um
polysilicon lines, deposited and patterned along with all active transistor gates and
covered with phosphorus glass, as is the balance of the chip prior to metallization.

No separate processing steps are associated with the polysilicon links.

Link opening is done on a commercially available laser trimmer which has been
modiﬁed to include improved positioning accuracy and a TV camera for visual
monitoring and alignment. Functional testing and laser programming are fully
automatic and require no additional wafer handling or manual intervention.
Replacement of each defective row or column takes about 1 s. A chip with no
defective bits requires no laser opening.

Figure 2.4 [52] illustrates a block diagram showing interconnection between
major hardware components of laser programming system. The experiment shows
that the programming time is quite short in the total laser appliance, includes chip

alignment, target detection, and laser movement.

2.4.2 A Fault-Tolerant Static RAM

Figure 2.5 shows a block diagram of a 8K x 8 high-performance CMOS Static
RAM (Hi-CMOS SRAM) developed by Hitachi Ltd., Japan [38]. The SRAM is

fabricated using double polysilicon technology to reduce the memory cell size. The

23

 

CSI'CSZ

‘II

’I
53

0
---q o

O BII PLANE

 

I.
—---1 p
)
2
III

-c-u‘

ME Mom
ARRAV

uamtﬂ 050M ‘
630(1230 M03

8 3mm] 050M *

---------q
—
-

I
I
I
I
I
I
I
I
I
I
I

2222222222]

 

----—-II

 

 

I
I
I
I
I
I man
I
I
I
I
s

Sercn nos
[[ cowon can b4

 

 

 

 

 

 

 

 

SPARE SELECI CIRCUI SPARE SELECT CIRCUII
COLUMN DECOOER . . . CW" DECODER
L‘—

II .3

lO SENSE AMPLIFIER,
IIo BUFFER (no)

 

 

 

 

 

 

 

 

 

 

 

 

I2 ‘12

Figure 2.5 Block Diagram of the 8K x 8 Bit Static RAM [38]-

 

 

 

 

 

 

 

 

 

 

 

 

 

 

r-Couoq ----------------------------
: SPARE ', Icﬂuat
I can. .; MEI-«om cm
I
: CELL
I»--- I. .............
:21 NORMAL
HE'— swncu uos
1
stair? 2222222222222222 r 2'7“ r" -
swm ; IsIl: I'Srl
"05 I rJL ! i {a l SPARE setter
I
g s: l 3352 3C '
I I I
C--- -IT..- .r'

 

 

 

NORMAL COLWN OECOOEFI

 

 

 

Figure 2.6 Block Diagram of the Redundancy Control Circuit [38].

IHIIII‘S‘IC poly ﬁlm
7

 

1 Si

 

 

 

 

 

Figure 2.7 Laser Diffusion Programmable Devices [38].

24

ﬁrst polysilicon layer is used for the gates of the transistors, while the second layer is
for the power supply line. The gate polysilicon length was 2 pm.

In order to improve the manufacturing yield of the SRAM, spare select circuits
are added to the column decoder, as shown in Figure 2.6. The redundancy circuit
utilizes new programmable devices for the column of the SRAM. The on chip
programming is achieved by applying laser pulse to an intrinsic polysilicon ﬁlm having

an n+ diffusion on either side as shown in Figure 2.7 [38]. Before application of the

laser pulses, resistivity between these two n+ layers is on the order of 1010 Q.

When a laser beam is applied to this structure, diffusion takes place from both sides,

and the intrinsic part is converted to an n-type. The resistance drops down to 2K 0.-
This results in electrical conduction between these n+ layers. In this design, an N2

laser-pumped dye laser (wavelength: 0.51 pm, pulse width: 7 as) with a beam energy

of about 107 W/cm2 was applied to intrinsic polysilicon having a 2 um linewidth and a
4 pm intrinsic layer length. In other words, the laser diffusion programmable device
normally stays at the OFF state until it is programmed to the ON state. This
programmable device is referred to Normal-off link.

The key advantages of this redundant circuit that utilizes laser diffusion
programmable devices are [38]: (1) this technique has excellent compatibility with
the Iii-CMOS process: (2) the column spare select scheme causes no access time
delay, and features a brief programming time where only two programmable devices
per spare are needed: (3) only a very small number of transistors per programming
circuit are necessary. Therefore, programming circuit area is minimal; and (4) no
damage to surface passivation insulators was observed. This means that the RAM’s
reliability cannot be affected without the use of extra surface passivation ﬁlms after

the laser process.

25

2.5 Discussion and Summary

On-chip redundancy techniques have been used extensively by semiconductor
manufacturers for fault-tolerant memory designs to enhance the chip yield. This
chapter has reviewed the repair techniques and repair process of fault-tolerant RAM
designs. Faults in a partially defective memory can be detected, located, and
repaired. Due to the similarity of both memory and PLA, the repair techniques and
processes developed for fault-tolerant memories should be able to apply to PLAs.

In the next chapter, the design of fault-tolerant PLA is discussed. Faults in a
partially defective PLA will be detected, located, and repaired.

CHAPTER 3

Fault-Tolerant Programmable Logic Arrays

 

Semiconductor device manufacturers continuously strive to increase chip
complexity, to reduce the speed-power product, to increase chip reliability, and to
produce the most useful and effective devices. As the integrated circuits progress
from LSI, VLSI to ULSI (Ultra Large Semiconductor Integration) technology, a single

chip may contain 106 transistors. The smaller dimension brings greater parasitic
capacitance and higher wiring resistance. This leads the slower signal propagation
time delay through the necessary wiring when random logic design is used. An
example is that the microprocessors lag behind simpler memory chips, if using random
logic design [17]. In other words, routing is crucial in ULSI system. On the other
hand, complexity index is measured by the regularity factor which is an important role
in accomplishing the ULSI. If all the circuits were realized by regular structures, the
regularity factor will be high. Regular structures, such as ROM, RAM, PLA, etc., in
ULSI design, will be used instead of random logic structures [45] which need more
chip area and introduce long propagation time delay due to the necessary routing and

placement.

The only integrated circuits so far to have exploited fault-tolerant techniques
commercially have been memory chips. With redundancy, partially defective memory
chips can be repaired. There are many similarities between memory chips and PLAs
that, at ﬁrst sight, suggest the application of the same redundancy techniques to both
these devices for fault-tolerant capability: in a Field PLA (FPLA), for example, a

26

27

fault in a product term can result in the term being both logically and physically
deleted [49]. This term can be replaced by a fault-free product term supplied by the
spares in a manner similar to replacement of rows and columns in a redundant
memory.

However, there exist unique conditions in the internal structure of a PLA that
severely limit the direct application of memory redundancy techniques. For example,
consider those faults, such as stuck-at faults in the OR plane, that require spare
output lines for repair. The obvious solution is to provide more lines than originally
required. In this case, at least in principle, some spare lines in the redundancy design
may be reserved to repair this type of fault. In practice however, major routing
problems are encountered when the switching of a faulty output signal line to a spare
has been accomplished.

In this chapter, a fault-tolerant design of an alternative regular structure, PLA,
is presented. A fault-tolerant PLA should be designed in such a way that it is fault
diagnosable and repairable during the manufacturing process, and testable in ﬁeld
use. This chapter is organized as follows. The basic structure of a PLA and its fault
models are introduced in the ﬁrst section. In Section 3.2, a repairable PLA (RPLA)
design and its repair rules are presented. Before a defective RPLA can be repaired,
the locations of defects must be precisely identiﬁed. Therefore, a fault diagnosable

PLA design is discussed in Section 3.3. Chapter summary is given in Section 3.4.

3.1 Programmable Logic Arrays

A programmable logic array (PLA) is a two-level AND-OR logic network that
implements the combinational circuits. By adding the storage elements such as
latches and ﬂip-flops, PLA can also realize sequential circuits. PLAs are often used
to implement controller, decoder and other glue logics needed between circuit blocks.
A typical large PLA may have as many as 50 inputs, 67 outputs, and 190 product

terms [32]. Due to the fact that complex chips (and in particular microprocessors)

28

can be efﬁciently implemented when using PLAs, a trend towards manufacturing

larger programmable chips is expected.

3.1.1 PLA Structure and Notation

A typical PLA consists of two planes: AND plane and OR plane. Figure
3.1 (a) shows a PLA implemented by a NOR-NOR structure in NMOS technology.
The PLA contains two input signals A and B, three output signals 0,, 02, and O3 ,

and three product terms P1, P2, and P3. Figure 3.1 (b) illustrates the logic functions

the PLA realized, while Figure 3.1(c) shows the cubic representation of the
personality of the PLA, where the cubic notation is listed in Table 3.1.

3.1 .2 Fault Models

To design a fault-tolerant PLA, it is necessary to consider the physical defects
that are likely to occur in the PLA with the speciﬁc technology being used. Three fault
models are considered for the N OR-NOR PLA structure in NMOS technology:
crosspoint faults, stuck-at faults, and bridging faults [2,41,43,50,54,59].

A crosspoint fault is caused by the unintentional presence or absence of a
transistor. Crosspoint faults can be subdivided into two classes: missing crosspoint
faults and extra crosspoint faults. The former is due to a missing contact at the
crosspoint in the AND plane or the OR plane; the latter is due to the unwanted
presence of a contact at the crosspoint. The crosspoint fault is technological

independence.

It is possible to distinguish four types of crosspoint faults according to the
location of the faults: growth faults, shrinkage faults, disappearance faults, and
appearance faults. A growth fault (or G-fault, for short) is caused by a missing

crosspoint in the AND plane, resulting in the disappearance of an input variable from

29

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

AND plane OR plane
"““ lLIlLILLI V“
r51 ‘ l P a X
4:] 1; 4H, 1 Product
_I-1_ J P2 2 AB .
_:I “II; I; w _ lrnes
"I am
(b2 .
q’1
- A B O1 (32 03
Inputs Outputs
(a)
Logic function of a PLA Cubic notation
Ol =P3 =B 0- 0 0 1
O2=P2+P3=AB+B 10 o 1 0
03 = P1= A - l l l 0
AB 010203
(b) (C)

Figure 3.1 Programmable Logic Array:
(a) NMOS Implementation; (b) Logic Functions; and
(c) Personality in Cubic Notation.

 

Table 3.1 Cubic Notation

 

 

 

 

 

AND plane OR plane
1: connect to complement input connect to output
0' connect to true input does not connect to output
- no connection not used

 

 

 

 

30

a product term (Figure 3.2 (a)). A shrinkage fault (or S-fault) is caused by an extra
crosspoint in the AND plane, resulting in an additional input variable in a Boolean
product term (Figure 3.2 (b)). A disappearance, or D-fault, (appearance, or A-fault),
is due to a missing (extra) crosspoint fault in the OR plane as shown in
Figure 3.2 (c) (Figure 3.2 (d)).

Regardless of the technology used in the actual implementation, the logical
line stuck-at fault model is ﬁequently used. A stuck-at fault is the simplest type of
fault that can occur in a PLA. A stuck-at fault is a line permanently at a logical 1 or 0
state. This can result from the faulty line being opened or shorted to the power line or
Ground line (GND). A stuck-at-O (s-a-0, for short) fault at an input bit line causes a
variable disappearing from an implicant. For instance, as shown in Figure 3.3 (a), the

fault-free logic function in the product line was: B, and now becomes B, because the
input bit line has an s-a-0 fault. Similarly, an s-a-0 fault at product line changes the

function from AB toB asshowninFigure 3.3 (b).

A stuck-at-l (s-a-l) faulty input bit line results in s-a-0 faults at those
product lines which have contacts in the crosspoints with this faulty line
(Figure 3.3 (c)). Similarly, an s-a-l faulty product line causes s-a-l faults at those
output lines which have contacts with that faulty product line (Figure 3.3 (d)).
Finally, the stuck-at faults at the output line cause the output lines to permanently be
the faulty state.

A bridging fault is a short between two adjacent or crossing lines. This fault
forces the same logic value to appear in the bridged lines. A bridging fault may cause
either a logical AND or a logical OR, depending upon the technology being used, of
the bridged Boolean functions in the plane of occurrence. In the MOS technology, a
wired-AND logic is assumed. Bridging faults can occur in both AND and OR planes.
It should be noted that a bridging fault will cause bath true and complement bit lines
to have s-a-0 faults if both lines are shorted.

31

 

 

 

 

 

Missing E
xtra
Vdd—#1 II‘S [Iii/P Vdd—£1“ II‘LTIy—P
A X B E A X B E
Fault-free logic Fault logic Fault-free logic Fault logic
p-_-A'§——>P=A P=X—>P=-AB
(a) (b)
Vdd Vdd
——A 21-32 _-‘-' AB
M55138 _— Extra ﬂL_—
‘E A '\ art A
E73 ——B
O o
F ault-free logic Fault logic Fault-free logic Fault logic
0=X+B —> O=B 0: AB —> 0=AB+A

(c) ((1)

Figure 3.2 Crosspoint Faults:
(a) Growth Fault; (b) Shrinkage Fault:
(c) Disapperance Fault; and (d) Appearance Fault.

 

32

 

 

 

 

Vdd S-A-o 3
an
jg] i/i p1 = z '5 L:_ X
47' 5 _ «I
_l_'l p2 = A T_ B
I; 411

 

 

A Y
Fault-free logic Fault logic 01

Pl 3 A B ' P1 = B Fault-ﬂee logic Fault logic

P2=X ' 122= 1 01=A+B—’ 01:3
(a) (b)

V

Vdd S-A-l S-A-l i
.415] / P =XE %— 2;
_.£1

 

I; ET B

A $7
01

Fault-free logic Fault logic

 

 

 

 

Fault-free logic Fault logic
01 =2. ‘I' B ——’ 01 = 1
P2 = A —. P2 = 0

(C) (d)

Figure 3.3 Stuck-at Faults:
(a) Input Bit Line Stuck-at-O; (b) Product Line Stuck-at-O;
(c) Input Bit Line Stuck-at-l; and (d) Product Line Stuck-at-l.

 

33

3.2 Design of the Repairable PLAs

To avoid complex routing and to repair the faulty PLA, a schematic diagram of
a repairable PLA (RPLA) is shown in Figure 3.4. In this design, two spare selector
circuits are added internally to control the reconﬁguration of the input/output signal
lines. The selectors are: the Spare Input Selector Circuit (SISC) and the Spare
Output Selector Circuit (SOSC). In addition, several spare lines are also augmented
in that design.

To repair a faulty RPLA, a set of repair rules which are based on the fault
models discussed in the previous section, must be established. The repair rules are
summarized in Table 3.2.

3.2.1 Repair Rules

When a stuck-at fault occurs in an input bit line, the line is forced to be either 1
(for s-a-l fault) or 0 (for s-a-0 fault). A spare bit line programmed with appropriate
crosspoints is selected to replace the faulty line, and the faulty line is then
disconnected from the SISC circuit shown in Figure 3.4. However, disconnecting the
faulty line may cause a "floating" logic. For sake of safety, the disconnected faulty
line must be connected to Ground line.

Similarly, a stuck-at faulty output line replaced by a spare output line is
disconnected from the SOSC circuit and connected to Ground line. In addition, the
faulty output line must be disconnected from the pull-up transistor because a
malfunctioning pull-up transistor may cause a short between the power line and the
grounded faulty output line.

An s-a-0 faulty product line does not affect the output functions of other
product terms of the PLA realized. However, an s-a-l faulty product line may
signiﬁcantly interfere with the functions, if the faulty line is not repaired. In addition

to the use of a spare product line to repair an s-a-l faulty line, the faulty line must be

34

 

 

Inputs Outputs
0 2 ® : OE-
Normal OFF Normal ON
Programmable Link Programmable Link
Product SOSC

Figure 3.4 Schematic Diagram of a Repairable PLA.

 

35

Table 3.2 Repair Rules

Fault Type Spare line Faulty Line
Stuck—at fault
Input bit line Input bit line remark 1
Product line Product line remark 2
Output line Output line remark 3
Crosspoint fault
Growth Input bit line remark 1
Product line remark 2
Shrinkage Input bit line remark 1
Product line don’t care
Disappearance Product line don’t care
Output line remark 3
Appearance Product line remark 2
Output line remark 3
Bridging fault
Adjacent
Input bit lines Input bit lines remark 1
Product lines Product lines remark 2
Output lines Output lines remark 3
Crossing
Input and product lines Input bit line remark l
and product line remark 2
Product and output lines Product line remark 2
and output line remark 3

Remarks: 1. Faulty bit line is disconnected from SISC, and is connected to GND.
2. Faulty product line is disconnected from the pull-up transistor, and
connected to GND.
3. Faulty output line is disconnected from the pull-up transistor and from
SOSC, and connected to GND.

36

disconnected from the pull-up transistor and also connected to the Ground line for
safety reason.

For repair of the crosspoint faults, G- and S- faults are repaired by either
spare input lines or spare product lines. Similarly, D- and A- faults are repaired by
either spare output lines or spare product lines. In other words, the spare product
lines can repair all the four types of crosspoint faults. If the crosspoint faults are
repaired by spare input bit lines or spare output lines, the repair process is the same
as the procedure for repairing the stuck-at faults. On the other hand, if the crosspoint
faults are repaired by spare product lines, two cases can be identiﬁed: (1) the repair
of S- and D- faults: and (2) the repair of G- and A- faults.

An S-fault, with an extra crosspoint in a product line of the AND plane,
causes the function realized by the product line to shrink. For example, as shown in

Figure 3.5, an S-fault changes the function from A to AB due to an extra crosspoint
occurring at the true bit line of B. The use of a spare product line programmed with
appropriate crosspoints can repair this fault. Since the function realized by the faulty
line is included by that of the spare line, the former function is then redundant and
does not affect the overall function. Therefore, the faulty line can be retained in the
array. However, in order to remove the possible redundancy for high fault coverage,
we suggest that the faulty line be disconnected and connected to Ground line.

Similarly, D-faults are repaired in the same manner.
Figure 3.6 shows that the function Ps1 =AB, realized by a spare product line

programmed with appropriate crosspoints, is included in P1: A which is realized by
the product line with a G-fault. The use of the spare product line cannot correct the
output function as shown. Therefore, the faulty line must be disconnected, i.e., the
faulty line is disconnected from the pull-up transistor and connected to the Ground
line. Similarly, the A-faults are repaired in the same way.

Bridging faults force the bridged lines to have the same logic. In general, the

adjacent bridging faults are repaired the same as that of stuck-at faults. For the

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

ll ~~ l Spare product line
S-fault, -. L p =
Extra ' 4g- 5. at
N P = AB
4; ' ‘
AND - OR
plane .04 plane
InPuts Output = P31 + P1
=A'ri +A
= A
Figure 3.5 The Repair of S-fault with a Spare Product Line.
Spare product line
I, > Lil . . II . , _
' - - . P = AB
Missing ~ i1; , -. ,_ i; . L 5‘
wk.“ _ g i g _.. . V P A
1-
i; E) . .
AND 2 '2 ' OR
plane .04. -oQ- plane
A B

Inputs

Output = Ps1 + P1: A (Incorrect)

Figure 3.6 Tire Repair of G-fault with a Spare Product Line.

 

38

crossing bridging faults, each bridged line is repaired by the same type of spare line
as illustrated in Table 3.2.

3.2.2 Repairable PLA

A repairable PLA is augmented by adding spare lines and two control circuits,
SISC and SOSC. In addition, two types of programmable links are employed: Normal-
on link and Normal-off link. As suggested by their names, the Normal-on (Normal-
off) link remains at the ON (OFF) state until the link is programmed; it then alters its
state. The programming techniques of Normal-on and Normal-off links have been
discussed in Section 2.4 for the designs of fault-tolerant 64K DRAM’s [7] and Hi-

CMOS 8k x 8 SRAM’s [38].

3.2.2.1 SISC and Spare Input Bit Lines

The SISC is added to the input portion of the conventional PLA between the
input decoder and the AND plane. The SISC, as shown in Figure 3.7 (a), consists of

programmable links and connecting lines with associated circuits.

The SISC circuit operates as follows: prior to the programming of the links, the
input signal line connects to the column line through the Normal-on link as in the
regular operation of a PLA. Since the Normal-off link is in the OFF state, there is no
connection between the input line and the spare input line. When faults are detected
and their faulty lines are located, these faulty lines are disconnected from the inputs
by opening the Normal-on links. These inputs are then switched to connect spare
input lines by programming the Normal-off links to the ON states.

More precisely, the mechanism of the line reconﬁguration is described as the
switches shown in Figure 3.7 (b), where the SI witch (equivalent to Normal-on link)

is closed and the 82 switches (equivalent to Normal-off links) are opened during the

39

normal operation. When the faulty input line b.| is found, for example, suppose that
the spare input bit line P31 is assigned to repair it. This can be accomplished as
shown in ﬁgure 3.7 (c), where the SI switch is opened and the $2 switch in the

lower connecting line is closed. In this case, the path is formed by connecting the

spare input line Ps1 instead of the faulty input line b1.

 

Spare bit lines Input bit lines

srsc P 22 $2 Norma... um.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

@E
{ii SEE: El“ Normal-offlink
“H HP “H ‘H
Input signals
(a)
b ANDplane b AND lane
P31 1 P31 1 p
SI I I L ..L I L L
62—6— 6_<S_ 6— rd—
529’ <f <1 9’ 52 <{ sf ?
6—6— 6—6— 5‘6— 6- ‘
529/ 9’ <{ 9’ ”sf If If 9/
Input signals Input signals
' (b) (C)

Figure 3.7 Spare Input Selector Circuit (SISC):
(a) Schematic Diagram; (b) Normal Operation; and
(c) Line Reconﬁguration.

 

Figure 3.8 (21) illustrates the stepwise operations of the SISC and spare bit
lines. A faulty bit line, as indicated by the dotted line, is repaired by the following
steps (Each step is numbered in sequence). First, the faulty line is disconnected

40

 

  
      

 

 

 
    

 

 

 

    
 
 

 

 

 

 

 
   
      

     

 

 

 

 

 

 

 

 

 

Spare
input bit
lines
Unused Desired
GND
3 0’ 2 a“
,_ Iamxamﬁil ilil'ﬁl
g 8‘] ‘3» Spare product lines gl“ Spare
. product
gt"- 3'2 [Err A 1 “5525?; LC 5'5 lines
ND P am unscrew;
r fill- =
I] 5o __-.,,,,_,,,g;__§ ;.;; AND plane
I- I- ;JI- heaven-remain
Normal product line :- uui'g q
I. :3}. “WW‘W’WMIM Normal product lines
9 “shantwraararii Iran:
I- I— ]- unaware
I: I: I?” L] é (I arraxaeemﬁlwam
1r: < c = \LmrerIarmrInu.
input bit 2; astriszirsfiilar \inputbit
<5— 6_ 6— 6— lines ii 5?? lines
9’ <f r 9’ amgmnargsgrarg
Input signals Input signals
(a) (b)
Normal-offlink El u Normal-on link 0 9

Figure 3.8 The Programming Procedure of SISC and Spare Input Lines:
(a) Line Reconﬁguration; and (b) Physical Layout.

 

41

from the SISC (Step 1), and grounded (Step 2). Then, the desired spare bit line is
disconnected from the GND line (Step 3), (for sake of safety, the unused spare bit
lines are generally grounded), and connected to input signal through the SISC (Step
4). Finally, the crosspoints in the spare line are programmed (Step 5).

Figure 3.8 (b) shows that the width of a pair of spare input bit lines is 32 x,

and the size of the SISC is (85n +11) x 16n 12, where n and s" are the numbers of

input lines and spare input lines, respectively.

3.2.2.2 SOSC and Spare Output Lines

The SOSC is designed in a fashion similar to the SISC. The output portion of
the conventional PLA is modiﬁed by inserting the SOSC between the OR plane core
and the output inverters.

Similar to the stepwise operations of the SISC, Figure 3.9 (a) shows the
programming procedure of the SOSC. F‘u'st, the faulty line is disconnected from the
pull-up transistor (Step 1) and the output line (Step 2), and grounded (Step 3). Then,
the desired spare output line is connected to the SOSC (Step 4) and the pull-up
transistor (Step 5), (for safety, the unused spare output line is disconnected from the
pull-up transistor). Finally, the appropriate crosspoints in the spare output line are
prom-mm

Figure 3.9 (b) shows that the width of a spare output line is 22 x, and the
length of the SOSC is ( 8sm+14 ) A, where Sm is the number of spare output lines and

the extra Ground signal line is 14 x in length.

42

 

      

  
  

      
 

    
 
  
   

     

Vdd . ,3” 'I

1 "'3 ” ﬁg s

u" a §

Spare product line Spare rel; £§EI§

pI'OdUCt ll IthEMm-g‘é‘

§_§~‘-§

OR plane Normal .ﬁimmﬂmmglmm ‘
‘12 as: w. n.
N al product 11 ....-..m.——~—'m_m,_.

orm
product line

GND

WI/I/IﬂﬁWﬁﬁW/I/Atﬁ'l/Iﬂzﬁ

Output inverter 2
Output inveter GND

(a) (b)
O Normal-on link [I Normal-off link

Figure 3.9 The Programming Procedure of SOSC and Spare Output Lines:
(a) Line Reconﬁguration; and (b) Physical Layout.

 

43

3.2.2.3 Spare Product Lines

Figure 3.10 illustrates the stepwise programming procedure of the use of
spare product lines. First, the faulty product line is disconnected from the pull-up
transistor (Step 1), and grounded (Step 2). Then, the desired spare product line is
programmed so that the line is disconnected from the Ground line (Step 3) and
connected to the pull-up transistor (Step 4). Finally, the crosspoints in the spare line
are programmed (Step 5 and 6).

Figure 3.10 (b) shows that a spare product line is 22 x in width.

3.2.3 Automatic Layout Generator

The computer-aided design (CAD) tools play very important roles in VLSI
design. They can reduce the turnaround time and make design changes more quickly.
In this study, an automatic layout generator, MRPLA [10], has been developed and
implemented in Sun 3/ 160 for generating the physical layout of the repairable PLA.

MRPLA requires a template when generating the repairable PLAs. The.
template contains rectangles, or tiles, ﬁlled with mask information. These tiles are
labelled with names that can be called by Mquilt [36] routine to be aligned according
to a certain semi-regular structure which MRPLA will deﬁne.

The ﬁrst step in making a MRPLA template is to design a sample RPLA, as
shown in Figure 3.11. This RPLA should include at least one example of each
possible combination of template cells. With this template, MRPLA will have a
correct example to generate its own larger RPLAs.

Once a template has been designed for a sample RPLA, tiles can be deﬁned
for each cell in the template. There are 12 groups of tiles used in the MRPLA
template. They are

(1) the core of the AND plane;
(2) the core of the OR plane;

 

Vdd Unused spare product line
.L J.

CWdeﬁih ;§§aawa
eebk %

 

 

D re Pro uct line
4tiuuaaa-wnnﬁbﬁi
v v 6v v v
.L .L
1 3* gr r-Lrb
2 Spare output lines

 

a :33:
M

 

ily Pill Hui line

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

tSﬁp£are Input bit lines Output lines
lines
(a)
AND Plane + OR Plane

 

“\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\‘A \-

 

 

 

 

 

 

 

4 ‘ 35 . ............
‘ ‘1 ~ . .2 ' .........
if i ’ 41- A ..... I
1 5 2
(b)
O Normal-on link U Normal-off link

Figure 3.10 The Programming Procedure of Spare Product Lines:
(a) Line Reconﬁguration, and (b) Physical Layout.

 

45

 

 

Figure 3.11 A Sample RPLA Template.

 

(3) the sides of the AND plane;

(4) the sides of the OR plane;

(5) the top and bottom of the AND plane;
(6) the top and bottom of the OR plane;
(7) the tiles between planes; '
(8) the tiles of the spare input lines;

(9) the tiles of the spare output lines;

(10) the tiles of the SISC;

(11) the tiles of the SOSC;

(12) the tiles of the spare product lines.

Each tile in the sample RPLA template, as shown in Figure 3.11, is labelled
by a character. An array with the predeﬁned characters will represent a userodefined
RPLA. For example, consider an RPLA whose personality and spare line allocation
are speciﬁed in Figure 3.12 (a) and (b). According to the predeﬁned labels, a
character array, as shown in Figure 3.12 (c), is generated to describe this RPLA.
Finally, the physical layout of this RPLA is generated by MRPLA and shown in
Figure 3.12 (d).

The MRPLA is currently developed for the RPLA design in NMOS technology.
However, extension to other technology can be readily made. The automatic layout

generator MRPLA is more feasible than generating layout by using a graphic editor.

3.2.4 Performance

According to the physical layout generated by MRPLA, both chip area and

propagation delay time are evaluated.

47

 

2inputs, FffHIiJJj
1 0 0 1 3 product lines, DbbgkkLLN
0- 10 Zoutputs, c1Xe24mmp
- 1 1 l 2 spare input lines, CXxE3 6MMP
l spare product line, ex 1 e 4 4mmp
2 spare output lines BAaOoR

(a) . (b) . (e)

 

   
    

 

lSW-B‘ \‘n ‘5.“ QLN‘wyﬁiﬁ' \\:\\\

IE.-5l DEVI? -- 3...;

unite-ins :mtI-zatstwsmu lg"; §.§, 1E.

  
  
 
  

 

   
  
    
 
 

  
    

   

 

a 55.} 5.3%“: iii:
\ _ ,.'.:t_ u 5.553%le rum: IC‘ u- n::‘ﬁ..~.g.....3...‘-......~
§p3l§=ﬂHi-— gm:::PIE-E g 3351' J:
I XS" ‘d: \ \‘Ut L" \“O‘i igi‘é
II?“ 4" I ”Inga! a "D _. " :3'1‘ 2::
—:-‘ill let IEI‘iBH' E S
:\\‘3\‘!I,3!’L\‘: ’3‘Fk\w SW umP-Illelil-IJNWWWIMIISW

 

       
 

$15,347.95

Air

H

1.3 I

< t

s! I

35'

.24 (‘1?

m- («i
I

 

Nuaﬂsnia Ii
I-H-Il-
*2~sV§k\\\\V:!~;‘IhNEIF;IIS

  
 
 

nn...-mm-.---.......u......--.

I/I/Il/Iﬁi’i’l‘ WIAI

   

\\\\\\\\\\\\VNm“§li§ﬁ§§ltﬁ%d&\

i-E'T Elli“).

Ctk\\\‘~8i:C\QC€i em

  
 

(d)

Figure 3.12 MRPLA: (a) PLA Personality, (b) Spare Line Allocation,
(c) Character Array, and ((1) Physical Layout Generated.

 

48
3.2.4.1 Chip Area

To assess the chip area requirement of the designed RPLA, a floor plan is
shown in Figure 3.13. All dimensions are given in units of lambda. The ﬂoor plan is

sketched according to the actual dimension of the physical layout generated by
MRPLA.

 

 

 

 

 

 

 

 

 

 

 

 

 

a
27 2. Pull-up & links
A .
22$ 2. Spare product lines 228
‘ <— 8mL—D-C mxu<51u
in -
3pm AND GND OR 8pm GND
9 s3: c032- 1.25:
tron 3P 1
165 N
‘30kn n h‘ 1631)“ :416L
srsc 11 +88" 1 i sosc
' 14 + 83m 2.
Vdd Decoder ‘
33 ‘ Ou ut
§ 3] 3. decg’der

 

 

 

 

 

Figure 3.13 Floor Plan of an (sn,sp,sm)-RPLA.

 

For simplicity, let an (sn,sp,sm)-RPLA be denoted as an RPLA with sn spare
input bit lines, sp spare output lines, and 5m spare product lines. Table 3.3 lists the

area overhead for various spare line assignments, where the area of an (sn,sp,sm)-

RPLA is estimated as:
Area of an (sn ,sp ,sm )-RPLA = (30 + 16(n + Sn» x (71 + 8p + 8sn + 223p)

+ (21 + 8m + 223m) x (78 + 8p + 225p + 88m)

49

Following the physical layout generated by the UCB tool MPLA [40], a ﬂoor

plan for a conventional PLA is shown in Figure 3.14. Thus, the area is estimated as:

Original PLA area = 64(2n + m)p + 8(152n + 76m + 49p) + 3724

 

 

 

 

 

 

 

 

 

 

 

 

 

 

A
134)“ Pull-up & links
4F— : z :
8m}. 51
’ AND GND OR GND
3‘ 8p 3.
connec-
non
302;. l6n2. ,‘147v,'
Input decoder T t Output decoder
Vdd 42 it 42 A.
Figure 3.14 FloorPlan of the PLA.
Table 3.3 Area Overhead in RPLAs
original (102,1) ' (29212) ' (2’3’2) '
PLA RPLA RPLA RPLA
n p m Area Area % Area % Area %
50 190 67 2210460 134868 6.10% 209160 9.46% 241346 10.92%
60 200 60 2495564 142564 5.71% 220728 8.84% 255202 10.23%
100 200 100 4104524 189924 4.63% 275768 6.72% 331362 8.07%
100 400 100 8022924 253924 3.16% 400568 4.99% 456162 5.69%

50

3.2.4.2 Propagation Delay Time

The propagation delay time is based on the assumption that the delay time of
a logic gate is directly related to the driving capability of its transistor. While the pull-
up time is limited by the effective load capacitance and the charging current provided
by the pull-up transistor, the pull-down time is determined by the effective load
capacitance and the charging current drained by the pull-down transistor. The pull-up
time is usually longer than the pull-down time in NMOS technology and is commonly
considered as the delay.

An over-simpliﬁed delay time model could be written as T = 1: Chad where x

is a constant determined by the average charging time and the high state output

voltage, and Cload is the effective load capacitance [37]. Eventually, the effective
load capacitance is contributed by the transistor gate capacitance, Cg, and signal path
capacitance, Op. The transistor gate capacitance is due to the oxide interposed

between the gate and the substrate of a pull-down transistor. The signal path
capacitance is deﬁned as the capacitance presented by a signal path of 8 )t that is
approximately equal to the spacing between two product lines, or half of the spacing
between two input lines.

The delay time penalty is deﬁned as the increased delay time for the
redundancy delay. The delay time penalty for the AND plane of the RPLA in Figure
3.15 (a) is 2n / (2n+15.5p) [64]. Similarly, the delay time penalty for the OR plane of
the RPLA is m/(m+15.5p). The delay time penalty for the RPLA is 2n/(2n+15.5p) +
m/(m+15.5p). The detailed derivation of the above delay time penalty can be found in
Appendix 1. Table 3.4 shows the delay time penalty for various sizes of RPLAs.T It
should be noted that the delay penalty can be alleviated by the spare line allocation
shown in Figure 3.15 (b) [64].

51

Table 3.4 Delay Time Penalty of the RPLAs

n p m Delay Time Penalty
50 190 67 5.5%
60 200 60 5.6%
100 200 100 9.2%
100 400 100 4.7%

 

Snare . nuct lines

    

SISC l SOSC

Input Output
(a)
Spare Spare
input output

lines lines

Spare Product Lines

SISC SOSC

 

Input Output
0))

Figure 3.15 Different Allocation Schemes of Spare Lines:
(a) Spares Placed in the Border; and (b) in the Middle.

 

52

3.3 Design of the Diagnosable PLA

The design of RPLAs has shown that partially defective chips can be repaired
without reconﬁguring the external routing [64,65]. The design has enhanced the chip
yield signiﬁcantly [62]. However, before a defective PLA can be repaired, the
location of the defects must be precisely identiﬁed.

In this section, the design of a fault-diagnosable PLA (FDPLA) is presented
to achieve full diagnosability of single and multiple stuck-at faults, bridging faults, and
crosspoint faults. The design of an FDPLA requires that the design must be capable
of detecting, locating, and repairing faults during the manufacturing process, and also

capable of performing chip testing in ﬁeld use.

3.3.1 Augmented Circuits

To fulﬁll the above design requirements, a schematic diagram of a fault
diagnosable PLA (FDPLA) is presented as shown in Figure 3.16. The original PLA
is augmented by control circuits and two shift registers: the input lines’ shift register
(ISR) and the product lines’ shift register (PSR). In addition to the scan signals (Sin

and Sour) and the non-overlapped clock signals ((11 and ()2), ﬁve extra control

signals (Mp, Mlv R, W, and Vdd1) are needed to operate the ISR and PSR.

3.3.1.1 Product Lines’ Shift Register (PSR)

Figure 3.17 shows the schematic diagram of a PSR cell. The number of PSR
cells in the proposed FDPLA is half of the number of product lines, i.e., each PSR
cell’s output (labeled as "Next Value") is shared by two adjacent product lines

through multiplexing. Multiplexing is used because no more than one shift register

cell can ﬁt into the narrow 16 3. width shared by two product lines. The PSR is used

53

 

‘ MIRVW

-Sout

:U,JO “3).:0

     

1: la

I i 2 0
Inputs " Outputs -

Figure 3.16 A Schematic Diagram of a Fault-Diagnosable PLA.

Hort RC (Register Coil)

:‘IC multiplexing Circuit) value

r1 “1
Five“ "{:1 __ _] "=i. I
__..__|

pulp:

E
ﬁr:
(

 

 

 

 

_

ll" Li

 

1r"ll‘i

I. vnﬂuc
T {- ""1

, I

. ﬁg. 42.!

L__.L..L _.___
14,)!!! ‘b‘ "i’a

term—7:4.

 

 

F'-
a}
__u__—J
5-

v~
l

 

CC
(Control
Circuit)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Figure 3.17 A Product Lines’ Shift Register (PSR) Cell.

 

54

to disable all product lines but one. This allows the effect of a single product line on
the output to be observed.

Consider the shift register cell commonly used for testable PLA design, as
shown in Figure 3.18. When the cell holds a logic 1, the pass transistor is conducting
and therefore the product line is disabled (forced to 0). Conversely, when the cell
holds a logic 0, the product line is enabled. As a result, the shift register cell allows
the product lines to be disabled. This operation is referred to as the "write" operation.

 

 

Sout
p... 1...; _
Sin

Figure 3.18 The Function of Shift Register Cell in Testable PLA Design.

 

In contrast. the proposed PSR possesses both "read" and "write" functions. A
PSR cell is connected with a control circuit (CC) that is used to control all register
cells of the PSR. For simplicity of discussion, let the internal signal lines be labeled
by So-S4, and the external signals be denoted as Mp (multiplexing), R (read), and W

(write). The signal SO (= R+W, where "4" is an "OR" function) is used to control the

pass transistor which connects the register cell (RC) and the multiplexing circuit
(MC). When 80:0, the RC is disconnected from the MC. However, when 80:1, i.e.,

either read or write (but not both), the data transmission is enabled. Table 3.5
illustrates both the internal signals generated by the CC and the corresponding
operations performed for the combinations of external input signals.

55

It should be noted that, if only the i-th cell of the PSR holds a logic 1, then the
assignment (Mp,R,W)=(0,0,1) writes I to the (2i-1)-th product line (or P2“) and all

0’s to the remaining lines, i.e., the assignment only enables P2“ but disables the

remaining product lines. In summary, the proposed PSR not only can read the
contents of the product lines, but also can enable one product line at a time to enhance
the testability and diagnosability.

Table 3.5 Operau'ons of the PSR

4 Operations
Isolate PSR from the PLA
Read ODD numbered product line
Read EVEN numbered product line
Write dataofRC toODD and setEVENtoO
WritedataofRCtoEVENandsetODDtoO

x Invalid Case (RxWatl)

Remark: "ODD" ("EVEN") -- odd (even) numbered product line.
Fi -read;W-writeforPSR.

V-‘Ht-‘CCOE
acct—Hem
act—cucm
XOHOOOU)

HGOOOU)

3.3.1.2 Input Lines’ Shift Register (ISR)

Figure 3.19 shows an ISR cell, where the number of the ISR cells is the same
as that of the inputs. The ISR is operated in a similar fashion as the PSR. The ISR
allows reading the contents of the input bit lines and writing the data held in the RC
to the bit lines. In order to reduce extra external control signals, the read and write
operations in both the ISR and PSR are arranged in opposite ways, i.e., when ISR
writes data to the input lines, the PSR reads the contents of the product lines, and
vice versa. Speciﬁcally, R=1 sets the PSR to the "read" mode and the ISR to the
"write" mode, while W=1 sets the PSR to the "write" mode and the ISR to the "read"
mode. The control circuit (CC) generates the internal signals 85-88 for the MC of the

ISR. Similar to Table 3.5, Table 3.6 illustrates both the internal signals generated by

56

the CC and the corresponding operations performed for the combinations of external
inputs.
Similarly, the ISR can read the contents of each bit line, and also can enable

one bit line at a time to improve the testability and diagnosability.

 

NC (HUI-“91“.”. Next RC (legieter Ceii)
Circuit) “‘0'

 

 

 

 

 

 

 

 

 

 

 

 

 

.41— —i _l_—_j L— —
. a: 3i ﬁrs—j:
_'::_C " ‘L_D._F
IO“ ' ‘{____J+r-Liti —P — — ——l
L- —- —- -;' m.-
|'_' Ts‘s 1
general | gi I
Circuit) ’
R H' a 4: ﬁe 4’2

Figure 3.19 An Input Lines’ Shift Register (ISR) Cell.

 

Table 3.6 Operations of the ISR.

W R MI 85 36 S7 88 Operations

0 0 x 0 0 O 0 Isolate ISR from the PLA

0 l 0 0 1 0 0 Read COMP (complemented) bit line

0 l 1 l 0 0 0 Read TRUE (true) bit line

1 0 0 0 1 1 0 Write data of RC to COMP and set TRUE to 0
1 0 1 l 0 0 1 Write data of RC to TRUE and set COMP to 0
l 1 x x x x x Invalid Case (RxW¢l)

Remarks: "ODD" ("EVEN") -- complemented (true) bit line.
H - write; and W - read for ISR.

57

3.3.1.3 Extra Power Line Vdd1

In order to allow patterns to be applied to input bit lines either through the
input lines 11’s or the ISR cells, an extra power line Vdd1 for the input decoder is

used. If the patterns are applied through 11’s, the Vdd1 is set to a logic 1; otherwise,

a logic 0 is set.

3.3.2 Design Evaluation

Figure 3.20 (a) shows the physical layout of a FDPLA that includes the
original PLA, the spare lines and control circuits for repair, and the added shift
registers for fault diagnosis. According to the ﬂoor plan of Figure 3.20 (b), each

register cell layout of the ISR and PSR takes 16 x 140 1.2 in area, i.e., the augmented
area for FDPLA is:
Augmentedarea=(2n+p)x8x140

Table 3.7 lists the area overhead for the original PLA, FDPLA, and (1,2,1)-RPLA.

Table 3.7 Area Overhead of FTPLAs

Original Augmented PLA
PLA (1,2,1)-RPLA FDPLA FI'PLA
n p m Area Area % Area % %

50 190 67 2210460 134868 6.10% 324800 14.69% 20.80%
60 200 60 2495564 142564 5.71% 358400 14.36% 20.07%
100 200 100 4104524 189924 4.63% 448000 10.91% 15.54%
100 400 100 8022924 253924 3.16% 672000 8.38% 11.54%

58

 

"
‘25
., ”w...
.56" .0'
f

‘I

I. }x\ ... Q 3:
, ‘ $35219 \ a;
_ \‘sri‘sin‘g . :3:
‘Eix ‘. ‘ .
:3:5§3‘::.t§§§¢.::
- *: 5:139:38 s-
eize) *

m...—
4'

4.

.'-.e
...
it
.
i
‘
..

4

8i
.1,-

“e5.
—’ ‘1 ' I
”' €35!
III/ ‘ r '
2’54““
Me's???"

I

41mg”:— '

W
”ma/ff

/
.5”
/

           

'. . 'x' '

‘ ' “3:525 :'

. . -- 3A;- 5“ "e... .—
.-. e, s \my» \mx ﬁrﬁtﬂk M §§“'l‘ :
~~“W . mm~~~~m\

 

1 (l)

ShiFt Register to r

i

 

 

 

 

 

 

 

 

 

 

Vdd and GND ‘ Pull-up t um
Spore Product Line t
3' E-
mu. :3; g 5"” S"
t" 5 ° ~
links E AND DR 2 g
d O
3 .5}
SISC SUSC "_ "“‘q
v
dd Input Decoder Output Decoder aim

 

 

 

 

 

(b)

Figure 3.20 Fault-Diagnosable PLA:
(:1) Physical Layout; and (b) Floor Plan.

 

59

3.4 Summary

Low yield problem usually happens in a newly developed pilot technology. In
order to ensure that large chips are manufactured with a reasonable yield level, a
design of repairable programmable logic arrays (RPLAs) has been proposed, in which
the partially defective chips can be repaired without reconﬁguring the external
routing. However, before a defective RPLA can be repaired in the manufacturing
process, the locations of the defects must be precisely identiﬁed and spare lines have
to be optimally allocated. After packaging, it is desirable that the RPLA chip is easily

testable in ﬁeld use.

In this chapter, a fault-tolerant PLA design has been presented. The design
achieves full diagnosability of single and multiple stuck-at faults, bridging faults, and
crosspoint faults. The detailed diagnosis process will be discussed in the next
chapter.

Although the proposed FDPLA design achieves full diagnosability it requires
7 extra signals (W, R, Mp, Ml, Vdd1, Sin' and Sour) and two sets of shift registers

(ISR and PSR). Extra signals imply the increase of pin overhead. In our

implementation, however, the signals, such as R, M, Vdd1, and S are used only

our
for fault location purpose. Therefore, the signals can be applied or measured by using
the internal pads of the package. In other words, the signals will n0t cause any pin
overhead. Furthermore, as shown in Figure 3.21, the signals W, Mp, and Sin are, in

fact, the only pin overhead which is common to the easily testable PLA design
[25,54].

The results of this study show that the proposed ISR not only can read the
contents of the bit lines, but also can enable the bit lines, one at a time. Due to the
"read" and "write" functions, the proposed shift register is better than the

conventional shift register in its function. In addition, the basic cell’s layout of the

proposed shift register is 16 7t wide and 140 A. long. The proposed shift register cell is

60

smaller than that of the testable PLA design in [59], 16 x 170 2.2, by nearly 18% in

area.

Although the FTPLA design requires 10% to 25% overhead in chip area, the

salient features of detecting, locating and repairing faults in this design have

demonstrated its feasibility.

 

ISR (Read Only, but Not Used)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

$1321); 134 b2n-l b2n 01 02
I

weeﬂz

I1 12 In

 

P1
P2
1’1
AND R PSR
O . (Write Only)
P - I
P 3
1L— w, and M
3in p

Figure 3.21 An Easily Testable PLA Modiﬁed from the FDPIA.

 

CHAPTER 4

Fault Diagnosis and Repair Process

 

As mentioned previously, the design requirements of a fault-tolerant PLA are:
(1) the PLA should be able to detect, locate, and repair faults during the
manufacturing process; and (2) the PLA must be easily testable when the packaged
PLA is used in ﬁeld. To achieve these goals, a fault diagnosable PLA design and its
modiﬁcation for an easily testable PLA design have been presented in the previous

chapter.

This chapter describes the fault diagnosis and repair process for the fault
diagnosable PLA design. Two examples will be given to demonstrate that the
proposed fault diagnosable PLA achieves a full diagnosability of single and multiple
stucbat, bridging, and crosspoint faults. This chapter also presents a simple test
process for detecting faults in a fault-tolerant PLA that is packaged and used in ﬁeld.

4.1 Locate and Repair Faults in Manufacturing Process

In this section, a fault location and repair process is presented. The faults
include single and multiple stuck-at, bridging, and crosspoint faults. The proposed
process consists of four major steps: (1) detect faults in augmented circuits; (2)
identify and repair faults in the AND plane: (3) identify and repair faults in the OR

plane; and (4) repair crosspoint faults.

61

62

More speciﬁcally, we ﬁrst test the augmented circuits. Augmented circuits are
non-redundant: any faults in the added circuits are considered as fatal, and the repair
of the PLA is unnecessary. Once the augmented circuits have functioned properly, we
locate and repair faults in both planes. To identify faults in the AND plane, we set
the ISR to the "read" mode to observe the status of the input bit lines to locate both
stuck-at and bridging faults. This is followed by setting the ISR to the "write" mode
and the PSR to the "read" mode for reading the contents of the product lines. Then
we can locate the stuck-at and bridging faults at the product lines, and G- and S-
faults.

It should be noted that. in order to precisely locate faults, both stuck-at and
bridging faults must be repaired immediately when they are identiﬁed. Otherwise, a
stuck-at—l faulty bit line, for example, will cause those product lines which have
contacts in the crosspoints to have stuck-at-O faults. This would produce some
difﬁculties in precisely locating the faults and identifying the fault types. Once these
faults have been repaired, the remaining crosspoint faults are repaired by efﬁciently
utilizing the spare lines.

Finally, faults in the OR plane are identiﬁed by setting the PSR to the "write"
mode and observing the output lines. We can also locate both stuck-at and bridging
faults at the output lines, and D- and A- faults. After the stuck-at and bridging faults
are repaired, a spare allocation algorithm is applied to efﬁciently repair the crosspoint
faults.

4.1.1 Detect Faults In Augmented Circuits

The shift register chain used in the proposed FDPLA includes the ISR, PSR,
and some extra register cells for observing the control signals. To test the shift
register chain, we ﬁrst isolate the shift register cells from the multiplexing circuits, as
shown in Figures 3.18 and 3.19, by setting both signals R and W to logic 0’s, i.e.,
S0 = 0. Then, we apply a scan pattern (0101..01) to the shift registers to detect the

63

stuck-at faults. Since the signals generated by the control circuits can be observed
from the additional shift register cells, the control circuits are also fully testable. In
this step, any fault is considered fatal and no further testing is needed for the
unrepairable PLA.

8 Once no fault has been detected in the augmented circuits, the following fault
location procedure for borh planes is utilized A simpliﬁed diagram for the fault
diagnosable PLA, as shown in Figure 4.1, is used to describe the stepwise fault
location procedure.

 

 

 

 

 

 

 

 

C1D]—

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

P4
PSR
AND OR
P e
9:1
p, g
, b1 b4 b2n-1 bar 01 02 03, ,
, —1_
“Bi 432,21 (:3 W i7
I I I 01 02 0m
Ir I2 In

Figure 4.1 A Simpliﬁed Diagram for Fault-Diagnosable PLA.

 

64

4.1 .2 Identify and Repair Faults in the AND Plane

The faults in the AND plane include: (1) stuck-at faults at input bit lines, (2)
bridging faults at adjacent input bit lines, (3) bridging faults between input bit lines
and product lines, (4) stuck-at faults at product lines, and (5) G- and S- faults. The
bridging faults at adjacent product lines are considered in the next step.

Both stuck-at and bridging faults at the input bit lines can be easily tested and
located if the contents of the bit lines can be observed. Speciﬁcally, since both true
and complemented bit lines are arranged in sequence, the contents of the bit lines are
expected to be EP1=(1,0,1,0,...,1,0) for the applied input pattern (11,...Jn)=(1,...,1),
and EPO=(0,1,0,1,...,0,1) for the pattern (0,0,...,0), i.e., EPO is the bitwise complement
of BF]. As such, a stuck-at fault is located at the i-th bit line if EPOi=EPli.

Figure 4.2 illustrates the applied patterns for locating stuck-at faults and
bridging faults. We ﬁrst apply the test patterns through Il'In by setting
Vdd1 = "1"; set the ISR to the "read" mode by assigning (R,W)=(0,1); and write 0’s
to all product lines by shifting all 0’s to the PSR, and by assigning Mp=0. (It should
be noted that if the product line has an s-a-l fault, the product line will never be set
to the 0 state.) Consequently, the contents of bit lines can be read from the ISR,
where assigning MI=1 (0) reads the contents of the true (complemented) bit lines.
The contents are loaded to EPO or EPl for the applied 0’s or 1’s input patterns,
respectively.

Property 1.

If one of the bridged lines has a stuck-at fault, the bridging faults are

equivalent to stuck-at faults.

Proof. Since bridged lines should have the same logic, a bridged line with a stuck-at

fault will force the remaining lines to have the same stuck-at fault. Cl

65

For the case that none of the bridged lines has a stuck-at fault, the following
ﬁve cases can be identiﬁed.

(1) input bit line adjacent bridging fault (Type BRII fault),

(2) product line adjacent bridging fault (Type BRPP fault),

(3) output line adjacent bridging fault (Type BROO fault),

(4) one input bit line bridges to a product line (Type BRIP fault), and
(5) one product line bridges to an output line (Type BRPO fault).

Property 2.
Types BRII and BRIP faults are equivalent to s-a-0 faults.

Proof. For the type BRII, two cases can be identiﬁed here. The ﬁrst case is the
bridging faults between the true bit line and the complemented bit line of the same
input j. The second case is the short between the complemented bit line of input j and
the true bit line of input j+1. In either case, because of no stuck-at faults at the
bridged lines and the wired-AND bridging fault model, the bridged bit lines are
diagnosed as having s-a-O faults. For the type BRIP, since the bridged product lines
do not have stuck-at faults, the assigned 0-value product lines can force the bridged
bit line to be s-a-O, i.e., the BRIP bridging faults at the bridged bit lines are
equivalent to s-a-O faults. U

Both Properties 1 and 2 have shown that the bridging faults are equivalent to
stuck-at faults, and thus they have been located and repaired as stuck-at faults. As
mentioned previously, the stuck-at faults must be repaired immediately once they are
located. After the faults are repaired by the spare lines, the above procedure is
carried out again to assure that neither stuck-at faults nor bridging faults occur at the
assigned spare lines.

Figure 4.3 illustrates the process for locating stuck-at faults and bridging faults
at the product lines, as well as G- and S- faults. These faults can be identiﬁed if we

observe the contents of the product lines. More speciﬁcally, we ﬁrst apply test

 

RWMI
10
11

“CO

PSR

Vdd1=1

    

01 02 0m

 

1 I] l 12 I In
Figure 4.2 Identify Input Line Stuck-at Faults and Bridging Faults.

 

Sout

PSR

Vdd1=0

I112 In

Figure 4.3 Identify Faults at Product Lines as well as G- and S- Faults.

 

67

patterns through the ISR by assigning Vdd1 = "0," set the ISR to the "write" mode
and the PSR to the "read" mode by assigning (R,W)=(1,0); and write all 0’s to the
ISR with Ml=0 (and change to MI=1 later). Then, we expect to have all 1’s in the
PSR. If an unexpected result occurs at the j-th product line, this implies that the line
has an s-a-O fault. .

After the s-a-O faults have been located and repaired, a walking "1" pattern is
applied to the ISR. When only the j-th cell of the ISR holds 1, MI=1 results in a logic
1 in the true bit line (b2j_1), but all 0’s in the remaining bit lines, i.e., this assignment
enables only the bit line sz,1 and disables all other bit lines, and allows us to observe

one bit line at a time. The status of the crosspoints in each enabled bit line are
recorded and form the following matrix.

r7‘11 A12-~°A1r -
A21A22...A2,
App: . (4.1)

5.5.11.5 _.

 

 

where Aij = 0 (1) represents the absence (presence) contact of the crosspoint. It
should be noted that the statuses were observed from the PSR. A PSR cell holding 1
(0) implies that the corresponding crosspoint is absent (present), and thus Aij = 0

(l), i.e., the content of the PSR cell in this step is opposite to the entry Arj-

Property 3.
If the j-th row of the matrix APP contains all 0’s, then the product line Pj is

diagnosed as having an s-a-l fault.

68

Proof. If the j-th row of the matrix APP contains all 0’s, two cases can be identiﬁed:
either all crosspoints of Pi at the AND plane are absent, or Pj has an s-a-l fault.
Since both faults are repaired by spare product lines, in order to simplify the fault

location process, we would rather treat the multiple G-faults in Pj as an s-a-l fault

than distinguish these faults. El

Once the stuck-at faults have been repaired, the crosspoint faults are identiﬁed
as follows. Let matrix AP = [3.5]pr be the personality matrix for the AND array,

where aij’s are deﬁned as the same as Aij in (4.1). The crosspoint faults are located

by XORing both AP andAPF.

CAP=APeAPF=uijeAij1M (4.2)

Any non-zero entry indicates either a G- or S-fault at that position. In fact,
both faults can be precisely distinguished by checking the matrix AP, i.e., let
CAP(i,j)-=1, if aij=1’ the fault is a G-fault; otherwise, it is an S-fault. In order to

efﬁciently utilize the spare lines, both G- and S- faults are not repaired at this time.

4.1 .3 Identify and Repair Faults in the OR Plane

The faults in the OR plane include: (1) stuck-at faults at output lines; (2)
bridging faults at adjacent output lines, adjacent product lines, and crossing product
and output lines; and (3) A- and D- faults.

By conn‘olling the product lines and observing the output lines, the faults in the
OR plane can be detected easily. Figure 4.4 shows that we fust apply the test
patterns to product lines by assigning (Fi,W)=(0,1), i.e., the PSR is set to the "write"
mode and the ISR is the "read" mode. (But the contents read from the ISR are
ignored.) We then assign Vdd1 = "0"; and shift all 0’s to the PSR.

69

 

 

I1 I2 In
Figure 4.4 Identify s-a-l Faults at Output Lines.

 

1112 I.

Figure 4.5 Identify Bridging Faults; Outputs s-a-l Faults; and A- and D- Faults.

 

70

The result is that the internal lines of the outputs (without the inverting
buffers) are expected to be all 1’s, or the output signal lines (with the inverting
buffer) are expected to be all 0’s. Therefore, an unexpected 1 in an output line
indicates that the line has either an s-a-l fault or a BRPO fault.

Property 4.

Type BRPO faults are equivalent to the bridged output lines with s-a-l faults
and the bridged product lines with s-a-0 faults.

Proof. An s-a-l faulty output line can be diagnosed by applying all 0’s to the
product lines. Since the bridged product line has no stuck-at fault, the logic 0 in the
bridged product line will force the bridged output lines to have s-a-l faults. On the
other hand, since the faulty output lines are repaired by disconnecting them from the
SOSC and connecting them to Ground, the bridged product lines are thus forced to be

stuck—at-O. D

Figure 4.5 illustrates the process of locating the s-a-0 faults at outputs; the
bridging faults: BRPP, BROO, and BRPO faults; and A- and D-faults. Similar to the
process shown in Figure 4.3, a walking "1" pattern is applied to the PSR after the s-
a-l faults have been located and repaired. All the product lines but one are disabled
at a time. The status of the crosspoints in each enabled product line are recorded and

form the following matrix.

F—
311812 . . . Blm

B21322"°B2ﬂl
OFF: . . ... . (4.3)

 

 

__13pl apz...npm_

where Bij’s are deﬁned as the same as Aij in (4.1).

71

Property 5.
If the j-th column of the matrix OPP contains all 0’s, then the output line DJ is

diagnosed as having an s-a-0 fault.

Proof. The j-th column containing all 0’s implies that either Oj has an s-a—O fault, or
all crosspoints at Oj are missed. Both faults are repaired by spare output lines. For

simplicity, the multiple D-faults in Oj are diagnosed as Oj with an s-a-0 fault. El

Property 6.
If the j-th row of the matrix OPP contains all 0’s, then the product line Pj is
diagnosed as having an s-a-O fault.

Proof. The j-th row containing all 0’s implies that either the product line Pj has an s-
a-O fault, or all crosspoints of Pi in the OR plane are missed. Similarly, for simplicity,

the multiple D-faults at Pj are diagnosed as Pj with an s-a-O fault. El

Property 7.

Type BRPP faults are equivalent to the bridged product lines with s-a-O faults.

Proof. Recall that a walking "1" pattern was applied after the s-a-l faults at the
product lines had been located and repaired. If we assume that the only "1" is applied
to Pj with 0’s to all other product lines, the wired-AND bridging fault model will force
both Pi and P5 +1 to be bridged to 0. As a result, the product line Pj will not be enabled
as it should be. The outputs are then observed as having all 0’s, i.e., the j-th row of
the matrix OPP contains all 0’s. By Property 6, the product line Pj is diagnosed as
having an s-a-0 fault. Similarly, the same argument can be applied to the other
bridged product lines, and those lines are diagnosed as having s-a-O faults. Cl

72

The type BROO bridging faults are identiﬁed by checking the matrix OPF.
When both the j-th and (j+1)-th columns of the matrix OPF are the same, two cases

are identiﬁed: either having multiple crosspoint faults, or a bridging fault at both
adjacent output lines, Oj and Oj+1° In fact, the BROO bridging faults can be

distinguished by checking if the bit patterns of the j-th column of the matrix OPF is
the same as the bit patterns obtained from ANDing both the j-th and (j+1)-th

columns of the matrix OP.

Once the stuck-at faults and bridging faults have been repaired, the crosspoint
faults are identified as follows. Let matrix OP = [bijlpxm be the personality matrix for

the OR array, where bij is deﬁned the same as Aij in (4.1). The crosspoint faults are

located by XORing both OP and OPP

COP = or 69 OFF = [ bi]. e (4.4)

Bij lpxm
Similarly, any non-zero entry indicates either a D- or A- fault at that
position. Both fault types can be precisely identiﬁed by checking the matrix OP, i.e.,

let COP(i,j)=1, if bij=1, the fault is a D-fault; otherwise, it is an A-fault.

After all the crosspoint faults are located, an algorithm for efficiently utilizing
spare lines is needed.

4.1.4 Repair Crosspoint Faults

According to the repair rules listed in Table 3.2, both G- and S- faults can be
repaired either by spare input bit lines, and/or product lines. Similarly, both D- and A-
faults can be repaired either by spare output lines and/or product lines. By
concatenating both matrices CAP and COP as the fault map, a spare allocation

algorithm developed in [31] can be used to efﬁciently repair crosspoint faults.

73

4.2 Fault Diagnosis and Repair Algorithm

Based on the fault location and repair process, a fault diagnosis and repair
algorithm is summarized in Appendix 2.

Two examples are given in this section to demonstrate the proposed fault
diagnosis and repair process. The results will show that the proposed fault
diagnosable design achieves a full diagnosability of all single and multiple stuck-at
faults, bridging faults, and crosspoint faults.

Consider a fault-tolerant PLA, as shown in Figrne 4.6 (a). The PLA consists
of 4 input lines, 4 product lines, and 3 output lines, i.e., n=4, m=3, p=4, and 128. In
addition, 2 spare input bit lines, 2 spare product lines, and 2 output lines are also
added for the repair of the defective chip, i.e., sn=2, sp=2, and sm=2. The personality

of the PLA can be described by the following cubical representation,

1-01 010
~101 101
10-- 011
00-1 110

Both matrices AP and OP are generated according to the personality of the
fault-free PLA in Figure 4.6 (a) as follows,

01001001 010
AP= 00011001 and OP= 101 (4.5)
01100000 011

10100001 11

74

 

Sout

      
    
 
  
 
 
 

input ShWt Regster

Product
snmr Regster

  

in
o—

t:

M

N
Y.)
3

 

—J 01 02 DJ

Inputs UutPUtS

Input Shmt Regster

5-3

L
P3
3.0:
'50)
0°
S_tz
0.4)
2
.C
m

 

Figure 4.6 Examples in the Fault-Diagnosable PLA Design:
(a) Fault-free PLA; and (b) Faulty PLA.

 

75

4.2.1 Example 1

Consider a faulty PLA, as shown in Figure 4.6 (b), in which the following
faults are applied to the fault-free PLA,

b2 : s-a-O, Pl: s-a-l, 02: s-a—l,
AP(3,3):G-fault, OP(3,1): A-fault, Isl: s-a-l.

When the scan patterns are applied, the observed scan-out patterns match
the applied scan-in patterns. Thus, no fault is detected in the added circuits.

For Step B, after the input patterns (11.12.13.14)=(0,0,0,0) and (1.1.1.1) are

applied, we obtain

EPO=(0,0,0,1,0,1,0,1),
and
EP1=(1,0,1,0,1,0,1,0).

This results in b2 having an s-a—O fault, and the faulty bit line being repaired by the
spare input bit line 131 . The number of spare bit lines becomes sn=1.

In order to diagnose the possible faults in the spare input lines, Step B is
proceeded again, and both EPO and EPl are re-generated as follows,

. EPO=(0,1,0,1,0,1,0,1),
and
EPl=(1,1,1,0,1,0,1,0).

This causes the bit position of b2 (now is 1,1) to have an s-a-l fault, and the line is
then repaired by the spare input line 182. The number of spare input bit lines becomes
sn=0, i.e. no more spare bit lines are available. After the faulty line has been

repaired, Step B is again repeated and both EPO and EPl are ﬁnally obtained as

EPO=(0,1,0,1,0,1,0,1),
and
EP1=(1,0,1,0,1,0,1,0).

This shows that no stuck-at fault is identiﬁed in the bit lines.

76

For Step C, when all 0’s are applied to bit lines bi’s, we get the contents of
the product lines as (P1323334)=(1,1,1,1), which shows no s-a—0 fault in Pr

Therefore, the matrix APP is generated as follows,

00000000
00011001
APF= 01000000

10100001

Since the ﬁrst row contains all 0’s, an s-a-l fault at P1 is identiﬁed. Therefore,
P1 is repaired by the spare product line P31, and the number of spare product lines
becomes sp=1. Once faults are repaired, the fault location process is then repeated to
start from Step B, and the matrix APF is regenerated as

01001001
00011001
Apps. 01000000

10100001

The matrix APP shows that no s-a-l fault at the product lines is identiﬁed.

Finally, we get the matrix CAP as follows,

00000000
00000000
CAP= 00100000
00000000

The matrix indicates that the bit position CAP(3,3) may have either a G- or S-
fault. In fact, the bit position has a G-fault because AP(3,3)=1 in (4.5). This G-fault
will be repaired later.

77

For Step D, 0’s are applied to all product lines, so we get the output lines as
(01,02,03)=(OJ,0). The output line 02 is diagnosed as having an s-a-l fault, and

repaired by a spare output line 081. The number of spare output lines becomes sm=1.

Once the faults are repaired, the entire fault location proceeds to start ﬁem
Step B. We then obtain (01,02,03)=(0,0,0) as expected. This is followed by the

generation of the matrix OFF,
010

101

OPP: 111

110

The matrix shows that neither an s-a-O fault at the output lines and product
lines, nor the bridging faults (BROO), is identiﬁed. Therefore, we generate the matrix
COP as

000
000
COP= 100
000

However, this matrix shows that the bit position COP(3, 1) may have either a
D-fault or an A-fault. In fact, the bit position has an A-fault due to OP(3,1)=0 in
Equation (4.5).

For Step E, if we merge bOth matrices CAP and COP, we get

CAP COP
00000000 000
00000000 000
00100000 100

00000000 000

78

Taking this fault map, the spare allocation algorithm ﬁnds a solution such that
a spare product line is used to repair both crosspoint faults. After the faults are
repaired, the entire location process is then proceeded again. The result shows that
no further fault is identiﬁed, and the process is done.

4.2.2 Example 2

Consider the application of the following bridging faults to the PLA of Figure
4.6 (a), ‘ "

(1) at the bit lines b2 and b3;

(2) at the product line P2 and the output line 02; and

(3) at the product lines P3 and P4, where P4 has also an s-a-O fault.

The fault diagnosis process proceeds as follows. First, when the scan pattern
is applied, the observed scan-out patterns match the applied scan-in pattern. Thus,
no fault is detected in the added circuits.

For Step B, after the input patterns (11,12,13,I4)=(0,0,0,0) and (1,1,l,1) are

applied, we obtain

EPO=(0,0,0,1,0,1,0,1),
and
EP1=(l,0,0,0,l,0,1,0).

This results in both bit lines b2 and b3 having s-a-O faults. The faults are repaired by
the spare input bit lines Isl and 182, respectively. The number of spare bit lines

becomes to sn=0.

In order to diagnose the faults in the spare lines, Step B runs again, both EPO
and EPl are re- generated as follows,

EPO=(0,1,0,1,0,1,0,1),
and
EP1=(1,0,1,0,1,0,1,0).

This shows that no stuck-at fault at the bit lines is identiﬁed.

79

For Step C, when 0’s are applied to all bit lines bi’s, we obtain the contents of
the product lines as (P1323334)=(1,1,0,0), which shows that both P3 and P4 are
diagnosed as having s-a-0 faults. Therefore, both lines are repaired by the spare
product lines P31 and P52, and the number of spare product lines becomes sp=0. The
location process then proceeds again, and (PPPZ,P3,P4)=(1,1,1,1) as expected. Then,

we generate the matrix APP as follows,

01001001
. 00011001

10100001

The matrix shows that no s-a-l fault at the product lines is identiﬁed. Since
both matrices AP and APF are the same, i.e., the matrix CAP is a zero-matrix,
neither a G-fault nor an S-fault at the AND plane is identiﬁed.

For Step D, when 0’s are applied to all product lines, we get the output lines
as (01,02,03)=(0,1,0), i.e. the output line 02 is diagnosed as having an s-a-l fault,
and is repaired by a spare output line 031- The number of spare output lines becomes

sm=1.

Once the faults are repaired, the entire fault location proceeds again to start
from Step B, and (01,02,03)=(0,0,0) as expected. Then, followed by the generation

OfthcmatrixOPF,
010
000

110

80

The matrix shows that no s-a-0 fault at the output lines is identiﬁed. But, an

s-a-O fault is identiﬁed at P2 because of all zero’s in the second row of the matrix
OPF. Therefore, it should be repaired by a spare product line. Unfortunately, since
sp=0 implies that no more spare lines are available for repair, the defective chip is

claimed to be unrepairable. The fault diagnosis process is then terminated.

4.2.3 Discussion

Example 1 shows that a defective chip can be repaired by the assigned spares.
However, the defective chip in Example 2 is unrepairable.

Example 2 also shows that the bridging fault at the adjacent bit lines (or type
BRH) are diagnosed as both bit lines having s-a-0 faults (Property 2). The bridging
fault at the output line and product line (or type BRPO) is diagnosed as the output
line with an s-a-l fault and the product line with an s-a-O fault (Property 4). Finally,
the bridging fault at the adjacent product line, where one bridged product line has an s-
a-O fault, is diagnosed as both product lines with s-a-O faults (Property 1).

4.3 Test Chip in Field Use

Due to lack of controllability and observability, PLA testing, particularly for
large chips, has been recognized as a very difﬁcult problem. To alleviate such a
problem, the design of easily testable PLAs have been popularly implemented
[9,25,59]. The key to the easily testable PLA design is the use of additional
hardware to enable only one product line at a time to increase the testability [54].

In Section 3.4, an easily testable PLA design has been presented and shown
in Figure 3.21. The design modiﬁes the fault diagnosable PLA design, but requires no
additional hardware.

81

That PLA is tested as follows: Basically, a walking one pattern is applied to
the PSR to enable only one product line. For each enabled product line, the use of the
main test pattern and auxiliary patterns proposed in [6] can then detect single and
multiple faults in either the PSR, or the PLA itself.

4.4 Summary

A fault diagnosis and repair algorithm has been presented. The algorithm
shows that the proposed fault diagnosable PLA design achieves a full diagnosability
of single and multiple stuck-at, bridging, and crosspoint faults. A test procedure is
also presented for a packaged PLA chip. It should be noted that the fault-tolerant
PLA is repaired by the use of laser programming techniques. Our diagnosis and
repair process basically implements the scheme that locates stuck-at and bridging

faults and then repairs them.

CHAPTER 5

Yield Analysis

 

This chapter analyzes the effects of adding redundancy to the design of fault-
tolerant PLAs. In theory, a higher probability of repair can be achieved if a larger
number of spares is added. However, since the added redundancy and the associated
circuitry are also susceptible to defects, too much redundancy may have a
"diminishing" effect on the chip. Therefore, it is not always guaranteed that the
additional redundancy improves the overall probe yield.

In this chapter a yield model for the design of fault-tolerant PLAs is presented
and simulated in Section 5.1 and 5.2. Based on the yield model, the optimal
redundancy that provides the maximal chip yield is discussed in Section 5.3.

5.1 Yield Model

Productivity of chips is reduced both by gross imperfections and from faults
caused by random defects in the photolithography and materials [46]. The
productivity of the redundant chip is also strongly dependent upon the percentage of
uncorrectable defect area of the chip. In the design of fault-tolerant PLAs, the defects
in the AND and OR planes can be repaired with the spare lines. Other faults, such as
the defects in decoders, power lines, clock lines, and control circuits are considered as

fatal errors and are not correctable by redundancy. As a result, the net yield with

82

83

redundancy, YNET’ is the product of the imperfection yield YGI’ correctable yield

YCRD’ and uncorrectable yield YURD’ i.e.,
YNET = YGI X Yon!) x Yuno (5.1)

Since the yield YGI depends on a fabrication process which is not available at
this time, for brevity, we consider the relative net yield (Y NET/Y Gl)’ or effective yield

Yeff’ i.e.,

Yeti = YURD X cho (5.2)

For the statistics of the fabrication defects we can adopt one of the models
suggested in the literature, such as Poisson, general negative binomial, or binomial
statistics. In this work, the Polya-Eggenberger distribution, mixed Poisson statistics
using gamma distribution as a mix function [12,28,29,55,62,64], were employed to
deﬁne the random defects for the fabrication of chips. The probability of having x

faults on a chip for this distribution is

I‘(x+a) (2.. la.)x
p = = 5.3
(X x) x! I‘(a) (1+ 2. loom" ( )

 

where a is a clustering parameter that depends on the defect density variation [55]

and A is the average number of faults per chip. In this work, it is assumed that the
correctable defect area of FTPLA includes only the AND/OR arrays and the spare

lines; the remaining chip area is uncorrectable.

5.1.1 Correctable Random Effect Yield, YCRD

The correctable yield, YCRD' is affected by the array yield, Yarray' and the

area penalty for redundancy, where the area penalty is the ratio of the chip area
without redundancy (ANR) to that with redundancy (AR)° The yield YCRD is

expressed as
YCRD = Yarray x ( ANR I AR ) (5’4)

84

In the previous work [62,64], the array yield was deﬁned as the probability
that the number of faults will be less than, or equal to, the total number of spare

lines s, i.e.,

 

Y _ P _ S I‘(x+a) (1 Iran)" (55)
array ' (“8) " x; x! 1"(a) (1+).Ia)“+"

However, the above model ignores the fact that the faults in the AND array can be
repaired only by either spare input or product lines, but not by the spare output line.
Also, the faults in the OR array cannot be repaired by the spare input lines.
Considering the above fact, a more accurate array yield has been proposed recently

[12]. The yield is expressed as

Yarray = P(X1SSAND) P(XZSSOR)

SAND sOR I‘(x1+a) (A1d /a)"1 F(x2+a) (AﬂaY‘Z (5.6)
x11 I‘(a) ( 1+ A1d /oc)°‘+"1 x2! I‘(a) (1+ Azd /a)°‘+"2

x1-0 xz-O

where SAND = sn+sp, Son = sp-l-sm, s = sn+sp+srn ( sn, sp, and 3m are the number of

spare input, product, and output lines, respectively). A1 and A2 are the chip areas of
AND and OR arrays, respectively. We assume that the defect density d and the

cluster parameter a in both arrays are identical.

Recall that, as the fault diagnosis process discussed in the previous chapter,
both stuck-at faults and bridging faults are repaired by the corresponding spares. For
instance, stuck-at faulty product lines must be repaired by spare product lines. On
the other hand, crosspoint faults are optimally repaired by the available spare lines
that are determined by the spare allocation algorithm. In other words, all the faults
are repaired by either spare input, product, or output lines. Therefore, following the
proposed repair rules and fault diagnosis algorithm, a new yield model for the array

yield is presented.

85

A fault-tolerant PLA is said to be repairable if the spare lines can cover all
faults. Otherwise, it is unrepairable. In other words, the PLA is repairable if various
types of spares are sufﬁcient to repair the corresponding types of faulty lines. Let
Pw(sw) be the probability that the number of w-type faulty lines is less than or equal

to sw, where w = n for bit line, w = m for output line, and w = p for product line. The

array yield is then expressed by the product of these probabilities, i.e.,
me = Pn(sn) x Pp(sp) x Pm(sm) (5.7)

In general, the probability that has exactly i faulty product lines out of the total
of p-i-sp product lines can be described by the Binomial Distribution function, i.e.,

Pp(X= i ) = C {”‘p qpi(1-qp)p+sp-i (5.8)

where qp is the failure rate of the product lines. Thus, the probability Pp(sp) is
expressed as

S
Pp(sp) = P (XSsp) = EZC {tsp qpil 1 _ qp )pO-sp-i (5.9)

Similarly, the probabilities Pm(sm) and Pn(sn) are

9’in
Pm(sm) = p (xssm) = 25°C :“sm qmi( 1 - qm )“H'sm'i (5.10)
1
S

n
Pn<sn>=P<xsn>= 2c?"*‘nq,,i( 1 -anzmsn-i (5.11)
i=0

Therefore, substituting the array yield in Equation (5.7) to Equation (5.4), the

correctable random effect yield YCRD' is

YCRD = Pun") x Pp(sp) x Pm(sm) x ANR/AR (5.12)

86

5.1.2 Uncorrectable Random Effect Yield, YURD

The percentage of uncorrectable defect area is one of the key factors in
determining the effectiveness of redundancy. The uncorrectable yield is deﬁned
as [12]

YURD =(1+7.x(AUNc/ASUS)/a)’“ (5.13)

where AUNC and A8le are the uncorrectable defect-susceptible area and the total
defect-susceptible area, respectively. In general, the random defect yield is very

sensitive to the percentage of the uncorrectable defect area. A low percentage of
uncorrectable defect area allows one to go higher levels of integration before the yield
term falls off signiﬁcantly.

5.2 Yield Simulation

Although the proposed fault-tolerant PLA has not yet been fabricated and the
experimental processing data are not available to precisely determine the failure
rates, we may employ the experimental data studied in [48]. This experiment
predicts all faults that are likely to occur in a MOS integrated circuit or subcircuit.

The study shows that, of the original 4800 defects, only 476 actually produce
signiﬁcant faulty behaviors at the circuit level. They can classified as follows: 72
crosspoint faults, 388 stuck-at or bridging faults, and 16 power line faults. Since the
power line faults do not affect the calculation of array yield, they are excluded.
According to our repair rules, most crosspoint faults are efﬁciently repaired by spare
product lines. Thus, it is reasonable to assume that the number of crosspoint faults
repaired by spare product lines is as many as twice that repaired by spare input lines
and by spare output lines. Also, we assume that the stuck-at and bridging faults are
uniformly distributed to each type of line. Based on these assumptions, of the 460

faults, 147 are contributed to bit lines, 166 to product lines, and another 147 to output

87

lines. In other words, the study shows that both bit lines and output lines have the

same failure rate because their structures are virtually the same, i.e., q“ = qm, but the
failure rate qp is nearly 12% higher than q“, i.e., qp = 1.12 q“.
Let qn = qm = q, and thus qp = 1.12 q. The failure rate q is generally obtained

from the statistics in the fabrication process and manufacturing process. In this
study, however, the failure rate is roughly estimated from the following calculation.
The basic concept is that a non-redundant PLA design is conceptually identical to the
redundant PLA design with the spares (sn,sp,sm) = (0,0,0). Therefore, the

probabilities of having any failures in both designs should be the same. In practice,
the former probability can be obtained from Equation (5.3) with x = 0, i.e.,

Pun: P(x=0) = ( 1 + i. / a )3 (5.14)
and the latter probability is the product of YURD' in Equation (5.13), and YCRD with

(sn, sp, sm) = (0.0.0), in Equation (5.12), i.e.,

P = Y R l xY
n c o (sn.sp.sm>- (0,0,0) one

= [Pntowptmrmtm x ANn/AR] x11+wa><AUNCIAsus>W (5.15)
For a redundant design with no redundancy, ANR = AR’ and, by Equations (5.9)-

(5.11), Pn(0) = (1492“, 13mm) = (141)“, and rpm) = (1-1.12q)P, Equation (5.15)

can be written as
PR = [(1-q)2“+m(1-1.12q)P] x[1+(Ua)(AUNc/ASUS)]‘°‘ (5.16)
By equating both PNR in Equation (5.14) and PR in Equation (5.16), we obtain

(la/ctr“ = 11+waxAUNc/Asusn'“ [(ImZMmu-uzqﬁ’] (5.17)

If the parameters a and x, and the area ratio AUNC/ASUS are given, then, we should
be able to solve Equation (5.17) for q.

88

For example, consider a (50,190,67)-PLA; according to the floor plane shown
in Figure 3.13, the area ratio is calculated as (Auuc/Asus) = 0.2038. Let a = 2 [62]

and consider the case of it a 4, Equation (5.17) results in q = 0.00398, i.e., the failure

rates qn = qm = 0.00398 and qp = 0.00446.

The failure rates are subject to the number of average faults. For various
average numbers of faults, Figtu'e 5.1 illustrates the correctable random effect yield

YCRD’ uncorrectable random effect yield YURD’ and the effective yield Yeti for the
(50,190,67)-PLA with (sn,sp,sm) = (3,4,2). For the case of r = 4, YCRD = 88.79%,
YURD = 54.13%, and Yeti = 48.06%. This shows that the chip yield for the redundant

design is much higher than the 11.1% yield for the nonredundant design.

 

 

 

 

 

 

 

 

 

 

% of '° 88.79%
yield Ycr cl
80 .00 ‘1 Y
CRD
60 .00 " 54.13%
. 48.06%
‘10 ' 00 m
Yur'
a Y Y e f’ 9
2° '°° 11.1% °"
\Non-redundancy
o . o oo . . . Y n r
0.00 2.00 L1.00 6.00 3.00 10.0 “=2
Average Number of Faults (it)
Figure 5.1 Yields for (50,190,67)-PLA with Redundancy (sn,sp,sm)=(3,4,2).

 

89

Figtne 5.2 illustrates the effects of adding redundancy. Figure 5.2 (a) plots
the correctable yield YCRD versus the number of spare product lines, where sn=3,

sm = 2, and sp is varied ﬁ'om 0 to 10. The plot shows that, as the number of spare

product lines increases, the array yield Y increases, but the area ratio ANR’AR

array
decreases. As a result, the overall YCRD is increased initially, but decreased as the

number of spare product lines increases. For example, YCRD = 86.10% for s = 2,

p

YCRD = 89.00% for sp = 3, but YCRD = 88.79% for sp = 4.

Figure 5 .2 (b) plots various yield simulations versus the number of spare
product lines. The plots also show that the yield Yeti = 47.93% for (sn,sp,sm) =
(3,3,2), and increases to 48.06% for (sn,sp,sm) = (3,4,2), but drops to 47.82% for
(sn,sp,sm) = (3,5,3). The results show that the additional redundancy may not
improve the overall yield.

Figure 5.2 (c) plots the eﬂ'ective yields for the spare assignments (4,6,4),
(3,4,2), and (2,2,1) versus the average number of faults. For 1. = 0, the effective
yields for (4,6,4), (3,4,2), and (2,2,1) are 83.74%, 89.23%, and 93.81%, respectively.
(It should be noted that the yield does not reach 100% due to the area penalty). For
3.- 1, the yields for the spare assignments are respectively 71.08%, 75.11%, and
77.57%, but for 1. = 10, the yields are respectively 24.36%, 24.33%, and 20.29%. This
implies that less redundancy is better if the average number of faults is smaller, but,
for larger numbers of faults, more redundancy may produce a higher chip yield.

Figure 5.2 has provided signiﬁcant evidence that the additional redundancy

may not improve the yield. This motivates the study of ﬁnding optimal redundancy in
the proposed fault-tolerant PLA design.

90

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

%
100 .0
E
Correctable 9° '°° ‘
Yield, YCRD
60 .00 “
‘10 .00 ‘1
3° ‘°° ‘ Non-redundancyL
ﬂnr
0 . 000 r r v v S
0.00 2.00 “.00 6.00 0.00 10.0 p
(a)
%
100 .0
89.00% 88.79%
on on - YCRD
Effective 6° 0° ‘ 53.35% 54.13% YURD
Yreld, Ye"
48.06%
1° W ‘ 47.98% Yeti
20 00 ~ Non-redundancy
0 000 r r 5 S
0 00 2.00 “0.00 6 00 B 00 10 .0 p
(2 2 1) % (b)
e 9 N94.
(3,4,2) _———>

(4.6.4) °° °a ‘

60 00“

Effective Yields

Y9" Ho '00 7 \

20.00 '1

 

 

 

\ Non-redundancy

 

o o r .
o oo oo zoo H no sea eon in A,

(C)

Figure 5.2 Yield Analysis for (50,190,67)-PLA with 01:2:

(a) Correctable Yield, (it = 4); (b) Effective Yield, (71.: 4); and
(c) Yields for Various Spare Line Assignments.

 

91

5.3 Optimal Redundancy

Integrated circuit manufacturers ﬁnd it highly desirable to be able to predict the
yield loss before a chip is fabricated, and to expect to maximize the probe yield, and
thus maximize proﬁts. In this section, an efficient way to determine the optimal
redundancy in the proposed fault-tolerant PLA design is presented.

As more redundancy is added to redundant PLAs, both yield and productivity
increase; however, the redundant spare lines inflate die size and reduce the number of
chips per wafer. Figure 5.2 has evidently shown that, as the number of spare lines
increases, the array. yield increases, but the area ratio ANR/AR drops. As the
redundancy increases, a point will eventually be reached where optimum yield is
obtained.

The optimal redundancy problem can be expressed by the following nonlinear
integer optimization problem:

Maximize Yeti :- YURD x [ Pn(sn)Pp(sp)Pm(sm) x ANR/AR ]
Subjectto 0<sn52n; 0<spSp; 0<smSm

Finding the parameters s", sp, and 8m is obviously not an easy task, even though the
parameters are integers. However, since the yield calculation is relatively simple, the
optimal solution can be easin formed if the bounds of parameters (sn,sp,sm) are
narrowed.

Consider the Binomial distribution

)N-i

P(X=i)= CNN-q (5.18)

where P(i) is the probability which has exactly 1 faulty lines from the total N lines,
and q is the failure rate. The expected value 11 and standard deviation 0 of this

distribution are u = Nq, and o = 1Nq(l-q) , respectively.

92

An empirical rule is applied to define the upper and lower bounds of the

parameters (3 5,“). Let x . n+3o: the ceiling function [x] is defined as the upper

n’sp’

bound and the ﬂoor function [x] as the lower bound. Therefore,

 

LknJ s sl1 s l'kn'l, where kn= 2nq+3 ‘1 2nq(1-q) , (5.19)
Lka s sp s l'kp'l. where kp=p(1.12q)+3 )Fp(1.12q)(1-1.12q) , (5.20)
[.ka 5 sm sl'kml where km=mq+3V mq(1-q) . (5.21)

Note that the term N in both u and o for the product lines includes the number
of the original and spare product lines, i.e., N = P+sp. However, for simplicity, we will
only consider N a p because the number of spare line is far smaller than p, i.e.,
3p << p. Similarly, both 3n and 8m are omitted in Equations (5.19) and (5.21).

Consider the (50,190,67me with 1:4. In the previous discussion, we have
q=0.00398. Equations (5.19)-(5.21) result in kn=2.29, kp=3.6, and km=1.81, i.e.,

2 S Sn 5 3, 3 5 sp 5 4, l 5 sm 5 2. Therefore, we may exhaustively calculate the

yields for all 8 possible combinations of (s Table 5.1 lists the yield

n,sp,sm).

 

Table 5.1 Yield Simulation for (50,190,67)-PLA.

Redundancy Effective
sn sp sm YCRD YURD Chip Yield
2 3 1 88.34% 53.24% 47.03%
2 3 2 89.45% 53.58% 47.93%
2 4 l 88.13% 53.51% 47.16%
2 4 2 89.24% 53.84% 48.05%
3 3 l 87.88% 53.52% 47.04%
3 3 2 89.00% 53.86% 47.93%
3 4 1 87.67% 53.79% 47.16%
3 4 2 88.79% 54.13% 48.06%

93

calculations. The results show that the chip yield of 48.06% for (3,4,2) is the
maximum. In other words, the assignment (3,4,2) is the optimal redundancy. In order
to verify that the assignment (3,4,2) is indeed optimal, it has been calculated the

yields for all combinations of (sn,sp,sm), where each parameter is varied from 1 to 5.

The results, as listed in Table 5.2, show that the yield of (3,4,2) is, in fact, the highest.

 

Table 5.2 The Effective Yields for (50,190,67)-PLA

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

( 1 S sn,sm,sp S 5 )
sns 1 2 3 4 5
1 36.36% 37.05 36.82 36.51% 36.20%
2 38.15% 38.88 38.64 38.32% 37.99%
3 38.15% 38.88 38.65 38.33% 38.00%
4 37.92% 38.65 38.41 38.09% 37.77%
5 37.66% 38.38338.15 37.84% 37.52%
sns 1 2 3 4 5
1 43.15% 43.97 43.69 43.33 42.96%
2 45.27% 46.14 45.85 45.47% 45.08%
3 45.27% 46.14 45.86 45.47% 45.09%
4 44.99% 45.86 45.58 45.20% 44.82%
5 44.69% 45.55 45.27 44.90% 44.52%
sns 1 2 3 4 _5
1 44.83% 45.68 45.39 45.01% 44.62%
2 47.03% 47.93 47.63 47.23% 46.82%
3 47.04% 47.93 47.64 47.24% 46.83%
4 46.74% 47.64%47.34 46.95% 46.55%
5 46.42% 47.31 47.02 46.63% 46.24%
sns 1 2 3 4 5
1 44.95% 45.80 45.51 45.12% 44.73%
2 47.16% 48.05 47.75 47.35% 46.94%
3 47.16% 48.06 47.76 47.35% 46.95%
4 46.86% 47.76 47.46 47.06% 46.66%
5 46.54% 47.43 47.14 46.75% 46.35%
snS 1 2 3 4 s
1 44.73% 45.57 45.28 44.90% 44.51%
2 46.92% 47.81 47.51 47.11% 46.70%
3 46.92% 47.82 47.52 47.11% 46.71%
4 46.63% 47.52% 47.22 46.83% 46.43%
5 46.31% 47.19 46.90 46.51% 46.11%

 

 

 

 

 

94

Similarly, for the (100,400,100me [18.62.64] with i=4, we obtain
q=0.00237, kn=2.54, kp=4.15, and km=1.70. This results in sn=2 or 3, sp=4 or 5, and

sm=l or 2. The yields are calculated and listed in Table 5.3. The results show that
(3,4,2) is the optimal redundancy.

Table 5.3 Yield Simulation for (100,400,100)-PLA.

Redundancy Effective
sn sp sm YCRD YURD Chip Yield
2 4 1 91.45% 67.12% 61.38%
2 4 2 92.69% 67.30% 62.38%
2 5 1 91.25% 67.24% 61.35%
2 5 2 92.49% 67.41% 62.35%
3 4 1 91.78% 67.27% 61.74%
3 4 2 93.03% 67.44% 62.74%
3 5 1 91.58% 67.39% 61.72%
3 5 2 92.83% 67.56% 62.71%
5.4 Summary

In this chapter, the yield analysis of fault-tolerant PLA and the Optimal spare
lines assignment have been presented. Although the total augmented area overhead
is nearly 10% to 25% over the original PLA, the results of this study show that the
redundant design can enhance the chip yield signiﬁcantly.

An empirical rule has been applied to simplify the optimization problem for
ﬁnding the optimal redundancy assignment. The proposed approach of finding optimal
redundancy is simple and effective.

Chapter 6

Fault-Tolerant RAM-Based PLAs

 

The availability of programmable logic devices based on memory cells now
allows implementation of "soft" hardware [16], i.e., hardware whose functions can be
changed while it resides in the system. With the most current IC component
technologies, once a given logic function is implemented in hardware, changing that
logic is difficult, requiring modifications to printed-circuit board traces, the addition or
replacement of components, and other costly procedures. However, with RAM-
based programmable logic, changes can be made to a system’s logic functions simply
by re-programming the devices.

Recently, the device re-programmability have been exploited in the following
applications. Buffalo Products [16] employs a RAM-based programmable logic
device for the bus and memory interface and control logic in their More Memory
expansion card for PC XT- or AT-compatible systems. In that system, an
installation program analyzes system parameters such as bus width, type of card
slot, available address spaces, etc., and then loads the appropriate conﬁguration
program to match system requirements.

A RAM-based programmable logic device has also been applied to the design
of a ﬂame-grabber board of digital imaging system [16]. Basically, the device
provides the graphics control, and it interfaces a PC-compatible computer with the
video output of such medical equipments as ultrasonic scanners and magnetic

resonance imaging systems. To support different video formats from the varying

95

96

types of medical instruments, several different conﬁguration programs are available
for the device. In this application, each video format is sequentially used for a speciﬁc

medical instrument, i.e., only one format is used at a time.

During the past decade, several re-programmable logic arrays have been
proposed: an electrically programmable and UV erasable implementation of PLA [15],
an electrically programmable and erasable PLA [19], and an alterable PLA [35].
These designs allow PLAs to be reprogrammed repeatedly in the same circuit during
system prototyping and these PIAs can be re-programmed in different circuits. In
this chapter, we focus on the design of RAM-based programmable logic arrays
(RBPLAs). An RBPLA is a PLA that takes RAM cells as its crosspoint contacts.
The RBPLA is typically used as a programmable device controller [35].

Fault-tolerant PLA design implementing laser programming technique has
been presented in Chapter 3 to enhance probe yield in the manufacturing process.
However, after the chip is packaged, faults cannot be repaired. In order to efﬁciently
utilize the spare elements resided in PLA chips and to repair faults which may occur
either in manufacturing process or in ﬁeld, the fault-tolerant PLA design implementing
the electrically programming technique is motivated.

In the next section, an RBPLA structure is presented. Followed by a fault-
tolerant RBPLA design in Section 6.2, and a fault-diagnosis and repair process in
Section 6.3. Finally, a chapter summary is given in Section 6.4.

6.1 Basic Structure of an RBPLA

Two types of RAM cells may be implemented: static cell and dynamic cell.
Although the array size of the dynamic cell approach is much smaller than that of the
static one for the same device density, the need of the control and refresh circuits in
the dynamic approach results in a slower speed than the static one in their

performance.

97

In this section, two RBPLA structures with dynamic and static cell memories
are discussed. The PLA with dynamic cell memory is referred to as DRBPLA for
short, while the one with static cell memory is referred to as SRBPLA.

6.1.1 A DRBPLA Structure

A PLA based on the dynarrric cell memory has been proposed in [35]. The
DRBPLA, as shown in Figure 6.1, allows users to reprogram the PLA as many times
as needed. The PLA consists of two major parts: (1) the basic PLA functions, with
the AND and OR arrays, i.e., the A—cells (corresponding to crosspoints of a
conventional PLA) and input/output lines: and (2) the control logic needed to program
the logic of the PLA and to maintain (refresh) the state of the dynamic storage cells,
such as shift registers, Fl/W control circuits, and the refresh-cells (R-cells).

 

 

lnpub cutout:

Figure 6.1 The Structure of a DRBPLA [35].

 

98

The control and refresh circuits and their operations can be found in [11,35] in
detail. Here, the functions of the shift registers are discussed. Two sets of shift
registers are connected to the AND and OR arrays. We ﬁrst consider the two shift
registers connected to the AND array. For simplicity, the shift register that closes to
the R-cells is denoted as R-SR, while the other one is denoted as A-SR. During the
programming phase [11,35], the R-SR contains data to program the A-cells, while
the A-SR controls the column of A-cells to be programmed. More speciﬁcally, if the
i-th cell of the A-SR holds the only logic 1, the A—cells in the i-th column are
programmed with the data stored in the R-SR. Similarly, during the refreshing phase,
while the A-SR enables the column to be refreshed, the R-SR is used to store the
data. The shift registers connected to the OR array perform the same functions as
discussed above. In summary, the shift registers are used for programming and
refreshing the storage cells.

The shift registers also can be used in a similar fashion as the fault-tolerant
PLA design discussed in Section 3.3 for fault detection and location. Based on the
RBPLA of Figure 6.1 together with minor modiﬁcation, a fault-tolerant DRBPLA
design has been presented in [11], in which partially defective chips can be
electrically repaired in ﬁeld use.

Because the structure and operation of the dynamic cell memory are much
more complicated than those of the static cell memory, and also because both
DRBPLA and SRBPLA have similarities in both fault models and fault-diagnosis
process, for sake of clarity, the detailed SRBPLA design and its operation are
preferably discussed in the next section. Fault models and fault-tolerant structure of

SRBPLA design are presented in Section 6.2.

6.1.2 An SRBPLA Structure

Figure 6.2 illustrates the schematic diagram of a PLA structure based on the

static cell memory. An SRBPLA consists of two major parts: (1) the basic PLA

99

 

Shift Register

C C

C C C

B/I B/I B/l B/l

 

Inputs

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Outputs Sin
(a)
V dd D D D D
Product line
3 _L j E _L
J—L. J—L J
7 ' ] MZ ...L. ..
Ml SRAM Cell -J L] I}
<53 [9 #1 M2 M1
SRAM Cell
Ccell Cce_ll_
(AND plane) (OR plane)
Input line Output line
(b) (c)

Figure 6.2 SRBPLA structure: (a) Scheme Diagram;
(b) and (C) C cells in AND and OR plane.

 

100

functions with the AND and OR arrays, i.e., the C~cells (corresponding to
crosspoints of a conventional PLA) and input/output lines; and (2) the control logic
needed to program the logic of the PLA, such as shift registers, and read/write control
circuits.

Figures 6.2 (b) and 6.2 (c) show the circuit schematic diagrams of the C-cells
in the AND and OR arrays. The basic C-cell structure consists of an SRAM cell and
transistors M1 and M2. The SRAM cell employs a pair of cross-coupled inverters as
the storage ﬂip-ﬂop and a pair of access devices to provide a switchable path for data
into and out of the cell. The cell enabled line S is held low except when cells

connected to it are to be accessed for reading or for writing. Two lines, D and 5',
provide the data path. The transistor Ml corresponds to the crosspoint in a
conventional PLA. The presence (absence) of the crosspoint depends upon the
transistor M2 in the ON (OFF) state which is determined by the state of the SRAM

cell.

The SRBPLA allows designers to assign the output phase, i.e., the output
may take either true or complementary logic realized by an non-inverting or inverting
buffer, referred to as B/I-cell. The B/I-cell is normally in the inverting state (ON-
state) unless it is programmed.

In addition to both arrays, three shift registers are also added with the control
circuitry. For simplicity, the shift registers connected to AND-array and OR-array
are referred to as AND-SR and OR-SR, respectively, while the vertical shift register
is referred to as the S-SR. Figure 6.3 illustrates the control circuitry, where two

signal lines Wm and Rm are used to control the "write" and "read" operations of the C
cell, and the signal F is the output of the gate ORing Wm and Rm as shown. During
the programming phase (or "write" operation), the shift registers AND-SR and OR-
SR contain the data to program the C-cells in a speciﬁc row determined by the S-SR.

More speciﬁcally, if the i-th cell of the S-SR holds the only logic 1, only the i-th row
is enabled and thus the C-cells in the i-th row are programmed with the data held in

101

both AND-SR and OR-SR cells. The above operation is performed by setting Wm=1,
and thus setting F=l to pass the data held in the S—SR cell. On the other hand,
during the "read" operation, the control signal Rm is set to logic 1 and results in F=1
to select the row of C-cells to be read. Finally, during the normal operation, both Wm

and Rm are set to logic 0, and thus results in F=O, to isolate all three shift registers '
from the PLA.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Figure 6.3 Control Circuit in SRBPLA.

 

102

6.2 A Fault-Tolerant SRBPLA Design

Similar to the fault-tolerant PLA design discussed in Section 3.3, the
schematic diagram of a fault-tolerant SRBPLA is illustrated in Figure 6.4, where the
original SRBPLA is augmented by SISC and SOSC, as well as the spare C-cells. In
addition, a column of P-cells are also added in between the AND and the OR arrays.
The P-cells are used to alleviate the problem for the repair of the stuck-at-l faulty

product line discussed in Section 3.2.

 

out

W m
R

\V
D

SISC

 

Inputs

Outputs S

Figure 6.4 Fault-Tolerant SRBPLA Scheme.

 

The SISC and SOSC operations in the SRBPLA are operated in a similar
fashion as those in RPLA discussed in Section 3.2. The basic cell structure of the
SISC and its operation are illustrated in Figure 6.5. The basic cell is referred to as SI-

cell. While Figure 6.5 (a) shows the NMOS implementation of an SI-cell, Figures

103

6.5 (b) and 6.5 (c) describe the operations of the SI-cell. During the normal
operation, as shown in Figure 6.5 (b), (for simplicity, it is referred to as the ON
state), a double throw switch is used to connect the input signal to the input line. The
operation is equivalent to the case that the SRAM cell holds a 1 to turn on the
transistor T1 and to turn off both transistors T2 and T3, as shown in Figure 6.5 (a).
On the other hand, when a faulty input line is detected and located, the switch is
reconﬁgured so that the input line is connected to the spare one and the faulty line is
connected to Ground line. That is, the SRAM cell holds a logic 0 to turn off the
transistor T1 and to turn on both 12 and r3. Similarly, the SO-cell, a basic cell of the
SOSC, is operated exactly the same as the SI-cell

 

 

 

 

 

 

 

 

 

 

 

AND plane
AND plane Spare |
Spare input line mput hne (L .....
Vdd Input line =
r2 |’
G 11" { 13 (b)
SRAM 14 AND 9““
1'12;??7'596111?éziéiifiii::i:;;ai*%?'* Spare
‘35 ian line 0
- Input line -
Input line '='
(a) (C)

Figure 6.5 SI—Cell: (a) NMOS Implementation;
(b) ON—state; and (c) OFF-state.

 

Figure 6.6 illustrates the circuit symbol and the basic cell structure of the
product line link, referred to as P-cell, and its operation. The P-cell is programmed in
the same way as the C-cell. When the SRAM cell holds a 1, Figure 6.6 (c) shows

104

 

AND Product line OR

3 P C811 Plane 'c Plane

;

(a)
(C)

 

Vdd

II
[I

iii}7*?!Ti?’?¥'7”"f‘§j3'i -- Product line

AND 0 0R
Plane {

(b) (d)

 

 

 

 

 

Figure 6.6 P-Cell: (a) Symbol; (b) NMOS Implementation;
(c) ON-state; and (d) OFF-state.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

J:—

 

 

 

 

 

 

l'rmluct linc
S

 

 

SR
ccll

 

"-u

 

 

ILJ

 

 

 

 

 

 

CCL’II C L‘Cll
(AM) planet (OR mm)

 

 

 

 

 

 

 

 

 

 

 

 

l) B Input line D T) Output linc S

Figure 6.7 Control Signals of the Fault-Tolerant SRBPLA.

 

105

that the P-cell is turned to the ON (or Connecting) state during the normal operation.
On the Other hand, when the SRAM cell holds a 0, Figure 6.6 (d) describes that the
P-cell is turned to the OFF (or Disconnecting) state if the faulty product line is
located.

Figure 6.7 shows the schematic diagram of the control circuitry. Similar to the
FDPLA of Figure 3.16, the internal signals RP and Wp are used to control the

operation of reading and writing the input and product lines.

6.3 Fault Diagnosis and Repair Process

This section describes the fault diagnosis and repair process of a fault-tolerant
SRBPLA. The fault models considered here are also the stuck-at, bridging, and
crosspoint faults. Based on the fault models, the SRBPLA is capable of detecting,
locating, and repairing single and multiple faults during manufacturing process and in
ﬁeld use. Due to the similarity of the fault-tolerant designs for both SRBPLA and
FDPLA, the fault models, repair rules, and diagnosis and repair process are almost
the same. The differences are discussed below.

6.3.1 Fault Models

The major difference between the SRBPLA and FDPLA is in their crosspoint
contacts. The SRBPLA takes the SRAM cell as its crosspoint contacts. In this
implementation, any failures of the SRAM cell will affect the function of the crosspoint
faults, and thus they are diagnosed as the crosspoint faults. Similarly, failures may
occur at the 81-, SO-, and P- cells. However, if the failures do not affect the desired
functions for those cells, then it is not necessary to locate them. For example, a fault-
free product line requires the P-cell in the l-state. Any failures that may cause

product line to have a stuck-at- 1 , will not be concerned.

106

The failures diagnosed in the SI-cells (SO-cells) are considered as the input
(output) lines having stuck-at faults. A stuck;at-l faulty control signal S causes the
corresponding row of C-cells to be read or written. On the other hand, a stuck-at-O
faulty signal results in the corresponding row of C-cells being at the idle state, i.e., no
data are read and loaded.

Finally, since the adjacent lines in the SRBPLA are far apart to each other, the
adjacent bridging faults will not be considered in SRBPLAs. That is, only crossing
bridging faults are considered.

6.3.2 Fault Diagnosls and Repair Algorithm

Algorithm II, as illustrated in Appendix 3, summarizes the fault diagnosis and
repair process for the fault-tolerant SRBPLA design. The process for SRBPLA is
similar to that for FDPLA. The only differences are in the memory cells and other
functional cells. After the shift registers are tested, as indicated in Step 3.1, the
state of each cell is initialized by setting 1’s to the C-cells in the AND array and the
spare output part, as well as the 81-, SO-, and P- cells, and 0’s to the C—cells in the
OR array. Then, the stuck-at faulty bit lines are located in the same way as
discussed in FDPLA. In order to ensure that no s-a-l faults occur in the S-lines, we
try to write 0’s to the C-cells in the AND array and 1’s to the OR array. If the i-th S-
line has a s-a—l fault, the later diagnosis process will disclose that the C—cells in the
i-th row contain all 0’s instead of the expected 1’s.

With the above programming in the C—cells, G- and A- faults, stuck-at faults
at the product lines and output lines, and s-a-O at the data line of C—cells can be
diagnosed. However, in order to identify the S- and D- faults, s-a-l faults at the
data line, and s-a-O faults in the product lines and output lines, the contents of the C-
cells are programmed by their complemented states as shown in Step E.1.

107

6.4 Summary

A RAM-based PLA structure is presented to allow designers to reprogram
the logic functions as many times as needed. A fault-tolerant SRBPLA design is also
proposed that can detect, locate, and repair faults in the manufactluing process, and
also in ﬁeld use. A fault-diagnosis and repair algorithm gives the evidence that the
proposed fault tolerant design achieves a full diagnosability of single and multiple
crosspoint faults, bridging, faults, and stuck-at faults.

CHAPTER 7

Conclusions

 

This chapter summarizes the major contribution of this dissertation research

and outlines directions for future research.

7.1 Summary of Major Contributions

The yield of integration circuits (ICs) has always been crucial to the
commercial success of their manufacture. The manufacturing of VLSI products with
increasing complexity is the current trend of IC market. However, as the complexity
increases and the geometry shrinks, the probability of having faulty components also
increases, thereby lowing the chip yield.

One practical way to solve the low yield problem is the use of fault-tolerant
design. The only integrated circuits so far to have exploited fault-tolerant techniques
commercially have been memory chips. This is because memory chips are particularly
densely packed and therefore increasingly vulnerable to defects, and also because a
regular memory array lends itself to a variety of efﬁcient fault-tolerant designs.

Recently, due to the strucrural regularity, design simplicity, and fast
turnaround time, Programmable Logic Arrays (PLAs) have been increasingly popular
for implementing Boolean logic functions for VLSI/ULSI chips. Due to the similar
structure, large PLAs are also increasingly vulnerable to defects. In order to repair
partially defective PLA chips to enhance overall chip yield, a fault-tolerant PLA
design is proposed in this dissertation. Faults in a partially defective PLA chip can

108

109

be detected, located, and repaired. The repair does not affect the external signal
routing. It is the first fault-tolerant structure for PLA. Included in the study are the
structtues of repairable PLAs, and fault-diagnosable PLAs, an automatic layout

generator for fault-tolerant PLAs, and a fault diagnosis and repair process.

Some important issues for the design of fault-tolerant design such as die size,
speed, and yield enhancement, have also been addressed in this study. The results of
this study have shown that the yield can be enhanced significantly for large PLAs.
Another important issues in optimal redundancy and spare allocation have also been
discussed in Chapter 5. A simple, yet efficient optimization method has been
presented to determine the optimal redundancy of various sizes of PLAs.

Finally, a RAM-based PLA structure is proposed with its fault-tolerant
design. Although the PLA structlue taking RAM cells as its crosspoint contacts
consumes more silicon area than the conventional PLA, its re-programmability allows
the designers to change the design as many times as needed. Exploiting the
advantages of the shift registers used for programming memory cells in the RAM-
based PLA structure, the shift registers are also used to detect and locate faults that
occur either in manufacturing processing, or in ﬁeld use. In Chapter 6, the fault-
tolerant SRBPLA design is presented with the fault-diagnosis and repair process.

In summary, this dissertation focuses on fault-tolerant design of large PLAs.
It is novel in the sense that it is the ﬁrst fault-tolerant structure ever proposed for
PLAs. This structure will be helpful in achieving fault diagnosability and repair, and
thus improving chip yields.

7.2 Directions for Future Research

Array logic has been used extensively for structured design of LSI systems.
In particular, the PLA has been proved to be a very effective tool for implementing

multiple output combinational logic functions. In this section, some practical issues in

110

the fault-tolerant design for yield enhancement in array logic are addressed for further
study.

7.2.1 Fault-Tolerant Design of Folded PLAs

The PLA is an architectural structure used to reduce the design effort in
producing VLSI integrated circuits. However, industrial PLAs tend to be sparse and
thus grossly wasteful of chip area [34], i.e., area occupied only by interconnections
and not directly contributing to the implementation of the logic functions. The wasted
area will reduce circuit yield while also degrading the time performance of the PLA by
inuoducing unnecessary parasitics [21]. Folding is a technique which attempts to
reduce the area of a PLA by exploiting its sparsity. In other words, the objective of a
folding technique is to determine permutations of the rows (and/or columns) which
permit a maximal set of column pairs (row pairs) to be implemented in the same
column (row) of the physical logic array. There are many kinds of folding according to
the particular technological implementation and design style followed, e.g., Simple and
Multiple Column Folding, Simple and Multiple Row Folding, and Consu‘ained
Folding. The results of the study in [21] have shown that the chip area can be
reduced as much as 50% if a simple folding technique is applied.

Although the folded PLAs significantly reduce the chip area from the original
PLA, the test of such PLAs is very difﬁcult because the availability of the testable
structure, thereby limiting its applicability. Also, a "cut" is generally applied to a row
(column) shared by a row pair (column pair). The cut may introduce a bridging fault in
that row (column), i.e., the line is not completely "cut". Thus, the low yield problem
has been found in the folded PLA design. Therefore, the marriage of fault-tolerant
design and folding technique is perfectly applied to enhance the chip yield. This leads

to a very interesting topic for future research.

111

7.2.2 Fault-Tolerant Design of VLSI/ULSI/WSI Array Structures

Wafer scale integration (W SI) offers advantages of low assembly cost, high
reliability, and high performance. However, a major problem of the WSI approach is its
low manufacturing yield. Without redundant components and a reconﬁguration
mechanism for replacing defective components, the yield of a wafer scale device is very
likely to approach zero. Therefore, it is necessary to build redundancy and
reconﬁguration mechanisms into a WSI device for improving the yield [61].

Two-level redundancy scheme has been discussed in [61]. While the ﬁrst
level of redundancy, called the local redundancy, can repair the point defects, the
second level of redundancy, called the global redundancy, repairs the cluster defects.
Memory devices with two-level redundancy scheme have been widely used in the
WSI systems. The point defects occurred at the cell array can be repaired by the spare
cells (the use of local redundancy). On the other hand, those point defects occurred at
non-redundant areas, as well as the cluster defects, will cause the redundant memory
to be unrepairable. and this unrepairbale device is thus replaced by the other fault-free

device (the use of global redundancy).

It has been shown that a two-level redundancy scheme can improve the
manufacturing yield signiﬁcantly. The implementation of PLA as controller in WSI has
not yet been considered so far. This is simply because the repair of PLA was not
possible. With the success of this study, the proposed fault-tolerant PLA structure
can be applied to the design of the WSI system to improve the manufacturing yield.
This leads to another research t0pic for further study.

Due to the regular structures, the fault-tolerant design of VLSI array structures
such as memory and PLA, have been considered. The next logical step is to
implement the proposed fault-tolerant design for the design of some other array
structures, such as Storage Logic Arrays (SLAs) [51], or Programmable Array Logics
(PALs) [14].

As the technology moves to ULSI, more than one million transistors can be

incorporated on a single chip. Regularity plays an important role in the ULSI

112

approach. The fault-tolerant design is a practical way to enhance the chip yield before

the manufacturing process is manned

7.2.3 New Yet Low-Yield Technologies

As mentioned previously, it is desirable to incorporate redundant and fault-
tolerance process into any regular suuctures with low yield problems. In general, the
manufacturing yield for a new technology is generally low. It has been reported that
yield of LSI GaAs circuits is very low when the technology is applied to memory
design [26]. Table 7.1 lists the overall yield drops from 52% to 1% for the JFET
technology as the RAM size increases from 1K to 16K. Recently, several
manufacturers have tried to produce programmable logic devices implementing GaAs
technology. Unfortunately, the yields are unreasonably low. Therefore, fault-tolerant
structures should be applied to those regular structures with new technology.

Table 7.1 Yields of LSI GaAs Circuits [26]

 

Circuit Best Wafer Overall lot Size

D-MESFET

16 K RAM 2% 0.5% 7.3 x 7.5 mm
4 K RAM 40% 15% 3.4 x 7.5 mm
1 K RAM 55% 8% N/A
JFET

16 K RAM 3% 1% 8.2 x 5.6 mm
4 K RAM 41% 16% 4.1x 5.0 mm

1 K RAM 71% 52% 2.6 x 2.5 mm

 

 

APPENDICES

APPENDIX 1

Propagation Delay Time

As mentioned in Chapter 3, the propagation delay is based on the assumption
that the delay time of a logic gate is directly related to the driving capability of its
transistor. Under the above assumption, the delay time for a non-redundant PLA
with n inputs, p product terms, and m outputs, namely, a (n,p,m)-PLA, is estimated.
The total effective load capacitance in the AND plane is comprised of the total gate
capacitance in an input line of the AND plane and the worst case signal path
capacitance. The total gate capacitance depends on the number of pull-down
transistor in an input line. Suppose that the number of pull-down transistors in an
input line is g, the total gate capacitance is 909. The worst case signal path
capacitance is pCp because there are p product lines across to the input lines.
Therefore, the total delay time in the AND plane is approximately estimated as

Tn' x{(2n+p)Cp+gCg}
where l: is a constant determined by the average charging time.

For the redundant PLA of Figure 3.15 (a), the worst case signal path includes
the path to utilize the spare input lines. Therefore, the worst case signal path
capacitance is (4n+p)Cp, and the delay time is about

Tr = K [ (4n~l»p)Cp + 9 Co}

In fact, the number of pull-down transistor in the PLA design is determined by
the function implemented. The distribution of the locations of the pull-down transistor
is usually spare. Therefore, it is reasonable to assume that the average number of
pull-down transistors in a column of the AND plane is about the half of the number of

product lines, i.e. g=p/2. The quantities of the gate and signal path capacitances have

114

115

been studied and approximately estimated as CO - 1.77 x 10'14 F and Cp = 0.6084 x

10'15 F, i.e., Co = 290p. Thus, the ratio of the delay times for the PLA with and
without redundancy in the AND plane is

TrlTn = 1+ 2n / (2n+15.5p)

APPENDIX 2

Fault Diagnosis Algorithm for FDPLA

Algorithm 1:

* 11: number of input lines; sn: number of spare bit lines;
* m: number of output lines; sm: number of spare output lines;
* p: number of product lines; sp: number of spare product lines.

* r: number of input bit lines; r=2n.
*

Step A. /* Test the added circuits */

Set R=W=0 to isolate both ISR and PSR from the PLA;
Apply a scan pattern to detect stuck-at faults;
If any faults are detected, GOTO Step UR. l’ Un—repairable */

Step B. /* Stuck-at and bridging faults at the input bit lines */

B.1. Set Vdd1=1, (R,W)=(O,l). /“ ISR - "read" mode */
MP4), and shift all 0’s to PSR. P apply all 0’s to Pi’s */

B.2. Apply patterns (11...,In)=(0,..,0), and (1...,1) to obtain EPO and EPl.

B.3. Flag:=0;
For i=1 to 11 Do
If (EPO(i)=EPl(i)) Then I“ Stuck-at faults at bi */
Begin
if (sn=0) Then GOTO Step UR; /"' Un-repairable */
Repair bi by a spare input bit line;
Flag:=1;
snz=sn-l;
End
If (Flag=l) GOTO Step B.2. /* Check faults in spare lines */

116

Step C.

C.1.

C2.

C3.

C4.

117

/* G-faults, S-faults, and Stuck-at faults at the product lines *I

Set Vdd1=0, (R,W)=(1,0). /* ISR - "write" mode; PSR - "read" mode */
Set MPG and shift all 0’s to ISR. /*.apply all 0’s to bi’s */

Flag:=0;
For 131 to p Do
If (Pi=0) Then /"' s-a-0 fault at Pi */
Begin
If (sp=0) GOTO Step UR; /* Un-repairable */
Repair Pi by a spare product line;
Flag:=1;
s :=s -1;

P P
End

If (F1ag=l) GOTO Step B. /"' Check faults in spare lines *I

P Generate the matrix APF */
For j=l to r Do

Begin
Set bj=l and the others to 0’3;

For i=1 to p Do A(i,j):= complement of Pi;
End
Flag:=0;

For i=1 to p Do
If (the i-th row of APP contains all 0’s) Then
Begin /* s-a-l fault at Pi 1"I
If (sp=0) GOTO Step UR; P Un-repairable */
Repair Pi by a spare product line;
Flag:=1;
spz=spo1;
End
If (F1ag=l) GOTO Step B; /" Check faults in spare lines */

C.5. /* Generate the matrix CAP */

For i=1 to p
For j=l to r
CAP(id'):=a(i.i) 9 A(i.j);

Step D.

D1.

D2.

D.3.

D4.

118

/* D-faults, A-faults, Stuck-at and bridging faults at the output lines, and
bridging faults at the product 1ines*/

Set Vddl=0, (R,W)=(0,1). /* ISR - "read" mode; PSR - "write" mode */
Set MP=0 and shift all 0’s to PSR. /* apply all 0’s to Pi’s */

Flag:=0;
For j=1 to m Do
If (Oj=1) Then /"‘ s-a-l fault at Oj */
Begin
If (sm=0) GOT 0 Step UR; /"' Un-repairable */
Repair Oj by a spare output line;
Flag:=1;
smz=sm-l;
End

If (Flag=1) GOTO Step B. /* Check faults in spare lines */

/"' Generate the matrix OPP */
For j=1 to p Do

Begin
Set Pj=l and the others to 0’s;

For i=1 to m Do BG,i):=Oj;
End
Flag:=0;

For i=1 to 111 Do
If (the i-th column of OPP contains all 0’s) Then
Begin /* s-a-O fault at Oi ‘/
If (sm=0) GOT 0 Step UR; l‘ Un-repairable */
Repair 0i by a spare output line;
Flag:=1;
smz=sm-1;
End
If (Flag=1) GOTO Step B; /* Check faults in spare lines */

D5.

D6.

D7.

Step E.

Step UR.

119

Flag:=0;
For i=1 to p Do
If (the i-th row of OPF contains all 0’s) Then
Begin /* s-a-0 fault at Pi ’/
If (sp=0) GOTO Step UR; /"' Un-repairable */
Repair Pi by a spare product line;
Flag:=1;
sp:=sp-l;
End
If (Flag=1) GOTO Step B; /* Check faults in spare lines *I

/" BROO bridging faults "l
Flag:=0;
If (bit patterns of w adjacent columns in' OPP are the same) Then

If (the bit pattern equals to the pattern of ANDing the
corresponding columns in OP) Then /* BROO Bridging faults */
Begin
Flag:=1;
smz=sm-w;
If (sm < 0) GOTO UR; l‘ Un-repairable */
Repair these w’s bridged lines by spare output lines.
End
Repeat this step until all columns are checked for BROO faults.
If (Flag=1) GOTO Step B. P Check faults in spare lines */

/"' Generate the matrix COP */
For i=1 to p
For j=l to m
COP(i,j):=b(i,j) 6 B(i.j);

/" Spare allocation for crosspoint faults */

Call spare allocation routine to repair crosspoint faults.

If (the required spares do not exceed the available spares) Then
the defective chip is repairable and STOP.

/"' Un-repairable */
The defective chip is not repairable.

 

APPENDIX 3

Fault Diagnosis Algorithm for SRBPLA

Algorithm II:
Step A. /* Test the shift registers */
Same as part A of Algorithm 1, except setting (Rp,Wp,Wm)=(0,0,0).

Step 3. /* Identify Stuck-at and bridging faults at the input bit lines *I
8.1. /* Initialize the state of the SRBPLA */

Set (Rp,Wp,Wm)=(0,0,l), and S-SR=(1,1,...,1);

Load 1’s to the C-cells in the AND arrays, i.e.,

the personality of matrix AP contains all 1’s entries;
Load 0’s to the C-cells in the OR arrays, i.e., OP(i,j)=0;
Load 1’s to the C-cells of the spare output lines; and
Load 1’s to the 81-, SO-, P—cells.

8.2. - 8.4. P Locating the stuck-at faults at b. */
Same as B]. - B3 of Algorithm I except setting (Rp,Wp,Wm)=(0,1,0)

Step C. /* Identify G-faults, and stuck-at faults at the product */

C.1. Set (Rp,Wp,Wm)=(0,0,l), S-SR=(0,0,...,0); AND-SR=(0,0,...,0), and

OR-SR=(0,0,...,0); I" A stuck-at-l fault at j-th S signal will re-program
the j-th row C-cells from 1 to 0 in the AND plane */

C.2. - 0.5. /* Identify stuck-at faults at the product lines and
' generate matrix CAP */

Same as C. l. - C4. of Algorithm I except setting (Rp,Wp,Wm)=(1,0,0)

120

0.6.

C.7.

Step D.

121

/" Identify s-a—O fault at the data path */
Flag:=0;

For i=1 to r Do

If (the i-th column of APP contains all 0’s) Then

Begin /* i-th column has s-a-O fault at signal D *I
If (sn=0) GOTO Step UR; I“ Un-repairable */
Repair bi by a spare input line;
Flag:=1;
s :=sn-l;
End
If (Flag=1) GOTO Step B; /“' Check faults in spare lines */

/"' Generate the matrix CAP */

Same as GS. of Algorithm 1

/‘ Identify A-faults, s-a-l and bridging faults at the output lines, and
bridging faults at the product lines)

D.1. - D.3. /* Generate matrix OPP and identify s-a-l faults at Oi */

Same as D.1. - D.3. of Algorithm I except setting (Rp,Wp,Wm)=(0,1,0)

D.4. /* Stuck-at fault at Oi *l

Flag:=0;
For i=1 to m Do
If (the i-th column of OPP contains all 1’s) Then

368i"
If (sm=0) GOTO Step UR; I" Un-repairable */
Repair Oi by a spare output line;
Flag:=1;
sm:=sm-l;
End
If (Flag=1) GOTO Step B; /"I Check faults in spare lines */

D.5. /* Generate the matrix COP */

Same as D5. of Algorithm I

Step E.

E1.

E2.

E.3.

E4.

122

/* S- and D-faults, s-a-O at outputs */

/* Re-program the state of the AND and OR arrays */
Set (Rp,Wp,Wm)a(0 ,0,1), and S-SR=(1,1,...,1);

Load 0’s to the C-cells in the AND arrays, i.e., AP(i,j)=0;
Load 1’s to the C-cells in the OR arrays, i.e., OP(i,j)=1;

/"' Generate the matrix APP */

Same as C3. of Algorithm I

P Identify s-a-l faults at the data path */

Flag:=0;

For i=1 to r Do

If (the i-th column of APF contains all l’s) Then

Begin I“ i-th column has s-a-l fault at signal D *I
If (sn=0) GOTO Step UR; l’ Un-repairable */
Repair bi by a spare input line;
Flag:=1;
snz=sn-1;

End

If (Flag=1) GOTO Step 8; /* Check faults in spare lines */

/"' Generate the matrix CAP */
For i=1 to p
For j=1 to r
CAP(iJ)==_CAP(i.i) + (add) 9AM»;

E.5. - E.7. /" Generate the matrix OPP and

EB.

Step F.

identify s-a-0 faults at the product and outputs */
Same as D.3. - D5. of Algorithm I

/"' Generate the matrix COPF */
For i=1 to p
For j=1 to m
COP(i,j):= COP(i,j) + (b(i,j) $B(i,i));

/" Spare allocation for crosspoint faults */

Call spare allocation routine to repair crosspoint faults.

If (the required spares do not exceed the available spares) Then
the defective chip is repairable and STOP.

Step UR. /* Un-repairable */

The defective chip is not repairable.

 

BIBLIOGRAPHY

BIBLIOGRAPHY

 

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[3]

[9]

Abbott, R, K. Kokkonen, R. 1. Kung, and R. J. Smith, "Equipping a Line of
Memories with Spare Cells," Electronics, pp. 127-130, July 28, 1981.

Abraham, J. A., and W. K. Fuchs "Fault and Error Models for VLSI," IEEE
Proceedings, Vol. 74, No. 5, pp. 639-654, May 1986.

Albuquerque, N. M., "Cut-and-Patch Lasers Speed Chip Repairs,"
Electronics, pp. 19-20, June 16, 1986.

Asai, S., "Semiconductor Memory Trends," IEEE Proceedings, Vol. 74, No.
12. pp. 1623-1635, Dec. 1986.

Bindels, J. F. M., J. D. Chlipala, F. H. Fisher, T. F. Mantz, R. G. Nelson, and
R. T. Smith, "Cost-Effective Yield Improvement in Fault-Tolerant VLSI
Memory," IEEE International Solid State Circuits Conference Digest, New
York, NY, pp. 82-83, Feb. 1981.

Bozorgui-Nesbat, S., and E. J. McCluskey, "Lower overhead design for
testability of programmable logic arrays," IEEE Trans. on Computers Vol. C-
35, No. 4, pp. 379-383, April 1986.

Cenker, R. P., D. G. Clemons, W. R. Huber, J. B. Petrizzi, F. J. Procyk, and
Trout G. M, "A Fault-Tolerant 64k Dynamic RAM," IEEE Trans. on
Electron Devices, Vol. ED-26, No. 6, pp. 853-860, June 1979.

Chang, M.-F., W. K. Fuchs, and J. H. Patel, "Diagnosis and Repair of
Memory with Coupling Faults," Proceeding of the International Conference
on Computer-Aided Design, Santa Clara, CA, pp. 524-527, Nov. 1988.

Chang, T. Y., "The Design of Testable Programmable Logic Arrays," Master
thesis, Michigan State University, 1987.

124

[10]

[11]

[12]

[13]

[141'

[15]
[16]

[17]

[18]

[19]

[20]

[21].

125

Chang, T. Y., "MRPLA Users Manual," Dept. of Electrical Engineering,
Michigan State University, 1989.

Chang, T. Y., and C. L. Wey, "The Design of Electrically Field-Repairable
Programmable Logic Arrays (EFRPLAs)," Proceeding of 31st Midwest
Symposium on Circuits and Systems, pp. 36-39, St. Louis, MO., Aug. 1988.

Chang, T. Y., and C. L. Wey, "Design of Fault Diagnosable and Repairable
PLA," IEEE Journal of Solid State Circuits, Vol. SC-24, No. 5, pp. 1451-
1454, Oct. 1989.

Day, J. R., "A Fault-Driven Comprehensive Redundancy Algorithm for
Repair of Dynamic RAMs," IEEE Design and Test of Computers, Vol. 2,
No. 3, pp.35-44, June 1985.

Design with Programmable Array Logic, Technical Staff of Monolithic
Memories, Inc., New York, McGraw-Hill, 1981.

EP300, Erasable Programmable Logic Devices, Altera Corp., Santa Clara.

Fawcett, B. K., "Taking Advantage of Reconﬁgurable Logic," Programmable
Logic Guide, High Performance, pp. 17-24, 1989.

Ferry, D., L. A. Akers, and E. W. Greeneich, Ultra Large Scale Integrated
Microelectronics, Prentice Hall Advanced Reference Series, 1988.

Fleisher, H. and L. I. Maissel, "An Introduction to Array Logic," IBM Journal
Research and Development, Vol. 19, pp. 98-109, March 1975.

Fong, E., M. Converse and P. Denham, "An Electrically Reconﬁgurable
Programmable Logic Array Using a CMOS/DMOS Technology," IEEE
Journal Solid State Circuits, Vol. SC-19, No. 12, pp. 1041-1043, Dec. 1984.

Fucks, W. K. ,and S. Y. Kuo, "Spare Allocation/Reconﬁguration for WSI,"
Wafer Scale Integration, Ed. by EB. Swartzlander, Jr., Kluwer Academic
Publishers. PP. 119-191, 1989.

Hachtel, G. D., A. R. Newton, and A. L. Sangiovanni-Vincentelli, "An
Algorithm for Optimal Folding," IEEE Trans. on CAD, Vol. CAD-1, No. 2,
pp. 63-76, April 1982.

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

126

Haddad, R. W., and A. T. Dahblna, "Increased Throughput for the Testing
and Repair of RAMs with Redundancy," Proceeding of the International
Conference on Computer-Aided Design, Santa Clara, CA, pp. 230-233, Nov.
1987.

Hasan, N., and C. L. Liu, "Minimum Fault Coverage .in Reconﬁgurable
Arrays," Proceeding IEEE International Fault-Tolerant Computing
Symposium, PP. 348-353, 1988.

Hemmady, V. G., and S. M. Reddy, "On the Repair of Redundant RAMs,"
Proceeding 26th ACM/IEEE Design Automation Conference, Anaheim, CA,
pp. 710-713, June 1989.

Khakbaz, J ., "A Testable PLA Design with Low Overhead and High Fault
Coverage," IEEE Trans. on Computers, Vol. C-33, No. 8, pp. 743-745, Aug.
1984.

Kirkpatrick, C. 0., "Making GaAs Integrated Circuits," IEEE Proceedings,
Vol. 76, No.7. DP. 792-815, July 1989.

Hnatek, E. R., Semiconductor Memories - an update, PennWell Publishing
Company, 1982.

Koren, I., "A Reconﬁgurable and Fault-tolerant VLSI Multiprocessor
Array," Proceeding 8th Symposium Computer Architecture, pp. 442-451,
May 1981.

Koren, 1., and D. K. Pradhan, "Modeling the Effect of Redundancy on Yield
and Performance of VLSI Systems," IEEE Trans. on Computers, Vol. 036,
No. 3, pp. 344-355, March 1987.

Kuo, S. Y. and W. K. Fuchs, "Efﬁcient Spare Allocation in Reconﬁgurable
Arrays," IEEE Design and Test of Computers, pp. 24—31, Feb. 1987.

Kuo, S. Y., and W. K. Fuchs, "Fault Diagnosis and Spare Allocation for
Yield Enhancement in A Large Reconﬁgurable PLAs," Proceeding IEEE
International Test Conference, Washington DC, pp. 944-951, Sep. 1987.

Law, H.-F. S., and M. Shoji, "PLA Design for BELLMAC-32A
Microprocessor," Proceeding International Conference Circuits and
Computers, pp. 161-164, Feb. 1982.

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

127

Mak, G. P., J. A. Abraham, and E. S. Davidson, "The Design of PLAs with
Concurrent Error Detection," Proceeding EBB International Test
Conference, Philadelphia, PA, pp. 303-310, Nov. 1982.

Makarenko, D. D., J. Tartar, "A Statistical Analysis of PLA Folding," EEE
Trans. on CAD, Vol. CAD-5, No. 1, pp. 39-51, Jan. 1986.

Marchand, J. F. P., "An Alterable Programmable Logic Array," EEE Journal
Solid State Circuits, Vol. SC-20, No. 5, pp. 1061-1066, Oct. 1985.

Mayo, R. N., and J. K. Ousterhout, "Pictures with Parentheses: Combining
Graphics and Procedures in a VLSI layout Tool," Proceeding 20th
ACM/EEE Design Automation Conference, Miami Beach, FL, pp. 270-276,
June 1983.

Mead, C. A., and L. A. Conway, Introduction to VLSI systems, Addison-
Wesley, Reading, Mass. 1980.

Minato, O., T. Masuhara, T. Sasaki, Y. Sakai, T. Hayashida, K. Nagasawa,
K. Nishimurs, "A Hi-CMOS 8k x 8k bit static RAM," EEE Journal of Solid
State Circuits, Vol. SC-17, No. 5. pp. 793-797 , Oct. 1982.

Moore, W. R., "A Review of Fault-Tolerant Techniques for the
Enhancement of Integrated Circuit Yield," EBB Proceedings, Vol. 74, N o. 5,
pp. 684-698, May 1986.

MPLA User’s Manual, Berkeley VLSI Tools. Department of Electrical
Engineering and Computer Science, University of California at Berkeley.

Ostapko, D. L., and S. J. Hong, "Fault Analysis and Test Generation for
Programmable Logic Arrays," EEE Trans. on Computers, Vol. C-28, No. 9,
pp. 617-626, Sept. 1979.

Posa, J. G.,"What to Do When the Bits Go Out," Electronics, pp. 117-120,
July 28, 1981.

Pradhan, D. K., and K. Son, "The Effects of Untestable Faults in PLAs and a
Design for Testability," Proceeding EEE International Test Conference,
Philadelphia, PA. PP. 359-367, Nov. 1980.

Sack, E. A., R. C. Lyman, and G. Y. Chang, "Evolution of the concept of a
computer on a slice," EEE Proceedings, Vol. 52, pp. 1713-1779, Dec. 1964.

[45]

[46]

[47]

[48]

[49]
[50]

[51]

[52]

[53]

[54]

[55]

[56]

128

Sasaki, 11., "Directions and Strategies to Achieve ULSI," VLSI Technology
Digest, San Diego, CA, pp. 3-6, May 1988.

Schuster, S. B., "Multiple Word/Bit Line Redundancy for Semiconductor
Memories," EBB Journal Solid-State Circuits, SC-13, No.5, pp. 698-703,
Oct. 1978.

Sear, D., "Is Bigger Always Better?," ASIC Technology & News, pp. 14-16,
May 1989.

Shen, J., W. Maly, and F. Ferguson, "Inductive Fault Analysis of MOS
Integrated Circuit," EBB Design and Test of Computers, pp. 13-26, Dec.
1985.

Signetics Field Programmable Logic Arrays, Signetics, CA, Oct. 1977.

Smith, J. E., "Detection of Faults in Programmable Logic Arrays," EEE
Trans. on Computers, Vol. C-28, No. 11, pp. 845-853, Nov. 1979.

Smith, K. F., T. M. Carter, and C. Hunt, "Structured Logic Design of
Integrated Circuits Using the Storage/Logic Array (SLA)," EEE Trans. on
Electron Device, Vol. BD-29, No. 4, pp. 765-776, April 1982.

Smith, R. S., J. D. Chlipala, J. F. M. Bindels, R. G. Nelson, F. H. Fisher, and
T. F. Mantz, "Laser Programmable Redundancy and Yield Improvement in a
64K DRAM," EBB Journal of Solid State Circuits, Vol. SC-16, No. 5, pp.
506-513, Oct. 1981.

Smith, R. T., "Using a Laser Beam to Substitute Good Cells for Bad,"
Electronics, pp. 131-134, July 28, 1981.

Somenzi, F., and S. Gai, "Fault Detection in Programmable Logic Arrays,"
EBB Proceedings, Vol. 74, No. 5, pp. 655-668, May 1986.

Stapper, C. H., A. N. McLaren and M. Dreckmann, "Yield Model for
Productivity Optimization of VLSI Memory Chips with Redundancy and
Partially Good Product," IBM Journal Research Development, Vol. 24, pp.
398-409, May 1980.

Stewart, D. M., "Production Test and Repair of 256k Dynamic RAMs with
Redundancy," Proceeding EBB International Test Conference, Philadelphia,
PA, pp. 471-474, Oct. 1983.

 

[57]

I [58]

[59]

[601 °

[61]

[62]

[63]

[64]

[65]

129

Strojwas, A. J., "Design for Manufacturability and Yield," Proceeding 26th
ACM/EBB Design Automation Conference, Las Vegas, NV, pp. 454-459,
June 1989.

Tarr, M., D. Boudreau, and R. Murphy, "Defect Analysis SyStem Speeds
Test and Repair of Redundant Memories," Electronics, pp. 175-179, Jan. 12,

' 1984.

Treuer, R., H. Fujiwara, and V. K. Agarwal, "Implementing a Built-In Self-
test PLA Design," EBB Design and Test of Computers, pp. 37-48, April
1985.

Walker, D. M. H., Yield Simulation for Integrated Circuits, Kluwer Academic
Publishers, Boston, 1987.

Wang, M., M. Culter, and S. Y. H. Su, "Reconﬁguration of VLSI/WSI Mesh
Array Processors with Two-Level Redundancy," EBB Trans. on
Computers, Vol. C-38, No. 4, pp. 547-554, April 1989.

Wey, C. I... "On yield Considerations for the Design of Redundant
Programmable Logic Arrays," EBB Trans. on CAD, Vol. CAD-7, No. 4, pp.
528-535, April 1988.

Wey, C. L., and F. Lombardi, "On the Repair of Redundant RAM’s," EBB
Trans. on CAD, Vol. CAD-6, No. 2, pp. 222-231, March 1987.

Wey, C. L., T. Y. Chang and M. K. Vai, "On the Design of Fault-Tolerant
Programmable Logic Arrays," Proceeding International Computer
Symposium. Taiwan. pp- 298-304, Dec. 1986.

Wey, C. L., M. K. Vai, and F. Lombardi, "On the Design of a Redundant

Programmable Logic Array (RPLA)," EBB Journal of Solid-State Circuits,
Vol. 8022, No.1. pp. 114-117, Feb. 1987.

 

I‘IIC

HIGRN STATE UNIV. LIBRARIES
I IIIIIIIIIIIIIIIIIIIIIII”IIIIIIIIIIIIIHI
31293013966530