Professional Documents
Culture Documents
1 Introduction
Testing in parallel the cores in a System-On-a-Chip (SOC)
helps reduce test application time; power dissipation imposes
strict limits to this parallelism as overheating the chip might
result in damaging the cores under test. Not only does average power dissipation during testing need to be reduced consequently to avoid overheating the chip, but furthermore particular attention has to be devoted to peak power dissipation issues
as exceeding power thresholds may result in burning the chip.
During test mode, filling in the scan chain with test data requires shifting the bits one by one into each chain, thus creating
increased switching activity in the flip-flops. The rippling effect in a scan chain reflects into the circuit, resulting in a large
number of unnecessary transitions at the circuit lines, unless
costly additional control is utilized.
In traditional approaches, the stimulus inserted to the scan
chain is one and the same as the test vector; similarly, the collected test response to be transmitted to the ATE is identical
to the circuit response captured in the scan cells. Modification of the scan chain through logic gate insertion between the
The work of the first author is supported through a graduate fellowship by
IBM.
2 Previous Work
Numerous methodologies that aim at test power reduction
in a scan-based environment have been proposed recently. The
utilization of externally controlled gates [1] to mask out the
transitions originating from the scan chain have been shown to
reduce test power drastically; however performance degrada-
test
stimulus
cell 5
cell 4
cell 3 cell 2
cell 1
inserted
stimulus
00110
Transition1
Transition2
cell 5
cell 4
cell 3 cell 2
cell 1
cell 4
cell 3 cell 2
cell 1
00001
Transition
scan in
scan in
cell 5
test
1
response
cell 4
cell 3 cell 2
cell 5
cell 1
scan out
1
test
1
response
tion due to gate delays inserted on critical paths may be unacceptable. Several techniques have been employed during the
test application process; appropriate primary input assignment
during shift cycles [2] helps reduce transition propagation from
scan chain to the circuit under test. The effectiveness of such
techniques is limited, as typically circuits are controlled mostly
by scan chains rather than primary inputs.
Several techniques aim at reducing transitions in the scan
chain. In [3], the test cubes are compacted and padded so
as to minimize the number of scan chain transitions; even
though test power is reduced, test application time may suffer a bit of deterioration in comparison to the random padding
case as padding for minimum power typically results in a few
additional test vectors. To minimize the switching activity in
the scan chain, two techniques, test vector ordering and scanlatch ordering, are proposed in [4]; however, the computational complexity of these techniques limits their applicability. Whetsel proposes [5] a general scheme for partitioning the
scan chain into several smaller scan chains so as to have only
one of the small chains active at a time; rippling occurs in only
one small chain, thus reducing the test power dissipated. A
numerous set of variations [6, 7] on this basic scheme advocated by Whetsel in [5] is pursued by Bonhomme et al. The
instantiation of the general scheme advocated by Whetsel [5]
to partitioning into two scan chains specifically is illustrated in
[6, 7]. Bonhomme et al examine this particular instance in the
context of ATE controlled scan-based environments in [6] and
continue their efforts by focusing on the same bipartitioning
instance in the context of LFSR produced tests in [7].
In [8], scan chain modification through the insertion of logic
gates between the scan cells is proposed to reduce the scan
chain transitions associated with the test stimuli inserted. The
test vectors are analyzed to identify the location and the type of
logic to be inserted to the scan path. Even though considerable
scan-in test power reductions are achieved, scan-out test power
is not handled, limiting the effectiveness of the technique in
overall test power reduction. Furthermore, the technique delivers only average power reductions; peak power considerations
are overlooked.
3 Motivation
The rippling that originates from the scan chain reflects into
more transitions inside the circuit, causing increased power
dissipation unnecessarily. Scan chain flip-flops frequently tog-
observed
response
00000
Transition3
scan out
gle due to transitions embedded in the test data; figure 1 illustrates how the transitions between the test data bits flip the
scan cells. The transition between the two rightmost bits of
the test vector flips all but the rightmost scan cell, whereas the
other transition in the test stimulus vector flips the two leftmost
cells. The transition embedded in the test response on the other
hand flips the three rightmost cells.
The scan chain transitions during the shift of the test data
can be reduced through scan chain modifications. Appropriate
insertion of logic gates between scan cells enables the insertion
of a new stimulus and the observation of a new response with
fewer transitions embedded. For instance in figure 2, insertion
of an inverter between the third and the fourth scan cells helps
reduce the number of transitions from 9 down to 4 during the
shift cycles.
Through various scan chain modifications, numerous test
data transformations can be implemented, each having a certain impact on the test data transitions. These modifications
consist of logic gate insertion between scan cells as well as
inversion of capture paths; such modifications involve no gate
delay introduction to the critical paths of the circuit. It is essential to identify the relationship between a particular scan chain
modification and the transformation function implemented by
this modification; identification of this relationship in turn
helps pinpoint the optimal scan chain modification that would
lead to the transformed test data with the fewest possible transitions.
4 System Overview
In this section, the characteristics for possible scan chain
modifications are explored to enable the implementation of a
computationally efficient methodology; a divide and conquer
scheme is presented wherein various scan chain fragments can
be independently considered for possible modification. The
modifications allowed in the scan chain are judiciously selected so as to enable independence among various scan chain
fragments; the impact of the insertion of different gates on
test data transformation is analyzed. The variety of the scan
chain modifications is increased through insertion of inverters
in capture cycles; not only is the number of possible scan chain
modifications increased, but furthermore peak power violations during capture cycles can be effectively handled through
the appropriate capture path inversions.
cell i+1
cell i1
...
...
cell i
cell i1
...
scan out
scan in
...
scan out
scan in
bit i
bit i-1
ai 1
ai+1 ai+2 ai 1 ai+1 ai+2
ai
Figure 3. Impact of XOR gates on the transitions between test data bits
system
test
system test
capture/test
MUX
normal/test
MUX
normal/test
MUX
F1
F2
F3
F4
F5
F6
F7
F8
Gates
XOR,XOR
XNOR,XNOR
INV,INV
XOR,XNOR
XNOR,XOR
INV,-,INV
-,-
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
Test bit
ai
ai ai+1
0
(ai ai+1 )
0
ai
ai ai+1
0
(ai ai+1 )
0
ai
ai
ai
ai
ai
ai
ai
a0i
a0i
a0i
a0i
ai
1
1
1
1
1
1
1
1
1
000
00
--0-0
-0
000
001
00
-+
-+
0+
-0
-0
0+
00
010
00
++
++
0+
+0
+0
0+
00
Impact
011 100
00
++00
++0+0
+0
0+0
+0
0000
00
101
++
00
++
+0
0+
+0
0+
00
110
-+
00
-+
-0
0+
-0
0+
00
111
-00
--0
0-0
000
Table 1. The transition-preserving modifications and their impact on the transitions between test data bits
5 Algorithmic Framework
In this section the possible scan chain modifications are presented. Average and peak power issues are considered separately; two different algorithms, both based on test data analysis, are provided.
Table 1 lists the transformations associated with various
scan chain modifications along with the impact on the test data
transitions. The second column denotes the gates to be inserted between the elements of the left pair and the gates to
be inserted between the elements of the right pair, in that order. The third column shows the test data that needs to be inserted to the scan chain to bring in the three actual test vector
bits into the scan cells. The information provided in the third
column constitutes the transformed test response data as well.
Notably, for all the entries in table 1, the rightmost test bit,
ai 1 , either remains intact or is simply negated, thus ensuring
the transition-preserving property for all such modifications.
The distinct local impact of every modification on the transitions within a block of test data is demonstrated in the same
table; for all eight combinations of three test data bits, the effect of implementing a modification on the transitions between
the first two and the last two bit positions in the block, respectively, is provided. A transition that is removed is denoted by
+, a transition created due to the modification of the scan
chain, by -; 0 denotes no impact. For instance, inserting 2
XOR gates results in removing both transitions when the test
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
ai+1
Test bit
ai
ai ai+1
0
(ai ai+1 )
0
ai
ai ai+1
0
(ai ai+1 )
0
ai
ai
ai
ai
ai
ai
ai
a0i
a0i
a0i
a0i
ai
1
1
1
1
1
1
1
1
1
Circuit
s713
s953
s1423
s5378
s9234
s13207
s15850
s35932
Area cost(%)
5.8
2.8
5.7
3.8
2.6
3.3
4.7
4.2
8 Experimental Results
The proposed average and peak test power reduction
schemes have been applied to several fully-scanned circuits in
ISCAS89 [9]. The test vectors that are used to compute the test
power reductions achieved by the proposed methodologies are
generated by ATALANTA [10].
Table 3 demonstrates the average test power reductions
based on test data transformations; the reduction in the total
number of transitions in the scan chain, and the increase in
area overhead are provided. Overall test power reductions of
10% to 20% are achieved with small area cost of typically less
than 5%.
Peak power reductions along with the associated area costs
are reported in table 4; every time the peak power reduction
algorithm is executed, a target peak power level is selected in
advance. Of course, no guarantees can be extended for achieving arbitrary peak power thresholds. From the experimental
data, it is seen that a 20% peak power reduction is attained for
only the circuit s713 whereas a 5% reduction is achieved for
all the circuits. The proposed methodology achieves 10% reduction in peak power for six of the eight benchmarks. As the
target peak power reduction is increased, the area cost becomes
larger; since the number of test cycles wherein power violation
occurs is increased, the number of modifications implemented
in the scan chain goes up, resulting in larger area overhead.
The considerable average power reductions attained by the
methodology we propose enable the application of manufacturing tests with no overheating of the chip. Furthermore, controlling the peak power below certain thresholds eliminates the
danger of burning the chip during test. Reliable test of circuits
is enabled by the average and peak power reduction method-
Circuit
s713
s953
s1423
s5378
s9234
s13207
s15850
s35932
Average
Target: 5%
Area
3.4
2.4
4.7
3.4
2.2
3.1
4.0
3.4
3.3
Target: 10%
Area
4.1
2.9
5.6
N/A
2.7
3.5
N/A
3.9
3.8
Target: 15%
Area
5.6
N/A
5.8
N/A
N/A
N/A
N/A
4.1
5.2
Target: 20%
Area
5.9
N/A
N/A
N/A
N/A
N/A
N/A
N/A
5.9
9 Conclusion
To enable parallelism when testing cores in an SOC design, the average and peak test power associated with the cores
should be reduced. The proposed methodology achieves this
reduction by slightly modifying the scan chain based on the
analysis of the given test set. The suggested modifications in
the scan chain enable the decomposition of the test set into
blocks as a modification in a scan chain fragment impacts only
the transitions within a single block. Each test data block is
analyzed individually to identify the modification that leads
to the optimal transformation of the block; subsequently, the
modification is implemented for the corresponding scan chain
fragment. The consequent computationally efficient algorithm
achieves significant test power reductions.
The proposed methodology achieves test power reductions
through insertion of logic gates on the test paths. In addition
to logic gate insertions between scan cells which offer numerous transformations for both stimuli and responses, particular
capture paths are inverted, resulting in an increased number of
Circuit
s713
s953
s1423
s5378
s9234
s13207
s15850
s35932
[6] alone
45.5
40.3
51.9
38.0
42.6
39.1
53.9
47.9
Improvement
15.0
12.7
11.1
10.5
13.8
7.6
14.1
10.2
transformations for test responses. Since no gate delays are introduced to the critical paths, no interference with the normal
operation of the circuit exists.
To demonstrate the efficacy of the proposed approach, we
have applied it on several ISCAS85 and ISCAS89 benchmark
circuits. The experimental results indicate that significant average and peak test power reductions are achieved with slight
additional area cost. This area cost is negligible compared to
the test application time saved due to the increased parallelism
among core tests that the proposed methodology enables.
References
[1] H. J. Wunderlich and S. Gerstendorfer, Minimized power consumption for scan based BIST, in ITC, pp. 8594, 1999.
[2] T. Huang and K. Lee, An input control technique for power
reduction in scan circuits during test application, in ATS, pp.
315320, 1999.
[3] R. Sankaralingam, R. R. Oruganti and N. A. Touba, Adapting
scan architectures for low power operation, in VTS, pp. 3540,
2000.
[4] V. Dabholkar, S. Chakravarty, I. Pomeranz and S. M. Reddy,
Techniques for minimizing power dissipation in scan and
combinational circuits during test application, IEEE TCAD,
vol. 17, n. 12, pp. 13251333, 1998.
[5] L. Whetsel, Adapting scan architectures for low power operation, in ITC, pp. 863872, 2000.
[6] Y. Bonhomme, P. Girard, L. Guiller, C. Landrault and
S. Pravossoudovitch, A gated clock scheme for low power
scan testing of logic ICs or embedded cores, in ATS, pp. 253
258, 2001.
[7] Y. Bonhomme, P. Girard, L. Guiller, C. Landrault, S. Pravossoudovitch and H.J. Wunderlich, A modified clock scheme for
a low power BIST test pattern generator, in VTS, pp. 306311,
2001.
[8] O. Sinanoglu, I. Bayraktaroglu and A. Orailoglu, Test Power
Reduction Through Minimization of Scan Chain Transitions,
in VTS, pp. 161169, 2002.
[9] F. Brglez, D. Bryan and K. Kozminski, Combinational Profiles
of Sequential Benchmark Circuits, IEEE ISCAS, vol. 14, n. 2,
pp. 19291934, May 1989.
[10] H. K. Lee and D. S. Ha, On the Generation of Test Patterns
for Combinational Circuits, Technical Report 12-93, Department of Electrical Eng., Virginia Polytechnic Institute and State
University.