Optimal Control of Rapid Thermal Annealing in A Semiconductor Process

Journal of Process Control 14 (2004) 423430
www.elsevier.com/locate/jprocont
Optimal control of rapid thermal annealing

in a semiconductor process q
R. Gunawan, M.Y.L. Jung, E.G. Seebauer, R.D. Braatz *
Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, 600 South Mathews Avenue,
93 Roger Adams Laboratory, Box C-3, Urbana, IL 61801-3792, USA
Received 9 May 2003; received in revised form 21 July 2003; accepted 21 July 2003
Abstract
This study focuses on the optimal control of rapid thermal annealing (RTA) used in the formation of ultrashallow junctions
needed in next-generation microelectronic devices. Comparison of dierent parameterizations of the optimal trajectories shows that
linear proles give the best combination of minimizing junction depth and sheet resistance. Worst-case robustness analysis of the
optimal control trajectory motivates improvements in feedback control implementations for these processes. This is the rst time
that the eects of model uncertainties and control implementation inaccuracies are rigorously quantied for RTA.
2003 Elsevier Ltd. All rights reserved.
Keywords: Optimal control; Batch control; Uncertainty analysis; Robustness analysis; Microelectronics processes; Semiconductor processing; Rapid
thermal annealing
1. Introduction The state-of-the-art in post-implant annealing em-

ploys a lamp-based rapid thermal annealing (RTA). Fig.
Moores law requires a continued shrinkage of feature 1 shows a typical RTA spike anneal program, which
sizes in microelectronic devices. For example, advanced consists of a stabilization step at constant temperature
CMOS devices will require junction depths between 13 (650 C), followed by a spike anneal comprising a
and 22 nm in the source and drain extension region by linear heating step at a constant rate (100 C/s)
the year 2005 according to the 2001 International reaching a maximum temperature (1000 C), and -
Technology Roadmap for Semiconductors. The current nally a radiative cooling step at a initial rate of several
technology for formation of such ultrashallow junctions tens of degrees per second. In the literature, there exists
depends on ion implantation of dopant, such as boron, conicting experimental evidence on the ecacy of using
into silicon. Although the junction depth can be made high heating rates (up to 400 C/s) in the spike anneal
shallower by reducing the implant energy, the eective- prole to reduce TED [1,2]. However, recent results [3,4]
ness of this approach is limited by the need to anneal out tend to conrm the benet of using high heating rates.
the point and/or extended defects generated by ion im- The results also suggest that the reduction in the junc-
plantation. Silicon self-interstitial defects can mediate tion depth comes at the expense of an undesired increase
the diusion of dopants during the annealing process, in the sheet resistance. The trade-o in reducing the
which leads to a signicant increase of the junction junction depth without sacricing the sheet resistance
depth. This phenomenon is known as transient en- presents a major roadblock in transistor miniaturization
hanced diusion (TED). and motivates a careful optimization of the post-implant
annealing temperature program. For this reason, con-
siderable eorts have been put forth in the design of the
optimal post-implant annealing programs to produce
q
A preliminary abbreviated version of this manuscript was accepted the desired junction depth while achieving low sheet
for presentation at the IFAC Symposium on Advanced Control of resistance (see [5] and references therein).
Chemical Processes Hong Kong, 2003.
* In the literature, there have been extensive develop-
Corresponding author. Tel.: +1-217-333-5073; fax: +1-217-333-
5052. ments in the control of rapid thermal processing (RTP)
E-mail address: braatz@uiuc.edu (R.D. Braatz). to provide temperature tracking and wafer thermal
0959-1524/$ - see front matter 2003 Elsevier Ltd. All rights reserved.
doi:10.1016/j.jprocont.2003.07.005
424 R. Gunawan et al. / Journal of Process Control 14 (2004) 423430
1200 reaction networks including boron activation and in-

Spike Anneal terstitial clustering. The model comprises coupled con-
1000 tinuity equations (i.e., mass balances) for each species
Temperature (oC)
800 and Poissons equation to incorporate the electrical eld

Stabilization eect on the charged species. The general continuity
600 equation is
400 oCi oJi
Gi 1
200
ot ox
where Ci denote the concentration, Ji is the ux, and Gi
0
0 10 20 30 40 50 60 70 is the net generation rate of species i. The ux Ji includes
Time (s)
terms describing the Fickian diusion and the electric
eld drift motion. The electric eld arises from the
Fig. 1. Typical rapid thermal anneal temperature program, which spatial charge imbalance according to Poissons equa-
consists of a stabilization step and a spike anneal (i.e., a fast linear tion:
heating step starting 650 C, followed by a natural cool down step).
o2 w Qx
2
uniformity (see [6] and references therein). However, the ox2 e
design of the optimal RTA program that minimizes where e denote the dielectric constant and the charge
TED has adopted an ad hoc approach through trial and density Qx is given by
error and heuristics [5], due to incomplete understanding X
of TED mechanisms and correspondingly undependable Qx p n ci C i 3
i
models for describing dopant diusion and activation. A
recent comprehensive TED modeling in conjunction with p and n denoting the hole and electron concentra-
with the maximum a posteriori estimation provided not tions, respectively, and ci denoting the average charge of
only a correlative model but also a model for a priori species i (e.g., a species A which possesses a charge dis-
predictive purposes [7,8]. This paper focuses on the de- tribution of 30% A2 and 70% A has an average charge
sign of the spike anneal program that optimizes the of 30% 2 + 70% )1 )0.1). The charge distributions
junction depth subject to a constraint on the sheet re- adhere to FermiDirac statistics. The concentrations p
sistance. A model-based optimal control using the and n are assumed to be in thermal equilibrium at all
aforementioned TED model was formulated utilizing times.
dierent parameterizations of the optimal trajectory to The generation term Gi includes the formation and
elucidate the true optimal annealing program. The op- annihilation rates due to the boron activation and
timal annealing program provides the temperature set- clusters formation and dissociation reactions. The boron
points for lower level RTP/RTA controllers. activation reactions provide a pathway between mobile
In addition, nite time control policies such as the interstitial boron Bi to and from immobile activated
spike annealing programs can result in control perfor- boron (i.e., substitutional boron, Bs ):
mance that is highly sensitive to model uncertainties and
control implementation inaccuracies [9]. Worst-case ro-
bustness analysis of the resulting optimal trajectory
quanties the performance degradation with respect to
model parameter uncertainties and control implemen-
tation inaccuracies [10]. Such analysis can provide
guiding directions in improving the TED model In addition, the intermediate (Bs Sii ) also acts as a
(through additional experimentations) and in the selec- nucleation center for mixed boronsilicon clusters (see
tion of the RTA control instrumentation (sensors and below). The rates of reactions follow reactant-limited
actuators) and strategies. This is the rst time that the rate expressions, where the reaction rate constants ad-
eects of model uncertainties and control implementa- here to the Arrhenius law.
tion inaccuracies are rigorously quantied for RTA. The above view of free Bi as the main pathway of
boron diusion agrees with computational density
functional theorem (DFT) calculations [1113] and ex-
2. Transient enhanced diusion model perimental results [14,15]. In 1999, two simultaneous
DFT-based studies appeared [16,17] providing evidence
2.1. Model formulation for the intermediate (Bs Sii ) as the main pathway of
boron diusion, which was supported by subsequent
TED arises from reactiondiusion processes con- publications [1820]. This led to an explicit debate re-
sisting of Fickian diusion, electrical drift motion, and garding the dominant mechanism of boron diusion in
R. Gunawan et al. / Journal of Process Control 14 (2004) 423430 425
silicon. However, our Monte Carlo simulations using a simple picture is often inadequate. For example, a
maximum likelihood parameters [21,22] showed that the survey [27] of pre-factors for gas desorption from
diusion of boron observed in TED is most likely me- semiconductor surfaces shows that, while the average
diated by motion of free Bi . value indeed lies near 1013 s1 , only ten percent of in-
Clusters of interstitial atoms have been shown to dividual cases fall within an order of magnitude of this
form during TED [23,24]. There exists evidence sup- range. The story is roughly similar for hopping diusion
porting the formation of clusters consisting of pure on semiconductors [28]. Although the Si Debye fre-
boron [24], pure silicon [23], and mixed boronsilicon quency will be employed in the present work, clearly the
[25]. During thermal annealing, the clusters can act as use of a priori estimates of pre-factors requires caution.
reservoir (during stabilization step) and source (during Determinations of the activation energies have also
heating step) for mobile interstitial boron and silicon. proven dicult. Nevertheless, experimental and ab initio
The formation and dissociation of pure interstitial DFT estimates of the activation energies are scattered
clusters follow the reactions throughout the literature. For many of the activation
f energies, the published values show signicant variation.
I I I2 For example, experimental results for the Si self-inter-
d
g
stitial suggest diusivities that vary by more than 10
Im1 I Im orders of magnitude [29]. The reliability of the experi-
d
mental results is questionable because the mobile spe-
where I denotes the interstitial (boron and silicon) and cies, including many point and extended defects, are
the indices m denote the sizes of the clusters. The cluster dicult to observe because they exist at low concen-
formation and growth rates assume diusion-limited tration. Ab initio calculations based on density func-
kinetics in agreement with much of the literature (see for tional theory (DFT) have also proven problematic.
example [26]). On the other hand, the dissociation rate Most of the DFT calculations are valid only at 0 K.
follows a rst-order kinetic expression with a rate con- However, the assumed mechanism may change at high
stant according to the Arrhenius law. temperatures. The expressions employed here tacitly
The following reaction network describes the forma- assume a rate law based on a transition state formula-
tion and dissociation of mixed boronsilicon clusters: tion, in which an atom attempts to cross a well-dened
g^ potential barrier at a frequency comparable to a lattice
Bp1 Siq Bi Bp Siq vibrational frequency. Such formulations are usually
d^
g^
applied under conditions at modest temperatures where
Bp Siq1 Sii Bp Siq kT is much less than the barrier height. At 1050 C,
d^
however, kT 0:11 eV, which approaches the same
where p, q are integers larger than or equal to 1. The magnitude as several barriers in Table 1. Under such
formation/growth and dissociation rates of mixed clus- conditions, it becomes questionable whether the poten-
ters again follow diusion-limited and rst-order kinet- tial barrier remains well dened. Other possible high-
ics, respectively, as in the pure cluster dynamics. temperature mechanisms have been discussed for other
Detailed description of the model is available elsewhere kinds of point defects by Van Vechten [30]. Another
[8]. problem with DFT calculations is that they typically do
not account for the eects of entropy, which can change
2.2. Parameter estimates the pre-exponential factor by orders of magnitude [31].
Table 1 lists the maximum a posteriori estimates for
The TED model requires a set of pre-exponential the most important TED energetics [8]. Maximum a
factors and activation energies associated with the dif- posteriori estimation takes a Bayesian approach which
fusivities and kinetic rate constants for the boron acti- combines experimental data with a priori information,
vation reaction and cluster dissociation dynamics. in this case, from the maximum likelihood estimation
Theoretical and experimental works of TED have paid applied to published experimental and/or DFT values.
little attention to the pre-exponential factors, focusing Fig. 2 presents the after-anneal experimental data from
instead on the activation energies. In fact, pre-factors various RTA programs along with the corresponding
are notoriously dicult to estimate a priori. In simple simulation proles using the TED model in conjunction
kinetic models of elementary reactions, pre-factors rep- with the MAP estimates. In addition, Fig. 3 shows the
resent modied attempt or collision frequencies. How- agreement of the TED simulations using the MAP es-
ever, in solid there exists a distribution of vibrational timates with the experimental observations compiled
frequencies, and thus dening a single attempt frequency from the literature [32,33], and with the Sematech curve,
is dicult. One commonly used estimate is the Debye which consolidates the best junction depth and sheet
frequency, which for Si is about 6 1012 s1 . However, it resistance pairs that can be achieved through conven-
is well known from other branches of kinetics that such tional spike annealing.
Table 1
Maximum a posteriori estimates of TED energetics
TED parameters MAP estimate Std. dev.
Ediff;Bi (boron diusivity) 0.336 0.013
Ediff;Si2 (Si diusivity) 0.72 0.03
i
Eko (kick-in/kick-out) 0.439 0.041
Eki (kick-in/kick-out) 0.50 0.03
Edis (kick-in/kick-out) 0.547 0.005
E2;B (cluster dissociationpure boron) 1.806 0.023
E2 (cluster dissociationpure Si) 1.4 0.03
E3 (cluster dissociationsize 3) 2.184 0.018
E4 (cluster dissociationsize 4) 3.055 0.002
Elarge (cluster dissociationlarge Si) 3.7 0.1
Elarge;mix (cluster dissociationlarge mixed) 3.5 n/aa
a
Energy is from ML estimation of a single published value.
10
22
resistance. (Performance variables can also include
junction abruptness, prole stability to subsequent
Boron Concentration (cm-3)
SIMS
10
21
process steps, and others.) Shallower junction depths
Simulation
give improved short channel performance to minimize
10
20
leakage current, while low sheet resistance reduces
power consumption. There are many avenues to reduc-
10
19
ing the junction depth, e.g., low implant energy [34]
combined with RTA with high ramp-up rates [2,4] and
10
18
alternative doping technology such as plasma inversion
ion implantation or plasma gas-immersion laser doping
10
17 [35]. Nevertheless, RTA currently provides the most
0 10 20 30 40 50 60 70
dependable and economical path to satisfy the current
Depth (nm) needs of ultrashallow junctions.
Fig. 2. Comparison of experimental and simulated boron proles us-
This paper presents a model-based control formula-
ing the TED model in conjunction with the MAP parameters. The tion for designing the optimal temperature program that
junction depth is dened as the depth from the surface at which the minimizes the junction depth while maintaining a suit-
total boron concentration reaches 1018 atoms/cm3 : on the order of 50 able sheet resistance. The optimization variable is the
nm in the present cases. RTA temperature prole, in particular, the heating and
cooling proles and the annealing (maximum) temper-
ature. The optimal control objective chosen here is to
1000 minimize the junction depth while maintaining a satis-
Literature factory sheet resistance, which can be formulated into a
Sheet Resistance R s (/sq)
Simulation constrained optimization problem:

800
Sematech Curve
min Xj 4
600 T t
Rs 6 Rs;max
bmin 6 dT 6 bmax
dt
T 6 Tmax
400
where Xj denotes the junction depth, Rs denotes the sheet

200 resistance, and T t is the RTA temperature trajectory.
The sheet resistance is given by:
0
20 40 60 80 100 1
Junction Depth Xj (nm)
Rs R X=sq 5
q lCCx dx
Fig. 3. Comparison of the junction depth and sheet resistance data where q denotes the carrier charge, lC denotes the
from experimental studies employing various annealing conditions, the mobility (concentration dependent), and Cx is the
Sematech curve, and the TED simulations.
spatial concentration of charge carrier (i.e., activated
dopant). The following formula gives the mobility lC
3. Optimal control formulation for boron [36]:
425:6 2
The performance of a CMOS transistor depends lC 44:9 0:719 cm =V s 6
C
particularly heavily upon the junction depth and sheet 1 2:231017
In this work, it is desired to produce junctions with the and the maximum junction depth increase of
sheet resistance below Rs;max of 350 X/sq. The constraints 1=2
dXj;wc rakLV h k 15
on the temperature gradient, bmin and bmax , describe the
limits for the cooling and heating rates, respectively. On the other hand, the worst-case analysis with re-
Lamp-based RTA can produce heating rates up to 400 spect to the control implementation inaccuracies re-
C/s [2], while recent advances in RTA technology using quires a second-order series expansion:
arc lamps [37] can achieve cooling rates up to 200 C/s
dXj M du duT H du 16
[38]. The maximum temperature of the thermal anneal
Tmax is set to the melting point temperature of silicon at where
1410 C.
oXj
Mi 17
oui uû

4. Worst-case analysis o2 Xj
Hik 18
oui ouk uû
Worst-case analysis [9,10] provides tools for quanti-
because the elements of M that are not on active con-
fying the robustness of the optimal control performance
straints are equal to zero and thus the eect of such
to uncertainties in model parameters and control im-
control uncertainties only shows up in the second-order
plementation. The parametric and control uncertainties
terms. Here the worst-case junction depth increase is
are described as norm bounded perturbations du and dh
given by
such that
n o dXj;wc max jM du duT H duj 19
Eh h : h ^h dh; kW h dhk 6 1 7 dumin 6 du 6 dumax
n o This optimization problem is equivalent to [10]

Eu u : u û du; kW u duk 6 1 8
max k 20
lD N P k
where W h and W u are positive-denite weighting ma-
trices. This formulation includes uncertain parameters where k is any real number, l is the structured singular
lying within a hyperellipsoid value, the perturbation D diagfDr ; Dr ; dc g consists of
n o independent real scalar blocks Dr and a complex scalar
T
Eh;MAP h : h ^h V 1 ^ 2
h h h 6 r a 9 dc , and
2 3
where r2 a denotes chi squared distribution with a 0 0 kw
condence level, which is equivalent to Eq. (7) with N 4 kH 0 kHz 5 21
1=2 T T T
W h 1=raV h for the Euclidean norm [10]. The z H M w z Hz Mz
uncertainty formulation also includes independent with
bounds on each element for the optimal control trajec-
tory w 12dumax dumin 22
umin 6 u 6 umax 10 and
A rst-order series expansion of the junction depth z 12dumax dumin 23
with respect to the MAP parameter estimates gives Upper and lower bounds for k can be eciently com-
dXj Lh ^
h L dh 11 puted using iterative l computations or skewed-l anal-
ysis (for details, see [10]).
where dh belongs to Eh;MAP and L denote the sensitivity
coecients given by

oXj 5. Numerical implementation
Li 12
oh î hh
The TED model was implemented using Alagator
Based on this expansion, the worst-case deviation of the scripts running on a public domain FLOOPS 2000 [39].
junction depth is given by FLOOPS spatially discretizes the continuity equations
dXj;wc max jL dhj 13 to give an implicit system of ordinary dierential equa-
kW h dhk 6 1
tions. To solve the resulting ODEs, the FLOOPS dif-
where the solution to the minimization problem gives ferential equation solver uses the TR/BDF2 [39]
the worst-case parameter uncertainty vector dhwc composite method for time integration, which combines
the trapezoidal rule (TR) and the backward dierentia-
ra
dhwc 1=2
V h LT 14 tion formula (BDF). The advantages of TR/BDF2
kLV h k method are its second-order accuracy and ease of im-
plementation (one-step algorithm). In addition, it also perature of 1111 C, in agreement with experimental
has a desirable stability condition (L-stability) for solv- studies [3,34]. The use of a high annealing temperature
ing sti dierential equations [40]. with fast heating and cooling is needed since the eective
An experimental as-implanted SIMS prole provided activation energy of TED is lower than that for boron
by International Sematech was used as the initial boron activation. Although experimental studies [3,34] have
prole. The ion implantation utilized 2 1015 ions/cm2 suggested using fast heating and cooling rates, the de-
of boron at 0.60 keV with 0 tilt into intrinsic silicon. termination of the maximum annealing temperature is
The total boron was assumed to contain 20% substitu- currently done through trial-and-error experimentation
tional boron and 80% interstitial boron as suggested by and heuristics. In contrast, the optimal control formu-
experimental observations [41,42]. The initial conditions lation using the TED model provides an estimate of the
for silicon interstitials were set to track the total boron annealing temperature, and therefore will reduce the
concentration. The clusters and the Bs Sii complex were number of costly experiments.
assumed not present initially. Boundary conditions at The quadratic parameterization led to the optimal
the surface for all species assumed no ux (i.e., quadratic trajectory with an annealing temperature of
Ji jx0 0) with no surface Fermi level pinning [43]. 1144 C. The use of quadratic proles only gave minimal
The parameterizations of the optimal trajectory in- improvement of the junction depth over the optimal
clude linear and quadratic proles. In the case of linear linear program. In fact, if the optimization problem for
parameterization, the design variables consisted of the linear parameterization was solved using relaxed con-
heating and cooling rates and the annealing tempera- straints on the heating rate and cooling rates to equal
ture, which were solved by extending the golden search those of the quadratic case, the optimal annealing
method [44] to multidimensional problems. The moti-
vation of using this algorithm came from the small
number of optimization variables and the ease of in- 1200
corporating the output constraint arising from the sheet
resistance. 1100 Linear
Quadratic
The quadratic parameterization denes optimal
Temperature ( C)
1000
o
heating and cooling proles with minimum heating and

cooling rates equal to the rates obtained in the linear 900
parameterization. In this case, the maximum of the
heating and cooling rates were relaxed to 1000 and )400 800
C/s, respectively. Again, the optimization used the
700
aforementioned extension of the golden search method.
In addition, only concave quadratic proles (i.e., posi- 600
tive quadratic coecients) were considered in accord to 0 1 2 3 4
the literature evidence showing that concave RTA pro- time (s)
gram was more eective than convex proles in reducing Fig. 4. The optimal spike anneal programs employing linear and
TED [45]. quadratic parameterizations.
The worst-case analysis required computing sensitiv-
ity matrices with respect to the parameters and optimal
control trajectories. The sensitivity coecients were es-
timated using the nite dierence method [46]. The
structured singular value problem in Eq. (20) was 1022
computed using the l-Analysis and Synthesis Matlab Linear Optimal

Boron Concentration (cm-3)
1021 Quadratic Optimal

Toolbox [47]. (Linear Relaxed)
1020
6. Results and discussion
1019
Fig. 4 presents the optimal RTA programs using
linear and quadratic parameterizations of the tempera- 1018
ture trajectory, which give junction depths of 51.3 and
48.2 nm, respectively (shown in Fig. 5), and the same 1017
sheet resistance of 350 X/sq. The optimal linear heating 0 10 20 30 40 50 60 70
Depth (nm)
and cooling rates were 400 and 200 C/s, respectively,
indicating that the optimal RTA program was to heat Fig. 5. Simulations of after-anneal boron proles employing the op-
and cool as quickly as possible to the annealing tem- timal RTA programs.
temperature increased to 1146 C to give a junction 7. Conclusions

depth of 48.2 nm (see Fig. 5). That is, if the same maxi-
mum heating and cooling rates are used, the quadratic This paper has shown that the optimal RTA program
and linear parameterizations lead to the same minimum for minimizing TED while achieving the desired sheet
junction depth. Since heating and cooling rates are the resistance consists of fast linear heating and cooling
true constraints in practice, there is no benet of using proles, as suggested by many experimental studies.
quadratic over linear heating and cooling proles. In The model based optimal control directly computes
addition, the results indicate that the use of even higher the maximum annealing temperature, which avoids the
heating rates than the existing RTA technology (>400 heuristics and trial-and-error approach commonly done
C) will not bring signicant reduction in the TED. in literature and in industrial practice, reducing the
Worst-case analysis was applied to the linear optimal number of experimentations and the cost to identify the
trajectory. The model parameter uncertainties were optimal annealing program. Worst-case analysis on
quantied by the MAP covariance estimate [8]. The the optimal junction depth suggests the need to improve
analysis of control implementation inaccuracies used existing metrology and RTA controllers to minimize
control trajectory perturbations of 5, 10, and 15 C at control implementation inaccuracies.
ve temperatures along the heating and cooling ramps
(660, 800, 950, 1050, 1100, 1112 C). Table 2 presents the
worst-case junction depth increases due to uncertainties Acknowledgements
in model parameters and control implementation.
The analysis indicates that the deviations from the Support from International Sematech and the Na-
optimal junction depth from model parameter uncer- tional Center for Supercomputing Applications (NCSA)
tainties are minimal, whereas the junction depth increase is gratefully acknowledged.
due to control errors are moderate to signicant. The
result suggests that the MAP estimation provided pa-
rameter estimates with sucient accuracy for use in the References
optimal control study. In practice, control errors can
[1] D.F. Downey, S.W. Falk, A. Bertuch, F.S.D. Marcus, Eects of
arise from many sources. Inaccuracies from temperature
fast rapid thermal anneals on sub-keV boron and BF2 ion
measurements using a pyrometer are in the order of 3 implants, Journal of Electronic Materials 28 (1999) 13401344.
15 C. The typical RTA controllers make approximately [2] S. Shishiguchi, A. Mineji, T. Hayashi, S. Saito, Boron implanted
100 temperature measurements every second [48], which shallow junction formation by high-temperature/short-time/high-
translates to a corrective control action every 4 C at a ramping-rate (400 C/sec) RTA, in: Symposium on VLSI Tech-
heating rate of 400 C/s. Temperature non-uniformity nology, Kyoto, Japan, 1997. Paper 7B-3.
[3] G. Mannino, P.A. Stolk, N.E.B. Cowern, W.B. de Boer, A.G.
across the wafer on average can cause control error of 10 Dirks, F. Roozeboom, J.G.M. van Berkum, P.H. Woerlee, N.N.
C. Another source of control implementation errors Toan, Eect of heating ramp rates on transient enhanced diusion
which is not easily quantied as a temperature devia- of ion-implanted silicon, Applied Physics Letters 78 (2001) 889
tion, is the rounding of the temperature trajectory at the 891.
top of spike annealing due to physical limitations. Ac- [4] G. Gelpey, K. Elliot, D. Camm, S. McCoy, J. Ross, D.F.
Downey, E.A. Arevalo, Advanced annealing for sub-130 nm
counting for all sources of control inaccuracies, the junction formation, in: 201st Meeting of the Electrochemical
control implementation errors could easily exceed 15 C Society, Philadelphia, PA, 2002. Abstract 735.
at any given time. From the worst-case analysis, control [5] S.C. Jain, W. Schoenmaker, R. Lindsay, P.A. Stolk, S. Decoutere,
errors of this magnitude can lead to junction depth in- M. Willander, H.E. Maes, Transient enhanced diusion of boron
crease as high as 10 nm, which is in similar magnitude to in Si, Journal of Applied Physics 91 (2002) 89198941.
[6] T.F. Edgar, S.W. Butler, W.J. Campbell, C. Pfeier, C. Bode, S.B.
the junction depth increase from the TED itself. This Hwang, K.S. Balakrishnan, J. Hahn, Automatic control in
nding indicates that existing metrology and controllers microelectronics manufacturing: practices, challenges, and possi-
for implementing RTA programs need improvement as bilities, Automatica 36 (2000) 15671603.
future junction depth requirements necessitate further [7] M.Y.L. Jung, E.G. Seebauer, New physics for modeling transient
reduction of junction depth, and thus smaller control enhanced diusion in RTP, in: Proceedings of the Rapid Thermal
and other Short-Time Processing Technologies, vol. 2000-9, ECS,
errors. Toronto, Canada, 2000, pp. 1520.
[8] R. Gunawan, M.Y.L. Jung, E.G. Seebauer, R.D. Braatz, Max-
imum a posteriori estimation of transient enhanced diusion
energetics, AIChE Journal 49 (2003) 21142123.
Table 2 [9] D.L. Ma, S.H. Chung, R.D. Braatz, Worst-case performance
Worst-case analysis of junction depth deviations (in nm) with respect analysis of optimal batch control trajectories, AIChE Journal 45
to uncertainties in model parameters and control implementation (1999) 14691476.
[10] D.L. Ma, R.D. Braatz, Worst-case analysis of nite-time control
dh jduj 6 5 C jduj 6 10 C jduj 6 15 C
policies, IEEE Transactions on Control Systems Technology 9
0.13 1.89 5.15 9.78 (2001) 766774.
[11] C.S. Nichols, G.F.A. van de Walle, S.T. Pantelides, Mechanisms [30] J.A. Van Vechten, Activation enthalpy of recombination-
of dopant impurity diusion in silicon, Physical Review B 40 enhanced vacancy migration in Si, Physical Review BCon-
(1989) 54845496. densed Matter 38 (1988) 99139919.
[12] J. Zhu, T.D. de la Rubia, L.H. Yang, C. Mailhiot, G.H. Gilmer, [31] J.A. Van Vechten, C.D. Thurmond, Entropy of ionisation and
Ab initio pseudopotential calculations of B diusion and pairing temperature variation of ionisation levels of defects in semicon-
in Si, Physical Review B 54 (1996) 47414747. ductors, Physical Review B (Solid State) 14 (1976) 35393550.
[13] J. Zhu, Ab initio pseudopotential calculations of dopant diusion [32] A. Agarwal, H.-J. Gossmann, A.T. Fiory, Eect of ramp rates
in Si, Computational Materials Science 12 (1998) 309318. during rapid thermal annealing of ion implanted boron for
[14] N.E.B. Cowern, G.F.A. van de Walle, D.J. Gravesteijn, C.J. formation of ultra-shallow junctions, Journal of Electronic
Vriezema, Experiments on atomic-scale mechanisms of diusion, Materials 28 (1999) 13331339.
Physical Review Letters 67 (1991) 212215. [33] A. Agarwal, H.-J. Gossmann, A.T. Fiory, Ultra-shallow junctions
[15] N.E.B. Cowern, G.F.A. van de Walle, P.C. Zalm, D.J. Oostra, by ion implantation and rapid thermal annealing: spike-anneals,
Reactions of point defects and dopant atoms in silicon, Physical ramp rate eects, Material Research Society Symposium Proceed-
Review Letters 69 (1992) 116119. ings 568 (1999) 1930.
[16] W. Windl, M.M. Bunea, R. Stumpf, S.T. Dunham, M.P. [34] A. Agarwal, Ultra-shallow junction formation using conventional
Masquelier, First-principles study of boron diusion in silicon, ion implantation and rapid thermal annealing, in: International
Physical Review Letters 83 (1999) 43454348. Conference on Ion Implantation Technology, Tyrol, Austria,
[17] B. Sadigh, T.J. Lenosky, S.K. Theiss, M.-J. Caturla, T.D. de la 2000, pp. 293299.
Rubia, M.A. Foad, Mechanism of boron diusion in silicon: an [35] E.C. Jones, E. Ishida, Shallow junction doping technologies for
ab initio and kinetic monte carlo study, Physical Review Letters ULSI, Materials Science and Engineering R 24 (1998) 180.
83 (1999) 43414344. [36] B.V. Zeghbroeck, Principles of Semiconductor Devices, 2002.
[18] P. Alippi, L. Colombo, P. Ruggerone, Energetics and diusivity of Available from <http://ece-www.colorado.edu/~bart/book/>.
atomic boron in silicon by density-functional-based tight-binding [37] B. Peuse, A. Rosekrans, J. Gelpey, P. Dick, Dynamic temper-
simulations, Computational Materials Science 22 (2001) 4448. ature uniformity of RTP using single zone illumination, in: 1st
[19] P. Alippi, L. Colombo, P. Ruggerone, A. Sieck, G. Seifert, T. International Rapid Thermal Processing Conference, 1993,
Frauenheim, Atomic-scale characterization of boron diusion in p. 443.
silicon, Physical Review B 64 (2001) 075207. [38] Vortek Industries Ltd., private communications, 2002.
[20] J.-W. Jeong, A. Oshiyama, Atomic and electronic structures of a [39] M.E. Law, S.M. Cea, Continuum based modeling of silicon
boron impurity and its diusion pathways in crystalline Si, integrated circuit processing: An object oriented approach,
Physical Review B 64 (2001) 235204. Computational Materials Science 12 (1998) 289308.
[21] M.Y.L. Jung, R. Gunawan, R.D. Braatz, E.G. Seebauer, Pair [40] R.E. Bank, W.M. Coughran Jr., W. Fichtner, E.H. Grosse, D.J.
diusion and kick-out: relative contributions to diusion of boron Rose, R.K. Smith, Transient simulation of silicon devices and
in silicon, Journal of Electrochemical Society, submitted for circuits, IEEE Transactions on Electron Devices ED-32 (1985)
publication. 19922007.
[22] R. Gunawan, M.Y.L. Jung, R.D. Braatz, E.G. Seebauer, Para- [41] M.J. Caturla, M.D. Johnson, T.D. de la Rubia, The fraction of
meter sensitivity analysis applied to modeling transient enhanced substitutional boron in silicon during ion implantation and
diusion and activation of boron in silicon, Journal of Electro- thermal annealing, Applied Physics Letters 72 (1998) 27362738.
chemical Society, in press. [42] H. Kobayashi, I. Nomachi, S. Kusanagi, F. Nishiyama, Lattice
[23] P.A. Stolk, H.J. Gossmann, D.J. Eaglesham, D.C. Jacobson, C.S. site location of ultra-shallow implanted B in Si using ion beam
Raerty, G.H. Gilmer, M. Jaraiz, J.M. Poate, H.S. Luftman, T.E. analysis, in: K.S. Jones, M.D. Giles, P. Stolk, J. Matsuo, E.C.
Haynes, Physical mechanisms of transient-enhanced dopant Jones (Eds.), Si Front-End ProcessingPhysics and Technology
diusion in ion-implanted silicon, Journal of Applied Physics 81 of Dopant-Defect Interactions III, Materials Research Society,
(1997) 60316050. Inc., Warrendale, PA, 2001, p. J5.3.
[24] E.J.H. Collart, A.J. Murrell, M.A. Foad, J.A. van den Berg, S. [43] M.Y.L. Jung, R. Gunawan, R.D. Braatz, E.G. Seebauer, Surface
Zhang, D. Armour, R.D. Goldberg, T.S. Wang, A.G. Cullis, T. fermi level pinning: An electrical valve in transient enhanced
Clarysse, W. Vandervorst, Cluster formation during annealing of diusion, in: Materials Research Society Spring Meeting, San
ultra-low-energy boron-implanted silicon, Journal of Vacuum Francisco, CA, 2001. Presentation J4.21.
Science and Technology B 18 (2000) 435439. [44] W.H. Press, B.P. Flannery, S.A. Teukolsky, W.T. Vetterling,
[25] T.E. Haynes, D.J. Eaglesham, P.A. Stolk, H.J. Gossmann, D.C. Numerical Recipes in C: The Art of Scientic Computing,
Jacobson, J.M. Poate, Interactions of ion-implantation-induced Cambridge University Press, New York, NY, 1992.
interstitials with boron at high concentrations in silicon, Applied [45] Z. Nenyei, C. Grunwald, W. Lerch, J. Niess, D.F. Downey, R.
Physics Letters 69 (1996) 13761378. Ostermeir, Competitive process analysis and some new optimiza-
[26] K.J. Laidler, Chemical Kinetics, Harper & Row, New York, NY, tion methods in RTP, in: Proceedings of the 6th International
1987. Conference on Advanced Thermal Processing of Semiconductors,
[27] Z. Wang, E.G. Seebauer, Estimating pre-exponential factors for Kyoto, Japan, 1998.
desorption from semiconductors: consequences for a priori [46] A. Varma, M. Morbidelli, H. Wu, Parametric Sensitivity in
process modeling, Applied Surface Science 181 (2001) 111120. Chemical Systems, Cambridge University Press, Melbourne,
[28] E.G. Seebauer, M.Y.L. Jung, Surface diusion on metals, Australia, 1999.
semiconductors and insulators, Chapter II, in: H.P. Benzel [47] G.J. Balas, J.C. Doyle, K. Glover, A. Packard, R. Smith, l-
(Ed.), Landolt-B ornstein Numerical Data and Functional Rela- Analysis and Synthesis Toolbox, The MathWorks, Inc., Natick,
tionships, Vol. III/42A, Springer Verlag, New York, 2001. MA, 2001.
[29] D. Eaglesham, Dopants defects and diusion, Physics World 8 [48] P.J. Timans, Mattson Technology, private communications,
(1995) 4145. 2002.

Optimal Control of Rapid Thermal Annealing in A Semiconductor Process

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Optimal Control of Rapid Thermal Annealing in A Semiconductor Process

Uploaded by

Copyright:

Available Formats

Journal of Process Control 14 (2004) 423430

Optimal control of rapid thermal annealing

1. Introduction The state-of-the-art in post-implant annealing em-

1200 reaction networks including boron activation and in-

800 and Poissons equation to incorporate the electrical eld

Simulation constrained optimization problem:

where Xj denotes the junction depth, Rs denotes the sheet

n o This optimization problem is equivalent to [10]

heating and cooling proles with minimum heating and

computed using the l-Analysis and Synthesis Matlab Linear Optimal

1021 Quadratic Optimal

temperature increased to 1146 C to give a junction 7. Conclusions

You might also like