Professional Documents
Culture Documents
n
e e
e
n
e
n
i
i
data
n
i
i
data
, (2)
where n (= 126) is the sample size. Table 1 shows that the average errors were not much different between Case 0
and Case 1, 5.51% and 5.34%, respectively. In terms of computational cost, Cases 0 and 1 took almost the same
CPU time per GENESIS run since the only difference is the initial design point.
3. Probabilistic Modeling of Optimization Error
With multiple structural optimization runs available, we can obtain a data driven model of the optimization
error by fitting a probability distribution. We use the maximum likelihood estimation (MLE) method for the
distribution fit
8
. In MLE, we find a vector of distribution parameters, , to maximize the likelihood function, l(),
which is a product of the probability density function, f, over the sample data x
i
(i = 1, , n),
( ) ( ). ;
1
i
n
i
x f l
(3)
The quality of fit is checked via the
2
goodness-of-fit test
8
, which is essentially a comparison of histograms
between the data and the fit. The test results will be given in terms of the p-value. A p-value near one implies a good
fit and a small chance that the data is inconsistent with the distribution. Conversely, a small p-value implies a poor
fit and a high chance that the data is inconsistent with the distribution.
Considering the one-sidedness of the optimization error, we selected the Weibull distribution
8, 9
, which is
defined by a shape parameter, , and a scale parameter . The probability density function (PDF) of the Weibull
distribution is
'
<
,
_
,
_
0 0
0 exp
) (
1
x if
x if
x
x
x f
. (4)
Once we obtain the fit parameters, and , estimates of mean and standard deviation of e can be calculated from
, )
1
(
1
)
2
( 2 ,
1
2
2
'
1
]
1
,
_
fit fit
(5)
where is the gamma function.
4. Direct Fit of Probabilistic Model to Optimization Error
A straightforward approach to finding a probabilistic model of the optimization error is to fit a model
distribution to e, calculated from Eq. 1. This approach is denoted a direct fit. The Weibull model was fitted to Case 0
and Case 1, and the results are summarized in Table 2. p-values of the
2
test indicated a poor fit for Case 0, while
the fit was acceptable for Case 1 with a 5% confidence level. Figure 2 compares histograms of the optimization
error, e, with the predicted frequencies from the fitted Weibull models. It is seen that the error distribution has a
mode near zero and decreases rapidly for large error. The Weibull fits give reasonable descriptions of the error
distribution for both Case 0 and Case 1, although the
2
test implied an unsatisfactory fit for Case 0.
The average errors estimated from the fit,
fit
ds x s h s g x f
2 1 2 1
(8)
Note that the optimization difference x is easily calculated from W
s
1
and W
s
2
that are readily available. Then, we can
fit Eq. 8 to the optimization differences via MLE.
The indirect fit was performed using the Weibull distribution on the pair of Cases 0 and 1. Recall that
relatively large perturbations (multiplication factors between 0.1 1.9) were applied to the initial design point. The
large perturbation was intended to reduce dependence (correlation) of W
s
between Case 0 and Case 1. The
2
test on
the optimization difference indicated a reasonable fit with a p-value of 0.5494. From the indirect fit, we estimate the
mean and standard deviation of the optimization error of each of the two cases involved. Table 3 shows that the
estimates of mean error by the indirect fit,
fit