Professional Documents
Culture Documents
Lecture 10/10
P Calculation of gradients: sensitivity analysis <Analytical, numericaland semi-analytical methods <Stable perturbation intervals <Accuracyproblems P Multicriterion problems <Pareto-optimality <Bound formulation P Response surface methods < Numericalexperiments < The curse of dimensionality To Read: PArora:8.3 PErik Lund:Design sensitivity analysis of finite element discretized structures PJuhani Koski: Multicriterion structural optimization PRasmussen& Lund: An information management system for the formulation and solution ofmultidiscipinary optimization problems PMorten Johansen et.al.: Response surface approximations for shape optimization of a 3Dmanifold
Optimization methods
- almost always rely on gradients
Search methods (feasible directions, penalty, etc):
x( k + 1) = x(k ) + s( k ) s( k ) = f ( g i (x( k ) ))
Gradients
The gradient is a vector of partial derivatives
g g g g i = i , i , .. , i xn x1 x2
P If you know the analytical expressions for gi, then you can differentiate analytically. This is called the analytical method and is always the best and most efficient option, but unfortunately not possible in most cases. P If gi is purely numerically defined, then you can compute the gradient numerically. This takes time and introduces truncation and round-off errors, but it may be the only option. This is called overall finite difference . P If you know some things about the nature of gi but not the full expressions, then it may be possible to work analytically some of the way. This is called the semi-analytical method .
g i g i (x x j ) x j xj
gi g i ( x + x j ) gi (x x j ) x j 2 x j
g(x)
g(x+ )x)
g g x x for x 0
so we should use a small )x. The size, however is limited by the finite accuracy of the computer.
The cost of forward (or backward) FD is one additional analysis per design variable. The cost o f central FD is two additional analyses for each variable.
Accuracy considerations
Choosing the perturbation size
g x
Choosing the right value of )x is a compromise between round-off errors and truncation errors.
Stable interval
g x
Truncation errors
Round-off errors
)x Depending on the nature of the problem, the stable interval may be large, small or even non-existing.
J ohn Rasmussen, I nstitute of Mechanical Engineering, Aalborg University, 2001
, f=
f +f
e
Ku = f
Where K is a stiffness matrix, f is the load vector, and u is the vector of unknown displacements. We imagine that the problem depends on design variables x = { x1, x2, .. , xn}, and we want to find the sensitivity w.r.t. xj.
K u f u+ K = x j x j x j
We rearrange to find the displacement sensitivitites:
u f K = u x j x j x j
Ku = f
u f K = u x j x j x j
We are solving a system with the same K as before, so we can re-use the same factorization. We do not have to perform the time-consuming operation K 6LDLT again!
f K and xj x j
analytically, then we are performing analytical sensitivity analysis. If we compute
f f K K and x j x j x j x j
then we are using the so-called semi-analytical method. Notice that the FD computation of these terms is not very time consuming. It is only proportional to the problem size.
Ku = f
Notice that this is not necessarily a finite element problem. This type of sensitivity analysis will apply to any numerical problem where the main operation is solution of a large system of equations. The method even applies if the system of equations is nonlinear, and the equations to find sensitivities still become linear. For large systems of equations, sensitivity analysis is much cheaper than ordinary analysis because we do not have to factorize again. This means that sensitivity analysis can be a very cheap way of getting a lot of information about the problem.
Case Story
Semi-analytical DSA in auto design A young engineering intern at Ford in Michigan was given the assignment of DSA of a car body skeleton. He modeled the frame with beam elements and used the semianalytical method to find sensitivities. His results turned out to be completely wrong, but nobody could figure out why. In fact, the more accurately he modeled the frame, the more wrong were the results.
Case Story
Simple model example
The problem is easier to understand if we make a simpler example:
n beam elements M L
We wish to study the sensitivity of the end deflection w.r.t. the beam length, L. Notice that this problem is solved exactly within the Bernoulli-Euler theory for any number of elements, n. It is very simple to compute the analytical solution to this problem.
J ohn Rasmussen, I nstitute of Mechanical Engineering, Aalborg University, 2001
Case Story
Error dependency on number of elements
It turns out that the error grows with the square of the number of elements! Even a small number of elements makes the result completely wrong. The explanation is that beam problems are inherently ill-conditioned because they have both displacements and rotations in the u vector and both forces and moments in the f vector. But why does the problem get worse when we increase the number of elements?
Error
Case Story
Why does a large number of elements cause problems?
To understand why the problem is related to the number of elements, we must study the nature of the pseudo-load vector for the beam problem.
u f K K = u x j x j x j
Pseudo-load vector
The resulting tip deflection is the difference between the contributions from all the forces in different directions. The more elements, the more forces. If all the forces are a little bit wrong, then the resulting deflection is a small difference between two large erroneous numbers. J ohn Rasmussen, I nstitute of Mechanical Engineering, Aalborg University, 2001
Case Story
What are the possible solutions? There are basically two ways of eliminating this problem: 1. Eliminate errors in the pseudo-load vector. This means more analytical work. 2. Do not introduce FD errors until after the equatio n solution has been done -> overall finite difference.
Case Story
When to watch out for erroneous sensitivities
P When the numerical problem is ill-conditioned. This happens often if the physical problem has very different properties in different directio ns. A beam, for instance has a high axial stiffness and a lo w lateral stiffness. P When the problem involves properties with different dimensions, for instance
< Forces and moments < Displacements and rotations < Pressures and velocities
Multicriterion Problems
- when you cannot decide
A well-posed design optimization problem has only one objective function. However, it can sometimes be difficult to decide which is the right one. In many cases, we have several properties that we would like to improve simultaneously. If we do so, we have not posed the problem correctly, and it will have many solutions. But it is a real issue in engineering design that you often cannot decide which is the best formulation before you see the final solutions.
Flexibility
Any bicycle on the curve is optimal w.r.t. weight and flexibility. But which one is really the best?
J ohn Rasmussen, I nstitute of Mechanical Engineering, Aalborg University, 2001
Pareto optimality
A curve like this is called a Pareto curve. If we had three objectives instead of two, it would be a surface. For a multicriterion problem to be Pareto-optimal, is must fulfil the following requirement: It is not possible to decrease any objective function further without increasing at least one of the others.
Flexibility
Knowing the Pareto cur ve or sur face is extremely useful in practical design, but it can be difficult to obtain; each point on the curve corresponds to solving one optimization problem.
PMinimize the mean value: PMinimize the RMS: PMinimize the maximum value:
g
j=1
2 0j
f (g 0 j ) = max(g j 0 )
Min/max problems
Minimizing the maximum is an important applicatio n in many fields, for instance
174 MPa
111 MPa
PMinimizing the maximum stress in a structure to reduce the risk of material failure PMaximizing the minimum eigenfrequency to remove a vibration problem PMaximizing the minimum pressure in a flow to remove cavitation PMinimizing the maximum velocity in a flow to make it as uniform as possible PMinimizing the maximum actuation force in a machine to allow use of smaller and cheaper components
Min/max problems
- bound formulatio n Min/max problems are very easy to solve because they can be bound formulated:
Minimize max( g 0 j ( x )) is converted to Minimize Subject to g 01( x) g 02 ( x) . . g 0n (x )
This usually creates a problem with many constraints, and it is best solved by a subproblem method, typically SLP. Convex programming based on dual methods will not work because there are too many constraints.
Numerical Experiments
- in response surface methods. The basic idea of response surface methods is to conduct a number of (numerical) experiments and use the results to learn how the problem behaves. To do this efficiently requires planning. It is easier to understand the concept through the use of an example: Optimization of a diffuser (Jens I. Madsen et al.):
Diffuser Optimization
- by response surface methods We parameterize the design boundary by a polynomial functio n:
y1 y2
y = a 4 x 4 + a3 x 3 + a2 x 2 where a4 = 1 1 1 y1 y2 + 2 4 18 5 1 a3 = y 1 + y 2 2 6 3 1 a2 = 3 y 1 y 2 + 4 9
Diffuser Optimization
Feasible domain We want to constrain the problem to monotonic wall shapes. Having the polynomial function defined, it is possible to derive that the set of monotonic wall shapes form a near-elliptical regio n in the design space.
All the dots inside the feasible domain are places where we could potentially compute function values, i.e., loss of energy. But it is expensive. We assume that the objective function is a quadratic polynomial in y1 and y2, and we want to identify the coefficients. Statisticalmethods can be used to identify that outof the candidate points, the highlighted ones will determine the coefficients with best reliability. So we choose those 10 points and perform CFD analyses to find the energy loss.
Diffuser Optimization
Identified objective function
The identified objective function is analytical and the problem can be optimized by many methods, for instance in a spreadsheet. It turns out that the optimum is rather flat, so many shapes have almost the same energy dissipation.
Assignments
Loose ends, examination, and a practical problem
Last chance to get help with previous assignments that you have not yet completed EMSD: Start defining your examination problem and let us discuss it. All others: Problem 8.8 (use a spreadsheet for the solution)