Professional Documents
Culture Documents
2.2 Derivative
2.2.1 Derivative
A function f (x) is said to be differentiable x * if the following limit exists:
f (x * + h) f (x * )
lim h0
. The value of the limit is called the derivative at
h
x * denoted by f (1) (x * ). nth derivative is denoted as f (n) (x * ).
2.2.2 Affine approximation
A function f (x) that is differentiable in the interval [a,b] can be expressed as a
sum of derivative based affine approximation and an error term as follows:
f (x) = f (a) + f (1) (a)(x a) + r(x).
!###"###
$
A( x )
The error terms r(x) tends to zero as x tends a. This follows directly from the
definition of derivative.
4
x=a
x =
x=b
x=a
x =
x
x=b
Geometric interpretation:
Differentiability ensure that there is point ( , f ( )) where
the tangent to the function is parallel to the line connecting the points
(a, f (a)) and (b, f (b)).
1 (2)
1 (m )
f (a)(b a)2 + ! +
f ( )(b a)m
2!
m!
x3
2)
2
4
2x + 3x
3) x 3 = o(x 2 )
4) x = o(1)
= o(x)
x2
xn
f (x) =
f (x)
f (x)
f (x)
x
x
x
1
2
n
f (x1 + h, x2 ,, xn ) f (x1 , x2 ,, xn )
f (x) = lim
h0
x1
h
Sometimes we write x f (x) for clarity.
Hessian
{ xx f (x)
=F(x)}i, j
2
=
f (x)
xi x j
10
g[x,d
] ( ) = dk
k=1
f (x + d)
xk
2
g[x,d
f (x + d)
] ( ) = d j dk
x j xk
j=1 k=1
n
11
12
Recap:
Let f (x) be an 2 times continously differentiable function
on an interval [ a,b ]. Then for h = b a, we have
h (1)
h 2 (2)
f (b) = f (a) + f (a) +
f (a + h)
1!
2!
for some [0,1].
Form I:
f (y) = f (x) + (y x)T f (x) + (y x)T F(x + (y x))(y x)
Form II:
2.8.2
Let g(x) be function from ! n to ! (vector input-scalar output) and let A
be and n m matrix. Then for the function h(y) = g(Ay), the gradient
is given by y h(y) = A T x g(x) x=Ay , and the Hessian is given by
yy h(y) = A T xx g(x)A x=Ay
14
xm
x1
Jacobian: J h ( x) =
#
$
#
15