Welcome to Scribd!

CS 229: Machine Learning Problem Set 0 summary

Uploaded by

0% found this document useful (0 votes)

164 views5 pages

This document summarizes key concepts and proofs from CS 229: Machine Learning Problem Set 0. It addresses calculating gradients, Hessians, and properties of symmetric positive semi-definite matrices. Key results shown include formulas for the gradient and Hessian of functions of matrix forms, properties of eigenvectors and eigenvalues of diagonalizable matrices, and proofs that eigenvalues of symmetric positive semi-definite matrices are non-negative.

Original Description:

cs229 ps0 my solutions

Original Title

ps0

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

164 views5 pages

CS 229: Machine Learning Problem Set 0 summary

Uploaded by

WilliamMa

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

CS 229: Machine Learning

Problem Set 0

William Ma

July 20, 2017

1 Question 1
1a Part a
Given f (x) = 12 xT Ax + bT x where A is a symmetric matrix and and b Rn is a vector,
we can calculate x f (x) by taking the partial derivative
n n n
f (x) h1 X X X i
= xi Aij xj + bi xi
xk xk 2
i=1 j=1 i=1
1h X X X X i
= Aij xi xj + Aik xi xk + Akj xk xj + Akk xk
xk 2
i6=k j6=k i6=k j6=k
n
X
+ bi xi
xk
i=1
1X 1X
= Aik xi + Akj xj + Akk x2k + bk
2 2
i6=k j6=k
n n
1 X 1 X
= Aik xi + Akj xj + bk
2 2
i=1 j=1
n
X
= Aik xi + bk
i=1

Now we can easily see that, if x f (x) = 2Ax + b

1b Part b
Given that f (x) = g(h(x)), where g : R R is differentiable and h : Rn R is
differentiable, we can expand f (x) to arrive at the solution

f (x)
= g(h(x))
xk xk
By invoking Chain Rule,

f (x)
= g 0 (h(x)) h(x)
xk xk
Combining these back into a vector,
0
g (h(x)) x 1 h(x)

f (x) = .. 0
= g (h(x))h(x)

.
g 0 (h(x)) xn h(x)

1
1c Part c
Given f (x) = 12 xT Ax + bT x where A is a symmetric matrix and and b Rn is a vector,
we can calculate the Hessian as follows
n n n
2 f (x) 2 h 1 X X X i
= x i A ij x j + bi x i
x2k x2k 2 i=1 j=1 i=1
2 1h X X X X i
= Aij xi xj + Aik xi xk + Akj xk xj + Akk x2k
x2k 2
i6=k j6=k i6=k j6=k
n
2 X
+ bi xi
x2k i=1
1X 1X
= Aik + Akj + 2Akk xk
2 2
i6=k j6=k
n n
1 X 1 X
= Aik + Akj
2 2
i=1 j=1
n
X
= Aik
i=1

Thus, the 2 f (x) = A.

1d Part d
Given f (x) = g(aT x), where g : R R is continuously differentiable and a Rn is a
vector, we can calculate f (x) using the result we got from problem 1a and 1b

f (x) = g 0 (aT x) (aT x)

= g 0 (aT x)a

However, for the Hessian, we have to expand, apply Chain rule to each term, then
recombine back into a vector.
2 f (x) 2
= xj g(aT x)
xi xj xk xi
n n
00 T X X
= g (a x) ak xk al xl
xi xj
k=1 l=1
00 T
g 00 (aT x)a1 an

g (a x)a1 a1 . . .
00 T
= g (a x)ai aj =
.. .. ..
. . .
g 00 (aT x)an a1 . . . g 00 (aT x)an an
= g 00 (aT x)aaT

Thus, 2 f (x) = g 00 (aT x)aaT .

2
2 Problem 2
2a Part a
nn
Proof. Given z Rn and that A = zz T , A S+ if A = AT and xT Ax 0.
A = AT
zz T = (zz T )T
zz T = (z T )T z T = zz t
Thus, A = AT .
xT Ax 0
xT zz T x 0
(xT z)(xT z)T 0
(xT z)2 0
nn
Thus, since A = AT and xT Ax 0, A S+ .

2b Part b
Given z Rn is a non-zero vector and A = zz t , the null-space of A is 1 since, Ax = 0
only when x is orthogonal to z, which implies that z T x = 0 as shown.
Ax = 0
zz T x = 0
z(0) = 0
Thus, the null-space is 1. Using the rank-nullity theorem, the rank of A is n 1.

2c Part c
Proof. Given A Snn
+ and B Rmn is arbitary,
BAB T = (BAB T )T
BAB T = (B T )T AT B T
BAB T = BAB T
Thus, BAB T = (BAB T )T .
xT BAB T x 0
(xT B)A(xT B)T 0
Since A Snn T T T T T
+ , then yAy 0. We can simply let y = x B for (x B)A(x B) 0 to
T T T T T T mm
be true. Thus, since BAB = (BAB ) and x BAB x 0, BAB S+ .

3
3 Problem 3
3a part a
Proof. Given that A is diagonalizable, such that A = T T 1 , and t(i) Rn is the i-th
column of T ,

At(i) = T T 1 t(i)

The inverse of a matrix, M Rnn multiplied by x(i) , the i-th column of M , returns
always returns a n n matrix, N , where
(
1, if j = i and k = i
Njk =
0, otherwise

Thus,

At(i) = T T 1 t(i) = T (i)

= t(i) i = i t(i)

Thus, At(i) = i t(i) where (t(i) , i ) are the eigenvector/eigenvalue pair of A.

3b Part b
Proof. Given that A is symmetric, A = U U 1 , U is orthogonal, and u(i) Rn is the
i-th column of T ,

Au(i) = U U T u(i)
= U U (1) u(i)

We can use the result we got from problem 3a and get that Au(i) = i u(i) , where (u(i) , i )
are the eigenvector/eigenvalue pair of A.

3c Part c
Proof. Given A Snn
+ and i is an eigenvalue of A,

xT Ax 0
xT U U T x 0
(xT U )(xT U )T 0
nn
Since is a diagonal matrix, S+ , which implies that i 0.

Calculus On Manifolds (Spivak) - Solutions
Document9 pages
Calculus On Manifolds (Spivak) - Solutions
gomdool17
100% (1)
Key differences between sorting algorithms
Document5 pages
Key differences between sorting algorithms
Arnav Mendi
100% (1)
Griffith's Quantum Mechanics Problem 2.51
Document3 pages
Griffith's Quantum Mechanics Problem 2.51
palison
No ratings yet
Integral Transforms Formula Sheet
Document2 pages
Integral Transforms Formula Sheet
grrrmo
No ratings yet
Joe L. Mott, Abraham Kandel, Theodore P. Baker Discrete Mathematics For Computer Scientists and Mathematicians 2008 PDF
Document763 pages
Joe L. Mott, Abraham Kandel, Theodore P. Baker Discrete Mathematics For Computer Scientists and Mathematicians 2008 PDF
Shreya Chaturvedi
82% (60)
CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus
Document4 pages
CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus
Sasanka Sekhar Sahu
No ratings yet
HW 3 Sol
Document4 pages
HW 3 Sol
Super Nezh
No ratings yet
Generating Functions for Solving Recurrence Relations
Document3 pages
Generating Functions for Solving Recurrence Relations
Saif Ali Khan
No ratings yet
Recitation 11: Based On Nesterov, Yurii. Introductory Lectures On Convex Optimization: A Basic Course
Document3 pages
Recitation 11: Based On Nesterov, Yurii. Introductory Lectures On Convex Optimization: A Basic Course
Rashmi Phadnis
No ratings yet
Recitation 11: Based On Nesterov, Yurii. Introductory Lectures On Convex Optimization: A Basic Course
Document3 pages
Recitation 11: Based On Nesterov, Yurii. Introductory Lectures On Convex Optimization: A Basic Course
Rashmi Phadnis
No ratings yet
Quadratic Function Minimization
Document5 pages
Quadratic Function Minimization
Abiy
No ratings yet
Propagate
Document11 pages
Propagate
Michel Rodrigues Andrade
No ratings yet
2008 Seemous Problems Solutions
Document4 pages
2008 Seemous Problems Solutions
Hipstersdssadad
No ratings yet
Cebir Cevaplar Ders
Document3 pages
Cebir Cevaplar Ders
EGE
No ratings yet
Linear Regression Guide: Modeling Relationships Between Variables
Document25 pages
Linear Regression Guide: Modeling Relationships Between Variables
Joseph George
No ratings yet
Solving separable differential equations
Document4 pages
Solving separable differential equations
Abegail Lorilla
No ratings yet
#5768 Prakash Pant Solution SSMJ
Document2 pages
#5768 Prakash Pant Solution SSMJ
mrr.pant92
No ratings yet
Q1 Solves Joint CDF Problem
Document11 pages
Q1 Solves Joint CDF Problem
RNG007
No ratings yet
Lecture DM
Document11 pages
Lecture DM
Harsh Sharma
No ratings yet
Unconstrained Minimization in R: Newton Methods
Document5 pages
Unconstrained Minimization in R: Newton Methods
Abdesselem Boulkroune
No ratings yet
Hyperbolic Equations: 8.1 D'Alembert's Solution
Document10 pages
Hyperbolic Equations: 8.1 D'Alembert's Solution
rohit singh
No ratings yet
Calculus on Rn: Limits and Continuity
Document7 pages
Calculus on Rn: Limits and Continuity
catalin
No ratings yet
One Nonlinear Equation and Its Numerical Solution: Taylor Series Expansion About Point X
Document4 pages
One Nonlinear Equation and Its Numerical Solution: Taylor Series Expansion About Point X
Piyush Singh
No ratings yet
Nbody Dissipative
Document10 pages
Nbody Dissipative
Fulana Schlemihl
No ratings yet
Assignment
Document2 pages
Assignment
Asmelash Teka
No ratings yet
Math556 07 Inequalities
Document7 pages
Math556 07 Inequalities
Frankie Huang
No ratings yet
Cvar 4
Document2 pages
Cvar 4
nicholas_j_vaughan
No ratings yet
Suggested Solutions To Test: Email Address: Ymei@math - Cuhk.edu - Hk. (Any Questions Are Welcome!)
Document3 pages
Suggested Solutions To Test: Email Address: Ymei@math - Cuhk.edu - Hk. (Any Questions Are Welcome!)
Bryan Steve Pichucho
No ratings yet
Lecture 2
Document10 pages
Lecture 2
Rohan Gope
No ratings yet
Rules of differentiation
Document4 pages
Rules of differentiation
Meryem
No ratings yet
Useful Integrals in Reactor Design
Document1 page
Useful Integrals in Reactor Design
Sufian Hashim
No ratings yet
Useful Integrals in Reactor Design
Document1 page
Useful Integrals in Reactor Design
Nadiene Salleha
No ratings yet
Formulario Derivadas
Document1 page
Formulario Derivadas
Luciana Ferreira
No ratings yet
Tutorial 2: N N 2k 2k
Document6 pages
Tutorial 2: N N 2k 2k
Afiq Adnan
No ratings yet
Homework 1 Gradient, Divergence and Laplacian
Document6 pages
Homework 1 Gradient, Divergence and Laplacian
Entendiendo La Física
No ratings yet
Formulas Derivative
Document1 page
Formulas Derivative
Gaurav Gedam
No ratings yet
sp05 Recitation3
Document3 pages
sp05 Recitation3
Afshin
No ratings yet
CBSE I Succeed Math 12th SP11
Document10 pages
CBSE I Succeed Math 12th SP11
vk7820066
No ratings yet
BC210417807 mth622
Document3 pages
BC210417807 mth622
Qurat Ul Ain
No ratings yet
Finalspring 2022
Document9 pages
Finalspring 2022
Muhammad Khalil
No ratings yet
Week2 - Lecture4
Document3 pages
Week2 - Lecture4
izmitliserhat4
No ratings yet
STAT 153 HOMEWORK 2 SOLUTIONS
Document7 pages
STAT 153 HOMEWORK 2 SOLUTIONS
Devaraj Subrmanayam
No ratings yet
x→∞ x→0 x→0 βx αβ x→0 αβ x→0
Document9 pages
x→∞ x→0 x→0 βx αβ x→0 αβ x→0
ana Plejic
No ratings yet
Phy 158: Mathematics For Physics Tutorial One: Dorcas Attuabea Addo February 3, 2020
Document11 pages
Phy 158: Mathematics For Physics Tutorial One: Dorcas Attuabea Addo February 3, 2020
Tommy Chris
No ratings yet
MATH 2131 3.0 Section M Test Solution Analysis
Document2 pages
MATH 2131 3.0 Section M Test Solution Analysis
veechow
No ratings yet
Multivariate Calculus
Document15 pages
Multivariate Calculus
Martin De Los Santos
No ratings yet
Mathematical Physics - Homework 5
Document5 pages
Mathematical Physics - Homework 5
高英倫
No ratings yet
Nakahara GTP Solutions
Document50 pages
Nakahara GTP Solutions
Alessandro Quercetti
67% (3)
Nakahara GTP Solutions
Document48 pages
Nakahara GTP Solutions
Syed Amir Iqbal
No ratings yet
Análise Matemática II Exame Resolução
Document6 pages
Análise Matemática II Exame Resolução
Jorge Dinis Manhepe
No ratings yet
Https App - Oswaalbooks.com Download Sample-Qp Subsolution 350SAP-3 Sol
Document11 pages
Https App - Oswaalbooks.com Download Sample-Qp Subsolution 350SAP-3 Sol
3107aloksingh
No ratings yet
Nonlinear Programming 3rd Edition Theoretical Solutions Manual
Document13 pages
Nonlinear Programming 3rd Edition Theoretical Solutions Manual
Jigo Castelo
No ratings yet
Appendix A
Document6 pages
Appendix A
Sunil
No ratings yet
Delta Function Reps
Document2 pages
Delta Function Reps
chris224
No ratings yet
1 D'alembert's Solution: Weston Barger July 12, 2016
Document11 pages
1 D'alembert's Solution: Weston Barger July 12, 2016
ranv
No ratings yet
Differential and Integral Calculus 2 - Homework 3 Solution
Document6 pages
Differential and Integral Calculus 2 - Homework 3 Solution
Dominik
No ratings yet
C
Document13 pages
C
Otacilio Lucas Kobashigawa Amorim
No ratings yet
1112exam6sol_BHM7BgB
Document13 pages
1112exam6sol_BHM7BgB
吳思翰
No ratings yet
Solution To HW4 Mat324
Document2 pages
Solution To HW4 Mat324
farsamuels183
No ratings yet
Lecture 10
Document4 pages
Lecture 10
Tấn Long Lê
No ratings yet
Tables of The Legendre Functions P—½+it(x): Mathematical Tables Series
From Everand
Tables of The Legendre Functions P—½+it(x): Mathematical Tables Series
M. I. Zhurina
No ratings yet
Ten-Decimal Tables of the Logarithms of Complex Numbers and for the Transformation from Cartesian to Polar Coordinates: Volume 33 in Mathematical Tables Series
From Everand
Ten-Decimal Tables of the Logarithms of Complex Numbers and for the Transformation from Cartesian to Polar Coordinates: Volume 33 in Mathematical Tables Series
L.A. Lyusternik
No ratings yet
Polyrakis, 2019. Atomic Sublattices and Basic Derivatives in Finance.
Document20 pages
Polyrakis, 2019. Atomic Sublattices and Basic Derivatives in Finance.
Maria Papadaki
No ratings yet
Math 4575: HW #4
Document2 pages
Math 4575: HW #4
Fung Alex
No ratings yet
Number Systems and Mathematical Formulas Guide
Document6 pages
Number Systems and Mathematical Formulas Guide
raghu
No ratings yet
How To Square Fractions
Document3 pages
How To Square Fractions
Annelyanne Rufino
No ratings yet
0765-Pure Maths With Mechanics P2 PDF
Document3 pages
0765-Pure Maths With Mechanics P2 PDF
Alphonsius Wong
No ratings yet
Assignment 3 Sol 1 Series and Matrices Iitm
Document3 pages
Assignment 3 Sol 1 Series and Matrices Iitm
Ram Lakhan Meena
No ratings yet
Simultaneous Equations
Document7 pages
Simultaneous Equations
Benjamin Hi
No ratings yet
Band Theory and Bloch Theorem in Solid State Physics
Document8 pages
Band Theory and Bloch Theorem in Solid State Physics
Vicky Vicky
No ratings yet
Algebra Exam 2017
Document7 pages
Algebra Exam 2017
Anonymous PV7Vpc
0% (1)
NEG11MathPTPaper 12 06 10
Document12 pages
NEG11MathPTPaper 12 06 10
Rosemarie Velasco Dalupang
No ratings yet
Solution Manual For Precalculus Mathematics For Calculus International Metric Edition 7th Edition Stewart Redlin Watson 1305999983 9781305999985
Document36 pages
Solution Manual For Precalculus Mathematics For Calculus International Metric Edition 7th Edition Stewart Redlin Watson 1305999983 9781305999985
jenniferramostaecsjonzi
100% (29)
Fourier Transform of Periodic Signals
Document55 pages
Fourier Transform of Periodic Signals
Shiju Ramachandran
No ratings yet
Tutorial Letter 201/1/2017: Calculus A
Document10 pages
Tutorial Letter 201/1/2017: Calculus A
Khathutshelo Kharivhe
No ratings yet
Fuzzy (Strong) Congruence Relations On Hypergroupoids and Hyper BCK-algebras
Document14 pages
Fuzzy (Strong) Congruence Relations On Hypergroupoids and Hyper BCK-algebras
Theeana Dhayalan
No ratings yet
Mathematical Notation and Functions in Machine Learning
Document132 pages
Mathematical Notation and Functions in Machine Learning
Dominik Schmidt
No ratings yet
Chapter 3 Graph Theory PDF
Document17 pages
Chapter 3 Graph Theory PDF
Bran Inocencio
No ratings yet
DE LEC 1b FAMILIES OF CURVES
Document17 pages
DE LEC 1b FAMILIES OF CURVES
Christian Francisco
No ratings yet
Calculus Review
Document19 pages
Calculus Review
makunjap
No ratings yet
Https Doc 14 7g Apps Viewer - Googleusercontent
Document92 pages
Https Doc 14 7g Apps Viewer - Googleusercontent
durga_gaayu
No ratings yet
Ritz Method - Wikipedia
Document3 pages
Ritz Method - Wikipedia
rpraj3135
No ratings yet
Differential Equations Chapter 8 Review
Document18 pages
Differential Equations Chapter 8 Review
talonx11
No ratings yet
P3 - Chapter6 - Numerical Solutions of Equations
Document8 pages
P3 - Chapter6 - Numerical Solutions of Equations
Jonathan Lee
No ratings yet
Halamandaris Arianna A10754734 Project2
Document6 pages
Halamandaris Arianna A10754734 Project2
api-315609038
No ratings yet
Relational Algebra Rupinder
Document18 pages
Relational Algebra Rupinder
apurva manchanda 7010
No ratings yet
Notes 7.1 Graphing Exponential Functions PDF
Document11 pages
Notes 7.1 Graphing Exponential Functions PDF
Elias Jones
No ratings yet
M Matrik Songsang Bagi A: PPR Maths NBK m3
Document14 pages
M Matrik Songsang Bagi A: PPR Maths NBK m3
Adib Adwa
No ratings yet
Roller Coaster Engineer
Document8 pages
Roller Coaster Engineer
Ies Ingenieria
No ratings yet
Trignometric Functions
Document169 pages
Trignometric Functions
Kaushal Raghu
No ratings yet