Professional Documents
Culture Documents
1 / 37
Organizational Issues
2 / 37
Organizational Issues
2 / 37
Organizational Issues
2 / 37
Organizational Issues
2 / 37
Organizational Issues
2 / 37
Organizational Issues
2 / 37
Organizational Issues
Assessment:
CS189
Homework 40%
Implementation and application of methods. (Kaggle)
Mathematical/reinforcement of concepts.
Seven total.
Late policy: 5 slip days total. Thats it.
Midterm 20%
(Thursday, March 19, in the lecture slot.)
Final Exam 40%
3 / 37
Organizational Issues
Assessment:
CS289A Plus a project:
Homework 40%
Midterm 20%
Final Exam 20%
Final Project 20%
(due Friday, May 1. Proposal due Friday, April 3.)
4 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
Ethics:
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
Ethics:
Discussion of homework problems with other students is encouraged.
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
Ethics:
Discussion of homework problems with other students is encouraged.
All homeworks must be written individually (including programming
components).
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
Ethics:
Discussion of homework problems with other students is encouraged.
All homeworks must be written individually (including programming
components).
Please read the department policy on academic dishonesty. We will be
actively checking for plagiarism.
5 / 37
Organizational Issues
(Real) Prerequisites:
Math53 (vector calculus); Math54 (linear algebra); CS70 (discrete
math, probability); CS188 (more probability, decision theory).
No screens in lectures. (To see why, google laptops in class.)
Ethics:
Discussion of homework problems with other students is encouraged.
All homeworks must be written individually (including programming
components).
Please read the department policy on academic dishonesty. We will be
actively checking for plagiarism.
5 / 37
Texts
Springer Series in Statistics
Trevor Hastie
Robert Tibshirani
Jerome Friedman
The Elements of
Statistical Learning
Data Mining, Inference, and Prediction
Second Edition
6 / 37
7 / 37
7 / 37
Learn
Use experience to improve performance:
data, queries, interaction, experiments
Statistical issues are central.
7 / 37
Learn
Use experience to improve performance:
data, queries, interaction, experiments
Statistical issues are central.
Systems
Computational issues are also central.
Algorithms, optimization.
7 / 37
1
2
3
8 / 37
Problems
2
3
8 / 37
Problems
Methods
8 / 37
Problems
Methods
Concepts
8 / 37
Problems
Methods
Concepts
8 / 37
9 / 37
ESL
ESL
10 / 37
Classification
11 / 37
Classification
microsoft.com
12 / 37
Classification
apple.com
ESL
13 / 37
Classification
ISLR
14 / 37
Classification
ISLR
15 / 37
Classification
ESL
16 / 37
Regression
ESL
17 / 37
Regression
ESL
18 / 37
Regression
ESL
19 / 37
Regression
ESL
20 / 37
Regression
ESL
21 / 37
Density Estimation
ESL
22 / 37
Density Estimation
ESL
23 / 37
Dimensionality Reduction
ESL
24 / 37
Dimensionality Reduction
ESL
25 / 37
Dimensionality Reduction
ESL
26 / 37
Clustering
ESL
27 / 37
Clustering
28 / 37
Clustering
ESL
29 / 37
Clustering
ESL
30 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
31 / 37
Problems
Methods
Concepts
32 / 37
Methods
Linear classifiers: Perceptron
Support vector machines
Gaussian class conditionals
Logistic regression
Naive Bayes
Linear discriminant analysis
Linear regression
Decision trees, regression trees
Ensemble methods
Neural networks
Nearest neighbor
Principal components analysis
k-means clustering
33 / 37
Methods
Linear classifiers: Perceptron
Support vector machines
Gaussian class conditionals
Logistic regression
Naive Bayes
Linear discriminant analysis
1
Classification
Regression
Linear regression
Decision trees, regression trees
Ensemble methods
Neural networks
Nearest neighbor
Principal components analysis
k-means clustering
33 / 37
Methods
Linear classifiers: Perceptron
Support vector machines
Gaussian class conditionals
Logistic regression
Naive Bayes
Linear discriminant analysis
Probabilistic
modeling.
Linear regression
Decision trees, regression trees
Ensemble methods
Neural networks
Nearest neighbor
Principal components analysis
k-means clustering
33 / 37
Problems
Methods
Concepts
34 / 37
Concepts
1
35 / 37
Concepts
1
2
35 / 37
Concepts
1
2
35 / 37
Concepts
1
2
35 / 37
Concepts
1
2
35 / 37
Concepts
1
2
Optimization.
35 / 37
Concepts
1
2
Optimization.
Convexity.
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
Regularization
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
Regularization
Priors
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
Regularization
Priors
Practical issues:
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
Regularization
Priors
Practical issues:
Train/validate/test. Over-fitting.
35 / 37
Concepts
1
2
Optimization.
Convexity.
(Stochastic) gradient methods.
Newtons method.
Controlling complexity:
Bias-variance/approximation-estimation trade-off.
Regularization
Priors
Practical issues:
Train/validate/test. Over-fitting.
Resampling methods.
35 / 37
36 / 37
36 / 37
36 / 37
36 / 37
36 / 37
36 / 37
36 / 37
Linear regression
36 / 37
Linear regression
Optimization
36 / 37
Linear regression
Optimization
Linear Classification revisited
36 / 37
Linear regression
Optimization
Linear Classification revisited
Logistic regression
36 / 37
Linear regression
Optimization
Linear Classification revisited
Logistic regression
Linear Discriminant Analysis
36 / 37
Linear regression
Optimization
Linear Classification revisited
Logistic regression
Linear Discriminant Analysis
Support vector machines
36 / 37
Linear regression
Optimization
Linear Classification revisited
Logistic regression
Linear Discriminant Analysis
Support vector machines
Memory-based/Instance-based learning
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
Boosting
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
Unsupervised methods
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
Unsupervised methods
Clustering
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
Unsupervised methods
Clustering
Density estimation
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
Unsupervised methods
Clustering
Density estimation
Dimensionality reduction
37 / 37
Memory-based/Instance-based learning
k-nearest-neighbor
Properties of high-dimensional spaces
distance learning
Efficient indexing and retrieval methods
Decision trees
Classification and regression trees
Random Forests
3
4
Boosting
Neural networks / Deep Learning
Multilayer perceptrons
Variations such as convolutional nets
Examples and applications
Unsupervised methods
Clustering
Density estimation
Dimensionality reduction
Applications: Collaborative filtering, etc.
37 / 37