Professional Documents
Culture Documents
Products
Emily Fox & Carlos Guestrin
Machine Learning Specialization
University of Washington
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Where we see
recommender systems
Information overload
100 Hours a Minute Browsing is history
What do I care about? -Need new ways
to discover content
viewers videos
Recommendations combine
global & session interests
Recommendations form
coherent & diverse sequence
6
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Friend recommendations
Limitation:
-No personalization
Product info
Classifier
No
Other info
Pros:
- Personalized:
Considers user info & purchase history
- Features can capture context:
Time of the day, what I just saw,
- Even handles limited user history: Age of user,
13
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Limitations of classification approach
User info
Yes!
Purchase history
Product info
Classifier
No
Other info
Matrix C:
store # users who bought both items i & j
- (# items x # items) matrix
Result:
-Drowns out other eects
-Recommend based on popularity
18
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Normalize co-occurrences:
Similarity matrix
Jaccard similarity: normalizes by popularity
- Who purchased i and j divided by
who purchased i or j
Users
Rating
X= Rows index movies
Columns index users
Movies
R
X
L
=
But we dont
know topics of
users and movies
27
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Matrix factorization model:
Discovering topics from data
Xij known for black cells
Xij unknown for white cells
Rating
X= Rows index movies
Columns index users
Parameters Many ecient algorithms
of model for factorization
Only use observed values to estimate topic vectors Lu and Rv
Use estimated Lu and Rv for recommendations
28
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Limitations of matrix factorization
Cold-start problem
- This model still cannot handle a new user or movie
Recall
# liked & shown
# liked
Precision
# liked & shown
# shown
Recall
# liked & shown
# liked
Precision
# liked & shown
# shown
Recall =1
Precision = 1
recall
42
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Which Algorithm is Best?
For a given precision, want recall as large as possible
(or vice versa)
One metric: largest area under the curve (AUC)
Another: set desired recall and maximize precision
(precision at k)
precision
recall
43
2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Summary of
recommender systems