Professional Documents
Culture Documents
LDA
● Algorithm
– Initialize topic assignments randomly
– For each iteration
● For each document
– For each word
● Re-sample a topic for that word, given all the other words
3 2 1 3 1
commission trade nbn abc union
3 2 1 3 1
commission trade nbn abc union
3 2 1 3 1
commission trade nbn abc union
….
….
re-sampling
3 ? 1 3 1
commission trade nbn abc union
….
Re-assigning a topic to “trade”
● Topic 1 Topic 2 Topic 3
3 ? 1 3 1
commission trade nbn abc union
Re-assigning a topic to “trade”
● Topic 1 Topic 2 Topic 3
3 ? 1 3 1
commission trade nbn abc union
Area of rect: how much doc likes the topic * how much the topic likes the word
3 ? 1 3 1
commission trade nbn abc union
3 ? 1 3 1
commission trade nbn abc union
….
Pick a topic for “trade”
3 1 1 3 1
commission trade nbn abc union
….
Twitter LDA
● algorithm