Professional Documents
Culture Documents
Strategy 3: First
Phrase Mining then Topic
Modeling
Techniques
Phrase mining and document segmentation
Topic model inference with phrase constraint
Raw
freq.
True
freq.
90
80
[vector machine]
95
[support vector]
100
20
Phrase
Collocation Mining
Many of these measures can be used to guide the agglomerative phrasesegmentation algorithm
Topic 2
information retrieval
feature selection
social networks
web search
search engine
information
extraction
machine learning
semi supervised
large scale
support vector
machines
question answering
active learning
web pages
face recognition
Topic 1
Topic 2
10