Professional Documents
Culture Documents
j
j j
U T F
U Z
U T P )) , ( exp(
) (
1
) | (
problem : approach : research program: operation classification : summary
Ph.D. Thesis Proposal February 9th, 2005
39
Unsupervised learning
and active learning
1. Train an initial classifier from human-labeled data
2. Apply the current classifier to an unlabeled
operation
(Unsupervised learning) if the confidence is high, add
this instance and the predicted label into the training set
(Active learning) if the confidence is low, ask a human to
label this instance and then add it into the training set
3. Train a new classifier on all labeled data (both
machine-labeled and human-labeled)
Step 2-3 can be iterated
problem : approach : research program: operation classification : summary
Ph.D. Thesis Proposal February 9th, 2005
40
Classifier confidence score
1. Difference in probabilities between the
first rank and the second rank
2. The entropy of the classifier output
High entropy = low confidence
) | (
1
log ) | ( ) (
i j
j
i j
U T p
U T p T H