Professional Documents
Culture Documents
The Overview
verview of Bayes Classification Methods
ethods
Harshad M. Kubade
Assistant Professor,
Department of Information Technology, PIET, Nagpur, India
ABSTRACT
The Bayes methods are popular in classification P (t | M ) P ( M )
P(M | t )
because they are optimal. To apply Bayes methods, it P (t )
is required that prior probabilities and distribution of Therefore, the probability that patient has malaria is
patterns for class should be known. The pattern is
assigned to highest posterior probability class. The given below
0 .6 0 .2
three main methods under Bayes classifier are Byes P(M | t) 0 .3
0 .4
theorem, the Naive Bayes ayes classifier and Bayesian
belief networks.
III. Naive Bayes classifier
The Naive Bayes classifier is a probabilistic classifier
Keywords: Bayes theorem, Bayesian belief networks,
based on Bayes theorem. The Naive Bayes classifier
Naive Bayes classifier, prior probability, posterior
assumes that the presence (or absence) of a particular
probability
feature of a class is unrelated of a presence(or
absence) of any other feature, given the class
cla variable.
I. INTRODUCTION
The advantage of Naive Bayes classifier is that it
Bayes classifier is popular as it reduces the average
requires a small amount of training data to estimate
probability of error. Bayes classification methods
the parameters for the classification. The probability
require prior probability and distribution of classes.
model for a classifier is a conditional model
The posterior probability is calculated and the test
P (C | F1, ....., Fn)
pattern is assigned to the class having highest
Using Bayes theorem,
heorem, the above statement can be
posterior probability. The following are the various
written as
Bayes classification methods.
p (F i |C, F j ) = p(F i |C) P(X1, X2, …..., Xn) = P(Xi| Parents (Xi))
i1
and so the joint model can be expressed as
If node Xi has no parents, its local probability
p(C, F1 , . . . , Fn ) = p(C) p(F1 |C) p(F2 |C) p(F3 |C) · ·
distribution
ibution is said to be unconditional, otherwise it is
· p(Fn |C)
conditional. If the value of a node is observed, then
=p(C)
the node is said to be an evidence node. Each variable
is associated with a conditional probability table
This means that under the above assumptions of
which gives the probability of this variable
variabl for
independence, the conditional distribution over the
different values of its parent nodes.
class variable C can be expressed as
The value of P (X1, ..., Xn ) for a particular set of
P(C|1,..Fn)=
values for the variables can be easily computed from
the conditional probability table.
where Z is a scaling factor dependent only on F1 , . . ,
Fn , i.e., a constant if the values of the feature
Consider the five events M, N, X, Y, and Z. Each
variables are known.
event has a conditional probability table. The events
M and N are independent events i. e. they are not
The naive Bayes classifier combines the Bayes
influenced by any other events. The conditional
probability
ability model with a decision rule. One common
probability table for X shows the probability of X,
rule is to pick the hypothesis that is most probable;
this is known as the maximum a posterior or MAP
decision rule. The corresponding classifier is the
function ‘‘classify’’ defined as follows: