Professional Documents
Culture Documents
HC MY
Ni dung trnh by
Khi nim Cc loi thut ton Cc v d hc my Quy trnh gii quyt bi ton bng phng php my hc Biu din d liu Cc thut ton my hc
Khi nim Hc my
Hc my l mt phng php to ra cc chng trnh my tnh bng vic phn tch cc tp d liu. Hc my c lin quan ln n thng k, v c hai lnh vc u nghin cu vic phn tch d liu, nhng khc vi thng k, hc my tp trung vo s phc tp ca cc gii thut trong vic thc thi tnh ton
ng dng Hc my
My truy tm d liu. Chn on y khoa. Pht hin th tn dng gi. Phn tch th trng chng khon. Phn loi cc chui DNA, nhn dng ting ni v ch vit, dch t ng, chi tr chi v c ng r-bt (robot locomotion).
Hc c gim st -- trong , thut ton to ra mt hm nh x d liu vo ti kt qu mong mun. Mt pht biu chun v mt vic hc c gim st l bi ton phn loi: chng trnh cn hc (cch xp x biu hin ca) mt hm nh x mt vector ti mt vi lp bng cch xem xt mt s mu d_liu - kt_qu ca hm . Hc khng gim st -- m hnh ha mt tp d liu, khng c sn cc v d c gn nhn.
5
Hc na gim st : kt hp cc v d c gn nhn v khng gn nhn sinh mt hm hoc mt b phn loi thch hp.
Hc tng cng : trong , thut ton hc mt chnh sch hnh ng ty theo cc quan st v th gii. Mi hnh ng u c tc ng ti mi trng, v mi trng cung cp thng tin phn hi hng dn cho thut ton ca qu trnh hc.
6
Chuyn i -- tng t hc c gim st nhng khng xy dng hm mt cch r rng. Thay v th, c gng on kt qu mi da vo cc d liu hun luyn, kt qu hun luyn, v d liu th nghim c sn trong qu trnh hun luyn.
Hc cch hc -- trong thut ton hc thin kin quy np ca chnh mnh, da theo cc kinh nghim gp.
Cc v d hc my
Cc v d hc my (tip)
10
Cc v d hc my (tip)
11
Cc v d hc my (tip)
12
Quy trnh hc my
13
14
Cc thut ton hc
Bayes (Mitchell, 1996). Cy quyt nh (Fuhr et al, 1991). Vc-t trng tm (Centroid- based vector) (Han v Karypis, 2000). k-lng ging gn nht (Yang, 1994). Mng nron (Wiener et al, 1995). Support vector machines (Joachims, 1998).
15
16
i vi d liu phi cu trc th phi biu din bng d liu c cu trc. Biu din d liu bng M hnh thng tin khng gian-Vector
17
18
Cho vn bn D = Khi tt c u ngh hai i mnh nht ng Nam sp sa vo hai hip ph th bt ng ci u vng ca L Cng Vinh i ln tch tc mang v chic cp AFF cho i tuyn Vit Nam... Gi s b t in bao gm: Th_thao, Bng_, i_tuyn, ng_Nam_, Cp_AFF, Vit_Nam Th vn bn D c biu din bng phng php tn sut l: D = (0,0,1,1,1,1)
19
Cc gi tr wij c tnh da trn tn s (hay s ln) xut hin ca thut ng trong vn bn. Gi fij l s ln xut hin ca thut ng ti trong vn bn dj, khi wij c tnh bi mt trong ba cng thc:
Cc thut ton my hc
Bayes (Mitchell, 1996). Cy quyt nh (Fuhr et al, 1991). Vc-t trng tm (Centroid- based vector) (Han v Karypis, 2000). k-lng ging gn nht (Yang, 1994). Mng nron (Wiener et al, 1995). Support vector machines (Joachims, 1998).
22
23
nh l Bayes cho php tnh xc sut xy ra ca mt s kin ngu nhin A khi bit s kin lin quan B xy ra. Xc sut ny c k hiu l P(A|B), v c l "xc sut ca A nu c B".
24
Bayes (tip)
Xc sut xy ra A ca ring n, khng quan tm n B. K hiu l P(A) v c l xc sut ca A Xc sut xy ra B ca ring n, khng quan tm n A. K hiu l P(B) v c l "xc sut ca B". Xc sut xy ra B khi bit A xy ra. K hiu l P(B|A) v c l "xc sut ca B nu c A".
Khi bit ba i lng ny, xc sut ca A khi bit B cho bi cng thc:
25
26
V d: (tip)
Xc sut P(A): Xc sut rng anh ta chi tennis (bt k Ngoi tri nh th no v Gi ra sao) Xc sut P(B ): Xc sut rng Ngoi tri l nng v Gi l mnh P(B|A): Xc sut rng Ngoi tri l nng v Gi l mnh, nu bit rng anh ta chi tennis P(A|B): Xc sut rng anh ta chi tennis, nu bit rng Ngoi tri l nng v Gi l mnh
27
P(A|B) => Gi tr xc sut c iu kin ny s c dng d on xem anh ta c chi tennis hay khng? P(A)=8/12, P(B|A)=1/2 Trong trng hp: A l Anh ta khng chi tennis P(A)=4/12, P(B|A)=1/2
28
30
31
income student redit_rating c buys_comput high no fair no high no excellent no high no fair yes medium no fair yes low yes fair yes low yes excellent no low yes excellent yes medium no fair no low yes fair yes medium yes fair yes medium yes excellent yes medium no excellent yes high yes fair yes medium no excellent no
32
Mt sinh vin tr vi mc thu nhp trung bnh v mc nh gi tn dng bnh thng s mua mt my tnh hay khng?
Lp hun luyn: C1:buys_computer = yes C2:buys_computer = no D liu cn phn loi: X = (age <=30, Income = medium, Student = yes Credit_rating = Fair)
33
P(Ci):
34
P(X|buys_computer = yes) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044 P(X|buys_computer = no) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019 P(X|buys_computer = yes) * P(buys_computer = yes) = 0.028 P(X|buys_computer = no) * P(buys_computer = no) = 0.007
P(X|Ci)*P(Ci) :
36
37
38
39
40
41