Welcome to Scribd!

3 Statistik Cukup

Uploaded by

0% found this document useful (0 votes)

33 views7 pages

This lecture discusses sufficient statistics, dimensionality, and complexity in pattern recognition. A sufficient statistic contains all information needed to estimate an unknown parameter and allows the likelihood function to be factorized. The sample mean is a sufficient statistic for the Gaussian distribution. Dimensionality reduction is important to avoid overfitting as computational complexity grows with the number of features.

Original Description:

Original Title

3_statistik_cukup.ppt

Copyright

Available Formats

PPT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

33 views7 pages

3 Statistik Cukup

Uploaded by

yulia sari

Copyright:

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

ECE 8443 – Pattern Recognition

LECTURE 14: SUFFICENT STATISTICS

• Objectives:
Sufficient Statistics
Dimensionality
Complexity
Overfitting
• Resources:
DHS – Chap. 3 (Part 2)
Rice – Sufficient Statistics
Ellem – Sufficient Statistics
TAMU – Dimensionality

• URL: .../publications/courses/ece_8443/lectures/current/lecture_14.ppt
14: SUFFICIENT STATISTICS
DEFINITION
• Direct computation of p(D|) and p(|D) for large data sets
is challenging (e.g. neural networks)
• We need a parametric form for p(x|) (e.g., Gaussian)
• Gaussian case: computation of the sample mean and
covariance, which was straightforward, contained all the
information relevant to estimating the unknown population
mean and covariance.
• This property exists for other distributions.
• A sufficient statistic is a function s of the samples D that
contains all the information relevant to a parameter, .
• A statistic, s, is said to be sufficient for  if p(D|s,) is
independent of :
p( D | s ,  ) p(  | s )
p(  | s , D )   p(  | s )
p( D | s )
14: SUFFICIENT STATISTICS
FACTORIZATION THEOREM

• Theorem: A statistic, s, is sufficient for , if and only if

p(D|) can be written as: p( D |  )  g( s ,  )h( D ) .
• There are many ways to formulate sufficient statistics
(e.g., define a vector of the samples themselves).
• Useful only when the function g() and the sufficient
statistic are simple (e.g., sample mean calculation).
• The factoring of p(D|) is not unique:
g( s ,  )  f ( s ) g( s ,  ) h( D )  h( D ) / f ( s)
• Define a kernel density invariant to scaling:

~ g( s ,  )
g( s, ) 
 g( s ,  )d
14: SUFFICIENT STATISTICS
GAUSSIAN DISTRIBUTION
n 1 1 t 1
p( D |  )   exp[  ( x k   )  ( x k   )]
d 2 12
k 1( 2 )  2
1 1 n t 1 t 1 1
 exp[       2  x k  x t
k  xk ]
d 2 12
( 2 )  2 k 1

t 1  
n t 1 n
 exp[          x k  ]
2  k 1 
1 1 n t 1
 exp[   x k  x k ]
d 2 12
( 2 )  2 k 1
• This isolates the  dependence in the first term, and
hence, the sample mean is a sufficient statistic.
• The kernel is: ~ 1 1  1 1 
g( 
ˆ n , )  exp[ (   
ˆ n )t   (   
ˆ n) ]
1
12 2 n 
( 2 )d 2

n
14: SUFFICIENT STATISTICS
EXPONENTIAL FAMILY
• This can be generalized:
p( x |  )  x  exp[ a(  )  b(  )t c( x )]
and: n n
p( D |  )  exp[ na(  )  b(  )  c( x k ) ]  x k   g( s ,  )h( D )
t
k 1 k 1

• Examples:
14: PROBLEMS OF DIMENSIONALITY
DIRECTIONS OF DISCRIMINATION
• If features are statistically independent, in theory we can
get excellent performance.
• Recall the Bayes error rate for a two-class multivariate
normal problem: 1   u2 2
p( e )  e du
2 r 2
where r2 is the Mahalanobis distance:
r 2  ( 1   2 )t  1 ( 1   2 )
• For conditionally independent features:
2
d  i1   i 2 
r   
2

i 1 i 
Most useful features are those for which the difference of
the means is large w.r.t. the standard deviation.
14: PROBLEMS OF DIMENSIONALITY
COMPUTATIONAL COMPLEXITY
• “Big Oh” notation used to describe complexity:
if f(x) = 2+3x+4x2, f(x) has computational complexity O(x2)
• Recall:
1 t ˆ 1 d 1 ˆ
g( x )   ( x  
ˆ )  (x 
ˆ )  ln( 2 )  ln   ln P (  )
2 2 2

O( dn ) O( nd 2 ) O( 1 ) O( d 2 n ) O ( n )

• Watch those constants of proportionality (e.g., O(nd2).

• If the number of data samples is inadequate, we can
experience overfitting (which implies poor generalization).
• Hence, later in the course, we will study ways to control
generalization and to smooth estimates of key parameters
such as the mean and covariance (see textbook).

Tables of Weber Functions: Mathematical Tables, Vol. 1
From Everand
Tables of Weber Functions: Mathematical Tables, Vol. 1
I. Ye. Kireyeva
No ratings yet
Fast Fourier Emulators Based On Space (Lling Lattices: R.A. Bates, E. Riccomagno, R. Schwabe and H.P. Wynn
Document6 pages
Fast Fourier Emulators Based On Space (Lling Lattices: R.A. Bates, E. Riccomagno, R. Schwabe and H.P. Wynn
Eva Riccomagno
No ratings yet
Steganalysis of Block-DCT Image Steganography
Document16 pages
Steganalysis of Block-DCT Image Steganography
Vishnu Vidyan
No ratings yet
Lec17 PriorModeling
Document37 pages
Lec17 PriorModeling
hu jack
No ratings yet
Midterm - EE511 - Part B: K K K K
Document8 pages
Midterm - EE511 - Part B: K K K K
Anonymous zt7IGm
No ratings yet
Operator D Technique
Document14 pages
Operator D Technique
dude GFA
No ratings yet
03 - Image Segmentation
Document45 pages
03 - Image Segmentation
عبد الحميد عمرو عبد الحميد فرغلى هلالى
No ratings yet
Advanced Topics in Learning and Vision
Document26 pages
Advanced Topics in Learning and Vision
Hemanth M
No ratings yet
Aco Pipe FEM
Document23 pages
Aco Pipe FEM
lapu
No ratings yet
BEC Probe
Document3 pages
BEC Probe
Thilina Senaviratne
No ratings yet
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
Document51 pages
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
Peter Parker
No ratings yet
Homework 1: 1. Solve The Following Problems From Chapter 2 of The Text Book: 7, 12, 13, 31, 38
Document6 pages
Homework 1: 1. Solve The Following Problems From Chapter 2 of The Text Book: 7, 12, 13, 31, 38
김창민
No ratings yet
0 MathReview
Document18 pages
0 MathReview
Dhananjay Chopade
No ratings yet
Chapter 4: Part 2 Techniques of Integration: by Assoc - Prof. Mai Duc Thanh
Document31 pages
Chapter 4: Part 2 Techniques of Integration: by Assoc - Prof. Mai Duc Thanh
Triet Truong
No ratings yet
Parameter Estimation
Document32 pages
Parameter Estimation
Rohit Singh
No ratings yet
Properties of The Laplace Transform: - Objectives
Document11 pages
Properties of The Laplace Transform: - Objectives
Bala Samuvel Joseph
No ratings yet
Homework For Module 3 Part 2
Document6 pages
Homework For Module 3 Part 2
bita younesian
100% (3)
Pattern Classification: All Materials in These Slides Were Taken From
Document18 pages
Pattern Classification: All Materials in These Slides Were Taken From
avivro
No ratings yet
Slides Amari IG3
Document138 pages
Slides Amari IG3
anurag sahay
No ratings yet
Statistik Dalam Hidrologi Uji Kecocokan Data Terhadap Distribusi Kemnungkinan (Goodness of Fit of Data To Probability Distibution)
Document63 pages
Statistik Dalam Hidrologi Uji Kecocokan Data Terhadap Distribusi Kemnungkinan (Goodness of Fit of Data To Probability Distibution)
Kartika Sukma
No ratings yet
Existence and Convergence of Fixed Points of Generalized α-Non-Expensive Mappings in Metric Spaces
Document17 pages
Existence and Convergence of Fixed Points of Generalized α-Non-Expensive Mappings in Metric Spaces
aqsa sattar
No ratings yet
Semilinear Elliptic Equations For The Fractional Laplacian With Hardy Potential
Document29 pages
Semilinear Elliptic Equations For The Fractional Laplacian With Hardy Potential
shihomasami14
No ratings yet
Kechagias Pipiras 2018 MLRD Phase
Document24 pages
Kechagias Pipiras 2018 MLRD Phase
Adis Salkic
No ratings yet
Reviewsheet Formulas Ap Calc Ab 2006
Document3 pages
Reviewsheet Formulas Ap Calc Ab 2006
teachopensource
No ratings yet
Article
Document14 pages
Article
alipirkhedri
No ratings yet
OCR A Level Mathematics Sample Question Paper
Document40 pages
OCR A Level Mathematics Sample Question Paper
A
No ratings yet
NTT (Autosaved)
Document75 pages
NTT (Autosaved)
Mohammed H. Salem
No ratings yet
On The Computation of The Euler Constant: Article
Document16 pages
On The Computation of The Euler Constant: Article
Quyền Nguyễn
No ratings yet
15.1 Dynamic Optimization
Document32 pages
15.1 Dynamic Optimization
Kami Dentist
No ratings yet
Chapter III Random Variables
Document99 pages
Chapter III Random Variables
ሄኖክ ብርሃነ
No ratings yet
Moments of Order Statistics of The - 1
Document12 pages
Moments of Order Statistics of The - 1
Paola Andrea
No ratings yet
Slides Seance01
Document15 pages
Slides Seance01
Wojciech Wisniewski
No ratings yet
Lec29 StatsAndFits 2017
Document28 pages
Lec29 StatsAndFits 2017
Tasneem Momin
No ratings yet
Abbotsleigh 2016 3U Trials Solutions
Document28 pages
Abbotsleigh 2016 3U Trials Solutions
Adnan Hameed
No ratings yet
Vector Calculus R16
Document87 pages
Vector Calculus R16
Meghna Saha
No ratings yet
Optimum Statistical Classifiers
Document12 pages
Optimum Statistical Classifiers
sveekan
100% (1)
Corr Risk With Spread Option
Document61 pages
Corr Risk With Spread Option
m325075
No ratings yet
Chapter II. Metric Spaces and The Topology of C
Document9 pages
Chapter II. Metric Spaces and The Topology of C
TOM DAVIS
No ratings yet
TMP 186
Document12 pages
TMP 186
Frontiers
No ratings yet
Intro2CFD Lecture1 Pulliam Intro Slides
Document22 pages
Intro2CFD Lecture1 Pulliam Intro Slides
Vladimir Jovanovic
No ratings yet
Polychoric and Polyserial Correlations
Document9 pages
Polychoric and Polyserial Correlations
Gregorio Ramos Ortega
No ratings yet
Audio Forensics From Acoustic Reverberation: Hafiz Malik Hany Farid
Document4 pages
Audio Forensics From Acoustic Reverberation: Hafiz Malik Hany Farid
Tanja Milošević
No ratings yet
Response To Reviewer
Document3 pages
Response To Reviewer
alipirkhedri
No ratings yet
Welcome: POLITEHNICA University of Bucharest Faculty of Aerospace Engineering
Document47 pages
Welcome: POLITEHNICA University of Bucharest Faculty of Aerospace Engineering
Aaron Jackson
No ratings yet
Lecture: 15-16 Operator Methods For Finding A Particular Solution
Document20 pages
Lecture: 15-16 Operator Methods For Finding A Particular Solution
Aditya Srivatsav
No ratings yet
Digital Holo FRFT
Document34 pages
Digital Holo FRFT
Jesus Ruiz
No ratings yet
Ch22 Presn PDF
Document34 pages
Ch22 Presn PDF
Alee López
No ratings yet
International Geogebra Conference For Southeast Europe January 15-16, Novi Sad, Serbia
Document15 pages
International Geogebra Conference For Southeast Europe January 15-16, Novi Sad, Serbia
carlos9leo
No ratings yet
Cambridge Books Online
Document8 pages
Cambridge Books Online
goserunner
No ratings yet
Lecture - 2: One-Dimensional Analysis
Document38 pages
Lecture - 2: One-Dimensional Analysis
GooftilaaAniJiraachuunkooYesusiin
No ratings yet
Local Discontinuous Galerkin Method For The Fractional Diffusion Equation With Integral Fractional Laplacian
Document11 pages
Local Discontinuous Galerkin Method For The Fractional Diffusion Equation With Integral Fractional Laplacian
Max Cole
No ratings yet
MCIntro
Document14 pages
MCIntro
Nguyễn Mai Phương
No ratings yet
Apde2 Exam 2012
Document6 pages
Apde2 Exam 2012
zekrom
No ratings yet
CS 229, Summer 2020 Problem Set #1
Document14 pages
CS 229, Summer 2020 Problem Set #1
nhung
No ratings yet
Article Title 1
Document3 pages
Article Title 1
alipirkhedri
No ratings yet
DPP (26-29) - 12th - Maths - 2015 - E
Document4 pages
DPP (26-29) - 12th - Maths - 2015 - E
kax
No ratings yet
Experiment 2
Document4 pages
Experiment 2
Subham Mukherjee
No ratings yet
Elementary Topology - Full
Document115 pages
Elementary Topology - Full
Babu Jutt
No ratings yet
CSCE 3110 Data Structures & Algorithm Analysis: Rada Mihalcea
Document30 pages
CSCE 3110 Data Structures & Algorithm Analysis: Rada Mihalcea
Anuja Khamitkar
No ratings yet
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
Document45 pages
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
Trina Mae Garcia
No ratings yet
From Chap 18: T-Test For Paired Samples
Document11 pages
From Chap 18: T-Test For Paired Samples
Sherlok Holmes
No ratings yet
QuantumMechanics Quetions
Document2 pages
QuantumMechanics Quetions
vijay
No ratings yet
Propagating Scalar Modes in (2+1) - CDT Quantum Gravity
Document60 pages
Propagating Scalar Modes in (2+1) - CDT Quantum Gravity
Adam Bruce
No ratings yet
4.4 Non Parametric Test
Document56 pages
4.4 Non Parametric Test
Pavithra
No ratings yet
Econometrics Project: Professor: Daniela Șerban
Document15 pages
Econometrics Project: Professor: Daniela Șerban
Victor Stefan Voicu
100% (1)
Critical Values of Student's T-Distribution
Document1 page
Critical Values of Student's T-Distribution
Juliene Ermie Parel Berame
No ratings yet
Are Female Mallards Attracted To The Color Green?
Document5 pages
Are Female Mallards Attracted To The Color Green?
Daniel Brown
No ratings yet
Test of Hypothesis - T and Z Tests. Chi-Square Test. F Test.
Document15 pages
Test of Hypothesis - T and Z Tests. Chi-Square Test. F Test.
Barath Raj
No ratings yet
One Way ANOVA Post Hoc Tests in Excel
Document3 pages
One Way ANOVA Post Hoc Tests in Excel
SamuNavarroD
No ratings yet
ANOVA For One Way Classification Theory
Document4 pages
ANOVA For One Way Classification Theory
Atul Jhariya
No ratings yet
Chapter 9 Testbank
Document34 pages
Chapter 9 Testbank
vx8550_373384312
100% (5)
Pengaruh Alat Penyajian Disposableterhadap Sisa Makanan Pasien Di Ruang Rawat Inap Rsup Dr. Kariadi Semarang
Document9 pages
Pengaruh Alat Penyajian Disposableterhadap Sisa Makanan Pasien Di Ruang Rawat Inap Rsup Dr. Kariadi Semarang
Putri Wahidatul Hasana
No ratings yet
Tabla D3 Duncan Statistics Control
Document3 pages
Tabla D3 Duncan Statistics Control
Oscar Sotomayor
No ratings yet
LCAO MO Theory Illustrated by Its Application To H2
Document8 pages
LCAO MO Theory Illustrated by Its Application To H2
maugonzalezsuarez
No ratings yet
Binomialnegativaheterogenea PDF
Document6 pages
Binomialnegativaheterogenea PDF
Ramonfebrero
No ratings yet
The Ultimate Atom Rap
Document6 pages
The Ultimate Atom Rap
Shivam Pandey
No ratings yet
Chapter1 Econometrics IntroductionToEconometrics
Document42 pages
Chapter1 Econometrics IntroductionToEconometrics
Abdullah Khatib
No ratings yet
Jntuk 2 1 RV&SP Nov 2017 Q.P
Document8 pages
Jntuk 2 1 RV&SP Nov 2017 Q.P
SANDEEPC9
100% (1)
Binomial Distribution
Document22 pages
Binomial Distribution
Glenn Avelino
No ratings yet
Dr. Andreas Ipp, Simulations of The Glasma in 3+1D
Document189 pages
Dr. Andreas Ipp, Simulations of The Glasma in 3+1D
Vasillis Mamos
No ratings yet
Mb0050 SLM Unit10
Document30 pages
Mb0050 SLM Unit10
Margabandhu Narasimhan
No ratings yet
Z Test
Document38 pages
Z Test
len
100% (2)
Lec26 Ass
Document5 pages
Lec26 Ass
Farid Akhtar
No ratings yet
Essential Statistics For The Behavioral Sciences 1st Edition Privitera Solutions Manual
Document6 pages
Essential Statistics For The Behavioral Sciences 1st Edition Privitera Solutions Manual
KellyHughesarpi
100% (37)
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
Document8 pages
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
shivani
100% (1)
Social Science Theories
Document5 pages
Social Science Theories
CZARINA LOUISE AQUINO
No ratings yet
Unit 6: 1. Explain The Classification of Random Process With Neat Sketches. Ans
Document12 pages
Unit 6: 1. Explain The Classification of Random Process With Neat Sketches. Ans
srinivas
0% (1)
Introduction To Quantum Computing
Document7 pages
Introduction To Quantum Computing
Umar Khan
No ratings yet
Lecture 3 General
Document23 pages
Lecture 3 General
Kenyan tough fight Ahtam
No ratings yet
MSC 3 Sem Mathematics General Relativity P 4 Dec 2016
Document2 pages
MSC 3 Sem Mathematics General Relativity P 4 Dec 2016
Shalabh Tiwari
No ratings yet