You are on page 1of 26

Chapter 1

Overview of Pattern Recognition

Introduction to Pattern Recognition


What is Pattern?
something exhibiting certain characteristics (regularity)
something able to serve as a model (Greek Beauty, Kyoto Beauty)
something representing a concept of what was observed or experienced (Sticky dish)
In engineering, usually represented as a set of measurements or observations in vector or mat
rix forms
Examples:

Introduction to Pattern Recognition


Pattern Recognition
to recognize given information (usually measures or observation) via information reduction, infor
mation mapping, or information labeling process.
Usually involves the concepts of classification, recognition, or description.

Definition
Classification
assigns input data into one or more of C pre-specified classes based on the extracted features.

Description
an alternative to classification where a conceptual or structural description of the input pattern is desire
d.

Recognition
ability to classify , even identify in many cases.

Preprocessing
filtering and transformation of raw input to minimize noise and to extract rather reliable features.

Postprocessing
Refining the classification or recognition results by utilizing properties of the pattern classes

Introduction to Pattern Recognition


Definition
Desirable Feature
distinct characteristics of data (measurements or observations).
allows representation of high dimensional raw measurements in a lower dimensional space
while preserving essential information, i.e., reducing the redundancy.
provides a good separable space for pattern classification.
examples: mean, variance, distribution type, etc.
desirable feature selection is the most essential one for success.
feature selection is a problem dependent task (the concept of tuning).

Noise
undesirable components in data
distortions or errors in raw input data (preprocessing can increase the distortion)
errors in feature extraction, errors in training data
More Classification Errors
Searching for Noise Robust Features

Introduction to Pattern Recognition


Major Four types of pattern recognition approaches
Tree-structure (or decision) based pattern recognition

Just imagine how to classify apples from oranges; usually intuitional tasks
But very powerful in many applications
Still rather hard to generalize the algorithm, i.e., training data dependency
Recommended to design ones own decision based pattern recognizer before imple
menting the main PR algorithm as a practice.

Statistical pattern recognition


statistical or decision theoretic approach
parametric and non-parametric methods; Most engineering tasks
Easy to generalize and usually produces rather good results for new unknown inputs:
Why?
Not easy to understand the inner operation of the PR algorithm

Introduction to Pattern Recognition


Major Four types of pattern recognition approaches
Syntactic pattern recognition
structural analysis on interrelationships or interconnections of features
requires quantification and extraction of structural information and then assesses structural similarit
y of patterns;
Very powerful for bio-pattern, character recognition, and NLP

Neural Network based Pattern recognition


Somehow a block box approach
involves neural networks for pattern recognition
(Given input Trained multilayer NN Classified output)
Almost impossible to analyze the internal decision behaviors; Hard to improve

Fuzzy Pattern recognition


Based on the Fuzzy theory
Not easy to implement but motion compensation for cameras can be an example.

Introduction to Pattern Recognition


Mappings in an abstract representation of pattern generation/classificati
on/interpretation systems

p4
class i
p1
class j

class
k
Class membership or
description space: C

m1

p2

m2

p3

m3

pattern space: P

Observation or
measurement space: F

Introduction to Pattern Recognition


Pattern recognition procedures

Preprocessing
Usually the first step after acquiring raw data from which pattern recognitio
n is performed.
Usually involves:
Noise reduction; Filtering
Adjusting or deduction of the average value of data (In many cases, including normal
ization)
Segmentation of region (object) of interest from the acquired raw data
Processing of the raw data and mapping the results into different representation dom
ains
Various transforms can be used such as Wavelet, Fast Fourier transforms

Introduction to Pattern Recognition


Pattern recognition procedures

Preprocessing noise reduction in speech signals


Primary signal

Noise reference signal

After spectral subtraction

After noise reduction

Introduction to Pattern Recognition


Pattern recognition procedures

Preprocessing noise reduction in image


Before noise reduction

After noise reduction

Introduction to Pattern Recognition


Pattern recognition procedures

Preprocessing segmentation
Region Tree

Object Tree

Root

R1

Region
Segmentation

R3

R2
R4
R5
R8

R9
R12

Body

TV

Jacket

Hair

Face

Torso

R7

R6
R10

R 11
R13

Introduction to Pattern Recognition


Pattern recognition procedures
Feature extraction
needs to be computationally feasible by manual or by automated procedure
s
Feature vector and feature space
Features are formed as a d-dimensional vector, called the feature vector.
Feature vectors are typically used in statistical and neural network based pattern reco
gnition.

Observation
environment

Feature extraction
process

X (feature vector)
X (string)
P (structure)

Input data preprocessing/feature extraction

Introduction to Pattern Recognition


Pattern recognition procedures

Pattern classification (Assuming we already have pre-trained patterns)


action of classifying the raw data into one of the pre-determined classes or patterns
What can we do if a raw datum from a totally unknown class is met?
Signal (information)
Data acquisition
Measure mi
Preprocessing and
enhancement
Feature extraction

Classification

Description

diagram of typical pattern recognition systems

Selection problem of features

An example of fish classification [textbook, Figure 1.1]

Selection problem of features


Which feature is better for good classification between salmo
n and sea bass classes?

Histogram of length feature for salmon and sea


bass classes

Histogram of lightness feature for salmon and sea


bass classes

Changing the position of the threshold affects the classification error!

Representation of Feature Vectors


Scatter Plot
plot of feature vectors in feature space
difficult to visualize feature vectors with greater than 3-D.

A two-class scatter plot

Some Terminologies
Classifiers
partitions feature space into class-labeled decision regions.
Separable or inseparable
If inseparable, any compromise is possible and allowed?

A decision region
represents the corresponding class in feature space.

Discriminant function
computes the class-likelihood value of a feature (vector) with respect to a certain class.

Adaptive or Self-learning .vs. Fixed

Classification Rule
Classification on features can be done based on compari
ng a set of discriminant functions
Decision rules: Assign
gm x gi x

to class

(region

i 1,2, , c and i m

Likelihood, Nearest, Measure, Similarity, .

Rm

) if

Classification Rule
Decision boundary divides decision regions in feature space where

g k x gl x
Linear
discriminant
functions
R1
R2

Decision
boundary

R3

Quadratic
discriminant
function
R1

R2

arbitrary
discriminant
function
R1
R2

R2

Decision
region

R3

R2
R1

R4

Classification Rule
Which decision boundary is the best linear, quadra
tic or arbitrary decision boundary?

Classification Rule
Generalization problem
For a robust classification performance with test data, the decision boundary needs to
be conformed in a robust way.
So, the training samples should represent the nature of the underlying data successful
ly
Sufficient number of training data?
The more, the better. But not practical if too much.
Experimentally, or intuitively, we have to decide the number with some
restraints such as small number of available data.
The classifiers should be sufficiently trained! (Not always possible!)
There are some tricky methods to recover the shortage of the training data but sufficie
nt training data are better!
More insight for the tasks, better pattern recognizers!

Development Procedure of PR systems


Step 1: Study the classes of patterns under consideration to develop possible characteri
zations. (ideation, logic process)
Step 2: Determine the availability of feature/measurement data
Step 3: Consider constraints on desired system performance,
computational resources, and the implementation cost (Engineering)
Step 4: Consider the availability of training data
Step 5: Consider the availability of suitable and known PR technologies
Step 6: Develop a PR system simulation
Step 7: Train the system
Step 8: Simulate system performance
Step 9: Iterate among the above steps until desired performance is
achieved.
If still not successful, then ?

Development Procedure of PR systems

Procedure of designing a pattern classifier

Typical pattern classifier

Summary
Relationship of pattern recognition to other areas

(Adaptive) signal processing and systems


training and learning of adaptive systems

Artificial Intelligence, Expert System


Optimization/estimation theory
parameter estimation towards optimization of systems

Fuzzy sets
Structural Modeling
Formal language, ..

Summary
Pattern recognition applications
Image preprocessing, segmentation and analysis, coding
Seismic /weather/hurricane analysis and forecast
Radar, Sonar signal classification/analysis
Face, Gesture recognition
Speech/Audio recognition/understanding, Speaker and language recognitio
n
Fingerprint, DNA identification
Character (letter or number) recognition
Handwriting analysis; Electronic Signature for e-commerce
Human bio-signals such as ECG and EEG signal analysis/understanding
Medical diagnosis (tumor, mammogram, germ, virus, cancer,.)

Others to be mentioned
Terms

Novel Patterns (New unknown input)


Supervised, Unsupervised, Reinforced(-ment) Learning
Good Features

Translation, Rotation, Scale Invariant


Noise, Environment Robust
Statistically Independent (No Redundancy)

Authors are saying in p.12 that: Although the pattern classification techniques presented in this
book cannot substitute for domain knowledge, they can be helpful in making the feature values
less sensitive to noise.

You might also like