06 Image Compresssion

Computer Vision and Image
processing (CoSc4151)
Chapter six
Image compression
Jimma University
Jimma Institute of Technology
For Computer Science students
Abel W.
Image Compression
Image compression address the problem of reducing the amount of data

required to represent a digital image with no significant loss of information.
CV & IP 02 2
Applications that require image compression are
many and varied such as:
1. Internet,
2. Businesses,
3. Multimedia,
4. Satellite imaging,
5. Medical imaging
CV & IP 02 3
Image Compression
Images take a lot of storage space:

 1024 x 1024 x 32 x bits images requires 4 MB
 suppose you have some video that is 640 x 480 x 24 bits x 30 frames per
second , 1 minute of video would require 1.54 GB
Many bytes take a long time to transfer slow connections – suppose we

have 56,000 bps
 4MB will take almost 10 minutes
 1.54 GB will take almost 66 hours
Storage problems, plus the desire to exchange images over the Internet,
have lead to a large interest in image compression algorithms.
CV & IP 02 4
Compression algorithms remove redundancy
If more data are used than is strictly necessary, then we say that there is
redundancy in the dataset.
Data redundancy is not abstract concept but a mathematically quantifiable entity . If n1
and nc denote the number of information carrying units in two data sets that represent
the same information, the relative data redundancy RD of the first data set ( n1 ) can
be defined as
RD= 1 – 1/CR (1)
Where CR is compression ration, defined as CR = n1/nc (2)
Where n1 is the number of information carrying units used in the uncompressed dataset
and nc is the number of units in the compressed dataset. The same units should be used
for n1 and nc; bits or bytes are typically used.
When nc<<n1 , CR large value and RD 1. Larger values of C indicate better
compression
CV & IP 02 5
Entropy
Entropy in information theory is the measure of the
information content of a message.
Entropy gives a average bits per pixel required to encode an image.
2b 1
H    P (i ) log 2 P (i )
i 0
Probabilities are computed by normalizing the histogram of the image –

P(i)=hi/n
Where hi is the frequency of occurrence of grey level i and n is the total
number of pixels in the image.
CV & IP 02 6
A general algorithm for data compression and image
reconstruction
Input image ( f(x,y) Reconstructed image f ’(x,y)
Source
Source
Channel Channel Channel decoder
encoder
encoder decoder Reconstr
Data
uction
redundancy
reduction
An input image is fed into the encoder which creates a set of symbols from the input
data. After transmission over the channel, the encoded representation is fed to the
decoder, where a reconstructed output image f ’(x,y) is generated . In general , f ’(x,y)
may or may not an exact replica of f(x,y). If it is , the system is error free or
information preserving, if not, some level of distortion is present in the reconstructed
image .
CV & IP 02 7
Redundancy information and data
 Data is not the same thing as information.
 Data is the means with which information is
expressed.
 The amount of data can be much larger than the
amount of information.
 Redundant data doesn't provide additional
information.
 Image coding or compression aims at reducing the
amount of data while keeping the information by
reducing the amount of redundancy.
CV & IP 02 8
Types of redundancy
Three basic types of redundancy can be identified in a

single image:
1) Coding redundancy
2) Interpixel redundancy
3) Psychovisual redundancy
CV & IP 02 9
Different Types of Redundancy
Coding Redundancy
 Some gray levels are more common than
other.
Inter-pixel Redundancy
 The same gray level may cover
a large area.
Psycho-Visual Redundancy
 The eye can only resolve about
32 gray levels locally.
15.
CV & IP 02
10
Coding redundancy
Our quantized data is represented using code-words
The code-words are ordered in the same way as the intensities that they
represent;
 thus the bit pattern 00000000, corresponding to the value 0,
represents the darkest points in an image and the bit pattern
11111111, corresponding to the value 255, represents the brightest
points.
If the size of the code-word is larger than is necessary to
represent all quantization levels, then we have coding redundancy
CV & IP 02 11
Coding redundancy – Example
(Huffman coding)
Rk Pr(rk) Code 1 l1(rk) Code 2 l2(rk)
r0 = 0 0.19 000 3 11 2
r1 = 1/7 0.25 001 3 01 2
r2 = 2/7 021 010 3 10 2
r3 = 3/7 0.16 011 3 001 3
r4 = 4/7 0.08 100 3 0001 4
r5 = 5/7 0.06 101 3 00001 5
r6 = 6/7 0.03 110 3 000001 6
r7=1 0.02 111 3 000000 6
7
Lav g  l
k 0
2 ( rk ) Pr ( rk )
Using eq. (2) the resulting compression ratio Cn is 3/2.7 or 1.11 Thus approximately
10 percent of the data resulting from the use of code 1 is redundant. The exact level
of redundancy is
CV & IP 02 12
RD = 1 – 1/1.11 =0.099
Image compression
Reversible (lossless)
 no loss of information.
 The image after compression and decompression is identical to
the original image.
 Often necessary in image analysis applications.
 The compression ratio is typically 2 to 10 times.
Non reversible (lossy)

 loss of some information.
 It is often used in image communication, compact cameras,
video, www, etc.
 The compression ratio is typically 10 to 30 times.
CV & IP 02 13
Data compression
implies sending or storing a smaller number of bits. Although many

methods are used for this purpose, in general these methods can be
divided into two broad categories: lossless and lossy methods.
CV & IP 02 14
Data compression methods
Image Coding and Compression
CV & IP 02 15
LOSSLESS COMPRESSION
METHODS
 No loss of data, decompressed image exactly same as

uncompressed image
 Medical images or any images used in courts
 Lossless compression methods typically provide about a 10%
reduction in file size for complex images
CV & IP 02 16
Cont’d….
 Lossless compression methods can provide substantial

compression for simple images
 However, lossless compression techniques may be used for

both preprocessing and post processing in image compression
algorithms to obtain the extra 10% compression
CV & IP 02 17
1. Huffman Coding
 The Huffman code, developed by D. Huffman in
1952, is a minimum length code
 This means that given the statistical distribution of
the gray levels (the histogram), the Huffman
algorithm will generate a code that is as close as
possible to the minimum bound, the entropy
CV & IP 02 18
Cont’d…
 The method results in an unequal (or variable)
length code, where the size of the code words can
vary
 For complex images, Huffman coding alone will
typically reduce the file by 10% to 50% (1.1:1 to
1.5:1), but this ratio can be improved to 2:1 or 3:1
by preprocessing for irrelevant information
removal
CV & IP 02 19
Cont,d…
The Huffman algorithm can be described in five steps:
1. Find the gray level probabilities for the image by finding the
histogram
2. Order the input probabilities (histogram magnitudes) from
smallest to largest
3. Combine the smallest two by addition
4. GOTO step 2, until only two probabilities are left
5. By working backward along the tree, generate code by
alternating assignment of 0 and 1
CV & IP 02 20
CV & IP 02 21
CV & IP 02 22
Huffman Coding
• Huffman coding is the most popular technique for
removing coding redundancy.
• Unique prefix property
• Instantaneous decoding property
• Optimality
• JPEG
CV & IP 02 23
Huffman coding
Huffman coding assigns shorter codes to symbols that
occur more frequently and longer codes to those that occur
less frequently. For example, imagine we have a text file that
uses only five characters (A, B, C, D, E).
Before we can assign bit patterns to each character, we
assign each character a weight based on its frequency of use.
In this example, assume that the frequency of the characters
is as shown in below. Frequency character
CV & IP 02 24
CV & IP 02 25
Huffman coding
A character’s code is found by starting at the root and
following the branches that lead to that character. The code
itself is the bit value of each branch on the path, taken in
sequence.
CV & IP 02 26
Final tree and code
Encoding
Let us see how to encode text using the code for our five
characters. Figure 15.6 shows the original and the
encoded text.
CV & IP 02 27
Huffman encoding
Decoding
The recipient has a very easy job in decoding the data
it receives. Figure 15.7 shows how decoding takes
place.
CV & IP 02 28
Huffman decoding
Symbol Probability 1 2 3 4 Code
a2 0.4 0.4 0.4 0.4 0.6 1
a6 0.3 0.3 0.3 0.3 0.4 00
a1 0.1 0.1 0.2 0.3 011
a4 0.1 0.1 0.1 0100
a3 0.06 0.1 01010
a5 0.04 01011
CV & IP 02 29
Example
CV & IP 02 30
2. Run-length encoding
Run-length encoding is probably the simplest

method of compression. It can be used to compress
data made of any combination of symbols.
It does not need to know the frequency of

occurrence of symbols and can be very efficient if
data is represented as 0s and 1s.
15.
CV & IP 02
31
The general idea behind this method is to replace
consecutive repeating occurrences of a symbol by
one occurrence of the symbol followed by the
number of occurrences.
The method can be even more efficient if the data
uses only two symbols (for example 0 and 1) in its
bit pattern and one symbol is more frequent than the
other.
CV & IP 02 32
Run-length encoding
Run-length encoding example
CV & IP 02 33
Run-length encoding
Run-length encoding for two symbols
CV & IP 02 34
Image coding and compression
Image coding
 How the image data can be represented.
Image compression
 Reducing the amount of data required to

represent an image.
 Enabling efficient image storing and transmission.
CV & IP 02 35
LOSSY COMPRESSION METHODS
 Our eyes and ears cannot distinguish subtle changes. In

such cases, we can use a lossy data compression method.
 These methods are cheaper, they take less time and
space when it comes to sending millions of bits per
second for images and video.
CV & IP 02 36
Lossy Compression Methods
 Lossy compression methods are required to achieve
high compression ratios with complex images
 They provides tradeoffs between image quality and
degree of compression, which allows the compression
algorithm to be customized to the application
 Several methods have been developed using lossy
compression techniques. JPEG (Joint Photographic
Experts Group) encoding is used to compress pictures and
graphics, MPEG (Moving Picture Experts Group)
encoding is used to compress video, and MP3 (MPEG
audio layer 3) for audio compression.
CV & IP 02 37
Lossy compression
 With more advanced methods, images can be
compressed 10 to 20 times with virtually no visible
information loss, and 30 to 50 times with minimal
degradation
 Newer techniques, such as JPEG2000, can achieve
reasonably good image quality with compression
ratios as high as 100 to 200
 Image enhancement and restoration techniques can
be combined with lossy compression schemes to
improve the appearance of the decompressed image
CV & IP 02 38
Cont’d…
 In general, a higher compression ratio results in a
poorer image, but the results are highly image
dependent – application specific
 Lossy compression can be performed in both the
spatial and transform domains. Hybrid methods use
both domains.
CV & IP 02 39
Gray-Level Run Length Coding
 The RLC technique can also be used for lossy
image compression, by reducing the number of
gray levels, and then applying standard RLC
techniques
 As with the lossless techniques, preprocessing by
Gray code mapping will improve the compression
ratio
CV & IP 02 40
Lossy Bit plane Run Length Coding
a) Original image, 8 bits/pixel,

b) Image after reduction to 7 bits/pixel,
256 gray levels
128 gray levels,
CV & IP 02 41
Lossy Bitplane Run Length Coding (contd)
c) Image after reduction to 6 bits/pixel, d) Image after reduction to 5 bits/pixel,

64 gray levels 32 gray levels,
Calculate the compression ratio?
CV & IP 02 42
Fidelity criteria
When lossy compression techniques are employed, the decompressed image will not be
identical to the original image. In such cases , we can define fidelity criteria that
measure the difference between this two images.
A good example for (1) objective fidelity criteria is root-mean square ( RMS ) error
between on input and output image For any value of x,and y , the error e(x,y) can be
defined as :
e(x,y) = f ’(x,y) – f(x,y)
M 1 N 1
The total error between two images is:
  f ' ( x, y ) 
x 0 y 0
f ( x, y ) 
1
 1 M 1 N 1 2
 2
The root –mean square error , erms is : erms    f ' ( x, y )  f ( x, y )  
CV & IP 02
 MN x 0 y 0 43

Image compression – JPEG encoding
An image can be represented by a two-dimensional
array (table) of picture elements (pixels).
A grayscale picture of 307,200 pixels is represented
by 2,457,600 bits, and a color picture is represented
by 7,372,800 bits.
In JPEG, a grayscale picture is divided into blocks

of 8 × 8 pixel blocks to decrease the number of
calculations because, as we will see shortly, the
number of mathematical operations for each picture
is the square of the number of units.
CV & IP 02 44
Video compression – MPEG encoding
The Moving Picture Experts Group (MPEG) method is

used to compress video. In principle, a motion picture is
a rapid sequence of a set of frames in which each frame
is a picture. In other words, a frame is a spatial
combination of pixels, and a video is a temporal
combination of frames that are sent one after another.
Compressing video, then, means spatially compressing
each frame and temporally compressing a set of frames.
CV & IP 02 45
Spatial compression
The spatial compression of each frame is done with
JPEG, or a modification of it. Each frame is a picture
that can be independently compressed.
Temporal compression
In temporal compression, redundant frames are
removed. When we watch television, for example, we
receive 30 frames per second. However, most of the
consecutive frames are almost the same. For example, in
a static scene in which someone is talking, most frames
are the same except for the segment around the
speaker’s lips, which changes from one frame to the
next.
CV & IP 02 46
Audio compression
Audio compression can be used for speech or

music. For speech we need to compress a 64
kHz digitized signal, while for music we need to
compress a 1.411 MHz signal. Two categories of
techniques are used for audio compression:
predictive encoding and perceptual encoding.
CV & IP 02 47
Predictive encoding
In predictive encoding, the differences between samples are
encoded instead of encoding all the sampled values. This type
of compression is normally used for speech. Several standards
have been defined such as GSM (13 kbps), G.729 (8 kbps), and
G.723.3 (6.4 or 5.3 kbps). Detailed discussions of these
techniques are beyond the scope of this book.
Perceptual encoding: MP3
The most common compression technique used to create CD-
quality audio is based on the perceptual encoding technique.
This type of audio needs at least 1.411 Mbps, which cannot be
sent over the Internet without compression. MP3 (MPEG audio
layer 3) uses this technique.
CV & IP 02 48
Any ???
CV & IP 02 49

06 Image Compresssion

Uploaded by

Document Information

Copyright

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

06 Image Compresssion

Uploaded by

Copyright:

Computer Vision and Image

Image compression address the problem of reducing the amount of data

Images take a lot of storage space:

Many bytes take a long time to transfer slow connections – suppose we

Where CR is compression ration, defined as CR = n1/nc (2)

Entropy gives a average bits per pixel required to encode an image.

Probabilities are computed by normalizing the histogram of the image –

Three basic types of redundancy can be identified in a

Non reversible (lossy)

implies sending or storing a smaller number of bits. Although many

 No loss of data, decompressed image exactly same as

 Lossless compression methods can provide substantial

 However, lossless compression techniques may be used for

is as shown in below. Frequency character

Run-length encoding is probably the simplest

It does not need to know the frequency of

Run-length encoding example

Run-length encoding for two symbols

 How the image data can be represented.

 Reducing the amount of data required to

 Our eyes and ears cannot distinguish subtle changes. In

a) Original image, 8 bits/pixel,

c) Image after reduction to 6 bits/pixel, d) Image after reduction to 5 bits/pixel,

e(x,y) = f ’(x,y) – f(x,y)

In JPEG, a grayscale picture is divided into blocks

The Moving Picture Experts Group (MPEG) method is

Audio compression can be used for speech or

You might also like