You are on page 1of 47

Digital Signal Processing:

An Introduction and Some Examples of its


Everyday Use
Ardik Wijayanto
ardik@eepis-its.edu

Target
Sampling
FFT
Filter

Slide 2

Materi Dalam 1 Semester

Pendahuluan (1 TM)
Dasar DSP : Sinyal dan Sistem (2 TM) + 1 Tugas
ADC dan Segala Aspeknya (1 TM)
Persamaan Beda Koef Linier Konstan (2 TM)
Konvolusi (1 TM) + 1 Tugas
Transformasi Laplace (1 TM)
Transformasi Z (1 TM)
Filter Digital dan Segala Aspeknya (2 TM) + 1 Tugas
Transformasi Fourier (1 TM) + 1 Tugas

TM = Tatap Muka

DSP ???

Contents
What is DSP?
What is DSP used for?
Speech & Audio processing
Image & Video processing
Adaptive filtering

DSP Devices and Architectures


Summary & Conclusions

Slide 5

What is DSP?
Digital Signal Processing the processing
or manipulation of signals using digital
techniques

Input
Signal

ADC
Analogue
to Digital
Converter

Digital
Signal
Processor

Slide 6

DAC
Digital to
Analogue
Converter

Output
Signal

PSD - Ardik / Bima

PSD - Ardik / Bima

PSD - Ardik / Bima

What is DSP Used For?

And much more!


Slide 10

Slide 11

The world is filled with signals


2-D signals:

1-D signals:

Seismic vibrations
EEG and EKG
Speech
Sonar
Audio & music
Dow-Jones averages

Photographs
Medical images
Radar
Satellite data
Fax
Fingerprints

And of course there are 3-d signals (video, volumetric data sets)
and beyond
Slide 12

And we want to process them


Recognize whats in a signal
Target detection
Speech recognition
Image analysis

Predict a future value of the


signal
Stock market prediction

Interpolate missing values


of a signal

Enhance a signal
Image contrast enhancement

Compress a signal
Faster transmission
Less storage space

Synthesize a realistic example of


a signal
Speech synthesis
Image texture generation

Conceal lost video packets

Restore a signal that has


been degraded
Noise removal
Echo cancellation

Choose specific input signals to


control a process
Chemical process control
Aerodynamic control
Slide 13

PSD - Ardik / Bima

Slide 15

Speech Processing

Speech coding/compression
Speech synthesis
Speech recognition
Slide 16

Some Properties of Speech

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

oo
in
blue
e
ee
o
s
kin
in
in
inagain
spot
key
key

Slide 17

Some Properties of Speech


Vowels

oo in blue

o in spot

ee in key

Quasi-periodic
Relatively high signal power
Consonants

s in spot

k in key

Non-periodic (random)
Relatively low signal power
Slide 18

e in again

Speech Coding
TRAU

MSC

64 kbits/s
22.8 kbits/s
BSC
13 kbits/s
BTS

Slide 19

Speech Coding Linear Prediction


Try to predict the current sample value;
Transmit the prediction error.
s(n)
A(z)

se(n)

d(n)

d(n)
+

sr(n)

+
se(n)

Slide 20

A(z)

Speech Coding Vocoder


Encoder
Original Speech
Analysis:
Voiced/Unvoiced decision
Pitch Period (voiced only)
Signal power (Gain)

Decoder

Pitch
Period

Signal Power
Pulse Train

V/U
Vocal Tract
Model

Random Noise

LPC-10:

Slide 21

Synthesized Speech

Text-to-Speech Synthesis
Input
text

To be or
not to be
that is the
question

Tu bee awr
nawt tu bee
dhat iz dhe
kwestchun

Text
normalization

Parsing

expands
abbreviations
dates, times,
money..etc

semantic &
syntactic parts
of speech
analysis of text

Prosody
rules
Apply word
stress, duration
and pitch

Waveform
generation
Phonetic-toacoustic
transformation

phonetic form

Pronunciation
phonetic description
of each word, dictionary
with letter-to-sound
rules as a back up

Synthesized
speech

Text-to-speech synthesis sounds very natural these days.


Slide 22

Speech Synthesis Applications

Speaking clocks
Spoken (variable) announcements
Talking emails + talking heads for mobile
Synthesis of location-based information
(e.g. traffic information)
Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)
Slide 23

Speech/Speaker Recognition
Speech Recognition What has been spoken?
Speaker dependent Recognition system trained
for a particular persons voice.
Speaker independent Recognition system
expected to deal with a wide variety of speakers.

Speaker Recognition Who has spoken?


Not easy
Sometimestherearenogapsbetweenwords.
Sometim esthereareg aps inthe mid dleofwords.

Accents, dialects and Stress eggsist.


Slide 24

Speech Recognition System

Phoneme
models

speech

Feature
extraction

Phoneme
recognition

Word
pronunciation

Word
recognition

Semantic
knowledge

Sentence
recognition

Syntactic
knowledge

Slide 25

decision

Dialogue
knowledge

Digital Audio
Standard music CD:

Sampling Rate: 44.1 kHz


16-bit samples
2-channel stereo
Data transfer rate = 21644,100 = 1.4 Mbits/s
1 hour of music = 1.43,600 = 635 MB

Slide 26

Audio Coding (Contd)


Key standards:
MPEG: Layers I, II, and III (MP3); AAC.
used in DAB, DVD

Dolby AC3, Dolby Digital, Dolby Surround.

Typical bit rates for 2-channel stereo:


64kbits/s to 384 kbits/s.

Subband- or transform-based, making use


of perceptual masking properties.
Slide 27

Audio Coding (Contd)


Typical 3/2 multichannel stereo configuration:
Surround
Right

Right

Centre
Surround
Left

Left

5.1 channels (3/2) with LFE channel:


Left, Right, Centre,
Left Surround, Right Surround,
Low Frequency Effects (LFE) (Reduced Bandwidth).
LFE loudspeaker can, in general, be placed anywhere in the
listening room.
Slide 28

Audio Coding Masking


Auditory Masking:
Spectral: Strong frequency components mask weaker
neighbouring frequency components.
Temporal: Strong temporal events mask recent and
future events.
Spectral Masking

Temporal Masking
SPL/dB

SPL/dB

freq/kHz

10ms

Slide 29

160ms

time

Masking Example

60

dB

50

40

30

20

10
200

300

400

500
Hz

Slide 30

600

700

800

Image/Video
Still Image Coding:
JPEG (Joint Photographic Experts Group):
Discrete Cosine Transform (DCT) based

JPEG2000: Wavelet Transform based

Video Coding:
MPEG (Moving Pictures Experts Group):
DCT-based,
Interframe and intraframe prediction,
Motion estimation.

Applications: Digital TV, DVD, etc.

Slide 31

JPEG Example
Original

JPEG (4:1)

JPEG (100:1)

Slide 32

Video compression: Example of a


packet loss & need to interpolate:
Packet loss means
a horizontal slice of
data is missing
The decoder holds
over the slice from
the previous frame
to conceal the loss.
Because of camera
pan, slice from
previous frame
doesnt line up well
Interpolating the missing data in the current frame using the data from
Slide
33
above & below would likely provide
better
visual concealment of the loss

Example: contrast enhancement


Original magnetic resonance
brain scan

Contrast enhancement using


histogram equalization

Slide 34

ECE161C: DSP II

Image processing and computer vision


Topics include:
image formation: cameras, radiometry,
and color
2D DSP, Discrete Cosine Transform
Filtering, Edge detection
Multiscale representations, texture
Least squares model fitting, motion
Statistics
Principal components and face detection
Video compression
Slide 35

Quantization of 24-bit true color


down to 8-bit color:
24-bit color original

8-bit color quantized version

Slide 36 the quantization banding on the right


A little random dithering would help mask

ECE 172A Introduction to


Intelligent Systems
Main topics covered in the course
include:
1.
2.
3.
4.
5.
6.
7.
8.

Introduction to Intelligent Systems and


Sensor-based Robots
Model-Based approach in perception
Image segmentation
Edge Detection
Region growing
Texture analysis
Object recognition and image
understanding
Extraction of 3-dimensional cues: passive
and active approaches

Input Image

Project (about 5 weeks long):


1.
2.
3.

Vehicle Detection & Re-identification


Person Detection and Tracking
Robust Image Classification
Slide 37

Image Classified as a
Day Image

Adaptive Filtering
Self-learning: Filter coefficients adapt in response
to training signal.
d(n)

x(n)

W(z)

e(n)

y(n)

Filter update: Least Mean Squares (LMS) algorithm


w(n 1) w(n) 2e(n)x(n)
Slide 38

Adaptive Filtering Applications


Echo cancellation (telephone lines)
Used in modems (making Internet access possible!!)

Acoustic echo cancellation


Hands-free telephony

Adaptive equalization
Active noise control
Medical signal processing
e.g. foetal heart beat monitoring

Slide 39

Some Other Application Areas


Image analysis, e.g:
Face recognition,
Optical Character Recognition (OCR);

Restoration of old image, video, and audio signals;


Analysis of RADAR data;
Analysis of SONAR data;
Data transmission (modems, radio, echo
cancellation, channel equalization, etc.);
Storage and archiving;
Control of electric motors.
Slide 40

DSP Devices & Architectures


Selecting a DSP several choices:
Fixed-point;
Floating point;
Application-specific devices
(e.g. FFT processors, speech recognizers,etc.).

Main DSP Manufacturers:


Texas Instruments (http://www.ti.com)
Motorola (http://www.motorola.com)
Analog Devices (http://www.analog.com)

Slide 41

Typical DSP Operations


Filtering
Energy of Signal
Frequency transforms

L 1

y ( n)

ai x(n i )
i 0

Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}

Slide 42

Traditional DSP Architecture


X RAM

ai

x(n-i)

Y RAM

Multiply/Accumulate

Accumulator

y(n)

N.B. Most modern DSPs have more advanced features.


Slide 43

DSP at EPSON

Energy-saving Firmware
EPSON Scotland Design Centre develops a
broad range of technologies to minimize
power consumption and maximize cost
effectiveness in mobile DSP applications.

Slide 44

SDC Core Skills


DSP

Speech

Audio

Mobile

Services

Baseband processing

Administration

System modelling

Speech compression

MP3

Firmware design

Speech Recognition

Other digital audio

Channel coding

CAD Tools

System Integration

Speech synthesis

Performance
Assessment

AMR Coding

Computer
&
Networking

CPU (Oak, ARM)


H/w & S/w
Co-design

Speech enhancement
Speech Testing

System on Chip (SoC)

Slide 45

SDC Firmware Development


Algorithm
Definition
Floating-point
and
Fixed-point
Co-Simulation

COSSAP
Matlab ...
Behavioural,
RTL, Logic ...

Co-Design
Implementation

Co-Verification MCU, DSP ...

Product Development

With Barcelona and Tokyo


Design Centres

Slide 46

Summary & Conclusions


DSP used in a wide range of everyday applications
Looked at:
Speech coding; Speech synthesis & recognition;
Image/Video;
Adaptive filtering.

Other areas include:

Image analysis (e.g. face recognition, OCR, etc.);


RADAR/SONAR;
Data transmission and reception;
And many more..!!
Slide 47

You might also like