Professional Documents
Culture Documents
1 Copyright © 1998 Berkeley Design Technology, Inc. 2 © 2000 Berkeley Design Technology, Inc.
3 © 2000 Berkeley Design Technology, Inc. 4 © 2000 Berkeley Design Technology, Inc.
5 © 2000 Berkeley Design Technology, Inc. 6 © 2000 Berkeley Design Technology, Inc.
1
Today's DSP "Killer Apps" DSP Tasks for Microprocessors
7 © 2000 Berkeley Design Technology, Inc. 8 © 2000 Berkeley Design Technology, Inc.
2
TMS32010 Filter Code Features Common to Most DSP
Processors
13 © 2000 Berkeley Design Technology, Inc. 14 © 2000 Berkeley Design Technology, Inc.
u Hardware support for u Other operations (e.g., saturation, mac x0,y0,a x:(r0)+,x0 y:(r4)+,y0 mov *r0,x0
managing numeric fidelity: rounding) typically take multiple mov *r1,y0
l Shifters
mpy x0,y0,a
cycles
add a,b
l Guard bits mov y0,*r2
l Saturation
inc r0
inc r1
15 © 2000 Berkeley Design Technology, Inc. 16 © 2000 Berkeley Design Technology, Inc.
17 © 2000 Berkeley Design Technology, Inc. 18 © 2000 Berkeley Design Technology, Inc.
3
Execution Control Specialized I/O for DSP
19 © 2000 Berkeley Design Technology, Inc. 20 © 2000 Berkeley Design Technology, Inc.
Computational demands Multiple parallel execution units, Execution-time locality Hardware-assisted zero-overhead
hardware acceleration of looping, specialized instruction
common DSP functions caches, streamlined interrupt
handling
Numeric fidelity Accumulator registers, guard bits, MAC-centricity Single-cycle multiplier(s) or MAC
saturation hardware
unit(s), MAC instruction
Harvard architecture, support for Streaming data No data cache; powerful DMA
High memory bandwidth
parallel moves
Real-time constraints Few dynamic features, on-chip
Specialized addressing modes, RAM instead of cache
Predictable data access e.g., modulo addressing, bit-
patterns reversed addressing Standards Rounding, saturation
21 © 2000 Berkeley Design Technology, Inc. 22 © 2000 Berkeley Design Technology, Inc.
Memory 12
move #Xaddr,r0 10
7
move #Haddr,r4 8
6
rep #Ntaps
4
mac x0,y0,a x:(r0)+,x0 y:(r4)+,y0
2
0
u Other 2nd-generation processors: Analog ARM7TDMI ADSP-218x
23 © 2000 Berkeley Design Technology, Inc. 24 © 2000 Berkeley Design Technology, Inc.
4
Third Generation (1995): Fourth Generation (1997-2000):
Ex: Motorola DSP56301, TI TMS320C541 Ex: TMS320C6201/6701, LSI401Z, MMX Pentium
27 © 2000 Berkeley Design Technology, Inc. 28 © 2000 Berkeley Design Technology, Inc.
32-bit input register 32-bit input register u Each instruction performs lots of work
u Algorithms, data organization must be amenable to data-parallel
16 bits 16 bits 16 bits 16 bits processing
l Programmers must be creative, and sometimes pursue
alternative algorithms
++ −− ×× ++ −− ×× l Reorganization penalties can be significant
29 © 2000 Berkeley Design Technology, Inc. 30 © 2000 Berkeley Design Technology, Inc.
5
SIMD Challenges High-Performance GPPs with SIMD
u Loss of generality
u Most high-performance GPPs targeting desktop applications are
l Each iteration of a loop processes N elements (typically 4 ≤ N
superscalar architectures
≤ 8) l Pentium, PowerPC
l Amplified if loops are unrolled for speed
u Often have many dynamic features to accelerate performance,
enable higher clock speeds
u High program memory usage
l Sophisticated, multi -level caches
l Re-arranging data for SIMD processing
l Branch prediction
l Merging partial results
l Speculative execution
l Loop unrolling
u Most offer SIMD extensions to increase performance on DSP and
u Often, only fixed-point supported multimedia applications (audio, video)
l MMX/SSE, AltiVec
31 © 2000 Berkeley Design Technology, Inc. 32 © 2000 Berkeley Design Technology, Inc.
u These processors can often execute DSP tasks faster than DSP u GPPs designed for embedded applications are starting to address
processors DSP needs
u So why do people still use DSPs? u Embedded GPPs typically don't have the advanced features that
l Price affect execution-time predictability, so are easier to use for DSP
l Power consumption
33 © 2000 Berkeley Design Technology, Inc. 34 © 2000 Berkeley Design Technology, Inc.
• e.g., TMS320C27xx
35 © 2000 Berkeley Design Technology, Inc. 36 © 2000 Berkeley Design Technology, Inc.
6
Hybrid DSP/Microcontrollers Hybrid DSP/Microcontrollers
Advantages, Disadvantages Advantages, Disadvantages
37 © 2000 Berkeley Design Technology, Inc. 38 © 2000 Berkeley Design Technology, Inc.
1st gen 2nd gen 3rd gen 4th gen 1st gen 2nd gen 3rd gen 4th gen
1982 1987 1995 2000 1982 1987 1995 2000
1000 1000
123 123
100 100
13 13
10 10
4 4
Speech
1 0.5 1 0.5
0.1 0.1
TMS32010 DSP56001 TMS320C54x TMS320C6201 TMS32010 DSP56001 TMS320C54x TMS320C6201
5 MIPS 13 MIPS 50 MIPS 250 MHz 5 MIPS 13 MIPS 50 MIPS 250 MHz
39 © 2000 Berkeley Design Technology, Inc. 40 © 2000 Berkeley Design Technology, Inc.
1st gen 2nd gen 3rd gen 4th gen 1st gen 2nd gen 3rd gen 4th gen
1982 1987 1995 2000 1982 1987 1995 2000
1000 1000
123 123
100 100 Audio
13 13
“2G” “2G”
10 Wireless 10 Wireless
4 4
Speech Speech
1 0.5 1 0.5
0.1 0.1
TMS32010 DSP56001 TMS320C54x TMS320C6201 TMS32010 DSP56001 TMS320C54x TMS320C6201
5 MIPS 13 MIPS 50 MIPS 250 MHz 5 MIPS 13 MIPS 50 MIPS 250 MHz
41 © 2000 Berkeley Design Technology, Inc. 42 © 2000 Berkeley Design Technology, Inc.
7
Processor DSP Speed: BDTImarks Execution Times
(Higher is Better) Complex Block FIR Filter Benchmark
(lower is better)
1st gen 2nd gen 3rd gen 4th gen
1982 1987 1995 2000 High- Performance DSPs vs High-Performance CPU
1000 “3G”
Wireless 16
123 14
100 Audio 12
microseconds
10
13
“2G” 8
10 Wireless
4 6
4
Speech
2
1 0.5
0
u DSP processor performance has increased by a factor of about 150x http://www.BDTI.com Collection of BDTI's papers on
over the past 15 years (~40% per year) DSP processors, tools, and
benchmarking
u Multi-issue architectures dominate the field of new high-performance http://www.eg3.com/dsp Links to other good DSP sites
processors
l But conventional DSPs still make up most of volume shipping today
comp.dsp Usenet group
u General-purpose processors increasingly tackling DSP, providing
competition for dedicated DSP processors Microprocessor Report
For info on newer DSPs
u Users of processors for DSP will have an expanding array of choices DSP Processor Fundamentals,
BDTI Textbook on DSP processors
u Compiler-friendliness is an increasingly important factor...
l ... as time-to-market pressures increase and applications become larger
Or, join BDTI...We're hiring!
u Selecting processors requires careful, application-specific
analysis (see www.BDTI.com)
45 © 2000 Berkeley Design Technology, Inc. 46 © 2000 Berkeley Design Technology, Inc.