Professional Documents
Culture Documents
slide 2
Outline
Classification
ILP Architectures
Data Parallel Architectures
Process level Parallel Architectures
Issues in parallel architectures
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 3
Outline
Classification
ILP Architectures
Flynns
[66]
Data Parallel
Architectures
Fengs
[72]
Process level
Parallel Architectures
Hndlers
[77]
Modern
(Sima, Fountain & Kacsuk)
Issues in parallel
architectures
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 4
Flynns Classification
Architecture Categories
SISD
SIMD
MISD
MIMD
slide 5
SISD
IS
IS
DS
slide 6
SIMD
P
IS
DS
C
P
DS
slide 7
MISD
IS
IS
DS
M
IS
IS
DS
slide 8
MIMD
IS
IS
DS
M
IS
IS
DS
slide 9
Fengs Classification
16K
MPP
256
bit slice
length 64
STARAN
PEPE
IlliacIV
C.mmP
16
1
1
PDP11
IBM370
16
32
word length
CRAY-1
64
slide 10
Hndlers Classification
< K x K , D x D , W x W >
control
data
word
dash degree of pipelining
TI - ASC
CDC 6600
C.mmP
PEPE
Cray-1
<1, 4, 64 x 8>
<1, 1 x 10, 60> x <10, 1, 12> (I/O)
<16,1,16> + <1x16,1,16> + <1,16,16>
<1 x 3, 288, 32>
<1, 12 x 8, 64 x (1 ~ 14)>
slide 11
Modern Classification
Parallel
architectures
Data-parallel
Function-parallel
architectures
architectures
slide 12
Vector
Associative
architectures
And neural
SIMDs
Systolic
architectures
architectures
slide 13
Thread level
Parallel Arch
Process level
Parallel Arch
(MIMDs)
Distributed
Memory
MIMD
Shared
Memory
MIMD
slide 14
Outline
Classification
ILP Architectures
Data Parallel Architectures
Pipelining
Process level
Parallel Architectures
VLIW
Superscalar
Issues in parallel
architectures
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 15
Pipelining
Simple multicycle design :
resource sharing across cycles
all instructions may not take same cycles
IF
RF EX/AG M
WB
slide 16
Hazards in Pipelining
Procedural dependencies => Control hazards
conditional and unconditional branches, calls/returns
slide 17
Pipeline Performance
T
S stages
Frequency of interruptions - b
CPI = 1 + (S - 1) * b
Time = CPI * T / S
Anshul Kumar, CSE
slide 18
Fetch
memory
Unit
FU
FU
FU
Register file
multi-operation instruction
slide 19
Fetch
memory
Unit
and issue
unit
Multiple instruction
FU
FU
FU
Register file
Funtional Unit
slide 20
slide 21
FU
FU
Register file
Instruction encoding
Scalability: Access time, area, power consumption
sharply increase with number of register ports
Anshul Kumar, CSE
slide 22
Preserving the
sequential
consistency of
exception
processing
slide 23
Outline
Classification
ILP Architectures
Data Parallel Architectures
Process level Parallel Architectures
SIMD Processors
Issues in parallel
Vectorarchitectures
Processors
Associative
Processors
Cache coherence
problem
Systolic Arrays
Interconnection networks
Anshul Kumar, CSE
slide 24
Vector Processors
Uni-processors with vector instructions
Associative Processors
SIMD like processors with associative memory
Systolic Arrays
Application specific VLSI structures
Anshul Kumar, CSE
slide 25
B11 B12 0 0 0 0
B B B 0 0 0
21 22 23
slide 26
T=0
B31
A23
A22
A31
B21
A12
A21
A11
B11
B12
Outline
Classification
ILP Architectures
Data Parallel Architectures
Process level Parallel Architectures
Issues in parallel architectures
MIMD Processors
Cache coherence
problem
- Shared
Memory
- Distributed
Interconnection
networks Memory
Anshul Kumar, CSE
slide 28
Function-parallel
architectures
Thread
level PAs
Process
level PAs
(MIMDs)
Distributed
Memory
MIMD
Shared
Memory
MIMD
slide 29
MIMD Architectures
Design Space
Extent of address space sharing
Location of memory modules
Uniformity of memory access
slide 30
Outline
Classification
ILP Architectures
Users
perspective
Data Parallel
Architectures
Architects perspective
Process level Parallel Architectures
Issues in parallel architectures
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 31
slide 32
Concurrent
control flow
Functional or
logic program
Vector/array
operations
Concurrent
tasks/processes/threads/objects
With shared variables
or message passing
Anshul Kumar, CSE
Relationship between
programming model
and architecture ?
slide 33
slide 34
Outline
Classification
ILP Architectures
Coherence Protocols
Bus or directory based
Data Parallel -Architectures
- Invalidate or update
Process level Parallel Architectures
- Definition of states
Issues in parallel architectures
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 35
slide 36
Outline
Classification
ILP Architectures
Data Parallel Architectures
Process level Parallel Architectures
Switching and control
Issues in parallel
architectures
Topology
Cache coherence problem
Interconnection networks
Anshul Kumar, CSE
slide 37
Interconnection Networks
Architectural Variations:
Topology
Direct or Indirect (through switches)
Static (fixed connections) or Dynamic (connections
established as required)
Routing type store and forward/worm hole)
Efficiency:
Delay
Bandwidth
Cost
Anshul Kumar, CSE
slide 38
Books
D. Sima, T. Fountain, P. Kacsuk, "Advanced Computer
Architectures : A Design Space Approach", Addison Wesley, 1997.
M.J. Flynn, "Computer Architecture : Pipelined and Parallel
Processor Design", Narosa Publishing House/ Jones and Bartlett,
1996.
D.A. Patterson, J.L. Hennessy, "Computer Architecture : A
Quantitative Approach", Morgan Kaufmann Publishers, 2002.
K. Hwang, "Advanced Computer Architecture : Parallelism,
Scalability, Programmability", McGraw Hill, 1993.
H.G. Cragon, "Memory Systems and Pipelined Processors",
Narosa Publishing House/ Jones and Bartlett, 1998.
D.E. Culler, J.P Singh and Anoop Gupta, "Parallel Computer
Architecture, A Hardware/Software Approach", Harcourt Asia /
Morgan Kaufmann Publishers, 2000.
slide 39