Professional Documents
Culture Documents
Circuits
Jan M. Rabaey
AAnantha
Design
Perspective
Chandrakasan
Borivoje Nikolic
Semiconductor
Memories
December 20, 2002
Digital Integrated Circuits2nd
Memories
Chapter Overview
Memory Classification
Memory Architectures
The Memory Core
Periphery
Reliability
Case Studies
Digital Integrated Circuits2nd
Memories
Random
Access
Non-Random
Access
SRAM
FIFO
DRAM
LIFO
Non-Volatile
Read-Write
Memory
EPROM
2
E PROM
Read-Only Memory
Mask-Programmed
Programmable (PROM)
FLASH
Shift Register
CAM
Memories
Read access
Write cycle
WRITE
Write access
Data valid
DATA
Data written
Memories
SN2 2
SN2 1
M bits
S0
Word 0
Word 1
Word 2
Storage
cell
Word 0
A0
Word 1
A1
Word 2
Storage
cell
AK 2 1
WordN2 2
WordN2 2
WordN2 1
WordN2 1
K 5 log2N
Input-Output
(M bits)
Intuitive architecture for N x M memory
Too many select signals:
N words == N select signals
Input-Output
(M bits)
Decoder reduces the number of select signals
K = log2N
Memories
AK
AK1 1
AL 2 1
Bit line
Row Decoder
2L 2 K
Word line
M.2K
Sense amplifiers / Drivers
A0
AK2 1
Column decoder
Amplify swing to
rail-to-rail amplitude
Selects appropriate
word
Input-Output
(M bits)
Memories
Blocki
BlockP 2 1
Row
address
Column
address
Block
address
Block selector
Global
amplifier/driver
I/O
Advantages:
1. Shorter wires within blocks
2. Block address activates only 1 block => power savings
Digital Integrated Circuits2nd
Memories
Z-address
buffer
X-address
buffer
Transfer gate
Column decoder
Sense amplifier and write driver
CS, WE
buffer
I/O
buffer
x1/x4
controller
Y-address
buffer
[Hirose90]
X-address
buffer
Memories
Commands
Comparand
Priority Encoder
CAM Array
2 words3 64 bits
9
2 Validity Bits
Mask
Address Decoder
I/O Bufers
Contents-Addressable Memory
Memories
Row Address
Column Address
RAS
Address
Bus
Address
Address transition
initiates memory operation
CAS
RAS-CAS timing
DRAM Timing
Multiplexed Adressing
SRAM Timing
Self-timed
Memories
BL
BL
VDD
WL
WL
WL
BL
WL
BL
WL
BL
WL
GND
Diode ROM
MOS ROM 1
MOS ROM 2
Memories
MOS OR ROM
BL[0]
BL[1]
BL[2]
BL[3]
WL[0]
V DD
WL[1]
WL[2]
V DD
WL[3]
V bias
Pull-down loads
Memories
WL[0]
GND
WL [1]
WL [2]
GND
WL [3]
BL [0]
BL [1]
BL [2]
BL [3]
Memories
Polysilicon
Metal1
Diffusion
Metal1 on Diffusion
Memories
Programmming using
the Contact Layer Only
Polysilicon
Metal1
Diffusion
Metal1 on Diffusion
Memories
BL [1]
BL [2]
BL [3]
WL [0]
WL [1]
WL [2]
WL [3]
Memories
Programmming using
the Metal-1 Layer Only
No contact to VDD or GND necessary;
drastically reduced cell size
Loss in performance compared to NOR ROM
Polysilicon
Diffusion
Metal1 on Diffusion
Memories
Programmming using
Implants Only
Polysilicon
Threshold-altering
implant
Metal1 on Diffusion
Digital Integrated Circuits2nd
Memories
BL
rword
WL
Cbit
cword
Memories
BL
CL
r bit
WL
r word
cbit
cword
Memories
WL
K cells
Memories
f pre
Precharge devices
WL [0]
GND
WL [1]
WL [2]
GND
WL [3]
BL [0]
BL [1]
BL [2]
BL [3]
Memories
Non-Volatile Memories
The Floating-gate transistor (FAMOS)
Floating gate
Gate
Source
Drain
G
tox
tox
n+
Substrate
Device cross-section
n+_
Schematic symbol
Memories
0V
10 V 5 V 20 V
S
Avalanche injection
25 V
S
5V
0V
D
Removing programming
voltage leaves charge trapped
2 2.5 V
S
5V
D
Programming results in
higher V T .
Memories
A Programmable-Threshold Transistor
0-state
1-state
ON
DV T
OFF
V WL
V GS
Memories
FLOTOX EEPROM
Gate
Floating gate
I
Drain
Source
2030 nm
V GD
-10 V
10 V
n1
n1
Substrate
p
10 nm
FLOTOX transistor
Fowler-Nordheim
I-V characteristic
Memories
EEPROM Cell
BL
WL
VDD
Memories
Flash EEPROM
Control gate
Floating gate
erasure
n1 source
programming
n1 drain
p-substrate
Memories
Flash
Digital Integrated Circuits2nd
EPROM
Courtesy Intel
Memories
12 V
BL 0
array
BL 1
0V
WL 0
0V
WL 1
open
Digital Integrated Circuits2nd
open
Memories
BL 0
BL 1
6V
12 V
WL 0
0V
WL 1
6V
0V
Memories
BL 0
1V
5V
WL 0
0V
WL 1
1V
BL 1
0V
Memories
Gate
Unit Cell
ONO
Gate
Oxide
FG
Source line
(Diff. Layer)
Courtesy Toshiba
Memories
Word lines
Active area
STI
Memories
Memories
DYNAMIC (DRAM)
Periodic refresh required
Small (1-3 transistors/cell)
Slower
Single Ended
Digital Integrated Circuits2nd
Memories
Q
M1
BL
M4
Q
M6
M3
BL
Memories
V DD
M4
BL
Q= 0
M5
V DD
Cbit
M1
Q= 1
V DD
BL
M6
V DD
Cbit
Memories
1
0.8
0.6
0.4
0.2
0
0
0.5
1 1.2 1.5 2
Cell Ratio (CR)
2.5
Memories
Q= 1
M1
BL = 1
M6
Q= 0
VDD
BL = 0
Memories
Memories
6T-SRAM Layout
VDD
M2
M4
Q
M1
M3
GND
M5
BL
M6
WL
BL
Memories
V DD
RL
M1
M2
M4
BL
Memories
SRAM Characteristics
Memories
BL 2
WWL
WWL
RWL
M3
X
M1
CS
M2
RWL
V DD 2 V T
X
BL 1
BL 2
V DD
DV
V DD 2 V T
Memories
3T-DRAM Layout
BL2
BL1
GND
RWL
M3
M2
WWL
M1
Memories
WL
Read 1
WL
M1
CS
V DD 2 V T
X GND
V DD
BL
V DD /2
V
sensing
CBL
Memories
Memories
V(1)
V PRE
DV(1)
V(0)
Sense amp activated
Word line activated
Memories
M 1 word
line
Field Oxide
n+
Poly
SiO2
Inversion layer
induced by
plate bias
Cross-section
Diffused
bit line
Polysilicon
gate
Polysilicon
plate
Layout
Memories
Memories
Cell plate
Cell Plate Si
Capacitor Insulator
Refilling Poly
Transfer gate
Isolation
Storage electrode
Si Substrate
Trench Cell
Digital Integrated Circuits2nd
Stacked-capacitor Cell
Memories
Bit
Bit
Bit
Bit
Word
CAM
M4
CAM
Word
Word
CAM
Bit
CAM
M8
M9
M6
M7
S
M3
Match
int
M5
S
M2
M1
Memories
CAM
SRAM
ARRAY
ARRAY
Input Drivers
Address
Tag
Hit
R/W
Data
Memories
Periphery
Decoders
Sense Amplifiers
Input/Output Buffers
Control / Timing Circuitry
Memories
Row Decoders
Collection of 2M complex logic gates
Organized in regular and dense fashion
(N)AND Decoder
NOR Decoder
Memories
Hierarchical Decoders
Multi-stage implementation improves performance
WL 1
WL 0
A 0A 1 A 0A 1 A 0A 1 A 0A 1
A 2A 3 A 2A 3 A 2A 3 A 2A 3
A1 A0
A0
A1
A3 A2
A2
A3
Memories
Dynamic Decoders
Precharge devices
GND
VDD
GND
WL 3
VDD
WL 3
WL 2
WL 2
VDD
WL 1
WL 1
V DD
WL 0
WL 0
VDD
A0
A0
A1
A1
A0
A0
A1
A1
Memories
A0
S0
S1
S2
A1
S3
Advantages: speed (tpd does not add to overall memory access time)
Only one extra transistor in signal path
Disadvantage: Large transistor count
Memories
A1
A1
D
Number of devices drastically reduced
Delay increases quadratically with # of sections; prohibitive for large decoders
Solutions: buffers
progressive sizing
combination of tree and pass transistor approaches
Memories
V DD
V DD
WL 0
R
V DD
V DD
WL 1
f
V DD
WL 2
f
V DD
Memories
Sense Amplifiers
V
tp = C
---------------Iav
large
make V as small
as possible
small
s.a.
input
output
Memories
M4
y
M1
bit
SE
M2
Out
bit
M5
Directly applicable to
SRAMs
Digital Integrated Circuits2nd
Memories
PC
BL
V DD
BL
V DD
EQ
WL i
M3
M1
SE
V DD
M4
M2
M5
2
x
2
x
x
SE
SE
SRAM cell i
Dif.
x Sense2
x
Amp
V DD
Output
SE
Output
(a) SRAM sensing scheme
Memories
BL
VDD
SE
SE
Memories
Charge-Redistribution Amplifier
V ref
VL
M2
M3
M1
Clarge
VS
C small
Transient Response
2.5
Concept
2.0
VS
V in
1.5
1.0
0.5
0.0
0.0
VL
V ref5 3V
1.00
2.00
time(nsec)
3.00
Charge-Redistribution Amplifier
V
EPROM
DD
SE
Load
M4
Out
V casc
M3
Cascode
device
Cout
Ccol
WLC
Column
decoder
M2
BL
WL
Digital Integrated Circuits2nd
M1
CBL
EPROM
array
Memories
Single-to-Differential Conversion
WL
BL
x
Cell
Dif.
S.A.
2
x
1
2
V ref
Output
Memories
L1
L0
V DD
R0
R1
SE
BLL
CS
CS
BLR
CS
Dummy cell
SE
CS
CS
CS
Dummy cell
Memories
2
BL
BL
1
BL
BL
t (ns)
t (ns)
reading 1
reading 0
3
EQ
WL
2
SE
1
t (ns)
control signals
Memories
Voltage Regulator
VDD
Mdrive
VDL
VREF
Equivalent Model
Vbias
VREF
Mdrive
VDL
Digital Integrated Circuits2nd
Memories
Charge Pump
V DD
CLK
VB
M1
B
2V DD 2 V T
V DD 2 V T
0V
Cpump
M2
V load
Cload
V load
0V
Memories
DRAM Timing
Memories
RDRAM Architecture
Bus
Clocks
Data
bus
Column
Row
k3 l
memory
array
demux
packet dec.
demux
packet dec.
Memories
A0
DELAY
td
A1
DELAY
td
A N2 1
DELAY
td
ATD
ATD
Memories
Memories
100
CD(1F)
CS(1F)
Q S(1C)
10
VDD (V)
Q S 5 CS VDD /2
V smax5 QS/(C S 1 CD )
4K
From [Itoh01]
Memories
substrate Adjacent BL
a-particles
WL
leakage
CS
electrode
Ccross
Memories
EQ
WL 1
WL 0
BL
WL
CWBL D
CWBL
WL D
WL 1
WL 0
BL
CBL
C
Sense
Amplifier
CBL
Memories
Folded-Bitline Architecture
WL 1
BL
WL 0 C
WBL
WL 0
WL D
WL D
CBL
C
BL
WL 1
x
C
CBL
Sense
EQ Amplifier
x
CWBL
Memories
Transposed-Bitline Architecture
Ccross
BL 9
BL
SA
BL
BL 99
(a) Straightforward bit-line routing
Ccross
BL 9
BL
SA
BL
BL 99
(b) Transposed bit-line architecture
Digital Integrated Circuits2nd
Memories
V DD
BL
n1
SiO2
1
1
2
2
1
1
2
2
Memories
Yield
Memories
Redundancy
Row
Address
Redundant
rows
:
Redundant
columns
Memory
Array
Column Decoder
Fuse
Bank
Column
Address
Memories
Error-Correcting Codes
Example: Hamming Codes
e.g. B3 Wrong
with
1
1
=3
Memories
Memories
CHIP
nCDE V INT f
selected miact
C PT V INT f
I DCP
n
ROW
DEC
PERIPHERY
From [Itoh00]
Memories
0.13m m CMOS
Ileakage
900n
700n
500n
Factor 7
300n
0.18m m CMOS
100n
0.00
.600
1.20
1.80
VDD
Memories
low-threshold transistor
V DDL
sleep
V DD,int
sleep
V DD,int
SRAM
cell
SRAM
cell
sleep
SRAM
cell
SRAM
cell
SRAM
cell
V SS,int
SRAM
cell
I ACT
Current (A)
21
10
I AC
22
10
102 3
Cycle time : 150 ns
T 5 75 C,S
24
10
I DC
25
10
26
10
15M
64M
255M
1G
4G
15G
64G
1.0
0.8
0.16
0.13
Capacity (bit)
3.3
2.5
2.0
1.5
1.2
0.40
0.32
0.24
0.19
From [Itoh00]
Memories
Case Studies
Programmable
Logic Array
SRAM
Flash
Memory
Memories
Main difference
ROM: fully populated
PLA: one element per minterm
Note: Importance of PLAs has drastically reduced
1. slow
2. better software techniques (mutli-level logic
synthesis)
But
Memories
GND
GND
V DD
GND
GND
GND
GND
V DD
X0
X0
X1
AND-plane
X1
X2
X2
f0
f1
OR-plane
Memories
Dynamic PLA
f AND
V DD
GND
f OR
f OR
f AND
V DD
X0
X0
X1
X1
AND-plane
X2
X2
f0
f1 GND
OR-plane
Memories
f AND
tpre teval
f AND
f OR
f OR
Memories
PLA Layout
AndPlane
VDD
x0 x0 x1 x1 x2 x2
Pullupdevices
Digital Integrated Circuits2nd
OrPlane
GND
f0 f1
Pullupdevices
Memories
4 Mbit SRAM
Hierarchical Word-line Architecture
Global word line
Sub-global word line
Local
word line
Block group
select
Block
select
Local
word line
Memory cell
Block 0
Block 1
Block
select
Block 2...
Memories
Bit-line Circuitry
Block
select
Bit-line
load
ATD
BEQ
Local WL
Memory cell
B/T
B/T
CD
CD
CD
I/O line
Sense amplifier
I/O
I/O
Memories
I /O
ATD
SEQ
Block
select
ATD
BEQ
BS
SA
BS
Vdd
I/O Lines
GND
SA
SEQ
SEQ
SEQ
SEQ
SEQ
Vdd
DATA
Dei
SA, SA
GND
DATA
BS
Data-cut
Memories
SGD
WL31
WL0
SGS
Block0
Block1023
BLT0
BLT1
Data Caches
(10241 32)3 8
Block1023
Sense Latches
(10241 32)3 8
I/O
Block0
Sense Latches
(10241 32)3 8
Data Caches
(10241 32)3 8
From [Nakamura02]
Memories
108
106
104
102
Result of 4 times
program
0V 1V 2V 3V 4V
Vt of memory cells
(a)
1V
2V
3V
4V
Vt of memory cells
Evolution of thresholds
100
0V
Final Distribution
From [Nakamura02]
Memories
Charge pump
2kB Page buffer & cache
10.7mm
11.7mm
Digital Integrated Circuits2nd
From [Nakamura02]
Memories
Technology
Technology
0.13m
0.13m p-sub
p-sub CMOS
CMOStriple-well
triple-well
1poly,
1poly,1polycide,
1polycide, 1W,
1W,2Al
2Al
Cell
0.077m2
Cellsize
size
0.077m2
Chip
125.2mm2
Chipsize
size
125.2mm2
Organization
Organization 2112
2112xx8b
8bxx64
64page
pagexx1k
1kblock
block
Power
Power supply
supply 2.7V-3.6V
2.7V-3.6V
Cycle
50ns
Cycle time
time
50ns
Read
25s
Readtime
time
25s
Program
Programtime
time 200s
200s //page
page
Erase
2ms
Erasetime
time
2ms// block
block
From [Nakamura02]
Memories
Memories
From [Itoh01]
Memories
From [Itoh01]
Memories
Memories