Professional Documents
Culture Documents
INTRODUCTION
In Recent years, wireless sensor networks (WSNs) have been evolving at an accelerated
pace. To build WSNs, the Zigbee protocol, in which medium access control layer and
physical layer are defined by IEEE 802.15.4, has been generally used. Because the Zigbee
protocol has low data rate and power specifications, its use can prolong battery life. This
feature makes the Zigbee protocol preferred over other technologies such as 802.11. And
Bluetooth. In addition, most Zigbee systems-on-chips (SoCs) support a number of power
modes including a standby mode that occupies the system 99.9% of the time to maximize the
battery life.
Thus, standby power reduction is extremely important for minimizing the power
consumption of Zigbee SoCsstandby power consumption becomes more critical as the
process technology scales down because the leakage current increases exponentially with the
scaling threshold voltage (Vt ) and the gate oxide thickness. To ensure that Zigbee SoCs can
operate properly after returning to the active mode, the logic states containing hardware
calibration, hardware configuration, and network routing information should be preserved
before entering the standby mode. Data preservation is also required to achieve smooth power
mode transition between standby mode and the active mode. Thus, retention flip-flops (RFFs)
aroused in many Zigbee SoCs for storing the logic states, and several types of RFFs have
been widely researched
applications that require long battery life and secure networking (Zigbee networks are
secured by 128 bit symmetric encryption keys.) Zigbee has a defined rate of 250 Kbit/s, best
suited for intermittent data transmissions from a sensor or input device.
Zigbee is a low-cost, low-power; wireless mesh network standard targeted at wide
development of long battery life devices in wireless control and monitoring applications.
Zigbee devices have low latency, which further reduces average current. Zigbee chips are
typically integrated with radios and with microcontrollers that have between 60-256 KB flash
memories. Zigbee operates in the industrial, scientific and medical (ISM) radio bands: 2.4
GHz in most jurisdictions worldwide; 784 MHz in China, 868 MHz in Europe and 915 MHz
in the USA and Australia. Data rates vary from 20 Kbit/s (868 MHz band) to 250 Kbit/s (2.4
GHz band). The Zigbee network layer natively supports both star and tree networks, and
generic mesh networking. Every network must have one coordinator device, tasked with its
creation, the control of its parameters and basic maintenance. Within star networks, the
coordinator must be the central node. Both trees and meshes allow the use of Zigbee routers
to extend communication at the network level. Zigbee builds on the physical layer and media
access control defined in IEEE standard 802.15.4 for low-rate WPANs. The specification
includes four additional key components: network layer, application layer, Zigbee device
objects (ZDOs) and manufacturer-defined application objects which allow for customization
and favour total integration. ZDOs are responsible for a number of tasks, including keeping
track of device roles, managing requests to join a network, as well as device discovery and
security. Zigbee is one of the global standards of communication protocol formulated by the
relevant task force under the IEEE 802.15 working group. The fourth in the series, WPAN
Low Rate/Zigbee is the newest and provides specifications for devices that have low data
rates, consume very low power and are thus characterized by long battery life. Other
standards like Bluetooth and IrDA address high data rate applications such as voice, video
and LAN communications.
Zigbee protocols are intended for embedded applications requiring low power
consumption and tolerating low data rates. The resulting network will use very small amounts
of power individual devices must have a battery life of at least two years to pass Zigbee
certification.
Typical application areas include
Industrial control
Embedded sensing
Building automation
Zigbee is a low-cost, low-power, wireless mesh networking standard. The low cost allows
the technology to be widely deployed in wireless control and monitoring applications, the low
power-usage allows longer life with smaller batteries, and the mesh networking provides high
reliability and larger range. The Zigbee Alliance, the standards body which defines Zigbee,
also publishes application profiles that allow multiple OEM vendors to create interoperable
products. The current list of application profiles either published or in the works are:
Home Automation
Telecommunication Applications
IEEE 802.11 and the Wi-Fi Alliance. The Zigbee 1.0 specification was ratified on 14
December 2004 and is available to members of the Zigbee Alliance. Most recently, the
Zigbee 2007 specification was posted on 30 October 2007. The first Zigbee Application
Profile, Home Automation, was announced 2 November 2007.
For non-commercial purposes, the Zigbee specification is available free to the general public.
An entry level membership in the Zigbee Alliance, called Adopter, costs US$ 3500 annually
and provides access to the as-yet unpublished specifications and permission to create
products for market using the specifications.
Zigbee operates in the industrial, scientific and medical (ISM) radio bands; 868 MHz in
Europe, 915 MHz in countries such as USA and Australia, and 2.4 GHz in most jurisdictions
worldwide. The technology is intended to be simpler and less expensive than other WPANs
such as Bluetooth. Zigbee chip vendors typically sell integrated radios and microcontrollers
with between 60K and 128K flash memory, such as the free scale MC13213, the Ember
EM250 and the Texas Instruments CC2430. Radios are also available stand-alone to be used
with any processor or microcontroller. Generally, the chip vendors also offer the Zigbee
software stack, although independent ones are also available.
"In the US, as of 2006, the retail price of a Zigbee-compliant transceiver is approaching
$1, and the price for one radio, processor, and memory package is about $3. Comparatively,
the price of consumer-grade Bluetooth chips is now under $3. In other countries the prices are
higher. For example in the UK, (March 2009) the one-off cost to a hobbyist for a barebones
Zigbee surface-mount transceiver IC varies from 5 to 9, with pre-assembled modules
around 10 more (excluding aerials).
The first stack release is now called Zigbee 2004. The second stack release is called
Zigbee 2006, and mainly replaces the MSG/KVP structure used in 2004 with a "cluster
library". The 2004 stack is now more or less obsolete.Zigbee 2007, now the current stack
release, contains 2 stack profiles, stack profile 1 (simply called Zigbee), for home and light
commercial use, and stack profile 2 (called Zigbee Pro). Zigbee Pro offers more features,
such as multi-casting, many-to-one routing and high security with Symmetric-Key Key
Exchange (SKKE), while Zigbee (stack profile 1) offers a smaller footprint in RAM and
flash. Both offer full mesh networking and work with all Zigbee application profiles
Zigbee 2007 is fully backward compatible with Zigbee 2006 devices: a Zigbee 2007
device may join and operate on a Zigbee 2006 network and vice versa. Due to differences in
routing options, Zigbee Pro devices must become non-routing Zigbee End-Devices (ZEDs)
on a Zigbee 2006 or Zigbee 2007 network, the same as Zigbee 2006 or Zigbee 2007 devices
must become ZEDs on a Zigbee Pro network. The applications running on those devices
work the same regardless of the stack profile beneath them.
Zigbee Coordinator (ZC): The most capable device, the Coordinator forms the root of
the network tree and might bridge to other networks. There is exactly one Zigbee
Coordinator in each network since it is the device that started the network originally (the
Zigbee LightLink specification also allows operation without a Zigbee Coordinator,
making it more usable for over-the-shelf home products). It stores information about the
network, including acting as the Trust Center & repository for security keys.
Zigbee Router (ZR): As well as running an application function, a Router can act as
an intermediate router, passing on data from other devices.
Zigbee End Device (ZED): Contains just enough functionality to talk to the parent
node (either the Coordinator or a Router); it cannot relay data from other devices. This
relationship allows the node to be asleep a significant amount of the time thereby giving
long battery life. A ZED requires the least amount of memory, and therefore can be less
expensive to manufacture than a ZR or ZC.
transmit periodic beacons to confirm their presence to other network nodes. Nodes may sleep
between beacons, thus lowering their duty cycle and extending their battery life. Beacon
6
intervals depend on data rate; they may range from 15.36 milliseconds to 251.65824 seconds
at 250 Kbit/s, from 24 milliseconds to 393.216 seconds at 40 Kbit/s and from 48 milliseconds
to 786.432 seconds at 20 Kbit/s.
However, low duty cycle operation with long beacon intervals requires precise timing, which
can conflict with the need for low product cost. In general, the Zigbee protocols minimize
the time the radio is on, so as to reduce power use. In beaconing networks, nodes only need to
be active while a beacon is being transmitted. In non-beacon-enabled networks, power
consumption is decidedly asymmetrical: Some devices are always active, while others spend
most of their time sleeping. Except for the Smart Energy Profile 2.0, Zigbee devices are
required to conform to the IEEE 802.15.4-2003 Low-Rate Wireless Person Area Network
(LR-WPAN) standard. The standard specifies the lower protocol layersthe physical layer
(PHY), and the Media Access Control portion of the data link layer (DLL). The basic channel
access mode is "carrier sense, multiple access/collision avoidance" (CSMA/CA). That is, the
nodes talk in the same way that humans converse; they briefly check to see that no one is
talking before they start, with three notable exceptions. Beacons are sent on a fixed timing
schedule and do not use CSMA. Message acknowledgments also do not use CSMA. Finally,
devices in beacon-enabled networks that have low latency real-time requirements may also
use Guaranteed Time Slots (GTS), which by definition do not use CSMA.
network key provided by the trust centre (through the initially insecure channel) to
communicate.
Thus, the trust centre maintains both the network key and provides point-to-point security.
Devices will only accept communications originating from a key provided by the trust centre,
except for the initial master key. The security architecture is distributed among the network
layers as follows:
The MAC sub layer is capable of single-hop reliable communications. As a rule, the
security level it is to use is specified by the upper layers.
The network layer manages routing, processing received messages and being capable
of broadcasting requests. Outgoing frames will use the adequate link key according to the
routing, if it is available; otherwise, the network key will be used to protect the payload
from external devices.
The application layer offers key establishment and transport services to both ZDO and
applications. It is also responsible for the propagation across the network of changes in
devices within it, which may originate in the devices themselves (for instance, a simple
status change) or in the trust manager (which may inform the network that a certain
device is to be eliminated from it). It also routes requests from devices to the trust center
and network key renewals from the trust centre to all devices. Besides this, the ZDO
maintains the security policies of the device.
(868 MHz)
CSMA-CA channel access Yields high throughput and low latency for low duty cycle
devices like sensors and controls
10
appliance. By 2010, regulations were in place in most developed countries restricting standby
power of devices sold to one watt.
Standby power is electrical power used by appliances and equipment while switched off or
not performing their primary function, often waiting to be activated by a remote controller.
That power is consumed by internal or external power supplies, remote control receivers, text
or light displays, circuits energized when the device is plugged in even when switched off,
etc. Power can be saved by disconnecting such devices, causing at worst only inconvenience.
While this definition is inadequate for technical purposes, there is as yet no formal definition;
an international standards committee is developing a definition and test procedure. The term
is often used more loosely for any device that continuously must use a small amount of power
even when not active; for example a telephone answering machine must be available at all
times to receive calls, switching off to save power is not an option. Timers, powered
thermostats, and the like are other examples. An uninterruptible power supply could be
considered to be wasting standby power only when the computer it protects is off.
Disconnecting standby power proper is at worst inconvenient; powering down completely, for
example an answering machine not dealing with a call, renders it useless.
11
CHAPTER 2
LITERATURE SURVEY
2.1 TRANCEIVER FOR ZIGBEE APPLICATIONS
A single-chip 2.4-GHz CMOS radio transceiver with integrated baseband processing
according to the IEEE 802.15.4 standard is presented. The transceiver consumes 14.7 mA in
receive mode and 15.7 mA in transmit mode. The receiver uses a low-IF topology for high
sensitivity and low power consumption, and achieves 101 dBm sensitivity for 1% packet
error rate. The transmitter topology is based on a PLL direct-modulation scheme.
Optimizations of architecture and circuit design level in order to reduce the transceiver power
consumption are described. Special attention is paid to the RF front-end design which
consumes 2.4mA in receive mode and features bidirectional RF pins. The 5.77 mm2 chip is
implemented in a standard 0.18- mCMOStechnology. The transmitter delivers +3 dBm into
the 100- differential antenna port. A fully integrated 2.4-GHz transceiver for wireless sensor
network applications based on the IEEE 802.15.4 standard is presented. The IC is fabricated
in a 0.18- m CMOS process and consumes 14.7 mA and 15.7 mA in receive and transmit
mode, respectively. Special attention is paid to design techniques to lower the current
consumption. A receiver RF front-end design is described where a new topology in
conjunction with a stacked LNA configuration leads to a 2.4-mA supply current. The RF
front-end exhibits a 6-dB noise figure and a sensitivity of 101 dBm for 1% packet error rate.
The transmitter topology is improved by using a PLL direct-modulation scheme and
amplifying the LO signal using an efficient push-pull PA. The fully differential RF port
shared between the LNA and PA avoids the antenna switch and, therefore, the associated
insertion loss. A high-level ESD robustness of 6 kV HBM is achieved even at the RF pins.
Low-drop voltage regulators are integrated allowing operation on a single 1.8 V to 3.6 V
supply voltage. The only external components needed are five capacitors for internal supply
voltage decoupling and crystal oscillator loading.
12
standard
Zigbee
Bluetooth
Wi-Fi
Application focus
Cable replacement
System resource
4 KB - 32KB
250 KB+
16 MB+
Battery life(days)
10-10000+
1-7
Network size
Unlimited
32
Maximum data
rate(KB/S)
20 - 250
720
11000+
13
Transmission
range(meters)
1-100+
1-10+
1-100
Success metrics
Reliability, Power,
Cost
Cost , Convenience
Speed , Flexibility
2.3
DC-DC
CONVERTER
FOR
INCREASING
SLEEP-MODE
EFFICIENCY
Battery current is a key parameter that decides the runtime of a portable electronic
system. For low power applications like IEEE 802.15.4 and Zigbee wireless network, the
average battery current drain approximates the sleep mode current drain, since significantly
more time is spent in sleep than in active usage. This paper proposes substituting a DC-DC
converter for a low drop-out (LDO) regulator in the sleep mode power chain, such that the
current drawn from the battery would be less than the actual current drained by the load. The
battery current saving, and hence battery runtime extension, is estimated to be around 35%
based on the analysis of a 65 nm CMOS IEEE 802.15.4/Zigbee low power wireless systemon-chip (SoC) model, whose parameters are extracted from state-of-theart industrial products
and experimental data from advanced nanometer processes. proposes using a DC-DC
converter in the sleep mode power chain of low power electronic system to reduce battery
current.
Compared to the best case scenario of conventional LDO-based power chain
structure, it reduces the battery current by as much as 35% based on the analysis of a leakage
dominated 65 nm CMOS IEEE 802.15.4/ Zigbee wireless SoC model, whose parameters are
extracted from state-of-the-art industrial products and experimental data. However, does not
detail the design of the ultra-light-load DC-DC converter with said efficiency, since
integrated DC-DC converters with similar output power profile (1-100 W) with above 80%
efficiency using all-digital PFM control has already been reported.
RF wake up (PRFW) scheme instead of time-based scheme to wake up node that indeed
needs to wake up. The PRFW scheme is enabled by a PRFW hardware module sensing radio
signal from other nodes. Analysis of energy consumption and delay shows that the power
saving capability and wake-up delay can be improved by using the PRFW scheme.
We have presented a passive RF wake-up (PRFW) scheme for wireless sensor
networks, which helps to achieve the balance between energy saving and delay. The hardware
architecture of note equipped with a PRFW module that supports the PRFW scheme is
described. By sensing energy from the radio signals, the PRFW module provides a interrupt
to the MCU. The MCU detects the preamble and wakes up RF transceiver when wake-up
signal is indeed to wake itself up. The analysis results show that, based on the PRFW, nodes
can be woken up timely when it is necessary to wake up with the aid of extremely low power.
consider this fully integrated transceiver consumes the lowest energy among different 2.4
GHz CMOS Zigbee transceivers published.
16
the power and takes up half the area. Using this scheme for an LSI chip, 20-MHz operation at
1.0 V and only a few nA standby current was achieved with 0.5-m CMOS technology.
Moreover, this scheme is effective for high-speed and low-power operation in quartermicrometer and finer devices. We have developed a new 1-V high-speed MTCMOS circuit
scheme for power-down application circuits. The main feature is a small, fast, low-power
balloon circuit for preserving data. A balloon DFF is not only three times faster than a
conventional MTCMOS DFF, but it consumes half the power and takes up half the area. This
is a very efficient circuit scheme for high-performance power-down application circuits using
MTCMOS technology. Moreover, it is effective for high-speed and low-power operation in
sub-quarter-micrometer and finer devices. Consequently, this scheme brings smart power
management to LSIs.
storage transistors show similar standby power savings of over 5000x, demonstrating
effectiveness for many process generations. This design uses the PMOS slave latch pass
transistors M5 and M6 and the PMOS M5 and M6 reduce the maximum clock frequency,
evident by the ST and STN nodes at VDD = 0.8 V. Finally, reducing the standby storage latch
power supply voltage VDDTG can be used to limit the drain to bulk band to band tunnelling
(IZENER) leakage component, which is increasingly limiting for low standby power modes.
19
CHAPTER 3
EXISTING SYSTEM
3.1 BALLOON CIRCUIT RETENTION FLIP-FLOP
Fig. 3.1 shows the concept of the new MTCMOS circuit for preserving data during the
sleep period. It involves the use of a memory circuit, which is always powered, and a switch.
We call the memory circuit a balloon circuit because of its shape and because we blow
up the memory with data at the beginning of the sleep mode to preserve it and let the data
out at the end to restore it. This circuit is connected to node A of the MTCMOS logic circuit.
This scheme has two states according to the circuit modes; sleep and active. In the sleep
mode, the balloon circuit preserves data using the memory circuit and a leakage current from
the memory circuit to the logic circuit is cut off by the switch .In the active mode, the balloon
circuit does not add to the load at the node because it is separated from the node by the switch
During the transitional period between these two states, the Switch becomes on-state to read
data from the node or restore data to the node.
3.1.1 OPERATION
20
21
INV4 (or INV3) generates a high VDD,IO level through M2 (or M1). A high VDD,core is
transferred as (VDD,coreVt_thick) (Vt_thick is Vt of the thickoxide transistor) to the
retention latch through M6 (or M5), after which the access transistor that transferred the high
VDD,core is turned off, avoiding the creation of a dc current path. During read access, the
access transistors supply the correct state to the VDD,core domain. In the standby mode,
VDD,core is turned off, and the clock is asserted low before the access transistors are turned
off; as a result, the retention latch supplied by VDD,IO is isolated by the access transistors
and is able to hold data. The SRAM-based RFF effectively reduces the area overhead by
eliminating additional control codes for power mode transition and embedding the retention
latch into the slave latch. The standby power is also reduced, as the leakage paths in the
standby mode (I1, I2, I3, and I4) include the thick-oxide transistors M1M6.
However, several problems can occur because the retention latch is not on the data-tooutput path. After write access for the high VDD,core level is performed through M5 when
CK is asserted high, M5 is turned off, and node X is left floating temporarily as CK becomes
low. Node X is vulnerable to glitch-creating noise such as cross-talk because it is floating
until the value of node X drops to (VDD, coreVt_thick). During the transition from the
standby mode to the active mode, data in the retention latch need to be transferred to nodes X
and Y; however, if the node value of slv_left is high, X becomes (VDD,coreVt_thick) owing
to the threshold drop, and a dc current will flow through the p and nMOS in INV2, possibly
inducing an incorrect output. To check the read-error rate of the SRAM-based RFF, 10 000
Monte Carlo simulations were performed with the Vt variation. In the simulation node, X was
set to the initial value of low, and the temperature and VDD,core were set to 0 C and 1.1 V
to model the worst condition. The simulation results indicate that 0.12% of SRAM-based
RFFs generate an incorrect output. The read-error rate increases with technology scaling
because the voltage margin for correct operation, [(VDD,coreVt_thick)(VDD,core)/2], is
reduced when VDD,core scales down. This implies that it is difficult to implement an SRAMbased RFF with deep submicrometer process technology.
The master latch and clocking are conventional circuits implemented in the thin gate
transistors. The setup and hold times for the MSFF are the same as for an entirely thin gate
design, since the timing critical path passes from input D to output Through only the fast thin
gate transistors. The slave latch is comprised of thick gate, high Transistors M1-M6.
Transistors M1-M4 forms the thick gate flip-flop slave latch and M5-M6 provides differential
write access when the clock is asserted high. NMOS isolation transistors M5 and M6 are
23
controlled by the VDD power supply of the thin gate circuitry. Unlike the schemes shown in
the thick gate latch is always used for the slave latch storage during both active and standby
modes.
When the VDD power supply is gated off by driving ENN to VDDTG transistor
leakage currents pull VDD toVSS and isolate the thick gate from the thin gate circuits as
transistors M5 and M6 cut-off. The circuit logic state is retained by the un-gatedVDDTG
power supply, while the low leakage thick-gate transistors minimize standby power.
Transistors M5 and M6 supply the correct state to the thing ate circuit domain, which also
contains all combinational logic, as it powers up. The clock must be asserted low before and
during power-down, as well as while the supply voltage is restored, to avoid writing the thick
gate latch. Consequently, for NMOS M5-M6 the entire clock tree can reside in the VDD
domain, allowing that leakage component to be completely suppressed. It is frequently easier
to gate VSS rather than VDD since the latter uses NMOS rather than PMOS supply gating
transistors.
In addition to the greater drive current of the former, it is also easier to provide sufficient
gate overdrive VGS Vt by using a high voltage on the NMOS transistor MG2. This supply
gating transistor is ideally a higher Vt thick gate device as well. In this case, when the power
is gated off, VSS will rise to VDD.
24
VDD,core is collapsed by turning off the voltage regulator; in this case, the slave latch is
isolated by turning off the access transistors (M1 and M2) and operates as a retention latch to
hold logic states by VDD,IO that is always turned on. It is important to note that the sizes of
the transistors on the data-to-output path need to be determined carefully. There are two
considerations, standby power and write-ability. As the standby power is consumed by only a
slave latch, INV1 and INV2 should be sized to minimize the sub threshold leakage current
that is generally proportional to the width of the transistor. However, in deep sub micrometer
process technology, Vt sharply decreases with a decrease in the channel width, and this is
called the inverse narrow width effect (INWE). This effect results in an exponential increase
in the leakage current, making the INWE increasingly important with technology scaling. The
second consideration is write-ability.
The write operation is performed by pulling down slv_left or slv_right because the master
latch always transmits a low signal to the slave latch in the proposed RFF. The pull-down
network of slv_right (slv_left) consists of M1 and M3 (M2 and M4) and the nMOS in INV6
(INV5). However, the pull-up network of slv_right (slv_left) composed of two-stacked
pMOS transistors in INV2 (INV1) is simultaneously turned on if the data to be transferred is
different than the retained data. Thus, the pull-down network (three-stacked nMOS
transistors) should be much stronger than the pull-up network (two-stacked pMOS transistors
in INV1 or INV2). The pull-down network is designed to be 10 times stronger than the pullup network in the proposed RFF. To check the write operation, 1 000 000 Monte Carlo
simulations were performed with a high VDD,IO of 2.7 V at a high temperature of 85 C to
consider the worst condition. The results show no failures and thus, reliable write operation is
verified. The proposed RFF does not require an additional retention latch nor does it require
the write/read operations necessary in SRAM-based RFFs. As all standby leakage paths (I1,
I2, I3, and I4) in the proposed RFF consist of thick-oxide transistors, the standby leakage
current will be less than that of an SRAM-based RFF
Further reduction in the standby leakage in paths I3 and I4 can be achieved by arranging
the access transistors (M1 and M3, and M2 and M4) in a stacked structure, and the inverters
(INV1 and INV2) in the slave latch are designed using stacked p/nMOS transistors, allowing
the standby leakage of I1 and I2 to also be reduced. Because the retention latch in the
proposed RFF is located on the data-tooutput path, and level-up conversion from the
VDD,core domain to the VDD,IO domain is achieved through an nMOS passtransistor levelconversion scheme, the problems of threshold drop in the SRAM-based RFF are resolved
with only a degradation in speed. However, the speed degradation is not a problem in this
26
target application because the digital logic in Zigbee SoCs operates at a low frequency of
approximately 20 MHz. Thus, the proposed RFF can be appropriately applied to the Zigbee
SoCs and also implemented using deep submicrometer process technology. In the standby
mode, logic states are preserved in a slave latch composed of thick-oxide transistors utilizing
an I/O supply voltage (VDD,IO) that is always turned on.
To use both the core supply voltage (VDD, core) and VDD,IO domains without incurring
problems due to the threshold drop, as in anSRAM-based RFF, an nMOS pass-transistor
level-conversion scheme is embedded into the data-to-output path. Because the power
management scheme using the proposed RFF enables the voltage regulator generating
VDD,core to be turned off in the standby mode, the standby power consumption can be
significantly reduced.
The proposed RFF also reduces the area overhead because it does not require an
additional retention latch, a level-up converter, control signals for power mode transition, and
power switches.
3.4.2 DISADVANTAGES
27
3.4.3 APPLICATIONS
28
CHAPTER 4
PROPOSED SYSTEM
4.1 RETENTION FLIP-FLOPS
To achieve higher density and performance and lower power consumption, CMOS
technology has been scaled down for several decades. Supply voltage (VDD) has been scaled
down in order to keep the power consumption under control. Hence, the transistor threshold
voltage (Vth) has to be commensurately scaled to maintain a high drive current and achieve
performance improvement. However, the threshold voltage scaling results in the substantial
increase of the sub threshold leakage current. Therefore, as a result of technology scaling,
leakage power is becoming a major contributor to total power consumption. Leakage current
is present in both active and standby modes of operation. In order to reduce the standby
leakage power consumption, circuits can be put in the power-down mode by switching-off
the power during the standby mode. The power can be switched-off by switching-off either
the supply voltage (VDD) or ground (GND) of the circuit. The transistors switching-off the
power are called sleep transistors. For switching-off the VDD (gated-VDD), PMOS sleep
transistors are required between the real VDD and virtual VDD of the circuit. However, for
switching-off the GND (gated-GND), NMOS sleep transistors are used between the real
GND and virtual GND of the circuit. Since the sleep transistors are fairly large, gated-GND is
preferred to gated-VDD because the NMOS sleep transistors can be much smaller than the
PMOS sleep transistors.
The state of a circuit which is stored in flip-flops may be lost in the power-down
mode. Therefore, in power down applications where the state of the circuit is required to be
preserved, shadow or balloon latches that state of the circuit during the power-down mode.
The balloon latch and some switches which are not in the critical paths use high threshold
voltage (high-Vth) transistors to reduce their leakage power, since they are always powered.
This conventional scheme requires extra data-preserving latches (balloon latches) and
complicated timing for transferring data back and forth between balloon latches and flip-flops
on any transition from power-down to active mode and vice versa. To overcome these
problems, several data-retention schemes have been proposed for power-down applications.
However, in some of the previous techniques, depending on the internal state of flip-flops in
the power-down mode, there can be sneak leakage paths due to interaction between gates
29
which are always powered and gates which are powered down. In leakage feedback flip-flop
has been proposed that eliminates all possible sneak leakage paths during the power-down
mode.
The internal clock and data gating circuitry is only composed of one inverter and one
PMOS transistor. The inverter of the clock-gating circuitry may not be required if the clock is
already buffered through the the clock distribution network. In the sleep mode, the PMOS
transistor pulls up the clock input node of the flip-flop to VDD. Therefore, the data input is
also automatically gated by the transmission gate at the input of the flip-flop. Moreover, the
loop of the cross coupled inverters in the master latch is closed to retain the data stored in the
master latch. In this scheme only the inverter that generates signal CKB is required to be
always powered, and therefore, can be implemented using high-Vth transistors. The dataretention TGFF (DR-TGFF) and the balloon-TGFF are designed in the 70nm CMOS
technology node, using Berkeley Predictive Technology Models. The TGFF without any
sleep transistor or data-retention circuitry and with all low-Vth transistors is also designed for
comparison
30
This power management scheme also has the same disadvantage as the power
management scheme in Fig.4.2 in that the voltage regulator for the retention logic (VR3)
must be turned on during the standby mode. Thus, a power management scheme to turn off
the voltage regulator in the standby mode is required.
current path created by the level difference between VDD,core and VDD,IO. The proposed
RFF achieves level-up conversion from VDD,core to VDD,IO by an embeddednMOS passtransistor level-conversion scheme using a low-only signal-transmitting technique.
The proposed level-converting RFF is shown in Fig. 3. The master latch is also based
on a cross-coupled-inverter latch as the DFF, and an additional data transmission path (M3
and M1) is embedded. Both thin- and thick-oxide transistors are used, and the core logic and
retention logic are supplied by VDD, core and VDD,IO respectively. Level up conversion
from the VDD, core domain in the master latch to the VDD, IO domain in the slave latch is
achieved through an nMOS pass-transistor level-conversion.
by turning off the voltage regulator; in this case, the slave latch is isolated by turning off the
access transistors (M1 and M2) and operates as a retention latch to hold logic states by
VDD,IO that is always turned on. It is important to note that the sizes of the transistors on the
data-to-output path need to be determined carefully. There are two considerations, standby
power and write-ability. As the standby power is consumed by only a slave latch, INV1 and
INV2 should be sized to minimize the sub threshold leakage current that is generally
proportional to the width of the transistor. However, in deep sub micrometer process
technology, Vt sharply decreases with a decrease in the channel width, and this is called the
inverse narrow width effect (INWE). This effect results in an exponential increase in the
leakage current, making the INWE increasingly important with technology scaling.
Power optimization
APPLICATIONS
On chip networks
Processing units
CHAPTER 5
HARDWARE REQUIREMENT
highly specific. Some examples are the simple operational amplifiers and timers, and the
more complex FM stereo decoders and single-chip FM radios.
35
There has been a trend towards fabricating the more commonly used analogue circuits
into single chip form. An example of this is FM radio receiver, which is a fairly complex
circuit when fabricated from discrete components. A FM radio receiver can now be
constructed from a FM radio chip, an audio amplifier chip and a few discrete passive
components.
5.1.2
(AND & OR gates). They are commercially available in families of devices which take their
name from the fabrication method used to manufacture the devices from different families are
not readily compatible in the same circuit. The more common types of logic integrated
circuits are typically represented in each family of devices like TTL, Schottky TTL, CMOS
and the new high speed CMOS. The CMOS family devices have a very low power
consumption that makes them very popular for many applications where very high speeds are
not required.
5.1.3
Computer integrated circuits are devices, which form the active components of a computer
system. They are often used in conjunction with digital integrated circuits, which provide a
glue logic function. Computer integrated circuits can be functionally divided into
microprocessors, memory devices and peripheral control devices.
36
Personal entertainment systems such as portable MP3 players and DVD players
Digital electronics compress and decompress video, even at high definition data rates,
Speed. Signals can be switched between logic 0 and logic 1 much quicker within a chip
than they can between chips. Communication within a chip can occur hundreds of times
faster than communication between chips on a printed circuit board. The high speed of
circuits on-chip is due to their small sizesmaller components and wires have smaller
parasitic capacitances to slow down the signal
Power consumption. Logic operations within a chip also take much less power. Once
again, lower power consumption is largely due to the small size of circuits on the chip
smaller parasitic capacitances and resistances require less power to drive them.
Smaller physical size. Smallness is often an advantage in itselfconsiders portable
televisions or handheld cellular telephones.
Lower power consumption. Replacing a handful of standard parts with a single chip
reduces total power consumption. Reducing power consumption has a ripple effect on the rest
38
of the system: a smaller, cheaper power supply can be used; since less power consumption
means less heat, a fan may no longer be necessary; a simpler cabinet with less shielding for
electromagnetic shielding may be feasible, too.
Reduced cost. Reducing the number of components, the power supply requirements,
cabinet costs, and so on, will inevitably reduce system cost. The ripple effect of integration is
such that the cost of a system built from custom ICs can be less, even though the individual
ICs cost more than the standard parts they replace. Understanding why integrated circuit
technology has such profound influence on the design of digital systems requires
understanding both the technology of IC manufacturing and the economics of ICs and digital
systems.
39
CHAPTER 6
SOFTWARE REQUIREMENT
Verification Tool
ModelSim 6.4c
Synthesis Tool
Xilinx ISE 9.1
6.1 MODELSIM
ModelSim SE - High Performance Simulation and Debug
ModelSim SE is our UNIX, Linux, and Windows-based simulation and debugs
environment, combining high performance with the most powerful and intuitive GUI in the
industry.
What's New in ModelSim SE?
Improved FSM debug options including control of basic information, transition table
and warning messages. Added support of FSM Multi-state transitions coverage (i.e.
coverage for all possible FSM state sequences).
The dataflow window can now compute and display all paths from one net to another.
Enhanced code coverage data management with fine grain control of information in
the source window.
Some IEEE VHDL 2008 features are supported including source code encryption.
Added support of new VPI types, including packed arrays of struct nets and variables.
Code Coverage
Integrated debug
System C Option
Windows 32-bit
Intuitive GUI for efficient interactive or post-simulation debug of RTL and gate-level
designs
Merging, ranking and reporting of code coverage for tracking verification progress
All ModelSim products are 100% standards based. This means your investment is
protected, risk is lowered, reuse is enabled, and productivity is enhanced
supported. The memory window allows memories to be pre-loaded with specific or randomly
generated values, saving the time-consuming step of initializing sections of the simulation
merely to load memories. All functions are available via the command line, so they can be
used in scripting.
Advanced Code Coverage
The ModelSim advanced code coverage capabilities deliver high performance with
ease of use. Most simulation optimizations remain enabled with code coverage. Code
coverage metrics can be reported by-instance or by-design unit, providing flexibility in
managing coverage data. All coverage information is now stored in the Unified Coverage
Data Base (UCDB), which is used to collect and manage all coverage information in one
highly efficient database. Coverage utilities that analyze code coverage data, such as merging
and test ranking, are available.
The coverage types supported include:
Branch coverage: expressions and case statements that affect the control flow of the
HDL execution
Condition coverage: breaks down the condition on a branch into elements that make
the result true or false
Expression coverage: the same as condition coverage, but covers concurrent signal
assignments instead of branch decisions
43
base platform comprises a robust set of well-integrated, tested, and targeted elements that
enable customers to immediately start a design. These elements include:
FPGA silicon
ISE Design Suite design environment
Third-party synthesis, simulation, and signal integrity tools
Reference designs common to many applications, such as memory interface and
configuration designs.
Development boards that run the reference designs
A host of widely used IP, such as Gig E, Ethernet, memory controllers, and PCIe.
Domain-Specific Platform
The next layer in the targeted design platform hierarchy is the domain-specific
platform. Released from three to six months after the base platform, each domain specific
platform targets one of the three primary Xilinx FPGA user profiles (domains):the embedded
processing developer, the digital signal processing (DSP) developer, or the logic/connectivity
developer. This is where the real power and intent of the targeted design platform begins to
emerge. Domain-specific platforms augment the base platform with a predictable, reliable,
and intelligently targeted set of integrated technologies, including:
Higher-level design methodologies and tools
Domain-specific embedded, DSP, and connectivity IP
Domain-specific development hardware and daughter cards
Reference designs optimized for embedded processing, connectivity, and DSP
Operating systems (required for embedded processing) and software
Every element in these platforms is tested, targeted, and supported by Xilinx and/or
our ecosystem partners. Starting a design with the appropriate domain-specific platform can
cut weeks, if not months, off of the user's development time.
Market-Specific Platform
45
With the breadth of advances and capabilities that the Virtex-6 and Spartan-6
programmable devices deliver coupled with the access provided by the associated targeted
design platforms, it is no longer feasible for one design flow or environment to fit every
designer's needs. System designers, algorithm designers, SW coders, and logic designers each
represent a different user-profile, with unique requirements for a design methodology and
associated design environment. Instead of addressing the problem in terms of individual fixed
tools, Xilinx targets the required or preferred methodology for each user, to address their
specific needs with the appropriate design flow. At this level, the design language changes
from HDL (VHDL/Verilog) to C, C++, MATLAB software, and other higher level
languages which are more widely used by these designers, and the design abstraction moves
up from the block or component to the system level. The result is a methodology and
complete design flow tailored to each user profile that provides design creation, design
implementation, and design verification. Indicative of the complexity of the problem, to fully
understand the user profile of a logic designer, one must consider the various levels of
expertise represented by this demographic. The most basic category in this profile is the
push-button user who wants to complete a design with minimum work or knowledge.
The push-button user just needs good-enough results. Contrastingly, more advanced
users want some level of interactive capabilities to squeeze more value into their design, and
the power user (the expert) wants full control over a vast array of variables. Add the
traditional ASIC designers, tasked with migrating their designs to an FPGA (a growing trend,
given the intolerable costs and risks posed by ASIC development these days), and clearly the
imperative facing Xilinx is to offer targeted flows and tools that support each user's
requirements and capabilities, on their terms. The most recent release of the ISE Design Suite
includes numerous changes that fulfil requirements specifically pertinent to the targeted
design platform. The new release features a complete tool chain for each top-level user
profile (the domain-specific personas: the embedded, DSP, and logic/connectivity designers),
including specific accommodations for everyone from the push-button user to the ASIC
designer.
The tighter integration of embedded and DSP flows enables more seamless integration
of designs that contain embedded, DSP, IP, and user blocks in one system. To further enhance
productivity and help customers better manage the complexity of their designs, the new ISE
47
Design Suite enables designers to target area, performance, or power by simply selecting a
design goal in the setup. The tools then apply specific optimizations to help meet the design
goal. In addition, the ISE Design Suite boasts substantially faster place-and-route and
simulation run times, providing users with 2X faster compile times. Finally, Xilinx has
adopted the FLEXnet Licensing strategy that provides a floating license to track and monitor
usage.
6.3 XILINX ISE DESIGN TOOLS
Xilinx ISE is the design tool provided by Xilinx. Xilinx would be virtually identical
for our purposes.
There are four fundamental steps in all digital logic design. These consist of:
1. Design The schematic or code that describes the circuit.
2. Synthesis The intermediate conversion of human readable circuit description to
FPGA code (EDIF) format. It involves syntax checking and combining of all the
separate design files into a single file.
3. Place & Route Where the layout of the circuit is finalized. This is the
translation of the EDIF into logic gates on the FPGA.
4. Program The FPGA is updated to reflect the design through the use of
programming (.bit) files. Test bench simulation is in the second step. As its
name implies, it is used for testing the design by simulating the result of
driving the inputs and observing the outputs to verify your design. ISE has the
capability to do a variety of different design methodologies including:
Schematic Capture, Finite State Machine and Hardware Descriptive Language
(VHDL or Verilog).
Level (RTL). Verilog supports all of these levels. However, this handout focuses on only the
portions of Verilog which support the RTL level.
Verilog is one of the two major Hardware Description Languages (HDL) used by
hardware designers in industry and academia. VHDL is the other one. The industry is
currently split on which is better. Many feel that Verilog is easier to learn and use than
VHDL. As one hardware designer puts it, I hope the competition uses VHDL. VHDL was
made an IEEE Standard in 1987, and Verilog in 1995. Verilog is very C-like and liked by
electrical and computer engineers as most learn the C language in college. VHDL is very
most engineers have no experience. Verilog was introduced in 1985 by Gateway Design
System Corporation, now a part of Cadence Design Systems, Inc.s Systems Division. Until
May, 1990, with the formation of Open Verilog International (OVI), Verilog HDL was a
proprietary language of Cadence. Cadence was motivated to open the language to the Public
Domain with the expectation that the market for Verilog HDL-related software products
would grow more rapidly with broader acceptance of the language. Cadence realized that
Verilog HDL users wanted other software and service companies to embrace the language
and develop Verilog-supported design tools.
CHAPTER7
RESULTS
SIMULATION RESULT
49
OBSERVATIONS: If we give the data as 1000 we have to retain that data in flip flop in
standby mode. Whenever the circuit goes to active mode from standby mode we have to get that data.
So, the output must have 1000. By simulation we can see that result.
CHAPTER 8
CONCLUSION
The proposed MULTI BIT RFF achieves ultralow-standby power by adopting a power
management scheme to use VDD, IO for data retention and to turn off the voltage regulator in
50
the standby mode. In addition, the retention latch in the proposed RFF is composed of a
stacked structure with thick-oxide transistors to reduce the standby leakage current.
REFERENCES
[1] W. Kluge et al., A fully integrated 2.4-GHz IEEE 802.15.4-complianttransceiver for
Zigbee applications, IEEE J. Solid-State Circuits,vol. 41, no. 12, pp. 27672775, Dec. 2006.
[2] J.-S. Lee, Y.-W. Su and C.-C. Shen, A comparative study of wireless protocols:
Bluetooth, UWB, Zigbee, and Wi-Fi, IEEE Wireless Commun., vol. 14, no. 4, pp. 7078,
Nov. 2007.
51
[3] J. Hu, W. Liu, W. Khalil, and M. Ismail, Increasing sleep-mode efficiency by reducing
battery current using a DC-DC converter, in Proc. IEEE Int. MWSCAS, Aug. 2010, pp. 53
56.
[4] Y.-I. Kwon, S.-G. Park, T.-J. Park, K.-S. Cho and H.-Y. Lee, An ultralow-power CMOS
transceiver using various low-power techniques for
LR-WPAN applications, IEEE Trans. Circuits Syst. I, vol. 59, no. 2,pp. 324336, Feb. 2012.
[5] M. Albano and S. Chessa, Data centric storage in Zigbee wireless sensor networks, in
Proc. IEEE ISSCC, Jun. 2010, pp. 330335.
[6] S. Shigemitsu, S. Mutoh, Y. Matsuya, Y. Tanabe, and J. Yamada,A 1-V high-speed
MTCMOS circuit scheme for power-down application circuits, IEEE J. Solid-State Circuits,
vol. 32, no. 6, pp. 861869,Jun. 1997.
[7] L. T. Clark, M. Kabir, and J. E. Knudsen, A low standby power flip-flop with reduced
circuit and control complexity, in Proc. IEEE CICC,Sep. 2007, pp. 571574.
[8] V. Yuban and S. V. Kosonocky, Low power integrated scan-retention mechanism, in
Proc. ISLPED, Aug. 2002, pp. 98102.
[9]H.Mahmoodi-Meimand and K. Roy, Data-retention flip-flops for power-down
applications, in Proc. ISCAS, vol.2. May. 2004 pp. 67780.
[10] J. Seomun and Y. Shin, Design and optimization power-gated circuits with autonomous
data retention, IEEE Trans. Very Large Scale Integer.
(VLSI) Syst., vol. 19, no. 2, pp. 227236, Feb. 2011.
[11] S. Mutoh, T. Douseki, Y. Matsuya, T. Aoki, and J. Yamada, 1V high speed digital circuit
technology with 0.5m multi-threshold CMOS, in
Proc. IEEE Int. ASIC Conf. Exhibit, Sep. 1993, pp. 186189.
[12] P. Meinerzhagen, O. Anderson, B. Mohammadi, Y. Sherazi, A. Burg,and J.N.Rodrigues,
A 500 fW/bit 14 fJ/bit-access 4kb standard-cell based sub-VT memory in 65nm CMOS, in
Proc. ESSCIRC, Sep. 2012,pp. 321324.
[13] N. B. Kothari, T. S. B. Sudarshan, S. Gurunarayanan, and R. A. Chandrasekhar, SoC
design of a low power wireless sensor network node for Zigbee systems, in Proc. Int. Conf.
Adv. Comput.Commun., Dec. 2006, pp. 462466.
[14] P. Zhao et al., Low-power clocked-pseudo-nMOS flip-flop for level conversion in dual
supply systems, IEEE Trans. Very Large Scale Integr.(VLSI) Syst., vol. 17, no. 9, pp. 1196
1202, Sep. 2009.
[15] L. A. Akers, The inverse-narrow-width effect, IEEE Electron DeviceLett., vol. 7, no. 7,
pp. 419421, Jul. 1986.
52
[16] A. Luo, S. Yin, L. Leibo, R. Zhou, and S. Wei, Low-power low-cost implementation of
IEEE 802. 15. 4 in WiSN SoC design, in Proc. Int.Conf. WICOM, Sep. 2010, pp. 14.
[17] G. Gerosa et al., A 2.2 W, 80 MHz superscalar RISC micro processor,IEEE J. SolidState Circuits, vol. 29, no. 12, pp. 14401452, Dec. 1994.
APPENDIX
SINGLE BIT PROPOSED RFF
module SD_Topmodule(Data,Cntrl,S1,S2,S3,S4,S5,Out,Outb);
input Data,Cntrl;
53
inout S1,S2,S3,S4,S5;
output Out,Outb;
SD_TX_Gate M1(
.A(Cntrl),
.IN(Data),
.OUT(S1)
);
SD_TX_Gate M2(
.A(Cntrl),
.IN(S3),
.OUT(S1)
);
SD_NotGate M3(
.A(S1),
.B(S2)
);
SD_NotGate M4(
.A(S2),
.B(S3)
);
SD_AndGate M5(
.A(S2),
.B(S3),
.C(S4)
);
SD_AndGate M6(
.A(S2),
.B(S3),
.C(S5)
);
SD_NotGate M7(
.A(S5),
.B(S4)
);
SD_NotGate M8(
.A(S4),
.B(S5)
);
SD_NotGate M9(
.A(S4),
.B(Outb)
54
);
SD_NotGate M10(
.A(S5),
.B(Out)
);
endmodule
.A(Cntrl),
.IN(S3),
.OUT(S6)
);
SD_NotGate M3(
.A(S1),
.B(S2)
);
SD_NotGate M4(
.A(S2),
.B(S3)
);
SD_AndGate M5(
.A(S2),
.B(S3),
.C(S4)
);
SD_NotGate M9(
.A(S4),
55
.B(Outb)
);
SD_NotGate M10(
.A(S5),
.B(Out)
);
endmodule
56