Vedic Multiplier9

8.2.
5 IEEE 32-Bit Floating point Multiplier
Figure 8.16 Black box view of IEEE 32 bit Floating point Multiplier (i)Description a b out flow Input data 32bit Input data 32 bit
Output data 32 bit Output data 1 bit
(ii) Device utilization summary: Selected Device: xc6vlx75tl-1Lff784 Table8.5 Device utilization summary for IEEE 32-bit Floating point Multiplier Logic utilization Number of slice LUTs Number of occupied Slices Number of bonded IOBs Average Fanout of Non-Clock Nets Used Available 786 345 97 4.14 46,560 11,640 232 utilization 1% 2% 41%
Figure 8.17 RTL diagram of IEEE 32-bit floating point Multiplier
8.2.6 16 Bit Squarer
Figure 8.18 Black box view of 16 bit squarer (i) Description a b r Input data 16bit Input data 16 bit
Output data 32 bit
(ii) Device Utilization Summary: Selected Device :xc3s500e-5fg320 Table8.6 Device utilization summary for 16-bit Squarer Used Available Utilization 402 221 221 0 402 9,312 4,656 221 221 9,312 4% 4% 100% 0% 4%
Logic Utilization Number of 4 input LUTs Number of occupied Slices Number of Slices containing only related logic Number of Slices containing unrelated logic Total Number of 4 input LUTs
Number of bonded IOBs Average Fanout of Non-Clock Nets
64 3.70
232
27%
Figure 8.19 RTL diagram of the 16 bit Squarer
8.3 LATENCY & THROUGHPUT OF MULTIPLIERS Latency is obtained from maximum combinational path delay in the synthesis report and through put is just the inverse of the Latency (in the case of combinational Multiplier).below table8.7 and 8.8 shows the Latency and throughput of all the Multipliers. Table8.7 Latency and Throughput of the Multipliers S.No 1 2 3 4 16 x 16 Multipliers Vedic Multiplier (U.T.S) Vedic Multiplier (N.S) Modified Booth Multiplier Array Multiplier Latency (ns) 37.336(22.023logic, 15.313 route 23.470(14.345logic, 9.125 route) 58.464(32.705logic, 25.759 route) 69.987(40.383logic, 29.60route) Throughput(Mhz) 26.783 42.60 17.10 14.28
Table 8.8 Latency and Throughput of IEEE 32 bit floating point multiplier and 16 bit Squarer S.No. 1 2 Target device used xc6vlx75tl1Lff784 xc3s500e5fg320 Applications using Vedic multiplier (UrdhvaTiryakbhyam) IEEE 32 bit Floating point multiplier 16 bit Squarer Latency (ns) 21.719ns (1.808ns logic, 19.911ns route) 37.606(22.301ns logic, 15.305ns route) Throughput( Mhz ) 46.04 26.59
8.4 AREA AND POWER OF THE MULTIPLIERS The area and power parameters can be understood from the number of LUT s and Slices used in the design; lesser the number of LUT and Slices lesser is the area and power dissipation. Number of slices and LUT s utilized are shown in the below table 8.9 for all the Multipliers. Table8.9 Slices and LUT s of all the Multipliers S.No 1 2 3 4 16x16 Multipliers Vedic Multiplier (U.T.S Vedic Multiplier (N.S) Modified Booth Multiplier Array Multiplier Slices used 409 122 432 477 LUT s 731 222 827 879
CHAPTER-9 CONCLUSIONS & FUTURE WORK

9.1 CONCLUSION Through the analysis of multiplication Sutras of Vedic mathematics namely Urdhva Tiryakbhyam and Nikhilam Sutras a new reduced-bit multiplication algorithms have been proposed. Urdhva Tiryakbhyam and Nikhilam Sutras are such Algorithms which can reduce delay, power and hardware requirements for multiplication of numbers. 16-bit Vedic Multiplier based on Urdhva Tiryakbhyam Sutra is efficient in design and performance when compared to 16-bit Array & Modified Booth Multipliers. 16-bit Vedic Multiplier based on Nikhilam Sutra is an application specific multiplier which performs large number multiplication effectively. Designing an IEEE single precision floating point Multiplier using a Vedic Multiplier based on Urdhva Tiryakbhyam Sutra makes it efficient for the use in various DSP applications. A 16-bit Squarer design using Urdhva Tiryakbhyam based Vedic Multiplier consumes less hardware resources and reduces delay. 9.2 FUTURE WORK Vedic Mathematics, developed about 2500 years ago, gives us a clue of symmetric computation. Vedic mathematics deals with various topics of mathematics such as basic arithmetic, geometry, trigonometry, calculus etc. All these methods are very efficient as far as manual calculations are concerned.
If all those methods effectively implement hardware, it will reduce the computational speed drastically. Therefore, it could be possible to implement a complete ALU using all these methods using Vedic mathematics methods. By using these ancient Indian Vedic mathematics methods world can achieve new heights of performance and quality for the cutting edge technology devices.
APPENDIX -A
XILINX FPGA DESIGN FLOW This section describes FPGA synthesis and implementation stages typical for Xilinx design flow.
Fig A1 Xilinx FPGA Design flow [16]
Synthesis
The synthesizer converts HDL (VHDL/ Verilog) code into a gate-level netlist represented in the terms of the UNISIM component library, a Xilinx library containing basic primitives). By default Xilinx ISE uses built-in synthesizer XST (Xilinx Synthesis Technology). Other synthesizers can also be used.
Synthesis report contains much useful information. There is a maximum frequency estimate in the "timing summary" chapter. One should also pay attention to warnings since they can indicate hidden problems. After a successful synthesis one can run "View RTL Schematic" task (RTL stands for register transfer level) to view a gate-level schematic produced by a synthesizer. XST output is stored in NGC format. Many third-party synthesizers (like Synplicity Synplify) use an industry-standard EDIF format to store netlist. Implementation Implementation stage is intended to translate netlist into the placed and routed FPGA design. Xilinx design flow has three implementation stages: translate, map and place and route. (These steps are specific for Xilinx: for example, Altera combines translate and map into one step executed by quartus_map.) Translate Translate is performed by the NGDBUILD program. During the translate phase an NGC netlist (or EDIF netlist, depending on what synthesizer was used) is converted to an NGD netlist. The difference between them is in that NGC netlist is based on the UNISIM component library, designed for behavioural simulation, and NGD netlist is based on the SIMPRIM library. The netlist produced by the NGDBUILD program contains some approximate information about switching delays.

Vedic Multiplier9

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Vedic Multiplier9

Uploaded by

Copyright:

Available Formats

8.2.

5 IEEE 32-Bit Floating point Multiplier

Output data 32 bit Output data 1 bit

Figure 8.17 RTL diagram of IEEE 32-bit floating point Multiplier

8.2.6 16 Bit Squarer

Output data 32 bit

Number of bonded IOBs Average Fanout of Non-Clock Nets

Figure 8.19 RTL diagram of the 16 bit Squarer

CHAPTER-9 CONCLUSIONS & FUTURE WORK

Fig A1 Xilinx FPGA Design flow [16]

You might also like