Deep Learning (CNN) On Fpga

Uploaded by

Ali

0% found this document useful (0 votes)

43 views18 pages

It was about CNN acceleration on FGPA(specfically targeting the zedboard).

Original Title

Progress Presentation

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

It was about CNN acceleration on FGPA(specfically targeting the zedboard).

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

43 views18 pages

Deep Learning (CNN) On Fpga

Uploaded by

Ali

It was about CNN acceleration on FGPA(specfically targeting the zedboard).

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 18

Search inside document

Deep Learning (CNN)

on FPGA
Group members:
(NC) Ali Ahmad Qureshi
(NC) Ahmed
(NC) Syed Muhammad Saqib
(PC) Muhammad Zeeshan Jilani
Supervisors: Dr. Sajid Gul Khawaja, Lec Aamir Javed
INTRODUCTION

In modern age, AI applications are rapidly increasing as well

as the use of CNN’s but when it comes to real time
processing the results are not satisfying particularly in terms
of inference(testing) time.
PROBLEM STATEMENT

Designing a hardware accelerator for implementation of CNN

(Tiny YOLO as an example case) with respect to practical
applications such as self driving cars.
OBJECTIVES

Current objectives are:

 Development of our own custom architecture.
 Interfacing of the USB camera with the zed board.
 Implementation of a complete CNN on hardware.
 Testing , which will be performed on the zed board.
STATE OF ART
 Intel has launched Xeon processor which is coupled with
FPGA.
 Microsoft launches BrainWave (acceleration framework) on
Azure that can run on Tensorflow based on Catapult (FPGA
cloud).
 Xilinx buys DeePhi, startup that was deploying DNN models
to FPGA using techniques like Deep Compression, Network
Pruning (DNNDK)
 Xilinx invests in TeraDeep that provides RTL acceleration
code for SoC.
 Even NVIDIA open-sources it's hardware blocks
implemented in Verilog under codename NVLDA that can
be deployed.
STATE OF ART
SYSTEM LEVEL DESIGN
SYSTEM LEVEL DESIGN(INTERNAL WORKING)
PROGRESS
We started our design using C++ and implemented all the components of Neural
Nets from the very scratch.
PROGRESS
A single layer of tiny yolo was successfully implemented using Vivado HLS which
converts the C code into RTL logic which can then be imported in form IP in
Vivado. Shown below is the block design of 1st layer of tiny YOLO.
PROGRESS
Application written in Xilinx SDK.
PROGRESS
Camera interfacing is also partially being done using petalinux.
PROGRESS
In parallel, single layer has been implemented in verilog for optimizing the use of
available resources.
PROGRESS
Unquantized(32 bit float)
• mAP=57

Quantized 20 bits
• mAP = 48

Quantized 18 bits
• mAP=40.1

Quantized 16 bits
• mAP=30.8

Quantized 14 bits
• mAP=25.9
PROGRESS

20 bits

18 bits

16 bits

14 bits
THINGS TO DO NEXT

We have implemented one layer of our CNN on hardware,

next we will be aiming to complete the rest of the layers
and then move on to the testing phase.
TIMELINE
THANK YOU

OpenCL Programming by Example
From Everand
OpenCL Programming by Example
Ravishekhar Banger
No ratings yet
Embedded Software Design and Programming of Multiprocessor System-on-Chip: Simulink and System C Case Studies
From Everand
Embedded Software Design and Programming of Multiprocessor System-on-Chip: Simulink and System C Case Studies
Katalin Popovici
No ratings yet
Xiao Dong Resume
Document5 pages
Xiao Dong Resume
Joven Camus
No ratings yet
ZyNet Automating Deep Neural Network Implementation On Low-Cost Reconfigurable Edge Computing Platforms
Document4 pages
ZyNet Automating Deep Neural Network Implementation On Low-Cost Reconfigurable Edge Computing Platforms
Sarthak Goyal
No ratings yet
Convolutional Neural Network Workbench - CodeProject
Document8 pages
Convolutional Neural Network Workbench - CodeProject
Hector Triana
No ratings yet
JTECCNN
Document6 pages
JTECCNN
윤희지
No ratings yet
Electronics 10 01514
Document19 pages
Electronics 10 01514
madupiz@gmail
No ratings yet
Verilog Chapter1 Introduction
Document32 pages
Verilog Chapter1 Introduction
ka jonh
No ratings yet
Using Neopixels With Netduino
Document22 pages
Using Neopixels With Netduino
Leonardo Moreno Forero
0% (1)
CPEN 311: Digital Systems Design Slide Set 19: High-Level Synthesis
Document28 pages
CPEN 311: Digital Systems Design Slide Set 19: High-Level Synthesis
ryujindance
No ratings yet
R&D Facilities at ECE Department
Document3 pages
R&D Facilities at ECE Department
Arpita Mukherjee
No ratings yet
Summary:: Raj Kumar Thavti
Document4 pages
Summary:: Raj Kumar Thavti
Raj kumar
No ratings yet
Design of Soc Based Platform & Development of Software For Video Display Application
Document4 pages
Design of Soc Based Platform & Development of Software For Video Display Application
lambanaveen
No ratings yet
Fpga: Digital Designs: Team Name:Digital Dreamers
Document8 pages
Fpga: Digital Designs: Team Name:Digital Dreamers
Rishabh
No ratings yet
Desmo Uliers 2012
Document12 pages
Desmo Uliers 2012
Anonymous UI7Jawq
No ratings yet
A Reconfigurable CNN-Based Accelerator Design For Fast and Energy-Efficient Object Detection System On Mobile FPGA
Document8 pages
A Reconfigurable CNN-Based Accelerator Design For Fast and Energy-Efficient Object Detection System On Mobile FPGA
Akash Meka
No ratings yet
Design of Adsl Modem For Wlan Applications
Document7 pages
Design of Adsl Modem For Wlan Applications
b prashanth
No ratings yet
Jonathan Romero Resume
Document7 pages
Jonathan Romero Resume
Jonathan S. Romero
No ratings yet
Software For Mobile Devices: Introduction To Android Platform and Development Environment (Gradle, Java) Lecture # 4
Document15 pages
Software For Mobile Devices: Introduction To Android Platform and Development Environment (Gradle, Java) Lecture # 4
Wasif Ibrahim
No ratings yet
10 1109@vlsi-Dat49148 2020 9196288
Document1 page
10 1109@vlsi-Dat49148 2020 9196288
Gauswami Hirva
No ratings yet
Using OpenCV and Vivado™ HLS To Accelerate Embedded Vision Applications in The Zynq SoC
Document6 pages
Using OpenCV and Vivado™ HLS To Accelerate Embedded Vision Applications in The Zynq SoC
scribd2004
No ratings yet
Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™
Document20 pages
Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™ Da Vinci™
muffassir
No ratings yet
NVIDIA Consolidated Teaching Kit
Document32 pages
NVIDIA Consolidated Teaching Kit
Tanat Tonguthaisri
No ratings yet
CS3691 Embedded and Iot Lab Manual
Document128 pages
CS3691 Embedded and Iot Lab Manual
balabasker
50% (4)
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
Document9 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
pavithra
No ratings yet
ESD - DAY 1-Enhanced
Document41 pages
ESD - DAY 1-Enhanced
INTTIC
No ratings yet
Nodejs Appliances On EmbeddedLinux
Document45 pages
Nodejs Appliances On EmbeddedLinux
Arslan Coskun
No ratings yet
Schematic Diagram of Ip Camera With Ethernet
Document18 pages
Schematic Diagram of Ip Camera With Ethernet
nguyenminhtuanengine
No ratings yet
For Reference
Document9 pages
For Reference
ashfaqsaraf
No ratings yet
Soft Machine MPR-11303
Document5 pages
Soft Machine MPR-11303
Abdelwahed
No ratings yet
FPGA Project
Document15 pages
FPGA Project
Quốc Bảo
No ratings yet
Deep-Drone-Object 2
Document8 pages
Deep-Drone-Object 2
Samuel Brand
No ratings yet
VHDL Lab Manual PDF
Document21 pages
VHDL Lab Manual PDF
ashwani_agrawal
50% (2)
IEEE Access
Document15 pages
IEEE Access
Hassan Mostafa
No ratings yet
Practical Workbook (DSD)
Document33 pages
Practical Workbook (DSD)
ayesha
No ratings yet
Baniaga Jeandy A Task1
Document4 pages
Baniaga Jeandy A Task1
Jeandy Baniaga
No ratings yet
CNN Apps
Document17 pages
CNN Apps
asidharth157
No ratings yet
Automates Neural Architecture Construction
Document23 pages
Automates Neural Architecture Construction
Alex Astra
No ratings yet
Nva 1167 DESIGN
Document7 pages
Nva 1167 DESIGN
Num Numtrex
No ratings yet
Vortex: Opencl Compatible Risc-V Gpgpu: Fares Elsabbagh Blaise Tine Priyadarshini Roshan Ethan Lyons Euna Kim
Document7 pages
Vortex: Opencl Compatible Risc-V Gpgpu: Fares Elsabbagh Blaise Tine Priyadarshini Roshan Ethan Lyons Euna Kim
hira
No ratings yet
Electronic Design Automation - Wikipedia, The Free Encyclopedia
Document6 pages
Electronic Design Automation - Wikipedia, The Free Encyclopedia
Adair Netto
No ratings yet
GauravDwivedi (12 0)
Document7 pages
GauravDwivedi (12 0)
Ronnit Shukla
No ratings yet
EE-421 Digital System Design Laboratory Manual: Group Members
Document34 pages
EE-421 Digital System Design Laboratory Manual: Group Members
Muhammad Sohaib
No ratings yet
DCG: An Efficient, Retargetable Dynamic Code Generation System
Document11 pages
DCG: An Efficient, Retargetable Dynamic Code Generation System
margarita price
No ratings yet
BLDC Motor Control Using Rapid Control Prototyping: Control Engineering and Applied Informatics March 2010
Document8 pages
BLDC Motor Control Using Rapid Control Prototyping: Control Engineering and Applied Informatics March 2010
Fahmi Gb
No ratings yet
Forty Seconds CV v0
Document1 page
Forty Seconds CV v0
D20CQVT01-N DANG DUC TIEN
No ratings yet
A FPGA Paint Brush 1application
Document7 pages
A FPGA Paint Brush 1application
Pradeep Kumar Reddy
No ratings yet
Scan Ai Services b2b Brochure
Document36 pages
Scan Ai Services b2b Brochure
api-62061513
No ratings yet
Modelling of Embedded System Using ZYNQ FPGA and Xilinx Vivado
Document21 pages
Modelling of Embedded System Using ZYNQ FPGA and Xilinx Vivado
gopi krishna
No ratings yet
Rapid Development of Real-Time Applications Using
Document8 pages
Rapid Development of Real-Time Applications Using
hamza salih
No ratings yet
Rapid Development of Real-Time Applications Using MATLAB/Simulink On TI C6000-Based DSP
Document8 pages
Rapid Development of Real-Time Applications Using MATLAB/Simulink On TI C6000-Based DSP
Mohaned Kamal Hassan
No ratings yet
Ec 305 Digital System Design Laboratory: V Semester Btech (E&C)
Document16 pages
Ec 305 Digital System Design Laboratory: V Semester Btech (E&C)
Parth Gupta
No ratings yet
pn0010676 2
Document2 pages
pn0010676 2
Elangovan Sekar
No ratings yet
Convolutional Neural Networks
Document17 pages
Convolutional Neural Networks
Manish Man Shrestha
No ratings yet
Deep Learning
Document6 pages
Deep Learning
Mahamad Ali
No ratings yet
An Efficient CNN Accelerator Using Inter-Frame Data Reuse of Videos On FPGAs
Document14 pages
An Efficient CNN Accelerator Using Inter-Frame Data Reuse of Videos On FPGAs
palansamy
No ratings yet
WWW - Xilinx-What Is A CPLD-261016
Document45 pages
WWW - Xilinx-What Is A CPLD-261016
abdulyunus_amir
No ratings yet
Electronic Design Automation
Document7 pages
Electronic Design Automation
mia farrow
No ratings yet
Sandeep Pasala
Document5 pages
Sandeep Pasala
sandeep pasala
No ratings yet
Unit - 4: Building IOT With Galileo/Ardunio
Document41 pages
Unit - 4: Building IOT With Galileo/Ardunio
16TUCS228 SRIDHAR T.S
No ratings yet
Error
Document3 pages
Error
Mahendar S
No ratings yet
Emotii Negative Puternice
Document6 pages
Emotii Negative Puternice
k
No ratings yet
Java Design Patterns With Examples
Document92 pages
Java Design Patterns With Examples
Pawan
96% (25)
The Digikam Handbook
Document225 pages
The Digikam Handbook
krishnags
No ratings yet
Zen and The Art of Python
Document43 pages
Zen and The Art of Python
Surajkumar Harikumar
No ratings yet
AWS Devops Engineer
Document13 pages
AWS Devops Engineer
Mujahid
No ratings yet
Siekmann - 2006 - Lecture Notes in Artificial Intelligence
Document526 pages
Siekmann - 2006 - Lecture Notes in Artificial Intelligence
ribporto1
No ratings yet
Fibonacci Heap Node
Document3 pages
Fibonacci Heap Node
Peda Baji
No ratings yet
Lilo Manual
Document8 pages
Lilo Manual
sarmis2
No ratings yet
KnowYourEngines Velocity2011
Document57 pages
KnowYourEngines Velocity2011
jobbysunrise
No ratings yet
Time Series Analysis (Elizabeth Bradley)
Document31 pages
Time Series Analysis (Elizabeth Bradley)
Suhel Mulla
100% (1)
Hotspotlogin PHP
Document7 pages
Hotspotlogin PHP
Văn Hòa Lê
No ratings yet
Readme
Document2 pages
Readme
Widy Alis
No ratings yet
SRX For Beginners
Document6 pages
SRX For Beginners
Son Tran Hong Nam
No ratings yet
Chevron - WellView - WITSML Mud Import Instructions
Document6 pages
Chevron - WellView - WITSML Mud Import Instructions
mega87_2000
No ratings yet
Encryption and Security Tutorial
Document358 pages
Encryption and Security Tutorial
tauneutrino
No ratings yet
A Greedy Algorithm
Document8 pages
A Greedy Algorithm
billpetrrie
No ratings yet
Systemc Refcard
Document2 pages
Systemc Refcard
dhdeshanmugam
No ratings yet
IRI Chakra Max-Brochure2017
Document4 pages
IRI Chakra Max-Brochure2017
gheodan
No ratings yet
Department User Manual
Document11 pages
Department User Manual
Karen Sims
No ratings yet
Temperature Monitoring System
Document23 pages
Temperature Monitoring System
Cyrus Fujimoto Masato Stephen
No ratings yet
10.2.1.4 Packet Tracer - Configure and Verify NTP
Document2 pages
10.2.1.4 Packet Tracer - Configure and Verify NTP
Ion Temciuc
No ratings yet
Logicore Ip Adder/Subtracter v11.0
Document10 pages
Logicore Ip Adder/Subtracter v11.0
Shiv Shankar
No ratings yet
RouterOS 2 7 ReferenceManual
Document599 pages
RouterOS 2 7 ReferenceManual
Yala Ngv Designe
No ratings yet
IBM FileNet P8 Platform 5.0 - System Implementation and Administration
Document3 pages
IBM FileNet P8 Platform 5.0 - System Implementation and Administration
Prabu V Murugaian
No ratings yet
Automatic Generation of Java Code From UML Diagrams Using Ujector
Document18 pages
Automatic Generation of Java Code From UML Diagrams Using Ujector
isoft2003
No ratings yet
Restricts and Limiting Video Streaming With Mikrotik
Document2 pages
Restricts and Limiting Video Streaming With Mikrotik
ManoaMino
100% (1)
SAP Questions
Document37 pages
SAP Questions
asad_raza4u
No ratings yet
14 - How-To-Do - ENG - Movicon Web Client Und TP Webserver PDF
Document7 pages
14 - How-To-Do - ENG - Movicon Web Client Und TP Webserver PDF
olujanz
No ratings yet
Btech It Ecommerce Report
Document11 pages
Btech It Ecommerce Report
dishant_kapadia
No ratings yet