Current Affiliation: Google Deepmind Current Affiliation: University of Oxford and Google Deepmind

Uploaded by

0% found this document useful (0 votes)

45 views1 page

deep learning

Original Title

Abstract

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

deep learning

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

45 views1 page

Current Affiliation: Google Deepmind Current Affiliation: University of Oxford and Google Deepmind

Uploaded by

deep learning

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

ABSTRACT

In this work we investigate the effect of the convolutional network depth on its
accuracy in the large-scale image recognition setting. Our main contribution is
a thorough evaluation of networks of increasing depth using an architecture with
very small (3×3) convolution filters, which shows that a significant improvement
on the prior-art configurations can be achieved by pushing the depth to 16–19
weight layers. These findings were the basis of our ImageNet Challenge 2014
submission, where our team secured the first and the second places in the localisation
and classification tracks respectively. We also show that our representations
generalise well to other datasets, where they achieve state-of-the-art results. We
have made our two best-performing ConvNet models publicly available to facilitate
further research on the use of deep visual representations in computer vision.
1 INTRODUCTION
Convolutional networks (ConvNets) have recently enjoyed a great success in large-scale image
and video recognition (Krizhevsky et al., 2012; Zeiler & Fergus, 2013; Sermanet et al., 2014;
Simonyan & Zisserman, 2014) which has become possible due to the large public image repositories,
such as ImageNet (Deng et al., 2009), and high-performance computing systems, such as GPUs
or large-scale distributed clusters (Dean et al., 2012). In particular, an important role in the advance
of deep visual recognition architectures has been played by the ImageNet Large-ScaleVisual Recognition
Challenge (ILSVRC) (Russakovsky et al., 2014), which has served as a testbed for a few
generations of large-scale image classification systems, from high-dimensional shallow feature encodings
(Perronnin et al., 2010) (the winner of ILSVRC-2011) to deep ConvNets (Krizhevsky et al.,
2012) (the winner of ILSVRC-2012).
With ConvNets becoming more of a commodity in the computer vision field, a number of attempts
have been made to improve the original architecture of Krizhevsky et al. (2012) in a
bid to achieve better accuracy. For instance, the best-performing submissions to the ILSVRC-
2013 (Zeiler & Fergus, 2013; Sermanet et al., 2014) utilised smaller receptive window size and
smaller stride of the first convolutional layer. Another line of improvements dealt with training
and testing the networks densely over the whole image and over multiple scales (Sermanet et al.,
2014; Howard, 2014). In this paper, we address another important aspect of ConvNet architecture
design – its depth. To this end, we fix other parameters of the architecture, and steadily increase the
depth of the network by adding more convolutional layers, which is feasible due to the use of very
small (3 × 3) convolution filters in all layers.
As a result, we come up with significantly more accurate ConvNet architectures, which not only
achieve the state-of-the-art accuracy on ILSVRC classification and localisation tasks, but are also
applicable to other image recognition datasets, where they achieve excellent performance even when
used as a part of a relatively simple pipelines (e.g. deep features classified by a linear SVM without
fine-tuning). We have released our two best-performing models1 to facilitate further research.
The rest of the paper is organised as follows. In Sect. 2, we describe our ConvNet configurations.
The details of the image classification training and evaluation are then presented in Sect. 3, and the
_current affiliation: Google DeepMind +current affiliation: University of Oxford and Google DeepMind

Self-Quiz Unit 3 - Attempt Review
Document7 pages
Self-Quiz Unit 3 - Attempt Review
Dr Tech
100% (2)
Time - PGP DSBA
Document43 pages
Time - PGP DSBA
vansh gupta
No ratings yet
There Are No Particles, Just Fields
Document14 pages
There Are No Particles, Just Fields
riletm86
No ratings yet
CIV 413 Introduction
Document61 pages
CIV 413 Introduction
حمدةالنهدية
No ratings yet
1223 Usp 33 Microbiologyvalidation
Document19 pages
1223 Usp 33 Microbiologyvalidation
عبدالعزيز بدر
No ratings yet
Image Super Resolution
Document8 pages
Image Super Resolution
Sam Rock
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
All Pastpapers
Document325 pages
All Pastpapers
Ahmed Masoud
100% (7)
Deep Collaborative Learning Approach Over Object Recognition
Document5 pages
Deep Collaborative Learning Approach Over Object Recognition
Ijsrnet Editorial
No ratings yet
VGG Architecture
Document4 pages
VGG Architecture
Akshat Gurnani
No ratings yet
Research Article: Concrete Cracks Detection Using Convolutional Neural Network Based On Transfer Learning
Document10 pages
Research Article: Concrete Cracks Detection Using Convolutional Neural Network Based On Transfer Learning
Vihanga
No ratings yet
Tiny Object Recognition
Document8 pages
Tiny Object Recognition
Leandrob131
No ratings yet
Compressing Deep Convolutional Networks
Document10 pages
Compressing Deep Convolutional Networks
Arduino Basic
No ratings yet
Google Le Net
Document9 pages
Google Le Net
Cazorla
No ratings yet
Deep Paper
Document12 pages
Deep Paper
asdfgh
No ratings yet
S S: T A C N: Triving For Implicity HE LL Onvolutional ET
Document14 pages
S S: T A C N: Triving For Implicity HE LL Onvolutional ET
Kenny S
No ratings yet
S S: T A C N: Triving For Implicity HE LL Onvolutional ET
Document14 pages
S S: T A C N: Triving For Implicity HE LL Onvolutional ET
Kenny S
No ratings yet
Object Detection and Its Implementation On Android Devices
Document8 pages
Object Detection and Its Implementation On Android Devices
Prateek singh
No ratings yet
Final T
Document8 pages
Final T
Pu Su
No ratings yet
Image Restoration Using Deep Learning
Document12 pages
Image Restoration Using Deep Learning
International Journal of Innovative Science and Research Technology
No ratings yet
Going Deeper With Convolutions
Document9 pages
Going Deeper With Convolutions
etwefws
No ratings yet
Reviewed Report
Document2 pages
Reviewed Report
1211103165
No ratings yet
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
Document9 pages
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
Paulo Lima Campos
No ratings yet
Google Net
Document9 pages
Google Net
Shubham Chaudhary
No ratings yet
Cloud Final Report PDF
Document7 pages
Cloud Final Report PDF
Pu Su
No ratings yet
D V2D: V D D S M: EEP Ideo To Epth With Ifferentiable Tructure From Otion
Document20 pages
D V2D: V D D S M: EEP Ideo To Epth With Ifferentiable Tructure From Otion
docjag
No ratings yet
Szeged y 2016
Document9 pages
Szeged y 2016
Maria Palancares
No ratings yet
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
Document9 pages
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
yadavghyam2001
No ratings yet
Applications of DL For CV 23
Document6 pages
Applications of DL For CV 23
cheint
No ratings yet
Los Días y Las Horas
Document8 pages
Los Días y Las Horas
Juan Fernandez Iriarte
No ratings yet
Li 2021
Document14 pages
Li 2021
Edgar Enrique Vilca Romero
No ratings yet
Navigation Domain Representation For Interactive Multiview Imaging
Document13 pages
Navigation Domain Representation For Interactive Multiview Imaging
Malvina Dolgani
No ratings yet
Benchmark Analysis of Popular Imagenet Classification Deep CNN Architectures
Document7 pages
Benchmark Analysis of Popular Imagenet Classification Deep CNN Architectures
Mahi
No ratings yet
A Unified Scheme For Super-Resolution and Depth Estimation From Asymmetric Stereoscopic Video
Document14 pages
A Unified Scheme For Super-Resolution and Depth Estimation From Asymmetric Stereoscopic Video
Sanjay Shelar
No ratings yet
An Analysis of ConformalLayers - Robustness To Corruptions in Natural Images
Document8 pages
An Analysis of ConformalLayers - Robustness To Corruptions in Natural Images
andr234ibatera
No ratings yet
Attention Aware Cost Volume Pyramid Based Multi-View Stereo Network For 3D Reconstruction
Document21 pages
Attention Aware Cost Volume Pyramid Based Multi-View Stereo Network For 3D Reconstruction
vaishu swapna
No ratings yet
FlowNet - Learning Optical Flow With CNN
Document9 pages
FlowNet - Learning Optical Flow With CNN
sobuz visual
No ratings yet
Revista de Sensores
Document13 pages
Revista de Sensores
Its Jhorey
No ratings yet
DEAlgoijns 2014 v16 n3 p201 213
Document13 pages
DEAlgoijns 2014 v16 n3 p201 213
backup.quazarsoftech
No ratings yet
And The Bit Goes Down: Revisiting The Quantization of Neural Networks
Document11 pages
And The Bit Goes Down: Revisiting The Quantization of Neural Networks
ganesh
No ratings yet
Turning Mobile Phones Into 3D Scanners
Document8 pages
Turning Mobile Phones Into 3D Scanners
Haris Džaferović
No ratings yet
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
Document9 pages
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
Jorge Velasquez Ramos
No ratings yet
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
Document9 pages
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
Mohammed Zubair
No ratings yet
Learning Spatiotemporal Features With 3D Convolutional Networks
Document16 pages
Learning Spatiotemporal Features With 3D Convolutional Networks
Harsh Kumar
No ratings yet
Corossion Detection
Document6 pages
Corossion Detection
suresh mp
No ratings yet
A Dilated CNN Model For Image Classification
Document9 pages
A Dilated CNN Model For Image Classification
yoga
No ratings yet
Deep Residual Learning For Image and Video Recognition
Document13 pages
Deep Residual Learning For Image and Video Recognition
Abu Rayhan
No ratings yet
Development of Image Super-Resolution Framework
Document5 pages
Development of Image Super-Resolution Framework
IAES International Journal of Robotics and Automation
No ratings yet
EDGE-Net: Efficient Deep-Learning Gradients Extraction Network
Document15 pages
EDGE-Net: Efficient Deep-Learning Gradients Extraction Network
Adam Hansen
No ratings yet
2013 Real-Time 3D Reconstruction in Dynamic Scenes Using Point-Based Fusion
Document8 pages
2013 Real-Time 3D Reconstruction in Dynamic Scenes Using Point-Based Fusion
Vivian Li
No ratings yet
Sign Language Recognition From Digital Videos Using Feature Pyramid Network With Detection Transformer
Document13 pages
Sign Language Recognition From Digital Videos Using Feature Pyramid Network With Detection Transformer
Miral Elnakib
No ratings yet
Automated Pavement Crack Damage Detection Using Deep Multiscale Convolutional FeaturesJournal of Advanced Transportation
Document12 pages
Automated Pavement Crack Damage Detection Using Deep Multiscale Convolutional FeaturesJournal of Advanced Transportation
Edgar Vilca Romero
No ratings yet
Real-Time Semantic Slam With DCNN-based Feature Point Detection, Matching and Dense Point Cloud Aggregation
Document6 pages
Real-Time Semantic Slam With DCNN-based Feature Point Detection, Matching and Dense Point Cloud Aggregation
JUAN CAMILO TORRES MUNOZ
No ratings yet
Liu A Data-Centric Solution To NonHomogeneous Dehazing Via Vision Transformer CVPRW 2023 Paper
Document10 pages
Liu A Data-Centric Solution To NonHomogeneous Dehazing Via Vision Transformer CVPRW 2023 Paper
lohithinfinite154
No ratings yet
Boosting Monocular Depth Estimation Models To High-Resolution Via Content-Adaptive Multi-Resolution CVPR 2021 Paper
Document10 pages
Boosting Monocular Depth Estimation Models To High-Resolution Via Content-Adaptive Multi-Resolution CVPR 2021 Paper
徐啸宇
No ratings yet
Electronics 12 01451
Document12 pages
Electronics 12 01451
r.shekara22
No ratings yet
Review of Advanced Image Processing Techniques Digital Elevation Model
Document7 pages
Review of Advanced Image Processing Techniques Digital Elevation Model
IJRASETPublications
No ratings yet
Inception-V4, Inception-ResNet and The Impact of Residual Connections On Learning
Document12 pages
Inception-V4, Inception-ResNet and The Impact of Residual Connections On Learning
Shubham Chaudhary
No ratings yet
Ummenhofer DeMoN Depth and CVPR 2017 Paper
Document10 pages
Ummenhofer DeMoN Depth and CVPR 2017 Paper
GG GE
No ratings yet
Uw Cse 11 02 02 PDF
Document8 pages
Uw Cse 11 02 02 PDF
Ho Man Tik
No ratings yet
Immersive Analytics With Webvr and Google Cardboard: Peter W.S. Butcher Jonathan C. Roberts Panagiotis D. Ritsos
Document2 pages
Immersive Analytics With Webvr and Google Cardboard: Peter W.S. Butcher Jonathan C. Roberts Panagiotis D. Ritsos
Mahtab
No ratings yet
Imagify Reconstruction of High - Resolution Images From Degraded Images
Document5 pages
Imagify Reconstruction of High - Resolution Images From Degraded Images
IJRASETPublications
No ratings yet
Depth Estimation From Single Image Using CNN-Residual Network
Document8 pages
Depth Estimation From Single Image Using CNN-Residual Network
Bora Cobanoglu
No ratings yet
Efficient Hybrid Tree-Based Stereo Matching With Applications To Postcapture Image Refocusing
Document15 pages
Efficient Hybrid Tree-Based Stereo Matching With Applications To Postcapture Image Refocusing
Isaac Anaman
No ratings yet
Image Dehazing Based On CMTnet Cascaded Multi-Scal
Document21 pages
Image Dehazing Based On CMTnet Cascaded Multi-Scal
M Tarun sai
No ratings yet
Deep Learning with Python: A Comprehensive Guide to Deep Learning with Python
From Everand
Deep Learning with Python: A Comprehensive Guide to Deep Learning with Python
Tom Lesley
No ratings yet
Q
Document7 pages
Q
HoàngAnh
No ratings yet
Ntambwekambuyi 2019
Document10 pages
Ntambwekambuyi 2019
María Pía Arancibia Bravo
No ratings yet
Two-Layer Obstacle Collision Avoidance With Machine Learning For More Energy-Efficient Unmanned Aircraft Trajectories
Document16 pages
Two-Layer Obstacle Collision Avoidance With Machine Learning For More Energy-Efficient Unmanned Aircraft Trajectories
Marcelo Rodrigues
No ratings yet
HWK Yr7 Unit 1.5 & 1.6
Document3 pages
HWK Yr7 Unit 1.5 & 1.6
King Fonseka
No ratings yet
Interaction of Local, Distortional, and Global Buckling - Anil Kumar, Kalyanaraman 2018
Document9 pages
Interaction of Local, Distortional, and Global Buckling - Anil Kumar, Kalyanaraman 2018
amoke
No ratings yet
CS772 Lec7
Document13 pages
CS772 Lec7
juggernautjha
No ratings yet
Time Table.: (Year: 2019-20 Sem: 3 Degree: B.E. Department:COMPUTER ENGINEERING Section: 3) Time Table (CORE)
Document3 pages
Time Table.: (Year: 2019-20 Sem: 3 Degree: B.E. Department:COMPUTER ENGINEERING Section: 3) Time Table (CORE)
Lakshay Sood
No ratings yet
Post Lab Carvone
Document2 pages
Post Lab Carvone
Precious Gaffud
No ratings yet
Variabel Moderat
Document26 pages
Variabel Moderat
raudhah
No ratings yet
Derivation of Circular Motion Physics Using Calculus: Brandon A. Belna
Document5 pages
Derivation of Circular Motion Physics Using Calculus: Brandon A. Belna
Brandone
No ratings yet
Conic Sections
Document10 pages
Conic Sections
matyie77
No ratings yet
ANSYS Function Builder
Document3 pages
ANSYS Function Builder
msb78
No ratings yet
Validation of A Loading Model For Simulating Blast Mine Effects On Armoured Vehicles
Document10 pages
Validation of A Loading Model For Simulating Blast Mine Effects On Armoured Vehicles
Erik Islas
No ratings yet
Midas NGen
Document5 pages
Midas NGen
Joseph Booker
No ratings yet
NOTES - AMAT 110 Finals
Document4 pages
NOTES - AMAT 110 Finals
May Gloria
No ratings yet
Kinetics A2 Rate Equations: R K (A) (B)
Document9 pages
Kinetics A2 Rate Equations: R K (A) (B)
ryan1230987
No ratings yet
6 Coord
Document58 pages
6 Coord
rosenthal elvis chimpay arias
No ratings yet
Full VB Documentation
Document34 pages
Full VB Documentation
Anil Batra
100% (1)
Adge Module6
Document11 pages
Adge Module6
Hanely De La Peña
No ratings yet
1a Trigonometry Method of Vector Addition
Document5 pages
1a Trigonometry Method of Vector Addition
Jojimar Julian
No ratings yet
Distribucion Log Normal
Document52 pages
Distribucion Log Normal
mtorrejon
No ratings yet
Statistical Hypothesis Test
Document6 pages
Statistical Hypothesis Test
Ish Roman
No ratings yet
PSI 3 Course Material v2.1
Document16 pages
PSI 3 Course Material v2.1
sreenath
No ratings yet
2023 Benjamin
Document4 pages
2023 Benjamin
张jing juan
No ratings yet