Deep Convolutional Networks are Hierarchical Kernel Machines

We extend i-theory to incorporate not only pooling but also rectifying nonlinearities in an extended HW module (eHW) designed for supervised learning. The two operations roughly correspond to invariance and selectivity, respectively. Under the assumption of normalized inputs, we show that appropriat...

Full description

Bibliographic Details
Main Authors: Anselmi, Fabio, Rosasco, Lorenzo, Tan, Cheston, Poggio, Tomaso
Format: Technical Report
Language:en_US
Published: Center for Brains, Minds and Machines (CBMM), arXiv 2015
Subjects:
Online Access:http://hdl.handle.net/1721.1/100200
_version_ 1826190982961430528
author Anselmi, Fabio
Rosasco, Lorenzo
Tan, Cheston
Poggio, Tomaso
author_facet Anselmi, Fabio
Rosasco, Lorenzo
Tan, Cheston
Poggio, Tomaso
author_sort Anselmi, Fabio
collection MIT
description We extend i-theory to incorporate not only pooling but also rectifying nonlinearities in an extended HW module (eHW) designed for supervised learning. The two operations roughly correspond to invariance and selectivity, respectively. Under the assumption of normalized inputs, we show that appropriate linear combinations of rectifying nonlinearities are equivalent to radial kernels. If pooling is present an equivalent kernel also exist. Thus present-day DCNs (Deep Convolutional Networks) can be exactly equivalent to a hierarchy of kernel machines with pooling and non-pooling layers. Finally, we describe a conjecture for theoretically understanding hierarchies of such modules. A main consequence of the conjecture is that hierarchies of eHW modules minimize memory requirements while computing a selective and invariant representation.
first_indexed 2024-09-23T08:48:17Z
format Technical Report
id mit-1721.1/100200
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T08:48:17Z
publishDate 2015
publisher Center for Brains, Minds and Machines (CBMM), arXiv
record_format dspace
spelling mit-1721.1/1002002019-08-02T03:02:31Z Deep Convolutional Networks are Hierarchical Kernel Machines Anselmi, Fabio Rosasco, Lorenzo Tan, Cheston Poggio, Tomaso i-theory extended HW module (eHW) Invariance Selectivity Hierarchy Machine Learning We extend i-theory to incorporate not only pooling but also rectifying nonlinearities in an extended HW module (eHW) designed for supervised learning. The two operations roughly correspond to invariance and selectivity, respectively. Under the assumption of normalized inputs, we show that appropriate linear combinations of rectifying nonlinearities are equivalent to radial kernels. If pooling is present an equivalent kernel also exist. Thus present-day DCNs (Deep Convolutional Networks) can be exactly equivalent to a hierarchy of kernel machines with pooling and non-pooling layers. Finally, we describe a conjecture for theoretically understanding hierarchies of such modules. A main consequence of the conjecture is that hierarchies of eHW modules minimize memory requirements while computing a selective and invariant representation. This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216. 2015-12-11T22:26:00Z 2015-12-11T22:26:00Z 2015-08-05 Technical Report Working Paper Other http://hdl.handle.net/1721.1/100200 arXiv:1508.01084 en_US CBMM Memo Series;035 Attribution-NonCommercial 3.0 United States http://creativecommons.org/licenses/by-nc/3.0/us/ application/pdf Center for Brains, Minds and Machines (CBMM), arXiv
spellingShingle i-theory
extended HW module (eHW)
Invariance
Selectivity
Hierarchy
Machine Learning
Anselmi, Fabio
Rosasco, Lorenzo
Tan, Cheston
Poggio, Tomaso
Deep Convolutional Networks are Hierarchical Kernel Machines
title Deep Convolutional Networks are Hierarchical Kernel Machines
title_full Deep Convolutional Networks are Hierarchical Kernel Machines
title_fullStr Deep Convolutional Networks are Hierarchical Kernel Machines
title_full_unstemmed Deep Convolutional Networks are Hierarchical Kernel Machines
title_short Deep Convolutional Networks are Hierarchical Kernel Machines
title_sort deep convolutional networks are hierarchical kernel machines
topic i-theory
extended HW module (eHW)
Invariance
Selectivity
Hierarchy
Machine Learning
url http://hdl.handle.net/1721.1/100200
work_keys_str_mv AT anselmifabio deepconvolutionalnetworksarehierarchicalkernelmachines
AT rosascolorenzo deepconvolutionalnetworksarehierarchicalkernelmachines
AT tancheston deepconvolutionalnetworksarehierarchicalkernelmachines
AT poggiotomaso deepconvolutionalnetworksarehierarchicalkernelmachines