Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices

© 2011 IEEE. A recent trend in deep neural network (DNN) development is to extend the reach of deep learning applications to platforms that are more resource and energy-constrained, e.g., mobile devices. These endeavors aim to reduce the DNN model size and improve the hardware processing efficiency...

Full description

Bibliographic Details
Main Authors:	Chen, Yu-Hsin, Yang, Tien-Ju, Emer, Joel S, Sze, Vivienne
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	English
Published:	Institute of Electrical and Electronics Engineers (IEEE) 2021
Online Access:	https://hdl.handle.net/1721.1/134768

_version_	1826213863685619712
author	Chen, Yu-Hsin Yang, Tien-Ju Emer, Joel S Sze, Vivienne
author2	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Chen, Yu-Hsin Yang, Tien-Ju Emer, Joel S Sze, Vivienne
author_sort	Chen, Yu-Hsin
collection	MIT
description	© 2011 IEEE. A recent trend in deep neural network (DNN) development is to extend the reach of deep learning applications to platforms that are more resource and energy-constrained, e.g., mobile devices. These endeavors aim to reduce the DNN model size and improve the hardware processing efficiency and have resulted in DNNs that are much more compact in their structures and/or have high data sparsity. These compact or sparse models are different from the traditional large ones in that there is much more variation in their layer shapes and sizes and often require specialized hardware to exploit sparsity for performance improvement. Therefore, many DNN accelerators designed for large DNNs do not perform well on these models. In this paper, we present Eyeriss v2, a DNN accelerator architecture designed for running compact and sparse DNNs. To deal with the widely varying layer shapes and sizes, it introduces a highly flexible on-chip network, called hierarchical mesh, that can adapt to the different amounts of data reuse and bandwidth requirements of different data types, which improves the utilization of the computation resources. Furthermore, Eyeriss v2 can process sparse data directly in the compressed domain for both weights and activations and therefore is able to improve both processing speed and energy efficiency with sparse models. Overall, with sparse MobileNet, Eyeriss v2 in a 65-nm CMOS process achieves a throughput of 1470.6 inferences/s and 2560.3 inferences/J at a batch size of 1, which is 12.6\times faster and 2.5\times more energy-efficient than the original Eyeriss running MobileNet.
first_indexed	2024-09-23T15:56:02Z
format	Article
id	mit-1721.1/134768
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T15:56:02Z
publishDate	2021
publisher	Institute of Electrical and Electronics Engineers (IEEE)
record_format	dspace
spelling	mit-1721.1/1347682023-01-11T17:50:51Z Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices Chen, Yu-Hsin Yang, Tien-Ju Emer, Joel S Sze, Vivienne Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science © 2011 IEEE. A recent trend in deep neural network (DNN) development is to extend the reach of deep learning applications to platforms that are more resource and energy-constrained, e.g., mobile devices. These endeavors aim to reduce the DNN model size and improve the hardware processing efficiency and have resulted in DNNs that are much more compact in their structures and/or have high data sparsity. These compact or sparse models are different from the traditional large ones in that there is much more variation in their layer shapes and sizes and often require specialized hardware to exploit sparsity for performance improvement. Therefore, many DNN accelerators designed for large DNNs do not perform well on these models. In this paper, we present Eyeriss v2, a DNN accelerator architecture designed for running compact and sparse DNNs. To deal with the widely varying layer shapes and sizes, it introduces a highly flexible on-chip network, called hierarchical mesh, that can adapt to the different amounts of data reuse and bandwidth requirements of different data types, which improves the utilization of the computation resources. Furthermore, Eyeriss v2 can process sparse data directly in the compressed domain for both weights and activations and therefore is able to improve both processing speed and energy efficiency with sparse models. Overall, with sparse MobileNet, Eyeriss v2 in a 65-nm CMOS process achieves a throughput of 1470.6 inferences/s and 2560.3 inferences/J at a batch size of 1, which is 12.6\times faster and 2.5\times more energy-efficient than the original Eyeriss running MobileNet. 2021-10-27T20:09:03Z 2021-10-27T20:09:03Z 2019 2019-07-03T16:40:55Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/134768 en 10.1109/JETCAS.2019.2910232 IEEE Journal on Emerging and Selected Topics in Circuits and Systems Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Institute of Electrical and Electronics Engineers (IEEE) arXiv
spellingShingle	Chen, Yu-Hsin Yang, Tien-Ju Emer, Joel S Sze, Vivienne Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title	Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title_full	Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title_fullStr	Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title_full_unstemmed	Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title_short	Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
title_sort	eyeriss v2 a flexible accelerator for emerging deep neural networks on mobile devices
url	https://hdl.handle.net/1721.1/134768
work_keys_str_mv	AT chenyuhsin eyerissv2aflexibleacceleratorforemergingdeepneuralnetworksonmobiledevices AT yangtienju eyerissv2aflexibleacceleratorforemergingdeepneuralnetworksonmobiledevices AT emerjoels eyerissv2aflexibleacceleratorforemergingdeepneuralnetworksonmobiledevices AT szevivienne eyerissv2aflexibleacceleratorforemergingdeepneuralnetworksonmobiledevices

Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices

Similar Items