Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices

Invertible logic has been recently presented that can realize bidirectional computing based on Hamiltonians for solving several critical issues, such as integer factorization and training neural networks. However, a hardware architecture for supporting large-scale general-purpose invertible logic ha...

Full description

Bibliographic Details
Main Authors:	Naoya Onizawa, Akira Tamakoshi, Takahiro Hanyu
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Open Journal of Circuits and Systems
Subjects:	Boltzmann machine sparse matrix FPGA integer factorization
Online Access:	https://ieeexplore.ieee.org/document/9645044/

_version_	1819364482580742144
author	Naoya Onizawa Akira Tamakoshi Takahiro Hanyu
author_facet	Naoya Onizawa Akira Tamakoshi Takahiro Hanyu
author_sort	Naoya Onizawa
collection	DOAJ
description	Invertible logic has been recently presented that can realize bidirectional computing based on Hamiltonians for solving several critical issues, such as integer factorization and training neural networks. However, a hardware architecture for supporting large-scale general-purpose invertible logic has not been studied. In this paper, we introduce a scalable hardware architecture based on sparse Hamiltonian matrices. In order to store and compute the Hamiltonians efficiently in hardware, a sparse matrix representation of PTELL (partitioned and transposed ELLPACK) is proposed. A memory size of PTELL can be smaller than that of a conventional ELL by reducing the number of paddings while parallel reading of non-zero values are realized for high-throughput operations. As a result, PTELL achieves around 1% and 10% memory usages of a conventional dense and ELL matrices, respectively, in case of invertible multipliers. In addition, the proposed hardware accelerator of invertible logic for supporting arbitrary Hamiltonians is implemented on Xilinx VU9P FPGA, which achieves around two orders of magnitude faster than a 16-core Intel Xeon implementation.
first_indexed	2024-12-24T22:59:39Z
format	Article
id	doaj.art-20826c96c8834c70930036fcdf2f8f58
institution	Directory Open Access Journal
issn	2644-1225
language	English
last_indexed	2024-12-24T22:59:39Z
publishDate	2021-01-01
publisher	IEEE
record_format	Article
series	IEEE Open Journal of Circuits and Systems
spelling	doaj.art-20826c96c8834c70930036fcdf2f8f582022-12-21T16:35:10ZengIEEEIEEE Open Journal of Circuits and Systems2644-12252021-01-01278279110.1109/OJCAS.2021.31165849645044Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian MatricesNaoya Onizawa0https://orcid.org/0000-0002-4855-7081Akira Tamakoshi1https://orcid.org/0000-0002-7348-8196Takahiro Hanyu2Research Institute of Electrical Communication, Tohoku University, Sendai, JapanResearch Institute of Electrical Communication, Tohoku University, Sendai, JapanResearch Institute of Electrical Communication, Tohoku University, Sendai, JapanInvertible logic has been recently presented that can realize bidirectional computing based on Hamiltonians for solving several critical issues, such as integer factorization and training neural networks. However, a hardware architecture for supporting large-scale general-purpose invertible logic has not been studied. In this paper, we introduce a scalable hardware architecture based on sparse Hamiltonian matrices. In order to store and compute the Hamiltonians efficiently in hardware, a sparse matrix representation of PTELL (partitioned and transposed ELLPACK) is proposed. A memory size of PTELL can be smaller than that of a conventional ELL by reducing the number of paddings while parallel reading of non-zero values are realized for high-throughput operations. As a result, PTELL achieves around 1% and 10% memory usages of a conventional dense and ELL matrices, respectively, in case of invertible multipliers. In addition, the proposed hardware accelerator of invertible logic for supporting arbitrary Hamiltonians is implemented on Xilinx VU9P FPGA, which achieves around two orders of magnitude faster than a 16-core Intel Xeon implementation.https://ieeexplore.ieee.org/document/9645044/Boltzmann machinesparse matrixFPGAinteger factorization
spellingShingle	Naoya Onizawa Akira Tamakoshi Takahiro Hanyu Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices IEEE Open Journal of Circuits and Systems Boltzmann machine sparse matrix FPGA integer factorization
title	Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices
title_full	Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices
title_fullStr	Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices
title_full_unstemmed	Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices
title_short	Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices
title_sort	hardware acceleration of large scale cmos invertible logic based on sparse hamiltonian matrices
topic	Boltzmann machine sparse matrix FPGA integer factorization
url	https://ieeexplore.ieee.org/document/9645044/
work_keys_str_mv	AT naoyaonizawa hardwareaccelerationoflargescalecmosinvertiblelogicbasedonsparsehamiltonianmatrices AT akiratamakoshi hardwareaccelerationoflargescalecmosinvertiblelogicbasedonsparsehamiltonianmatrices AT takahirohanyu hardwareaccelerationoflargescalecmosinvertiblelogicbasedonsparsehamiltonianmatrices

Hardware Acceleration of Large-Scale CMOS Invertible Logic Based on Sparse Hamiltonian Matrices

Similar Items