FPGA implementation of spiking convolutional neural networks for voice keyword recognition

The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its com...

Full description

Bibliographic Details
Main Author: Ng, Wei Soon
Other Authors: Goh Wang Ling
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149182
_version_ 1824454841070518272
author Ng, Wei Soon
author2 Goh Wang Ling
author_facet Goh Wang Ling
Ng, Wei Soon
author_sort Ng, Wei Soon
collection NTU
description The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its computation time remains the biggest challenge of the model. To improve the computational speed of SCNN, a highly efficient Field-Programmable Gate Array (FPGA)-based SCNN hardware architecture for voice keyword recognition is proposed in this project. The design methodology engages the parallelism and flexibility of FPGA to achieve high speed computation. The proposed architecture is fully pipelined to maximize the efficiency of the system. Furthermore, the spiking model used in this work is the Izhkevich spiking model [1] which is of much less complexity and yet able to achieve competitive accuracy when compared to the Hodgkin-Huxley spiking model [2]. The proposed SCNN model achieves an accuracy of 89% with zero accuracy loss upon mapping from the software environment to the hardware environment. Besides, through retraining and fixed-point quantization techniques, the memory consumption of weights and other parameters stored on FPGA is reduced by 8 times without compromising the accuracy. The proposed design flow for implementing the SCNN model serves as a good reference for future design, to help reduce the development time of SCNN model on FPGA.
first_indexed 2025-02-19T03:28:43Z
format Final Year Project (FYP)
id ntu-10356/149182
institution Nanyang Technological University
language English
last_indexed 2025-02-19T03:28:43Z
publishDate 2021
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1491822023-07-07T18:17:38Z FPGA implementation of spiking convolutional neural networks for voice keyword recognition Ng, Wei Soon Goh Wang Ling School of Electrical and Electronic Engineering Gao Yuan EWLGOH@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its computation time remains the biggest challenge of the model. To improve the computational speed of SCNN, a highly efficient Field-Programmable Gate Array (FPGA)-based SCNN hardware architecture for voice keyword recognition is proposed in this project. The design methodology engages the parallelism and flexibility of FPGA to achieve high speed computation. The proposed architecture is fully pipelined to maximize the efficiency of the system. Furthermore, the spiking model used in this work is the Izhkevich spiking model [1] which is of much less complexity and yet able to achieve competitive accuracy when compared to the Hodgkin-Huxley spiking model [2]. The proposed SCNN model achieves an accuracy of 89% with zero accuracy loss upon mapping from the software environment to the hardware environment. Besides, through retraining and fixed-point quantization techniques, the memory consumption of weights and other parameters stored on FPGA is reduced by 8 times without compromising the accuracy. The proposed design flow for implementing the SCNN model serves as a good reference for future design, to help reduce the development time of SCNN model on FPGA. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-05-28T04:08:58Z 2021-05-28T04:08:58Z 2021 Final Year Project (FYP) Ng, W. S. (2021). FPGA implementation of spiking convolutional neural networks for voice keyword recognition. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149182 https://hdl.handle.net/10356/149182 en B2063-201 application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
Ng, Wei Soon
FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title_full FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title_fullStr FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title_full_unstemmed FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title_short FPGA implementation of spiking convolutional neural networks for voice keyword recognition
title_sort fpga implementation of spiking convolutional neural networks for voice keyword recognition
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/149182
work_keys_str_mv AT ngweisoon fpgaimplementationofspikingconvolutionalneuralnetworksforvoicekeywordrecognition