FPGA implementation of spiking convolutional neural networks for voice keyword recognition
The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its com...
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project (FYP) |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149182 |
_version_ | 1824454841070518272 |
---|---|
author | Ng, Wei Soon |
author2 | Goh Wang Ling |
author_facet | Goh Wang Ling Ng, Wei Soon |
author_sort | Ng, Wei Soon |
collection | NTU |
description | The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its computation time remains the biggest challenge of the model.
To improve the computational speed of SCNN, a highly efficient Field-Programmable Gate Array (FPGA)-based SCNN hardware architecture for voice keyword recognition is proposed in this project. The design methodology engages the parallelism and flexibility of FPGA to achieve high speed computation. The proposed architecture is fully pipelined to maximize the efficiency of the system. Furthermore, the spiking model used in this work is the Izhkevich spiking model [1] which is of much less complexity and yet able to achieve competitive accuracy when compared to the Hodgkin-Huxley spiking model [2].
The proposed SCNN model achieves an accuracy of 89% with zero accuracy loss upon mapping from the software environment to the hardware environment. Besides, through retraining and fixed-point quantization techniques, the memory consumption of weights and other parameters stored on FPGA is reduced by 8 times without compromising the accuracy. The proposed design flow for implementing the SCNN model serves as a good reference for future design, to help reduce the development time of SCNN model on FPGA. |
first_indexed | 2025-02-19T03:28:43Z |
format | Final Year Project (FYP) |
id | ntu-10356/149182 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2025-02-19T03:28:43Z |
publishDate | 2021 |
publisher | Nanyang Technological University |
record_format | dspace |
spelling | ntu-10356/1491822023-07-07T18:17:38Z FPGA implementation of spiking convolutional neural networks for voice keyword recognition Ng, Wei Soon Goh Wang Ling School of Electrical and Electronic Engineering Gao Yuan EWLGOH@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering The spiking convolutional neural network (SCNN) is a hybrid model of both the spiking neural network (SNN) and convolutional neural network (CNN). It is known to be power efficient and accurate. Although SCNN has shown promising accuracy for voice keyword recognition in software environment, its computation time remains the biggest challenge of the model. To improve the computational speed of SCNN, a highly efficient Field-Programmable Gate Array (FPGA)-based SCNN hardware architecture for voice keyword recognition is proposed in this project. The design methodology engages the parallelism and flexibility of FPGA to achieve high speed computation. The proposed architecture is fully pipelined to maximize the efficiency of the system. Furthermore, the spiking model used in this work is the Izhkevich spiking model [1] which is of much less complexity and yet able to achieve competitive accuracy when compared to the Hodgkin-Huxley spiking model [2]. The proposed SCNN model achieves an accuracy of 89% with zero accuracy loss upon mapping from the software environment to the hardware environment. Besides, through retraining and fixed-point quantization techniques, the memory consumption of weights and other parameters stored on FPGA is reduced by 8 times without compromising the accuracy. The proposed design flow for implementing the SCNN model serves as a good reference for future design, to help reduce the development time of SCNN model on FPGA. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-05-28T04:08:58Z 2021-05-28T04:08:58Z 2021 Final Year Project (FYP) Ng, W. S. (2021). FPGA implementation of spiking convolutional neural networks for voice keyword recognition. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149182 https://hdl.handle.net/10356/149182 en B2063-201 application/pdf Nanyang Technological University |
spellingShingle | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering Ng, Wei Soon FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title | FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title_full | FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title_fullStr | FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title_full_unstemmed | FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title_short | FPGA implementation of spiking convolutional neural networks for voice keyword recognition |
title_sort | fpga implementation of spiking convolutional neural networks for voice keyword recognition |
topic | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering |
url | https://hdl.handle.net/10356/149182 |
work_keys_str_mv | AT ngweisoon fpgaimplementationofspikingconvolutionalneuralnetworksforvoicekeywordrecognition |