Consumer Document Analytical Accelerator Hardware

Document scanning devices are used for visual character recognition, followed by text analytics in the software. Often such character extraction is insecure, and any third party can manipulate the information. On the other hand, near-edge processing devices are restrained by limited resources and co...

Full description

Bibliographic Details
Main Authors: Aswani Radhakrishnan, Dibyasha Mahapatra, Alex James
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10018187/
_version_ 1828011343002730496
author Aswani Radhakrishnan
Dibyasha Mahapatra
Alex James
author_facet Aswani Radhakrishnan
Dibyasha Mahapatra
Alex James
author_sort Aswani Radhakrishnan
collection DOAJ
description Document scanning devices are used for visual character recognition, followed by text analytics in the software. Often such character extraction is insecure, and any third party can manipulate the information. On the other hand, near-edge processing devices are restrained by limited resources and connectivity issues. The primary factors that lead to exploring independent hardware devices with natural language processing (NLP) capabilities are latency during cloud processing and computing costs. This paper introduces a hardware accelerator for information retrieval using memristive TF-IDF implementation. In this system, each sentence is represented using a memristive crossbar layer, with each column containing a single word. The number of matching scores for the TF and IDF values was implemented using operational amplifier-based comparator accumulator circuits. The circuit is designed with a 180nm CMOS process, Knowm Multi-Stable Switch memristor model, and WOx device parameters. We compared its performance with that of a standard benchmark dataset. Variability and device-to-device related issues were also taken into consideration in the analysis. This paper concludes with implementing TF-IDF score calculation for applications such as information retrieval and text summarization.
first_indexed 2024-04-10T09:14:55Z
format Article
id doaj.art-fdc9bd67423a4727a0573ed98c7466a3
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-10T09:14:55Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-fdc9bd67423a4727a0573ed98c7466a32023-02-21T00:02:04ZengIEEEIEEE Access2169-35362023-01-01115161516710.1109/ACCESS.2023.323746310018187Consumer Document Analytical Accelerator HardwareAswani Radhakrishnan0https://orcid.org/0000-0002-1086-1870Dibyasha Mahapatra1Alex James2https://orcid.org/0000-0001-5655-1213School of Electronic Systems and Automation, Kerala University of Digital Sciences Innovation and Technology (Digital University Kerala), Thiruvananthapuram, Kerala, IndiaSchool of Electronic Systems and Automation, Kerala University of Digital Sciences Innovation and Technology (Digital University Kerala), Thiruvananthapuram, Kerala, IndiaSchool of Electronic Systems and Automation, Kerala University of Digital Sciences Innovation and Technology (Digital University Kerala), Thiruvananthapuram, Kerala, IndiaDocument scanning devices are used for visual character recognition, followed by text analytics in the software. Often such character extraction is insecure, and any third party can manipulate the information. On the other hand, near-edge processing devices are restrained by limited resources and connectivity issues. The primary factors that lead to exploring independent hardware devices with natural language processing (NLP) capabilities are latency during cloud processing and computing costs. This paper introduces a hardware accelerator for information retrieval using memristive TF-IDF implementation. In this system, each sentence is represented using a memristive crossbar layer, with each column containing a single word. The number of matching scores for the TF and IDF values was implemented using operational amplifier-based comparator accumulator circuits. The circuit is designed with a 180nm CMOS process, Knowm Multi-Stable Switch memristor model, and WOx device parameters. We compared its performance with that of a standard benchmark dataset. Variability and device-to-device related issues were also taken into consideration in the analysis. This paper concludes with implementing TF-IDF score calculation for applications such as information retrieval and text summarization.https://ieeexplore.ieee.org/document/10018187/Natural language processingTF-IDFhardware acceleratormemristive systemsmemristoranalog computation
spellingShingle Aswani Radhakrishnan
Dibyasha Mahapatra
Alex James
Consumer Document Analytical Accelerator Hardware
IEEE Access
Natural language processing
TF-IDF
hardware accelerator
memristive systems
memristor
analog computation
title Consumer Document Analytical Accelerator Hardware
title_full Consumer Document Analytical Accelerator Hardware
title_fullStr Consumer Document Analytical Accelerator Hardware
title_full_unstemmed Consumer Document Analytical Accelerator Hardware
title_short Consumer Document Analytical Accelerator Hardware
title_sort consumer document analytical accelerator hardware
topic Natural language processing
TF-IDF
hardware accelerator
memristive systems
memristor
analog computation
url https://ieeexplore.ieee.org/document/10018187/
work_keys_str_mv AT aswaniradhakrishnan consumerdocumentanalyticalacceleratorhardware
AT dibyashamahapatra consumerdocumentanalyticalacceleratorhardware
AT alexjames consumerdocumentanalyticalacceleratorhardware