Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model

Multimedia data, which includes textual information, is employed in a variety of practical computer vision applications. More than a million new records are added to social media and news sites every day, and the text content they contain has gotten increasingly complex. Finding a meaningful text re...

Full description

Bibliographic Details
Main Authors:	Poluru Eswaraiah, Hussain Syed
Format:	Article
Language:	English
Published:	PeerJ Inc. 2024-01-01
Series:	PeerJ Computer Science
Subjects:	Big data Feature extraction Feature subset Feature selection Feature vector
Online Access:	https://peerj.com/articles/cs-1805.pdf

_version_	1827375284710539264
author	Poluru Eswaraiah Hussain Syed
author_facet	Poluru Eswaraiah Hussain Syed
author_sort	Poluru Eswaraiah
collection	DOAJ
description	Multimedia data, which includes textual information, is employed in a variety of practical computer vision applications. More than a million new records are added to social media and news sites every day, and the text content they contain has gotten increasingly complex. Finding a meaningful text record in an archive might be challenging for computer vision researchers. Most image searches still employ the tried and true language-based techniques of query text and metadata. Substantial work has been done in the past two decades on content-based text retrieval and analysis that still has limitations. The importance of feature extraction in search engines is often overlooked. Web and product search engines, recommendation systems, and question-answering activities frequently leverage these features. Extracting high-quality machine learning features from large text volumes is a challenge for many open-source software packages. Creating an effective feature set manually is a time-consuming process, but with deep learning, new actual feature demos from training data are analyzed. As a novel feature extraction method, deep learning has made great strides in text mining. Automatically training a deep learning model with the most pertinent text attributes requires massive datasets with millions of variables. In this research, a Normalized Dominant Feature Subset with Weighted Vector Model (NDFS-WVM) is proposed that is used for feature extraction and selection for information retrieval from big data using natural language processing models. The suggested model outperforms the conventional models in terms of text retrieval. The proposed model achieves 98.6% accuracy in information retrieval.
first_indexed	2024-03-08T11:48:41Z
format	Article
id	doaj.art-b712905953454140a632fdc4e429ffce
institution	Directory Open Access Journal
issn	2376-5992
language	English
last_indexed	2024-03-08T11:48:41Z
publishDate	2024-01-01
publisher	PeerJ Inc.
record_format	Article
series	PeerJ Computer Science
spelling	doaj.art-b712905953454140a632fdc4e429ffce2024-01-24T15:05:21ZengPeerJ Inc.PeerJ Computer Science2376-59922024-01-0110e180510.7717/peerj-cs.1805Deep learning-based information retrieval with normalized dominant feature subset and weighted vector modelPoluru Eswaraiah0Hussain Syed1School of Computer Science and Engineering, VIT-AP University, Amaravati, Andhra Pradesh, IndiaSchool of Computer Science and Engineering, VIT-AP University, Amaravati, Andhra Pradesh, IndiaMultimedia data, which includes textual information, is employed in a variety of practical computer vision applications. More than a million new records are added to social media and news sites every day, and the text content they contain has gotten increasingly complex. Finding a meaningful text record in an archive might be challenging for computer vision researchers. Most image searches still employ the tried and true language-based techniques of query text and metadata. Substantial work has been done in the past two decades on content-based text retrieval and analysis that still has limitations. The importance of feature extraction in search engines is often overlooked. Web and product search engines, recommendation systems, and question-answering activities frequently leverage these features. Extracting high-quality machine learning features from large text volumes is a challenge for many open-source software packages. Creating an effective feature set manually is a time-consuming process, but with deep learning, new actual feature demos from training data are analyzed. As a novel feature extraction method, deep learning has made great strides in text mining. Automatically training a deep learning model with the most pertinent text attributes requires massive datasets with millions of variables. In this research, a Normalized Dominant Feature Subset with Weighted Vector Model (NDFS-WVM) is proposed that is used for feature extraction and selection for information retrieval from big data using natural language processing models. The suggested model outperforms the conventional models in terms of text retrieval. The proposed model achieves 98.6% accuracy in information retrieval.https://peerj.com/articles/cs-1805.pdfBig dataFeature extractionFeature subsetFeature selectionFeature vector
spellingShingle	Poluru Eswaraiah Hussain Syed Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model PeerJ Computer Science Big data Feature extraction Feature subset Feature selection Feature vector
title	Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model
title_full	Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model
title_fullStr	Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model
title_full_unstemmed	Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model
title_short	Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model
title_sort	deep learning based information retrieval with normalized dominant feature subset and weighted vector model
topic	Big data Feature extraction Feature subset Feature selection Feature vector
url	https://peerj.com/articles/cs-1805.pdf
work_keys_str_mv	AT polurueswaraiah deeplearningbasedinformationretrievalwithnormalizeddominantfeaturesubsetandweightedvectormodel AT hussainsyed deeplearningbasedinformationretrievalwithnormalizeddominantfeaturesubsetandweightedvectormodel

Deep learning-based information retrieval with normalized dominant feature subset and weighted vector model

Similar Items