PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS

The increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Inform...

Deskribapen osoa

Xehetasun bibliografikoak
Egile Nagusiak: Omar Al-rassam, Miran Hama Saeed Mohammed Amin, Zhenar Shaho Faeq
Formatua: Artikulua
Hizkuntza:Arabic
Argitaratua: University of Information Technology and Communications 2021-09-01
Saila:Iraqi Journal for Computers and Informatics
Gaiak:
Sarrera elektronikoa:https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332
_version_ 1827784184890916864
author Omar Al-rassam
Miran Hama Saeed Mohammed Amin
Zhenar Shaho Faeq
author_facet Omar Al-rassam
Miran Hama Saeed Mohammed Amin
Zhenar Shaho Faeq
author_sort Omar Al-rassam
collection DOAJ
description The increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Information retrieval system through eight experiments to test all the features that can be used in a vector space model. These experiments were compared to show the best and the worst implemented features. The features are represented by applying (tf.idf, stop words, stemming), (tf.idf, No- stop words, stemming), (tf.idf, No- stop words, No-stemming), (tf.idf, stop words, No-stemming), (tf, stop words, stemming), (tf, No- stop words, stemming), (tf, No- stop words, No-stemming), (tf, stop words, No-stemming). Results showed that using stop words, stemming approach, and tf.idf improve the performance of the system. However, when tf was used without using stop words and stemming approaches the performance of the system is declined. In addition, results showed that stop words have a significant effect on the system while the stemming approach has no noticeable effect particularly with tf.
first_indexed 2024-03-11T15:59:45Z
format Article
id doaj.art-4c2e5526a2ff467b9a5c3f85bb2e1264
institution Directory Open Access Journal
issn 2313-190X
2520-4912
language Arabic
last_indexed 2024-03-11T15:59:45Z
publishDate 2021-09-01
publisher University of Information Technology and Communications
record_format Article
series Iraqi Journal for Computers and Informatics
spelling doaj.art-4c2e5526a2ff467b9a5c3f85bb2e12642023-10-25T07:52:40ZaraUniversity of Information Technology and CommunicationsIraqi Journal for Computers and Informatics2313-190X2520-49122021-09-014726910.25195/ijci.v47i2.332293PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSISOmar Al-rassam0Miran Hama Saeed Mohammed Amin1Zhenar Shaho Faeq2Koya UniversityKoya UniversityKoya UniversityThe increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Information retrieval system through eight experiments to test all the features that can be used in a vector space model. These experiments were compared to show the best and the worst implemented features. The features are represented by applying (tf.idf, stop words, stemming), (tf.idf, No- stop words, stemming), (tf.idf, No- stop words, No-stemming), (tf.idf, stop words, No-stemming), (tf, stop words, stemming), (tf, No- stop words, stemming), (tf, No- stop words, No-stemming), (tf, stop words, No-stemming). Results showed that using stop words, stemming approach, and tf.idf improve the performance of the system. However, when tf was used without using stop words and stemming approaches the performance of the system is declined. In addition, results showed that stop words have a significant effect on the system while the stemming approach has no noticeable effect particularly with tf.https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332information retrievalvector space modelinverse document frequencyterm frequencystemming
spellingShingle Omar Al-rassam
Miran Hama Saeed Mohammed Amin
Zhenar Shaho Faeq
PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
Iraqi Journal for Computers and Informatics
information retrieval
vector space model
inverse document frequency
term frequency
stemming
title PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
title_full PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
title_fullStr PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
title_full_unstemmed PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
title_short PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
title_sort performance evaluation of information retrieval system using vector space model a comparative analysis
topic information retrieval
vector space model
inverse document frequency
term frequency
stemming
url https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332
work_keys_str_mv AT omaralrassam performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis
AT miranhamasaeedmohammedamin performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis
AT zhenarshahofaeq performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis