PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS
The increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Inform...
Egile Nagusiak: | , , |
---|---|
Formatua: | Artikulua |
Hizkuntza: | Arabic |
Argitaratua: |
University of Information Technology and Communications
2021-09-01
|
Saila: | Iraqi Journal for Computers and Informatics |
Gaiak: | |
Sarrera elektronikoa: | https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332 |
_version_ | 1827784184890916864 |
---|---|
author | Omar Al-rassam Miran Hama Saeed Mohammed Amin Zhenar Shaho Faeq |
author_facet | Omar Al-rassam Miran Hama Saeed Mohammed Amin Zhenar Shaho Faeq |
author_sort | Omar Al-rassam |
collection | DOAJ |
description | The increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Information retrieval system through eight experiments to test all the features that can be used in a vector space model. These experiments were compared to show the best and the worst implemented features. The features are represented by applying (tf.idf, stop words, stemming), (tf.idf, No- stop words, stemming), (tf.idf, No- stop words, No-stemming), (tf.idf, stop words, No-stemming), (tf, stop words, stemming), (tf, No- stop words, stemming), (tf, No- stop words, No-stemming), (tf, stop words, No-stemming). Results showed that using stop words, stemming approach, and tf.idf improve the performance of the system. However, when tf was used without using stop words and stemming approaches the performance of the system is declined. In addition, results showed that stop words have a significant effect on the system while the stemming approach has no noticeable effect particularly with tf. |
first_indexed | 2024-03-11T15:59:45Z |
format | Article |
id | doaj.art-4c2e5526a2ff467b9a5c3f85bb2e1264 |
institution | Directory Open Access Journal |
issn | 2313-190X 2520-4912 |
language | Arabic |
last_indexed | 2024-03-11T15:59:45Z |
publishDate | 2021-09-01 |
publisher | University of Information Technology and Communications |
record_format | Article |
series | Iraqi Journal for Computers and Informatics |
spelling | doaj.art-4c2e5526a2ff467b9a5c3f85bb2e12642023-10-25T07:52:40ZaraUniversity of Information Technology and CommunicationsIraqi Journal for Computers and Informatics2313-190X2520-49122021-09-014726910.25195/ijci.v47i2.332293PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSISOmar Al-rassam0Miran Hama Saeed Mohammed Amin1Zhenar Shaho Faeq2Koya UniversityKoya UniversityKoya UniversityThe increasing use of the internet has created a vast amount of digital information and it is expanding extremely fast. Therefore, Information retrieval becomes a challenging task to fetch relevant information for users. The aim of this paper was to examine and evaluate the performance of the Information retrieval system through eight experiments to test all the features that can be used in a vector space model. These experiments were compared to show the best and the worst implemented features. The features are represented by applying (tf.idf, stop words, stemming), (tf.idf, No- stop words, stemming), (tf.idf, No- stop words, No-stemming), (tf.idf, stop words, No-stemming), (tf, stop words, stemming), (tf, No- stop words, stemming), (tf, No- stop words, No-stemming), (tf, stop words, No-stemming). Results showed that using stop words, stemming approach, and tf.idf improve the performance of the system. However, when tf was used without using stop words and stemming approaches the performance of the system is declined. In addition, results showed that stop words have a significant effect on the system while the stemming approach has no noticeable effect particularly with tf.https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332information retrievalvector space modelinverse document frequencyterm frequencystemming |
spellingShingle | Omar Al-rassam Miran Hama Saeed Mohammed Amin Zhenar Shaho Faeq PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS Iraqi Journal for Computers and Informatics information retrieval vector space model inverse document frequency term frequency stemming |
title | PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS |
title_full | PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS |
title_fullStr | PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS |
title_full_unstemmed | PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS |
title_short | PERFORMANCE EVALUATION OF INFORMATION RETRIEVAL SYSTEM USING VECTOR SPACE MODEL: A COMPARATIVE ANALYSIS |
title_sort | performance evaluation of information retrieval system using vector space model a comparative analysis |
topic | information retrieval vector space model inverse document frequency term frequency stemming |
url | https://ijci.uoitc.edu.iq/index.php/ijci/article/view/332 |
work_keys_str_mv | AT omaralrassam performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis AT miranhamasaeedmohammedamin performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis AT zhenarshahofaeq performanceevaluationofinformationretrievalsystemusingvectorspacemodelacomparativeanalysis |