On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets

The typical goal of a collaborative filtering algorithm is the minimisation of the deviation between rating predictions and factual user ratings so that the recommender system offers suggestions for appropriate items, achieving a higher prediction value. The datasets on which collaborative filtering...

Full description

Bibliographic Details
Main Authors: Dionisis Margaris, Costas Vassilakis, Dimitris Spiliotopoulos
Format: Article
Language:English
Published: MDPI AG 2022-06-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/13/6/302
_version_ 1827659865047171072
author Dionisis Margaris
Costas Vassilakis
Dimitris Spiliotopoulos
author_facet Dionisis Margaris
Costas Vassilakis
Dimitris Spiliotopoulos
author_sort Dionisis Margaris
collection DOAJ
description The typical goal of a collaborative filtering algorithm is the minimisation of the deviation between rating predictions and factual user ratings so that the recommender system offers suggestions for appropriate items, achieving a higher prediction value. The datasets on which collaborative filtering algorithms are applied vary in terms of sparsity, i.e., regarding the percentage of empty cells in the user–item rating matrices. Sparsity is an important factor affecting rating prediction accuracy, since research has proven that collaborative filtering over sparse datasets exhibits a lower accuracy. The present work aims to explore, in a broader context, the factors related to rating prediction accuracy in sparse collaborative filtering datasets, indicating that recommending the items that simply achieve higher prediction values than others, without considering other factors, in some cases, can reduce recommendation accuracy and negatively affect the recommender system’s success. An extensive evaluation is conducted using sparse collaborative filtering datasets. It is found that the number of near neighbours used for the prediction formulation, the rating average of the user for whom the prediction is generated and the rating average of the item concerning the prediction can indicate, in many cases, whether the rating prediction produced is reliable or not.
first_indexed 2024-03-09T23:30:16Z
format Article
id doaj.art-7aaefbaced1f471badb6bf26e924ea1d
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-09T23:30:16Z
publishDate 2022-06-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-7aaefbaced1f471badb6bf26e924ea1d2023-11-23T17:10:05ZengMDPI AGInformation2078-24892022-06-0113630210.3390/info13060302On Producing Accurate Rating Predictions in Sparse Collaborative Filtering DatasetsDionisis Margaris0Costas Vassilakis1Dimitris Spiliotopoulos2Department of Digital Systems, University of the Peloponnese, Valioti’s Building, Kladas, 23100 Sparta, GreeceDepartment of Informatics and Telecommunications, University of the Peloponnese, Akadimaikou G. K. Vlachou, 22131 Tripoli, GreeceDepartment of Management Science and Technology, University of the Peloponnese, Akadimaikou G. K. Vlachou, 22131 Tripoli, GreeceThe typical goal of a collaborative filtering algorithm is the minimisation of the deviation between rating predictions and factual user ratings so that the recommender system offers suggestions for appropriate items, achieving a higher prediction value. The datasets on which collaborative filtering algorithms are applied vary in terms of sparsity, i.e., regarding the percentage of empty cells in the user–item rating matrices. Sparsity is an important factor affecting rating prediction accuracy, since research has proven that collaborative filtering over sparse datasets exhibits a lower accuracy. The present work aims to explore, in a broader context, the factors related to rating prediction accuracy in sparse collaborative filtering datasets, indicating that recommending the items that simply achieve higher prediction values than others, without considering other factors, in some cases, can reduce recommendation accuracy and negatively affect the recommender system’s success. An extensive evaluation is conducted using sparse collaborative filtering datasets. It is found that the number of near neighbours used for the prediction formulation, the rating average of the user for whom the prediction is generated and the rating average of the item concerning the prediction can indicate, in many cases, whether the rating prediction produced is reliable or not.https://www.mdpi.com/2078-2489/13/6/302recommender systemspersonalisationcollaborative filteringsparse datasetsrating predictionreliability
spellingShingle Dionisis Margaris
Costas Vassilakis
Dimitris Spiliotopoulos
On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
Information
recommender systems
personalisation
collaborative filtering
sparse datasets
rating prediction
reliability
title On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
title_full On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
title_fullStr On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
title_full_unstemmed On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
title_short On Producing Accurate Rating Predictions in Sparse Collaborative Filtering Datasets
title_sort on producing accurate rating predictions in sparse collaborative filtering datasets
topic recommender systems
personalisation
collaborative filtering
sparse datasets
rating prediction
reliability
url https://www.mdpi.com/2078-2489/13/6/302
work_keys_str_mv AT dionisismargaris onproducingaccurateratingpredictionsinsparsecollaborativefilteringdatasets
AT costasvassilakis onproducingaccurateratingpredictionsinsparsecollaborativefilteringdatasets
AT dimitrisspiliotopoulos onproducingaccurateratingpredictionsinsparsecollaborativefilteringdatasets