Automated Playlist Continuation with Apache PredictionIO

The Minrva project team, a software development research group based at the University of Illinois Library, developed a data-focused recommender system to participate in the creative track of the 2018 ACM RecSys Challenge, which focused on music recommendation. We describe here the large-scale data...

Full description

Bibliographic Details
Main Author: Jim Hahn
Format: Article
Language:English
Published: Code4Lib 2018-11-01
Series:Code4Lib Journal
Online Access:https://journal.code4lib.org/articles/13850
_version_ 1811222785523974144
author Jim Hahn
author_facet Jim Hahn
author_sort Jim Hahn
collection DOAJ
description The Minrva project team, a software development research group based at the University of Illinois Library, developed a data-focused recommender system to participate in the creative track of the 2018 ACM RecSys Challenge, which focused on music recommendation. We describe here the large-scale data processing the Minrva team researched and developed for foundational reconciliation of the Million Playlist Dataset using external authority data on the web (e.g. VIAF, WikiData). The secondary focus of the research was evaluating and adapting the processing tools that support data reconciliation. This paper reports on the playlist enrichment process, indexing, and subsequent recommendation model developed for the music recommendation challenge.
first_indexed 2024-04-12T08:21:21Z
format Article
id doaj.art-c7d28e2f8c794a119af0bec222efe483
institution Directory Open Access Journal
issn 1940-5758
language English
last_indexed 2024-04-12T08:21:21Z
publishDate 2018-11-01
publisher Code4Lib
record_format Article
series Code4Lib Journal
spelling doaj.art-c7d28e2f8c794a119af0bec222efe4832022-12-22T03:40:34ZengCode4LibCode4Lib Journal1940-57582018-11-014213850Automated Playlist Continuation with Apache PredictionIOJim HahnThe Minrva project team, a software development research group based at the University of Illinois Library, developed a data-focused recommender system to participate in the creative track of the 2018 ACM RecSys Challenge, which focused on music recommendation. We describe here the large-scale data processing the Minrva team researched and developed for foundational reconciliation of the Million Playlist Dataset using external authority data on the web (e.g. VIAF, WikiData). The secondary focus of the research was evaluating and adapting the processing tools that support data reconciliation. This paper reports on the playlist enrichment process, indexing, and subsequent recommendation model developed for the music recommendation challenge.https://journal.code4lib.org/articles/13850
spellingShingle Jim Hahn
Automated Playlist Continuation with Apache PredictionIO
Code4Lib Journal
title Automated Playlist Continuation with Apache PredictionIO
title_full Automated Playlist Continuation with Apache PredictionIO
title_fullStr Automated Playlist Continuation with Apache PredictionIO
title_full_unstemmed Automated Playlist Continuation with Apache PredictionIO
title_short Automated Playlist Continuation with Apache PredictionIO
title_sort automated playlist continuation with apache predictionio
url https://journal.code4lib.org/articles/13850
work_keys_str_mv AT jimhahn automatedplaylistcontinuationwithapachepredictionio