Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters

Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clus...

Full description

Bibliographic Details
Main Authors: Jared McCune, Alex Riley, Bernard Chen
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Fermentation
Subjects:
Online Access:https://www.mdpi.com/2311-5637/7/1/27
_version_ 1797396185300860928
author Jared McCune
Alex Riley
Bernard Chen
author_facet Jared McCune
Alex Riley
Bernard Chen
author_sort Jared McCune
collection DOAJ
description Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches.
first_indexed 2024-03-09T00:47:39Z
format Article
id doaj.art-6b42eaf0171e4175a1762dde7283aa41
institution Directory Open Access Journal
issn 2311-5637
language English
last_indexed 2024-03-09T00:47:39Z
publishDate 2021-02-01
publisher MDPI AG
record_format Article
series Fermentation
spelling doaj.art-6b42eaf0171e4175a1762dde7283aa412023-12-11T17:26:46ZengMDPI AGFermentation2311-56372021-02-01712710.3390/fermentation7010027Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of ClustersJared McCune0Alex Riley1Bernard Chen2Department of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USAWineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches.https://www.mdpi.com/2311-5637/7/1/27Wineinformaticscomputational wine wheelclusteringk-meansattribute selection
spellingShingle Jared McCune
Alex Riley
Bernard Chen
Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
Fermentation
Wineinformatics
computational wine wheel
clustering
k-means
attribute selection
title Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_full Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_fullStr Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_full_unstemmed Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_short Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_sort clustering in wineinformatics with attribute selection to increase uniqueness of clusters
topic Wineinformatics
computational wine wheel
clustering
k-means
attribute selection
url https://www.mdpi.com/2311-5637/7/1/27
work_keys_str_mv AT jaredmccune clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters
AT alexriley clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters
AT bernardchen clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters