Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clus...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-02-01
|
Series: | Fermentation |
Subjects: | |
Online Access: | https://www.mdpi.com/2311-5637/7/1/27 |
_version_ | 1797396185300860928 |
---|---|
author | Jared McCune Alex Riley Bernard Chen |
author_facet | Jared McCune Alex Riley Bernard Chen |
author_sort | Jared McCune |
collection | DOAJ |
description | Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches. |
first_indexed | 2024-03-09T00:47:39Z |
format | Article |
id | doaj.art-6b42eaf0171e4175a1762dde7283aa41 |
institution | Directory Open Access Journal |
issn | 2311-5637 |
language | English |
last_indexed | 2024-03-09T00:47:39Z |
publishDate | 2021-02-01 |
publisher | MDPI AG |
record_format | Article |
series | Fermentation |
spelling | doaj.art-6b42eaf0171e4175a1762dde7283aa412023-12-11T17:26:46ZengMDPI AGFermentation2311-56372021-02-01712710.3390/fermentation7010027Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of ClustersJared McCune0Alex Riley1Bernard Chen2Department of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USAWineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches.https://www.mdpi.com/2311-5637/7/1/27Wineinformaticscomputational wine wheelclusteringk-meansattribute selection |
spellingShingle | Jared McCune Alex Riley Bernard Chen Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters Fermentation Wineinformatics computational wine wheel clustering k-means attribute selection |
title | Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters |
title_full | Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters |
title_fullStr | Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters |
title_full_unstemmed | Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters |
title_short | Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters |
title_sort | clustering in wineinformatics with attribute selection to increase uniqueness of clusters |
topic | Wineinformatics computational wine wheel clustering k-means attribute selection |
url | https://www.mdpi.com/2311-5637/7/1/27 |
work_keys_str_mv | AT jaredmccune clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters AT alexriley clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters AT bernardchen clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters |