Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters

Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clus...

Full description

Bibliographic Details
Main Authors:	Jared McCune, Alex Riley, Bernard Chen
Format:	Article
Language:	English
Published:	MDPI AG 2021-02-01
Series:	Fermentation
Subjects:	Wineinformatics computational wine wheel clustering k-means attribute selection
Online Access:	https://www.mdpi.com/2311-5637/7/1/27

_version_	1797396185300860928
author	Jared McCune Alex Riley Bernard Chen
author_facet	Jared McCune Alex Riley Bernard Chen
author_sort	Jared McCune
collection	DOAJ
description	Wineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches.
first_indexed	2024-03-09T00:47:39Z
format	Article
id	doaj.art-6b42eaf0171e4175a1762dde7283aa41
institution	Directory Open Access Journal
issn	2311-5637
language	English
last_indexed	2024-03-09T00:47:39Z
publishDate	2021-02-01
publisher	MDPI AG
record_format	Article
series	Fermentation
spelling	doaj.art-6b42eaf0171e4175a1762dde7283aa412023-12-11T17:26:46ZengMDPI AGFermentation2311-56372021-02-01712710.3390/fermentation7010027Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of ClustersJared McCune0Alex Riley1Bernard Chen2Department of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USADepartment of Computer Science, University of Central Arkansas, Conway, AR 72034, USAWineinformatics is a new data science research area that focuses on large amounts of wine-related data. Most of the current Wineinformatics researches are focused on supervised learning to predict the wine quality, price, region and weather. In this research, unsupervised learning using K-means clustering with optimal K search and filtration process is studied on a Bordeaux-region specific dataset to form clusters and find representative wines in each cluster. 14,349 wines representing the 21st century Bordeaux dataset are clustered into 43 and 13 clusters with detailed analysis on the number of wines, dominant wine characteristics, average wine grades, and representative wines in each cluster. Similar research results are also generated and presented on 435 elite wines (wines that scored 95 points and above on a 100 points scale). The information generated from this research can be beneficial to wine vendors to make a selection given the limited number of wines they can realistically offer, to connoisseurs to study wines in a target region/vintage/price with a representative short list, and to wine consumers to get recommendations. Many possible researches can adopt the same process to analyze and find representative wines in different wine making regions/countries, vintages, or pivot points. This paper opens up a new door for Wineinformatics in unsupervised learning researches.https://www.mdpi.com/2311-5637/7/1/27Wineinformaticscomputational wine wheelclusteringk-meansattribute selection
spellingShingle	Jared McCune Alex Riley Bernard Chen Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters Fermentation Wineinformatics computational wine wheel clustering k-means attribute selection
title	Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_full	Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_fullStr	Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_full_unstemmed	Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_short	Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters
title_sort	clustering in wineinformatics with attribute selection to increase uniqueness of clusters
topic	Wineinformatics computational wine wheel clustering k-means attribute selection
url	https://www.mdpi.com/2311-5637/7/1/27
work_keys_str_mv	AT jaredmccune clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters AT alexriley clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters AT bernardchen clusteringinwineinformaticswithattributeselectiontoincreaseuniquenessofclusters

Clustering in Wineinformatics with Attribute Selection to Increase Uniqueness of Clusters

Similar Items