Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering

With the rise of social media platforms, tourists tend to share their experiences in the form of texts, photos, and videos on social media. These user-generated contents (UGC) play an important role in shaping tourism destination images (TDI) and directly affect the decision-making process of touris...

Full description

Bibliographic Details
Main Authors: Jiyeon Kim, Youngok Kang
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/11/4/245
_version_ 1827628524012306432
author Jiyeon Kim
Youngok Kang
author_facet Jiyeon Kim
Youngok Kang
author_sort Jiyeon Kim
collection DOAJ
description With the rise of social media platforms, tourists tend to share their experiences in the form of texts, photos, and videos on social media. These user-generated contents (UGC) play an important role in shaping tourism destination images (TDI) and directly affect the decision-making process of tourists. Among UGCs, photos represent tourists’ visual preferences for a specific area. Paying attention to the value of photos, several studies have attempted to analyze them using deep learning technology. However, the research methods that analyze tourism photos using recent deep learning technology have a limitation in that they cannot properly classify unique photos appearing in specific tourist attractions with predetermined photo categories such as Places365 or ImageNet dataset or it takes a lot of time and effort to build a separate training dataset to train the model and to generate a tourism photo classification category according to a specific tourist destination. The purpose of this study is to propose a method of automatically classifying tourist photos by tourist attractions by applying the methods of the image feature vector clustering and the deep learning model. To this end, first, we collected photos attached to reviews posted by foreign tourists on TripAdvisor. Second, we embedded individual images as 512-dimensional feature vectors using the VGG16 network pre-trained with Places365 and reduced them to two dimensions with t-SNE(t-Distributed Stochastic Neighbor Embedding). Then, clusters were extracted through HDBSCAN(Hierarchical Clustering and Density-Based Spatial Clustering of Applications with Noise) analysis and set as a regional image category. Finally, the Siamese Network was applied to remove noise photos within the cluster and classify photos according to the category. In addition, this study attempts to confirm the validity of the proposed method by applying it to two representative tourist attractions such as ‘Gyeongbokgung Palace’ and ‘Insadong’ in Seoul. As a result, it was possible to identify which visual elements of tourist attractions are attractive to tourists. This method has the advantages in that it is not necessary to create a classification category in advance, it is possible to flexibly extract categories for each tourist destination, and it is able to improve classification performance even with a rather small volume of a dataset.
first_indexed 2024-03-09T13:34:42Z
format Article
id doaj.art-0a785d4d80074158af4ea1f9e57b65e5
institution Directory Open Access Journal
issn 2220-9964
language English
last_indexed 2024-03-09T13:34:42Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series ISPRS International Journal of Geo-Information
spelling doaj.art-0a785d4d80074158af4ea1f9e57b65e52023-11-30T21:13:41ZengMDPI AGISPRS International Journal of Geo-Information2220-99642022-04-0111424510.3390/ijgi11040245Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector ClusteringJiyeon Kim0Youngok Kang1Department of Social Studies, Ewha Womans University, Seoul 03760, KoreaDepartment of Social Studies, Ewha Womans University, Seoul 03760, KoreaWith the rise of social media platforms, tourists tend to share their experiences in the form of texts, photos, and videos on social media. These user-generated contents (UGC) play an important role in shaping tourism destination images (TDI) and directly affect the decision-making process of tourists. Among UGCs, photos represent tourists’ visual preferences for a specific area. Paying attention to the value of photos, several studies have attempted to analyze them using deep learning technology. However, the research methods that analyze tourism photos using recent deep learning technology have a limitation in that they cannot properly classify unique photos appearing in specific tourist attractions with predetermined photo categories such as Places365 or ImageNet dataset or it takes a lot of time and effort to build a separate training dataset to train the model and to generate a tourism photo classification category according to a specific tourist destination. The purpose of this study is to propose a method of automatically classifying tourist photos by tourist attractions by applying the methods of the image feature vector clustering and the deep learning model. To this end, first, we collected photos attached to reviews posted by foreign tourists on TripAdvisor. Second, we embedded individual images as 512-dimensional feature vectors using the VGG16 network pre-trained with Places365 and reduced them to two dimensions with t-SNE(t-Distributed Stochastic Neighbor Embedding). Then, clusters were extracted through HDBSCAN(Hierarchical Clustering and Density-Based Spatial Clustering of Applications with Noise) analysis and set as a regional image category. Finally, the Siamese Network was applied to remove noise photos within the cluster and classify photos according to the category. In addition, this study attempts to confirm the validity of the proposed method by applying it to two representative tourist attractions such as ‘Gyeongbokgung Palace’ and ‘Insadong’ in Seoul. As a result, it was possible to identify which visual elements of tourist attractions are attractive to tourists. This method has the advantages in that it is not necessary to create a classification category in advance, it is possible to flexibly extract categories for each tourist destination, and it is able to improve classification performance even with a rather small volume of a dataset.https://www.mdpi.com/2220-9964/11/4/245image feature vectorclusteringSiamese Networkautomatic classification of tourist photosdeep learning model
spellingShingle Jiyeon Kim
Youngok Kang
Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
ISPRS International Journal of Geo-Information
image feature vector
clustering
Siamese Network
automatic classification of tourist photos
deep learning model
title Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
title_full Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
title_fullStr Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
title_full_unstemmed Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
title_short Automatic Classification of Photos by Tourist Attractions Using Deep Learning Model and Image Feature Vector Clustering
title_sort automatic classification of photos by tourist attractions using deep learning model and image feature vector clustering
topic image feature vector
clustering
Siamese Network
automatic classification of tourist photos
deep learning model
url https://www.mdpi.com/2220-9964/11/4/245
work_keys_str_mv AT jiyeonkim automaticclassificationofphotosbytouristattractionsusingdeeplearningmodelandimagefeaturevectorclustering
AT youngokkang automaticclassificationofphotosbytouristattractionsusingdeeplearningmodelandimagefeaturevectorclustering