Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example
In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co. Ltd.
2021-06-01
|
Series: | Data Science and Management |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666764921000138 |
_version_ | 1797976432446537728 |
---|---|
author | Lin Wang Sirui Wang Zhe Yuan Lu Peng |
author_facet | Lin Wang Sirui Wang Zhe Yuan Lu Peng |
author_sort | Lin Wang |
collection | DOAJ |
description | In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons. |
first_indexed | 2024-04-11T04:50:50Z |
format | Article |
id | doaj.art-3e9fef6d7ec9449aa2756aec81d8e69b |
institution | Directory Open Access Journal |
issn | 2666-7649 |
language | English |
last_indexed | 2024-04-11T04:50:50Z |
publishDate | 2021-06-01 |
publisher | KeAi Communications Co. Ltd. |
record_format | Article |
series | Data Science and Management |
spelling | doaj.art-3e9fef6d7ec9449aa2756aec81d8e69b2022-12-27T04:38:23ZengKeAi Communications Co. Ltd.Data Science and Management2666-76492021-06-0121219Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an exampleLin Wang0Sirui Wang1Zhe Yuan2Lu Peng3School of Management, Huazhong University of Science and Technology, Wuhan, 430074, ChinaSchool of Management, Huazhong University of Science and Technology, Wuhan, 430074, ChinaLéonard de Vinci Pôle Universitaire, Research Center, 92 916 Paris La Défense, FranceSchool of Management, Wuhan University of Technology, Wuhan, 430070, China; Corresponding author.In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.http://www.sciencedirect.com/science/article/pii/S2666764921000138Principal component analysis (PCA)Affinity propagationBaidu index dataCluster analysis |
spellingShingle | Lin Wang Sirui Wang Zhe Yuan Lu Peng Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example Data Science and Management Principal component analysis (PCA) Affinity propagation Baidu index data Cluster analysis |
title | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
title_full | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
title_fullStr | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
title_full_unstemmed | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
title_short | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
title_sort | analyzing potential tourist behavior using pca and modified affinity propagation clustering based on baidu index taking beijing city as an example |
topic | Principal component analysis (PCA) Affinity propagation Baidu index data Cluster analysis |
url | http://www.sciencedirect.com/science/article/pii/S2666764921000138 |
work_keys_str_mv | AT linwang analyzingpotentialtouristbehaviorusingpcaandmodifiedaffinitypropagationclusteringbasedonbaiduindextakingbeijingcityasanexample AT siruiwang analyzingpotentialtouristbehaviorusingpcaandmodifiedaffinitypropagationclusteringbasedonbaiduindextakingbeijingcityasanexample AT zheyuan analyzingpotentialtouristbehaviorusingpcaandmodifiedaffinitypropagationclusteringbasedonbaiduindextakingbeijingcityasanexample AT lupeng analyzingpotentialtouristbehaviorusingpcaandmodifiedaffinitypropagationclusteringbasedonbaiduindextakingbeijingcityasanexample |