Parallel Sequential Pattern Mining of Massive Trajectory Data

The trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of...

Full description

Bibliographic Details
Main Authors:	Shaojie Qiao, Tianrui Li, Jing Peng
Format:	Article
Language:	English
Published:	Springer 2010-09-01
Series:	International Journal of Computational Intelligence Systems
Subjects:	parallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
Online Access:	https://www.atlantis-press.com/article/1980.pdf

_version_	1817982898871992320
author	Shaojie Qiao Tianrui Li Jing Peng
author_facet	Shaojie Qiao Tianrui Li Jing Peng
author_sort	Shaojie Qiao
collection	DOAJ
description	The trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of trajectory sequential patterns based on the newly proposed concepts with regard to trajectory pattern mining. We propose an efficient and effective parallel sequential patterns mining (plute) algorithm that includes three essential techniques: prefix projection, data parallel formulation, and task parallel formulation. Firstly, the prefix projection technique is used to decompose the search space as well as greatly reduce the candidate trajectory sequences. Secondly, the data parallel formulation decomposes the computations associated with counting the support of trajectory patterns. Thirdly, the task parallel formulation employs the MapReduce programming model to assign the computations across a set of machines in a scalable and easy-to-use fashion. Based on the properties of parallel trajectory sequences, item pruning and sequence pruning strategies are applied to further prune the candidate sequences. Extensive experiments are conducted to evaluate the performance of plute in terms of parallel computing time and communication cost among processors. Experimental results show that plute outperforms the previously proposed parallel mining strategy (PartSpan) in mining massive trajectory data.
first_indexed	2024-04-13T23:26:04Z
format	Article
id	doaj.art-65ef6bda286f48b1a98305a8093f6dc6
institution	Directory Open Access Journal
issn	1875-6883
language	English
last_indexed	2024-04-13T23:26:04Z
publishDate	2010-09-01
publisher	Springer
record_format	Article
series	International Journal of Computational Intelligence Systems
spelling	doaj.art-65ef6bda286f48b1a98305a8093f6dc62022-12-22T02:25:03ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832010-09-013310.2991/ijcis.2010.3.3.10Parallel Sequential Pattern Mining of Massive Trajectory DataShaojie QiaoTianrui LiJing PengThe trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of trajectory sequential patterns based on the newly proposed concepts with regard to trajectory pattern mining. We propose an efficient and effective parallel sequential patterns mining (plute) algorithm that includes three essential techniques: prefix projection, data parallel formulation, and task parallel formulation. Firstly, the prefix projection technique is used to decompose the search space as well as greatly reduce the candidate trajectory sequences. Secondly, the data parallel formulation decomposes the computations associated with counting the support of trajectory patterns. Thirdly, the task parallel formulation employs the MapReduce programming model to assign the computations across a set of machines in a scalable and easy-to-use fashion. Based on the properties of parallel trajectory sequences, item pruning and sequence pruning strategies are applied to further prune the candidate sequences. Extensive experiments are conducted to evaluate the performance of plute in terms of parallel computing time and communication cost among processors. Experimental results show that plute outperforms the previously proposed parallel mining strategy (PartSpan) in mining massive trajectory data.https://www.atlantis-press.com/article/1980.pdfparallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
spellingShingle	Shaojie Qiao Tianrui Li Jing Peng Parallel Sequential Pattern Mining of Massive Trajectory Data International Journal of Computational Intelligence Systems parallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
title	Parallel Sequential Pattern Mining of Massive Trajectory Data
title_full	Parallel Sequential Pattern Mining of Massive Trajectory Data
title_fullStr	Parallel Sequential Pattern Mining of Massive Trajectory Data
title_full_unstemmed	Parallel Sequential Pattern Mining of Massive Trajectory Data
title_short	Parallel Sequential Pattern Mining of Massive Trajectory Data
title_sort	parallel sequential pattern mining of massive trajectory data
topic	parallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
url	https://www.atlantis-press.com/article/1980.pdf
work_keys_str_mv	AT shaojieqiao parallelsequentialpatternminingofmassivetrajectorydata AT tianruili parallelsequentialpatternminingofmassivetrajectorydata AT jingpeng parallelsequentialpatternminingofmassivetrajectorydata

Parallel Sequential Pattern Mining of Massive Trajectory Data

Similar Items