Parallel Sequential Pattern Mining of Massive Trajectory Data

The trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of...

Full description

Bibliographic Details
Main Authors: Shaojie Qiao, Tianrui Li, Jing Peng
Format: Article
Language:English
Published: Springer 2010-09-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://www.atlantis-press.com/article/1980.pdf
_version_ 1817982898871992320
author Shaojie Qiao
Tianrui Li
Jing Peng
author_facet Shaojie Qiao
Tianrui Li
Jing Peng
author_sort Shaojie Qiao
collection DOAJ
description The trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of trajectory sequential patterns based on the newly proposed concepts with regard to trajectory pattern mining. We propose an efficient and effective parallel sequential patterns mining (plute) algorithm that includes three essential techniques: prefix projection, data parallel formulation, and task parallel formulation. Firstly, the prefix projection technique is used to decompose the search space as well as greatly reduce the candidate trajectory sequences. Secondly, the data parallel formulation decomposes the computations associated with counting the support of trajectory patterns. Thirdly, the task parallel formulation employs the MapReduce programming model to assign the computations across a set of machines in a scalable and easy-to-use fashion. Based on the properties of parallel trajectory sequences, item pruning and sequence pruning strategies are applied to further prune the candidate sequences. Extensive experiments are conducted to evaluate the performance of plute in terms of parallel computing time and communication cost among processors. Experimental results show that plute outperforms the previously proposed parallel mining strategy (PartSpan) in mining massive trajectory data.
first_indexed 2024-04-13T23:26:04Z
format Article
id doaj.art-65ef6bda286f48b1a98305a8093f6dc6
institution Directory Open Access Journal
issn 1875-6883
language English
last_indexed 2024-04-13T23:26:04Z
publishDate 2010-09-01
publisher Springer
record_format Article
series International Journal of Computational Intelligence Systems
spelling doaj.art-65ef6bda286f48b1a98305a8093f6dc62022-12-22T02:25:03ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832010-09-013310.2991/ijcis.2010.3.3.10Parallel Sequential Pattern Mining of Massive Trajectory DataShaojie QiaoTianrui LiJing PengThe trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of trajectory sequential patterns based on the newly proposed concepts with regard to trajectory pattern mining. We propose an efficient and effective parallel sequential patterns mining (plute) algorithm that includes three essential techniques: prefix projection, data parallel formulation, and task parallel formulation. Firstly, the prefix projection technique is used to decompose the search space as well as greatly reduce the candidate trajectory sequences. Secondly, the data parallel formulation decomposes the computations associated with counting the support of trajectory patterns. Thirdly, the task parallel formulation employs the MapReduce programming model to assign the computations across a set of machines in a scalable and easy-to-use fashion. Based on the properties of parallel trajectory sequences, item pruning and sequence pruning strategies are applied to further prune the candidate sequences. Extensive experiments are conducted to evaluate the performance of plute in terms of parallel computing time and communication cost among processors. Experimental results show that plute outperforms the previously proposed parallel mining strategy (PartSpan) in mining massive trajectory data.https://www.atlantis-press.com/article/1980.pdfparallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
spellingShingle Shaojie Qiao
Tianrui Li
Jing Peng
Parallel Sequential Pattern Mining of Massive Trajectory Data
International Journal of Computational Intelligence Systems
parallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
title Parallel Sequential Pattern Mining of Massive Trajectory Data
title_full Parallel Sequential Pattern Mining of Massive Trajectory Data
title_fullStr Parallel Sequential Pattern Mining of Massive Trajectory Data
title_full_unstemmed Parallel Sequential Pattern Mining of Massive Trajectory Data
title_short Parallel Sequential Pattern Mining of Massive Trajectory Data
title_sort parallel sequential pattern mining of massive trajectory data
topic parallel computing; trajectory sequential patterns; prefix projection; data parallel formulation; task parallel formulation; massive trajectory data
url https://www.atlantis-press.com/article/1980.pdf
work_keys_str_mv AT shaojieqiao parallelsequentialpatternminingofmassivetrajectorydata
AT tianruili parallelsequentialpatternminingofmassivetrajectorydata
AT jingpeng parallelsequentialpatternminingofmassivetrajectorydata