Summary: | The <i>location-based aggregate queries</i>, consisting of the <i>shortest average distance query</i> (<i>SAvgDQ</i>), the <i>shortest minimal distance query</i> (<i>SMinDQ</i>), the <i>shortest maximal distance query</i> (<i>SMaxDQ</i>), and the <i>shortest sum distance query</i> (<i>SSumDQ</i>) are new types of location-based queries. Such queries can be used to provide the user with useful object information by considering both the spatial closeness of objects to the query object and the neighboring relationship between objects. Due to a large amount of location-based aggregate queries that need to be evaluated concurrently, the centralized processing system would suffer a heavy query load, leading eventually to poor performance. As a result, in this paper, we focus on developing the distributed processing technique to answer multiple location-based aggregate queries, based on the <i>MapReduce</i> platform. We first design a grid structure to manage information of objects by taking into account the storage balance, and then develop a distributed processing algorithm, namely the <i>MapReduce-based aggregate query algorithm</i> (<i>MRAggQ algorithm</i>), to efficiently process the location-based aggregate queries in a distributed manner. Extensive experiments using synthetic and real datasets are conducted to demonstrate the scalability and the efficiency of the proposed processing algorithm.
|