Efficient spatial data partitioning for distributed $$k$$ k NN joins

Efficient spatial data partitioning for distributed $$k$$ k NN joins

Abstract Parallel processing of large spatial datasets over distributed systems has become a core part of modern data analytic systems like Apache Hadoop and Apache Spark. The general-purpose design of these systems does not natively account for the data’s spatial attributes and results in poor scal...

Full description

Bibliographic Details
Main Authors:	Ayman Zeidan, Huy T. Vo
Format:	Article
Language:	English
Published:	SpringerOpen 2022-06-01
Series:	Journal of Big Data
Subjects:	Big data Spatial data Spatial query Indexing Partitioning Technique
Online Access:	https://doi.org/10.1186/s40537-022-00587-2

Similar Items

R*-Grove: Balanced Spatial Partitioning for Large-Scale Datasets
by: Tin Vu, et al.
Published: (2020-08-01)

Efficient Group <i>K</i> Nearest-Neighbor Spatial Query Processing in Apache Spark
by: Panagiotis Moutafis, et al.
Published: (2021-11-01)

CoPart: a context-based partitioning technique for big data
by: Sara Migliorini, et al.
Published: (2021-01-01)

A PID-Based kNN Query Processing Algorithm for Spatial Data
by: Baiyou Qiao, et al.
Published: (2022-10-01)

Trajectory Clustering and <i>k</i>-NN for Robust Privacy Preserving <i>k</i>-NN Query Processing in GeoSpark
by: Elias Dritsas, et al.
Published: (2020-07-01)

Distributed Distance Join Algorithm for Massive Spatial Data
by: WANG Ru-bin, LI Rui-yuan, HE Hua-jun, LIU Tong, LI Tian-rui
Published: (2022-01-01)

Skewness-Based Partitioning in SpatialHadoop
by: Alberto Belussi, et al.
Published: (2020-03-01)

Vector Spatial Big Data Storage and Optimized Query Based on the Multi-Level Hilbert Grid Index in HBase
by: Hua Jiang, et al.
Published: (2018-05-01)

Clustering-based method for big spatial data partitioning
by: Alaa Aldin Zein, et al.
Published: (2023-06-01)

OPTIMIZATION OF LOCATION BASED QUERIES USING SPATIAL INDEXING
by: S. Geetha, et al.
Published: (2014-04-01)

Quadrant-Based Minimum Bounding Rectangle-Tree Indexing Method for Similarity Queries over Big Spatial Data in HBase
by: Bumjoon Jo, et al.
Published: (2018-09-01)

SparkNN: A Distributed In-Memory Data Partitioning for KNN Queries on Big Spatial Data
by: Zaher Al Aghbari, et al.
Published: (2020-08-01)

Spatial Operations
by: Anda VELICANU
Published: (2010-09-01)

A MapReduce-Based Big Spatial Data Framework for Solving the Problem of Covering a Polygon with Orthogonal Rectangles
by: Süleyman Eken, et al.
Published: (2019-01-01)

An Enhanced Partitioning Approach in SpatialHadoop for Handling Big Spatial Data
by: Abdulaziz Shehab, et al.
Published: (2023-02-01)

GeoSpark SQL: An Effective Framework Enabling Spatial Queries on Spark
by: Zhou Huang, et al.
Published: (2017-09-01)

LocationSpark: In-memory Distributed Spatial Query Processing and Optimization
by: Mingjie Tang, et al.
Published: (2020-10-01)

Supporting Efficient Family Joins for Big Data Tables via Multiple Freedom Family Index
by: Qiang Zhu, et al.
Published: (2025-01-01)

Spatial Concept Query Based on Lattice-Tree
by: Aopeng Xu, et al.
Published: (2022-05-01)

Enabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes
by: Sebastian Villarroya, et al.
Published: (2022-01-01)

Survey of Continuous Queries over Spatial-Textual Data Streams
by: YANG Rong, NIU Baoning
Published: (2021-04-01)

A personalized query method for spatial keywords in indoor environments
by: Liping Zhang, et al.
Published: (2024-11-01)

A Distributed Air Index Based on Maximum Boundary Rectangle over Grid-Cells for Wireless Non-Flat Spatial Data Broadcast
by: Seokjin Im, et al.
Published: (2014-06-01)

Top-k Spatial Preference Queries in Directed Road Networks
by: Muhammad Attique, et al.
Published: (2016-09-01)

HiIndex: An Efficient Spatial Index for Rapid Visualization of Large-Scale Geographic Vector Data
by: Zebang Liu, et al.
Published: (2021-09-01)

cKd-tree: A Compact Kd-tree
by: Gilberto Gutierrez, et al.
Published: (2024-01-01)

A Hierarchical Spatial Network Index for Arbitrarily Distributed Spatial Objects
by: Xiangqiang Min, et al.
Published: (2021-12-01)

Grand challenges for the spatial information community
by: Leye Wang, et al.
Published: (2020-06-01)

Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems
by: Eduarda Costa, et al.
Published: (2019-05-01)

Computation of High-Frequency Sub-National Spatial Consumer Price Indexes Using Web Scraping Techniques
by: Ilaria Benedetti, et al.
Published: (2022-04-01)

RBSEP: a reassignment and buffer based streaming edge partitioning approach
by: Monireh Taimouri, et al.
Published: (2019-10-01)

Construct Trip Graphs by Using Taxi Trajectory Data
by: Hao Yu, et al.
Published: (2023-02-01)

Operator parallel optimization strategy for distributed databases
by: LIU Wenjie, et al.
Published: (2024-06-01)

An LSM-Tree Index for Spatial Data
by: Junjun He, et al.
Published: (2022-03-01)

The RLR-tree: a reinforcement learning based R-tree for spatial data
by: Gu, Tu, et al.
Published: (2023)

QRB-tree Indexing: Optimized Spatial Index Expanding upon the QR-tree Index
by: Jieqing Yu, et al.
Published: (2021-10-01)

A Dynamic Data Structure to Efficiently Find the Points below a Line and Estimate Their Number
by: Bart Kuijpers, et al.
Published: (2017-03-01)

Distance-Constraint k-Nearest Neighbor Searching in Mobile Sensor Networks
by: Yongkoo Han, et al.
Published: (2015-07-01)

In-Path Oracles for Road Networks
by: Debajyoti Ghosh, et al.
Published: (2023-07-01)

DAPR-tree: a distributed spatial data indexing scheme with data access patterns to support Digital Earth initiatives
by: Jizhe Xia, et al.
Published: (2020-12-01)