QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams

Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, m...

Full description

Bibliographic Details
Main Authors: Isam Mashhour Al Jawarneh, Paolo Bellavista, Antonio Corradi, Luca Foschini, Rebecca Montanari
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/12/4160
_version_ 1797529774782939136
author Isam Mashhour Al Jawarneh
Paolo Bellavista
Antonio Corradi
Luca Foschini
Rebecca Montanari
author_facet Isam Mashhour Al Jawarneh
Paolo Bellavista
Antonio Corradi
Luca Foschini
Rebecca Montanari
author_sort Isam Mashhour Al Jawarneh
collection DOAJ
description Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, mobility data are highly skewed and their arrival rates fluctuate. This nature poses an extra challenge on data stream processing systems, which are required in order to achieve pre-specified latency and accuracy goals. In this paper, we propose ApproxSSPS, which is a system for approximate processing of geo-referenced mobility data, at scale with quality of service guarantees. We focus on stateful aggregations (e.g., means, counts) and top-N queries. ApproxSSPS features a controller that interactively learns the latency statistics and calculates proper sampling rates to meet latency or/and accuracy targets. An overarching trait of ApproxSSPS is its ability to strike a plausible balance between latency and accuracy targets. We evaluate ApproxSSPS on Apache Spark Structured Streaming with real mobility data. We also compared ApproxSSPS against a state-of-the-art online adaptive processing system. Our extensive experiments prove that ApproxSSPS can fulfill latency and accuracy targets with varying sets of parameter configurations and load intensities (i.e., transient peaks in data loads versus slow arriving streams). Moreover, our results show that ApproxSSPS outperforms the baseline counterpart by significant magnitudes. In short, ApproxSSPS is a novel spatial data stream processing system that can deliver real accurate results in a timely manner, by dynamically specifying the limits on data samples.
first_indexed 2024-03-10T10:19:39Z
format Article
id doaj.art-8b9f925e1a634801b07285364308f31c
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T10:19:39Z
publishDate 2021-06-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-8b9f925e1a634801b07285364308f31c2023-11-22T00:32:58ZengMDPI AGSensors1424-82202021-06-012112416010.3390/s21124160QoS-Aware Approximate Query Processing for Smart Cities Spatial Data StreamsIsam Mashhour Al Jawarneh0Paolo Bellavista1Antonio Corradi2Luca Foschini3Rebecca Montanari4Dipartimento di Informatica—Scienza e Ingegneria, University of Bologna, Viale del Risorgimento 2, 40136 Bologna, ItalyDipartimento di Informatica—Scienza e Ingegneria, University of Bologna, Viale del Risorgimento 2, 40136 Bologna, ItalyDipartimento di Informatica—Scienza e Ingegneria, University of Bologna, Viale del Risorgimento 2, 40136 Bologna, ItalyDipartimento di Informatica—Scienza e Ingegneria, University of Bologna, Viale del Risorgimento 2, 40136 Bologna, ItalyDipartimento di Informatica—Scienza e Ingegneria, University of Bologna, Viale del Risorgimento 2, 40136 Bologna, ItalyLarge amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, mobility data are highly skewed and their arrival rates fluctuate. This nature poses an extra challenge on data stream processing systems, which are required in order to achieve pre-specified latency and accuracy goals. In this paper, we propose ApproxSSPS, which is a system for approximate processing of geo-referenced mobility data, at scale with quality of service guarantees. We focus on stateful aggregations (e.g., means, counts) and top-N queries. ApproxSSPS features a controller that interactively learns the latency statistics and calculates proper sampling rates to meet latency or/and accuracy targets. An overarching trait of ApproxSSPS is its ability to strike a plausible balance between latency and accuracy targets. We evaluate ApproxSSPS on Apache Spark Structured Streaming with real mobility data. We also compared ApproxSSPS against a state-of-the-art online adaptive processing system. Our extensive experiments prove that ApproxSSPS can fulfill latency and accuracy targets with varying sets of parameter configurations and load intensities (i.e., transient peaks in data loads versus slow arriving streams). Moreover, our results show that ApproxSSPS outperforms the baseline counterpart by significant magnitudes. In short, ApproxSSPS is a novel spatial data stream processing system that can deliver real accurate results in a timely manner, by dynamically specifying the limits on data samples.https://www.mdpi.com/1424-8220/21/12/4160mobility dataApache Sparkapproximate query processingspatial dataInternet of Thingssampling
spellingShingle Isam Mashhour Al Jawarneh
Paolo Bellavista
Antonio Corradi
Luca Foschini
Rebecca Montanari
QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
Sensors
mobility data
Apache Spark
approximate query processing
spatial data
Internet of Things
sampling
title QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_full QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_fullStr QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_full_unstemmed QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_short QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_sort qos aware approximate query processing for smart cities spatial data streams
topic mobility data
Apache Spark
approximate query processing
spatial data
Internet of Things
sampling
url https://www.mdpi.com/1424-8220/21/12/4160
work_keys_str_mv AT isammashhouraljawarneh qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT paolobellavista qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT antoniocorradi qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT lucafoschini qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT rebeccamontanari qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams