IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB

There is a growing demand for time series data analysis in industry areas. Apache IoTDB is a time series database designed for the Internet of Things (IoT) with enhanced storage and I/O performance. With User-Defined Functions (UDF) provided, computation for time series can be executed on Apache IoT...

Full description

Bibliographic Details
Main Authors: Pengyu Chen, Wendi He, Wenxuan Ma, Xiangdong Huang, Chen Wang
Format: Article
Language:English
Published: Tsinghua University Press 2024-03-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2023.9020010
_version_ 1797370354121834496
author Pengyu Chen
Wendi He
Wenxuan Ma
Xiangdong Huang
Chen Wang
author_facet Pengyu Chen
Wendi He
Wenxuan Ma
Xiangdong Huang
Chen Wang
author_sort Pengyu Chen
collection DOAJ
description There is a growing demand for time series data analysis in industry areas. Apache IoTDB is a time series database designed for the Internet of Things (IoT) with enhanced storage and I/O performance. With User-Defined Functions (UDF) provided, computation for time series can be executed on Apache IoTDB directly. To satisfy most of the common requirements in industrial time series analysis, we create a UDF library, IoTDQ, on Apache IoTDB. This library integrates stream computation functions on data quality analysis, data profiling, anomaly detection, data repairing, etc. IoTDQ enables users to conduct a wide range of analyses, such as monitoring, error diagnosis, equipment reliability analysis. It provides a framework for users to examine IoT time series with data quality problems. Experiments show that IoTDQ keeps the same level of performance compared to mainstream alternatives, and shortens I/O consumption for Apache IoTDB users.
first_indexed 2024-03-08T18:01:17Z
format Article
id doaj.art-6aa7d7abdd4c45f49ef93079c1fed53f
institution Directory Open Access Journal
issn 2096-0654
language English
last_indexed 2024-03-08T18:01:17Z
publishDate 2024-03-01
publisher Tsinghua University Press
record_format Article
series Big Data Mining and Analytics
spelling doaj.art-6aa7d7abdd4c45f49ef93079c1fed53f2024-01-02T01:34:00ZengTsinghua University PressBig Data Mining and Analytics2096-06542024-03-0171294110.26599/BDMA.2023.9020010IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDBPengyu Chen0Wendi He1Wenxuan Ma2Xiangdong Huang3Chen Wang4School of Software, Tsinghua University, Beijing 100084, ChinaSchool of Software, Tsinghua University, Beijing 100084, ChinaSchool of Software, Tsinghua University, Beijing 100084, ChinaSchool of Software, Tsinghua University, Beijing 100084, ChinaNational Engineering Research Center for Big Data Software (NERCBDS), Tsinghua University, Beijing 100084, ChinaThere is a growing demand for time series data analysis in industry areas. Apache IoTDB is a time series database designed for the Internet of Things (IoT) with enhanced storage and I/O performance. With User-Defined Functions (UDF) provided, computation for time series can be executed on Apache IoTDB directly. To satisfy most of the common requirements in industrial time series analysis, we create a UDF library, IoTDQ, on Apache IoTDB. This library integrates stream computation functions on data quality analysis, data profiling, anomaly detection, data repairing, etc. IoTDQ enables users to conduct a wide range of analyses, such as monitoring, error diagnosis, equipment reliability analysis. It provides a framework for users to examine IoT time series with data quality problems. Experiments show that IoTDQ keeps the same level of performance compared to mainstream alternatives, and shortens I/O consumption for Apache IoTDB users.https://www.sciopen.com/article/10.26599/BDMA.2023.9020010industrial big datadata qualitydata mining and analytics
spellingShingle Pengyu Chen
Wendi He
Wenxuan Ma
Xiangdong Huang
Chen Wang
IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
Big Data Mining and Analytics
industrial big data
data quality
data mining and analytics
title IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
title_full IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
title_fullStr IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
title_full_unstemmed IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
title_short IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB
title_sort iotdq an industrial iot data analysis library for apache iotdb
topic industrial big data
data quality
data mining and analytics
url https://www.sciopen.com/article/10.26599/BDMA.2023.9020010
work_keys_str_mv AT pengyuchen iotdqanindustrialiotdataanalysislibraryforapacheiotdb
AT wendihe iotdqanindustrialiotdataanalysislibraryforapacheiotdb
AT wenxuanma iotdqanindustrialiotdataanalysislibraryforapacheiotdb
AT xiangdonghuang iotdqanindustrialiotdataanalysislibraryforapacheiotdb
AT chenwang iotdqanindustrialiotdataanalysislibraryforapacheiotdb