Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques
Worldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the com...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-12-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340923007898 |
_version_ | 1797429960417214464 |
---|---|
author | Mariam Hosny Omnia Abu Waraga Manar Abu Talib Mohamed Abdallah |
author_facet | Mariam Hosny Omnia Abu Waraga Manar Abu Talib Mohamed Abdallah |
author_sort | Mariam Hosny |
collection | DOAJ |
description | Worldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the community level during May 2017–December 2019 in Dubai, United Arab Emirates. It was acquired from Dubai Pulse, an online repository containing consumption data from Dubai Electricity and Water Authority which provides utility services to the Emirate. Multiple parameters, such as population and number of buildings, were acquired from Dubai Statistics Center in addition to temperature which was obtained from Dubai International Airport. Additional features, such as expatriate ratio, number of customers, and building occupancy, were computed from the available data and utilized to generate a dataset towards accurate prediction. Various linear regression variants, support vector machines, decision tree models, ensemble models, and neural networks were implemented to forecast electricity consumption. The models were trained on two different formats of the same dataset, which were generated by sorting the data with respect to time, named as temporally ordered dataset, and by randomly dividing the data, labelled as randomly split dataset. In addition, the dependence of the models on the amount of data was identified by varying the size of the testing data. Moreover, two cross-validation (CV) procedures, namely rolling CV method and moving CV method, were applied to assess the reliability of the models. All analyses were evaluated by utilizing several performance metrics, namely root mean squared error, coefficient of determination, i.e., R2, 10-fold CV score, mean absolute error, median absolute error, and computational time. Furthermore, this data could be utilized to analyze the effect of coronavirus disease 2019 (COVID-19) prevention measures in Dubai on electricity usage as well as evaluate the consumption patterns at the consumer level. |
first_indexed | 2024-03-09T09:21:48Z |
format | Article |
id | doaj.art-13b8129aab484160af532b9224b4b366 |
institution | Directory Open Access Journal |
issn | 2352-3409 |
language | English |
last_indexed | 2024-03-09T09:21:48Z |
publishDate | 2023-12-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj.art-13b8129aab484160af532b9224b4b3662023-12-02T07:00:10ZengElsevierData in Brief2352-34092023-12-0151109718Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniquesMariam Hosny0Omnia Abu Waraga1Manar Abu Talib2Mohamed Abdallah3Department of Civil and Environmental Engineering, University of Sharjah, United Arab EmiratesDepartment of Computer Science, University of Sharjah, United Arab EmiratesDepartment of Computer Science, University of Sharjah, United Arab Emirates; Corresponding author.Department of Civil and Environmental Engineering, University of Sharjah, United Arab EmiratesWorldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the community level during May 2017–December 2019 in Dubai, United Arab Emirates. It was acquired from Dubai Pulse, an online repository containing consumption data from Dubai Electricity and Water Authority which provides utility services to the Emirate. Multiple parameters, such as population and number of buildings, were acquired from Dubai Statistics Center in addition to temperature which was obtained from Dubai International Airport. Additional features, such as expatriate ratio, number of customers, and building occupancy, were computed from the available data and utilized to generate a dataset towards accurate prediction. Various linear regression variants, support vector machines, decision tree models, ensemble models, and neural networks were implemented to forecast electricity consumption. The models were trained on two different formats of the same dataset, which were generated by sorting the data with respect to time, named as temporally ordered dataset, and by randomly dividing the data, labelled as randomly split dataset. In addition, the dependence of the models on the amount of data was identified by varying the size of the testing data. Moreover, two cross-validation (CV) procedures, namely rolling CV method and moving CV method, were applied to assess the reliability of the models. All analyses were evaluated by utilizing several performance metrics, namely root mean squared error, coefficient of determination, i.e., R2, 10-fold CV score, mean absolute error, median absolute error, and computational time. Furthermore, this data could be utilized to analyze the effect of coronavirus disease 2019 (COVID-19) prevention measures in Dubai on electricity usage as well as evaluate the consumption patterns at the consumer level.http://www.sciencedirect.com/science/article/pii/S2352340923007898Electricity consumption predictionMachine learning modelsArtificial neural networkCross validation |
spellingShingle | Mariam Hosny Omnia Abu Waraga Manar Abu Talib Mohamed Abdallah Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques Data in Brief Electricity consumption prediction Machine learning models Artificial neural network Cross validation |
title | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_full | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_fullStr | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_full_unstemmed | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_short | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_sort | simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
topic | Electricity consumption prediction Machine learning models Artificial neural network Cross validation |
url | http://www.sciencedirect.com/science/article/pii/S2352340923007898 |
work_keys_str_mv | AT mariamhosny simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT omniaabuwaraga simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT manarabutalib simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT mohamedabdallah simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques |