A data-driven approach to quantify disparities in power outages

Abstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017...

Full description

Bibliographic Details
Main Authors: Arkaprabha Bhattacharyya, Makarand Hastak
Format: Article
Language:English
Published: Nature Portfolio 2023-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-34186-9
_version_ 1797832087765516288
author Arkaprabha Bhattacharyya
Makarand Hastak
author_facet Arkaprabha Bhattacharyya
Makarand Hastak
author_sort Arkaprabha Bhattacharyya
collection DOAJ
description Abstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017 and January 2022 was collected for 36 ZIP/postal codes located within the utility’s service territory. The collected data was used to calculate the total number of outages, customers affected, and the duration of outages during those 5 years for each ZIP code. Next, each variable was normalized with respect to the population density of the ZIP code. After normalizing, a K-means clustering algorithm was implemented that created five clusters out of those 36 ZIP codes. The difference in the outage parameters was found to be statistically significant. This indicated differential experience with power outages in different ZIP codes. Next, three Generalized Linear Models were developed to test if the presence of critical facilities such as hospitals, 911 centers, and fire stations, as socioeconomic and demographic characteristics of the ZIP codes, can explain their differential experience with the power outage. It was found that the annual duration of outages is lower in the ZIP codes where critical facilities are located. On the other hand, ZIP codes with lower median household income have experienced more power outages, i.e., higher outage counts in those 5 years. Lastly, the ZIP codes with a higher percentage of the White population have experienced more severe outages that have affected more customers.
first_indexed 2024-04-09T14:02:08Z
format Article
id doaj.art-5e86d16cdfff499f91846f00e5a91756
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-09T14:02:08Z
publishDate 2023-05-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-5e86d16cdfff499f91846f00e5a917562023-05-07T11:12:58ZengNature PortfolioScientific Reports2045-23222023-05-0113111210.1038/s41598-023-34186-9A data-driven approach to quantify disparities in power outagesArkaprabha Bhattacharyya0Makarand Hastak1Lyles School of Civil Engineering, Purdue UniversityDivision of Construction Engineering and Management, Civil Engineering, Purdue UniversityAbstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017 and January 2022 was collected for 36 ZIP/postal codes located within the utility’s service territory. The collected data was used to calculate the total number of outages, customers affected, and the duration of outages during those 5 years for each ZIP code. Next, each variable was normalized with respect to the population density of the ZIP code. After normalizing, a K-means clustering algorithm was implemented that created five clusters out of those 36 ZIP codes. The difference in the outage parameters was found to be statistically significant. This indicated differential experience with power outages in different ZIP codes. Next, three Generalized Linear Models were developed to test if the presence of critical facilities such as hospitals, 911 centers, and fire stations, as socioeconomic and demographic characteristics of the ZIP codes, can explain their differential experience with the power outage. It was found that the annual duration of outages is lower in the ZIP codes where critical facilities are located. On the other hand, ZIP codes with lower median household income have experienced more power outages, i.e., higher outage counts in those 5 years. Lastly, the ZIP codes with a higher percentage of the White population have experienced more severe outages that have affected more customers.https://doi.org/10.1038/s41598-023-34186-9
spellingShingle Arkaprabha Bhattacharyya
Makarand Hastak
A data-driven approach to quantify disparities in power outages
Scientific Reports
title A data-driven approach to quantify disparities in power outages
title_full A data-driven approach to quantify disparities in power outages
title_fullStr A data-driven approach to quantify disparities in power outages
title_full_unstemmed A data-driven approach to quantify disparities in power outages
title_short A data-driven approach to quantify disparities in power outages
title_sort data driven approach to quantify disparities in power outages
url https://doi.org/10.1038/s41598-023-34186-9
work_keys_str_mv AT arkaprabhabhattacharyya adatadrivenapproachtoquantifydisparitiesinpoweroutages
AT makarandhastak adatadrivenapproachtoquantifydisparitiesinpoweroutages
AT arkaprabhabhattacharyya datadrivenapproachtoquantifydisparitiesinpoweroutages
AT makarandhastak datadrivenapproachtoquantifydisparitiesinpoweroutages