A data-driven approach to quantify disparities in power outages
Abstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2023-05-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-023-34186-9 |
_version_ | 1797832087765516288 |
---|---|
author | Arkaprabha Bhattacharyya Makarand Hastak |
author_facet | Arkaprabha Bhattacharyya Makarand Hastak |
author_sort | Arkaprabha Bhattacharyya |
collection | DOAJ |
description | Abstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017 and January 2022 was collected for 36 ZIP/postal codes located within the utility’s service territory. The collected data was used to calculate the total number of outages, customers affected, and the duration of outages during those 5 years for each ZIP code. Next, each variable was normalized with respect to the population density of the ZIP code. After normalizing, a K-means clustering algorithm was implemented that created five clusters out of those 36 ZIP codes. The difference in the outage parameters was found to be statistically significant. This indicated differential experience with power outages in different ZIP codes. Next, three Generalized Linear Models were developed to test if the presence of critical facilities such as hospitals, 911 centers, and fire stations, as socioeconomic and demographic characteristics of the ZIP codes, can explain their differential experience with the power outage. It was found that the annual duration of outages is lower in the ZIP codes where critical facilities are located. On the other hand, ZIP codes with lower median household income have experienced more power outages, i.e., higher outage counts in those 5 years. Lastly, the ZIP codes with a higher percentage of the White population have experienced more severe outages that have affected more customers. |
first_indexed | 2024-04-09T14:02:08Z |
format | Article |
id | doaj.art-5e86d16cdfff499f91846f00e5a91756 |
institution | Directory Open Access Journal |
issn | 2045-2322 |
language | English |
last_indexed | 2024-04-09T14:02:08Z |
publishDate | 2023-05-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj.art-5e86d16cdfff499f91846f00e5a917562023-05-07T11:12:58ZengNature PortfolioScientific Reports2045-23222023-05-0113111210.1038/s41598-023-34186-9A data-driven approach to quantify disparities in power outagesArkaprabha Bhattacharyya0Makarand Hastak1Lyles School of Civil Engineering, Purdue UniversityDivision of Construction Engineering and Management, Civil Engineering, Purdue UniversityAbstract This research proposes a data-driven approach to identify possible disparities in a utility’s outage management practices. The approach has been illustrated for an Investor-Owned Utility located in the Midwest region in the U.S. Power outage data for approximately 5 years between March 2017 and January 2022 was collected for 36 ZIP/postal codes located within the utility’s service territory. The collected data was used to calculate the total number of outages, customers affected, and the duration of outages during those 5 years for each ZIP code. Next, each variable was normalized with respect to the population density of the ZIP code. After normalizing, a K-means clustering algorithm was implemented that created five clusters out of those 36 ZIP codes. The difference in the outage parameters was found to be statistically significant. This indicated differential experience with power outages in different ZIP codes. Next, three Generalized Linear Models were developed to test if the presence of critical facilities such as hospitals, 911 centers, and fire stations, as socioeconomic and demographic characteristics of the ZIP codes, can explain their differential experience with the power outage. It was found that the annual duration of outages is lower in the ZIP codes where critical facilities are located. On the other hand, ZIP codes with lower median household income have experienced more power outages, i.e., higher outage counts in those 5 years. Lastly, the ZIP codes with a higher percentage of the White population have experienced more severe outages that have affected more customers.https://doi.org/10.1038/s41598-023-34186-9 |
spellingShingle | Arkaprabha Bhattacharyya Makarand Hastak A data-driven approach to quantify disparities in power outages Scientific Reports |
title | A data-driven approach to quantify disparities in power outages |
title_full | A data-driven approach to quantify disparities in power outages |
title_fullStr | A data-driven approach to quantify disparities in power outages |
title_full_unstemmed | A data-driven approach to quantify disparities in power outages |
title_short | A data-driven approach to quantify disparities in power outages |
title_sort | data driven approach to quantify disparities in power outages |
url | https://doi.org/10.1038/s41598-023-34186-9 |
work_keys_str_mv | AT arkaprabhabhattacharyya adatadrivenapproachtoquantifydisparitiesinpoweroutages AT makarandhastak adatadrivenapproachtoquantifydisparitiesinpoweroutages AT arkaprabhabhattacharyya datadrivenapproachtoquantifydisparitiesinpoweroutages AT makarandhastak datadrivenapproachtoquantifydisparitiesinpoweroutages |