Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering

Heavy industrial burning contributes significantly to the greenhouse gas (GHG) emissions. It is responsible for almost one-quarter of the global energy-related CO<sub>2</sub> emissions and its share continues to grow. Mostly, those industrial emissions are accompanied by a great deal of...

Full description

Bibliographic Details
Main Authors: Yan Ma, Caihong Ma, Peng Liu, Jin Yang, Yuzhu Wang, Yueqin Zhu, Xiaoping Du
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/18/3069
_version_ 1797553230613315584
author Yan Ma
Caihong Ma
Peng Liu
Jin Yang
Yuzhu Wang
Yueqin Zhu
Xiaoping Du
author_facet Yan Ma
Caihong Ma
Peng Liu
Jin Yang
Yuzhu Wang
Yueqin Zhu
Xiaoping Du
author_sort Yan Ma
collection DOAJ
description Heavy industrial burning contributes significantly to the greenhouse gas (GHG) emissions. It is responsible for almost one-quarter of the global energy-related CO<sub>2</sub> emissions and its share continues to grow. Mostly, those industrial emissions are accompanied by a great deal of high-temperature heat emissions from the combustion of carbon-based fuels by steel, petrochemical, or cement plants. Fortunately, these industrial heat emission sources treated as thermal anomalies can be detected by satellite-borne sensors in a quantitive way. However, most of the dominant remote sensing-based fire detection methods barely work well for heavy industrial heat source discernment. Although the object-oriented approach, especially the data clustering-based approach, has guided a novel method of detection, it is still limited by the costly computation and storage resources. Furthermore, when scaling to a national, or even global, long time-series detection, it is greatly challenged by the tremendous computation introduced by the incredible large-scale data clustering of tens of millions of high-dimensional fire data points. Therefore, we proposed an improved parallel identification method with geocoded, task-tree-based, large-scale clustering for the spatial-temporal distribution analysis of industrial heat emitters across the United States from long time-series active Visible Infrared Imaging Radiometer Suite (VIIRS) data. A recursive <i>k</i>-means clustering method is introduced to gradually segment and cluster industrial heat objects. Furthermore, in order to avoid the blindness caused by random cluster center initialization, the time series VIIRS hotspots data are spatially pre-grouped into GeoSOT-encoded grid tasks which are also treated as initial clustering objects. In addition, some grouped parallel clustering strategy together with geocoding-aware task tree scheduling is adopted to sufficiently exploit parallelism and performance optimization. Then, the spatial-temporal distribution pattern and its changing trend of industrial heat emitters across the United States are analyzed with the identified industrial heat sources. Eventually, the performance experiment also demonstrated the efficiency and encouraging scalability of this approach.
first_indexed 2024-03-10T16:12:27Z
format Article
id doaj.art-418ec69a815e42fbb4b4675c67e40bc2
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T16:12:27Z
publishDate 2020-09-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-418ec69a815e42fbb4b4675c67e40bc22023-11-20T14:20:06ZengMDPI AGRemote Sensing2072-42922020-09-011218306910.3390/rs12183069Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale ClusteringYan Ma0Caihong Ma1Peng Liu2Jin Yang3Yuzhu Wang4Yueqin Zhu5Xiaoping Du6Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaSchool of Information Engineering, China University of Geosciences, Beijing 100083, ChinaDevelopment Research Center of China Geological Surve, Beijing 100037, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaHeavy industrial burning contributes significantly to the greenhouse gas (GHG) emissions. It is responsible for almost one-quarter of the global energy-related CO<sub>2</sub> emissions and its share continues to grow. Mostly, those industrial emissions are accompanied by a great deal of high-temperature heat emissions from the combustion of carbon-based fuels by steel, petrochemical, or cement plants. Fortunately, these industrial heat emission sources treated as thermal anomalies can be detected by satellite-borne sensors in a quantitive way. However, most of the dominant remote sensing-based fire detection methods barely work well for heavy industrial heat source discernment. Although the object-oriented approach, especially the data clustering-based approach, has guided a novel method of detection, it is still limited by the costly computation and storage resources. Furthermore, when scaling to a national, or even global, long time-series detection, it is greatly challenged by the tremendous computation introduced by the incredible large-scale data clustering of tens of millions of high-dimensional fire data points. Therefore, we proposed an improved parallel identification method with geocoded, task-tree-based, large-scale clustering for the spatial-temporal distribution analysis of industrial heat emitters across the United States from long time-series active Visible Infrared Imaging Radiometer Suite (VIIRS) data. A recursive <i>k</i>-means clustering method is introduced to gradually segment and cluster industrial heat objects. Furthermore, in order to avoid the blindness caused by random cluster center initialization, the time series VIIRS hotspots data are spatially pre-grouped into GeoSOT-encoded grid tasks which are also treated as initial clustering objects. In addition, some grouped parallel clustering strategy together with geocoding-aware task tree scheduling is adopted to sufficiently exploit parallelism and performance optimization. Then, the spatial-temporal distribution pattern and its changing trend of industrial heat emitters across the United States are analyzed with the identified industrial heat sources. Eventually, the performance experiment also demonstrated the efficiency and encouraging scalability of this approach.https://www.mdpi.com/2072-4292/12/18/3069industrial heat sourcesclusteringparallel computingVIIRSheavy industrial layout
spellingShingle Yan Ma
Caihong Ma
Peng Liu
Jin Yang
Yuzhu Wang
Yueqin Zhu
Xiaoping Du
Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
Remote Sensing
industrial heat sources
clustering
parallel computing
VIIRS
heavy industrial layout
title Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
title_full Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
title_fullStr Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
title_full_unstemmed Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
title_short Spatial-Temporal Distribution Analysis of Industrial Heat Sources in the US with Geocoded, Tree-Based, Large-Scale Clustering
title_sort spatial temporal distribution analysis of industrial heat sources in the us with geocoded tree based large scale clustering
topic industrial heat sources
clustering
parallel computing
VIIRS
heavy industrial layout
url https://www.mdpi.com/2072-4292/12/18/3069
work_keys_str_mv AT yanma spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT caihongma spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT pengliu spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT jinyang spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT yuzhuwang spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT yueqinzhu spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering
AT xiaopingdu spatialtemporaldistributionanalysisofindustrialheatsourcesintheuswithgeocodedtreebasedlargescaleclustering