Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index

Total Suspended Matter is the core parameter of water color remote sensing and the important indicator for water quality evaluation of lakes. Rapid and high-precision monitoring of TSM is an important guarantee for water quality remote-sensing applications. China has launched many broad-bandwidth re...

Full description

Bibliographic Details
Main Authors: Mingjian Zhai, Xiang Zhou, Zui Tao, Tingting Lv, Hongming Zhang, Ruoxi Li, Yuxuan Huang
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-02-01
Series:Frontiers in Environmental Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fenvs.2023.1132346/full
_version_ 1811167241855565824
author Mingjian Zhai
Mingjian Zhai
Xiang Zhou
Zui Tao
Tingting Lv
Hongming Zhang
Ruoxi Li
Ruoxi Li
Yuxuan Huang
Yuxuan Huang
author_facet Mingjian Zhai
Mingjian Zhai
Xiang Zhou
Zui Tao
Tingting Lv
Hongming Zhang
Ruoxi Li
Ruoxi Li
Yuxuan Huang
Yuxuan Huang
author_sort Mingjian Zhai
collection DOAJ
description Total Suspended Matter is the core parameter of water color remote sensing and the important indicator for water quality evaluation of lakes. Rapid and high-precision monitoring of TSM is an important guarantee for water quality remote-sensing applications. China has launched many broad-bandwidth remote sensing satellites, all of which have similar bandwidth. The coordinated observation of multiple satellites can effectively meet the large-scale and high-frequency dynamic monitoring requirements of TSM concentration in lakes. This study proposed a machine-learning model to retrieve the TSM concentration from broad bandwidth satellites. The reliability and accuracy of various retrieve models (i.e., linear regression model, support vector regression model, random forest model, and back propagation neural networks model) were evaluated through the in-situ datasets of TSM concentration in lakes. The RF model was selected as the retrieved model of TSM concentration using broad bandwidth satellites. The results showed that 1) Compared with four machine learning models, the RF model can provide better performance (R2=0.88, Mean Absolute Percentage Error (MAPE) = 22.5%). Similarly, compared with the documented six TSM retrieve model, the RF retrieve model also has substantial advantages. 2) the Forel-Ule Index (FUI) can effectively enhance the precision and accuracy of the TSM retrieve model. 3) The RF model has good generalization ability and accuracy in the validation datasets (Lake Chagan: MAPE = 3.7%, Lake Changdang: MAPE = 4.3%). 4) The RF model was applied to the broad bandwidth satellites retrieve of TSM concentrations in Lake Bosten, Lake Chagan, and Lake Changdang, and the MAPEs were 5.3%, 8.1%, and 12.1%, respectively. This study showed that the RF model could effectively improve the retrieve performance and generalization ability of the broad bandwidth satellite’s TSM concentration, which meets the accuracy requirements of high-frequency dynamic monitoring of TSM concentration.
first_indexed 2024-04-10T16:05:59Z
format Article
id doaj.art-73533bface644d92bb332976e5a22c38
institution Directory Open Access Journal
issn 2296-665X
language English
last_indexed 2024-04-10T16:05:59Z
publishDate 2023-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Environmental Science
spelling doaj.art-73533bface644d92bb332976e5a22c382023-02-10T05:10:55ZengFrontiers Media S.A.Frontiers in Environmental Science2296-665X2023-02-011110.3389/fenvs.2023.11323461132346Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule IndexMingjian Zhai0Mingjian Zhai1Xiang Zhou2Zui Tao3Tingting Lv4Hongming Zhang5Ruoxi Li6Ruoxi Li7Yuxuan Huang8Yuxuan Huang9Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSchool of University of Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSchool of University of Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSchool of University of Chinese Academy of Sciences, Beijing, ChinaTotal Suspended Matter is the core parameter of water color remote sensing and the important indicator for water quality evaluation of lakes. Rapid and high-precision monitoring of TSM is an important guarantee for water quality remote-sensing applications. China has launched many broad-bandwidth remote sensing satellites, all of which have similar bandwidth. The coordinated observation of multiple satellites can effectively meet the large-scale and high-frequency dynamic monitoring requirements of TSM concentration in lakes. This study proposed a machine-learning model to retrieve the TSM concentration from broad bandwidth satellites. The reliability and accuracy of various retrieve models (i.e., linear regression model, support vector regression model, random forest model, and back propagation neural networks model) were evaluated through the in-situ datasets of TSM concentration in lakes. The RF model was selected as the retrieved model of TSM concentration using broad bandwidth satellites. The results showed that 1) Compared with four machine learning models, the RF model can provide better performance (R2=0.88, Mean Absolute Percentage Error (MAPE) = 22.5%). Similarly, compared with the documented six TSM retrieve model, the RF retrieve model also has substantial advantages. 2) the Forel-Ule Index (FUI) can effectively enhance the precision and accuracy of the TSM retrieve model. 3) The RF model has good generalization ability and accuracy in the validation datasets (Lake Chagan: MAPE = 3.7%, Lake Changdang: MAPE = 4.3%). 4) The RF model was applied to the broad bandwidth satellites retrieve of TSM concentrations in Lake Bosten, Lake Chagan, and Lake Changdang, and the MAPEs were 5.3%, 8.1%, and 12.1%, respectively. This study showed that the RF model could effectively improve the retrieve performance and generalization ability of the broad bandwidth satellite’s TSM concentration, which meets the accuracy requirements of high-frequency dynamic monitoring of TSM concentration.https://www.frontiersin.org/articles/10.3389/fenvs.2023.1132346/fulltotal suspended matterbroad bandwidth satelliteChinese lakesmachine learningForel-Ule Index
spellingShingle Mingjian Zhai
Mingjian Zhai
Xiang Zhou
Zui Tao
Tingting Lv
Hongming Zhang
Ruoxi Li
Ruoxi Li
Yuxuan Huang
Yuxuan Huang
Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
Frontiers in Environmental Science
total suspended matter
broad bandwidth satellite
Chinese lakes
machine learning
Forel-Ule Index
title Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
title_full Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
title_fullStr Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
title_full_unstemmed Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
title_short Retrieve of total suspended matter in typical lakes in China based on broad bandwidth satellite data: Random forest model with Forel-Ule Index
title_sort retrieve of total suspended matter in typical lakes in china based on broad bandwidth satellite data random forest model with forel ule index
topic total suspended matter
broad bandwidth satellite
Chinese lakes
machine learning
Forel-Ule Index
url https://www.frontiersin.org/articles/10.3389/fenvs.2023.1132346/full
work_keys_str_mv AT mingjianzhai retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT mingjianzhai retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT xiangzhou retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT zuitao retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT tingtinglv retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT hongmingzhang retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT ruoxili retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT ruoxili retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT yuxuanhuang retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex
AT yuxuanhuang retrieveoftotalsuspendedmatterintypicallakesinchinabasedonbroadbandwidthsatellitedatarandomforestmodelwithforeluleindex