Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers

A Transient Storage Model (TSM), which considers the storage exchange process that induces an abnormal mixing phenomenon, has been widely used to analyze solute transport in natural rivers. The primary step in applying TSM is a calibration of four key parameters: flow zone dispersion coefficient (&l...

Full description

Bibliographic Details
Main Authors: Hyoseob Noh, Siyoon Kwon, Il Won Seo, Donghae Baek, Sung Hyun Jung
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Water
Subjects:
Online Access:https://www.mdpi.com/2073-4441/13/1/76
_version_ 1797542776519262208
author Hyoseob Noh
Siyoon Kwon
Il Won Seo
Donghae Baek
Sung Hyun Jung
author_facet Hyoseob Noh
Siyoon Kwon
Il Won Seo
Donghae Baek
Sung Hyun Jung
author_sort Hyoseob Noh
collection DOAJ
description A Transient Storage Model (TSM), which considers the storage exchange process that induces an abnormal mixing phenomenon, has been widely used to analyze solute transport in natural rivers. The primary step in applying TSM is a calibration of four key parameters: flow zone dispersion coefficient (<inline-formula><math display="inline"><semantics><msub><mi>K</mi><mi>f</mi></msub></semantics></math></inline-formula>), main flow zone area (<inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>), storage zone area (<inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>s</mi></msub></semantics></math></inline-formula>), and storage exchange rate (<inline-formula><math display="inline"><semantics><mi>α</mi></semantics></math></inline-formula>); by fitting the measured Breakthrough Curves (BTCs). In this study, to overcome the costly tracer tests necessary for parameter calibration, two dimensionless empirical models were derived to estimate TSM parameters, using multi-gene genetic programming (MGGP) and principal components regression (PCR). A total of 128 datasets with complete variables from 14 published papers were chosen from an extensive meta-analysis and were applied to derivations. The performance comparison revealed that the MGGP-based equations yielded superior prediction results. According to TSM analysis of field experiment data from Cheongmi Creek, South Korea, although all assessed empirical equations produced acceptable BTCs, the MGGP model was superior to the other models in parameter values. The predicted BTCs obtained by the empirical models in some highly complicated reaches were biased due to misprediction of <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>. Sensitivity analyses of MGGP models showed that the sinuosity is the most influential factor in <inline-formula><math display="inline"><semantics><msub><mi>K</mi><mi>f</mi></msub></semantics></math></inline-formula>, while <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>, <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>s</mi></msub></semantics></math></inline-formula>, and <inline-formula><math display="inline"><semantics><mi>α</mi></semantics></math></inline-formula>, are more sensitive to <inline-formula><math display="inline"><semantics><mrow><mi>U</mi><mo>/</mo><msub><mi>U</mi><mo>*</mo></msub></mrow></semantics></math></inline-formula>. This study proves that the MGGP-based model can be used for economic TSM analysis, thus providing an alternative option to direct calibration and the inverse modeling initial parameters.
first_indexed 2024-03-10T13:35:18Z
format Article
id doaj.art-e226e728f1a1408c8c1325a1f6a84cfa
institution Directory Open Access Journal
issn 2073-4441
language English
last_indexed 2024-03-10T13:35:18Z
publishDate 2020-12-01
publisher MDPI AG
record_format Article
series Water
spelling doaj.art-e226e728f1a1408c8c1325a1f6a84cfa2023-11-21T07:37:17ZengMDPI AGWater2073-44412020-12-011317610.3390/w13010076Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural RiversHyoseob Noh0Siyoon Kwon1Il Won Seo2Donghae Baek3Sung Hyun Jung4Department of Civil and Environmental Engineering, Seoul National University, Seoul 08826, KoreaDepartment of Civil and Environmental Engineering, Seoul National University, Seoul 08826, KoreaDepartment of Civil and Environmental Engineering, Seoul National University, Seoul 08826, KoreaKorea Institute of Civil Engineering and Building Technology, 283, Goyangdae-ro, Ilsanseo-gu, Goyang-si 10223, Gyeonggi-do, KoreaDepartment of Civil and Environmental Engineering, Seoul National University, Seoul 08826, KoreaA Transient Storage Model (TSM), which considers the storage exchange process that induces an abnormal mixing phenomenon, has been widely used to analyze solute transport in natural rivers. The primary step in applying TSM is a calibration of four key parameters: flow zone dispersion coefficient (<inline-formula><math display="inline"><semantics><msub><mi>K</mi><mi>f</mi></msub></semantics></math></inline-formula>), main flow zone area (<inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>), storage zone area (<inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>s</mi></msub></semantics></math></inline-formula>), and storage exchange rate (<inline-formula><math display="inline"><semantics><mi>α</mi></semantics></math></inline-formula>); by fitting the measured Breakthrough Curves (BTCs). In this study, to overcome the costly tracer tests necessary for parameter calibration, two dimensionless empirical models were derived to estimate TSM parameters, using multi-gene genetic programming (MGGP) and principal components regression (PCR). A total of 128 datasets with complete variables from 14 published papers were chosen from an extensive meta-analysis and were applied to derivations. The performance comparison revealed that the MGGP-based equations yielded superior prediction results. According to TSM analysis of field experiment data from Cheongmi Creek, South Korea, although all assessed empirical equations produced acceptable BTCs, the MGGP model was superior to the other models in parameter values. The predicted BTCs obtained by the empirical models in some highly complicated reaches were biased due to misprediction of <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>. Sensitivity analyses of MGGP models showed that the sinuosity is the most influential factor in <inline-formula><math display="inline"><semantics><msub><mi>K</mi><mi>f</mi></msub></semantics></math></inline-formula>, while <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>f</mi></msub></semantics></math></inline-formula>, <inline-formula><math display="inline"><semantics><msub><mi>A</mi><mi>s</mi></msub></semantics></math></inline-formula>, and <inline-formula><math display="inline"><semantics><mi>α</mi></semantics></math></inline-formula>, are more sensitive to <inline-formula><math display="inline"><semantics><mrow><mi>U</mi><mo>/</mo><msub><mi>U</mi><mo>*</mo></msub></mrow></semantics></math></inline-formula>. This study proves that the MGGP-based model can be used for economic TSM analysis, thus providing an alternative option to direct calibration and the inverse modeling initial parameters.https://www.mdpi.com/2073-4441/13/1/76hydromorphic variableMultigene Genetic Programming (MGGP)sensitivity analysissolute transportTransient Storage Model (TSM)TSM parameter estimation
spellingShingle Hyoseob Noh
Siyoon Kwon
Il Won Seo
Donghae Baek
Sung Hyun Jung
Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
Water
hydromorphic variable
Multigene Genetic Programming (MGGP)
sensitivity analysis
solute transport
Transient Storage Model (TSM)
TSM parameter estimation
title Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
title_full Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
title_fullStr Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
title_full_unstemmed Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
title_short Multi-Gene Genetic Programming Regression Model for Prediction of Transient Storage Model Parameters in Natural Rivers
title_sort multi gene genetic programming regression model for prediction of transient storage model parameters in natural rivers
topic hydromorphic variable
Multigene Genetic Programming (MGGP)
sensitivity analysis
solute transport
Transient Storage Model (TSM)
TSM parameter estimation
url https://www.mdpi.com/2073-4441/13/1/76
work_keys_str_mv AT hyoseobnoh multigenegeneticprogrammingregressionmodelforpredictionoftransientstoragemodelparametersinnaturalrivers
AT siyoonkwon multigenegeneticprogrammingregressionmodelforpredictionoftransientstoragemodelparametersinnaturalrivers
AT ilwonseo multigenegeneticprogrammingregressionmodelforpredictionoftransientstoragemodelparametersinnaturalrivers
AT donghaebaek multigenegeneticprogrammingregressionmodelforpredictionoftransientstoragemodelparametersinnaturalrivers
AT sunghyunjung multigenegeneticprogrammingregressionmodelforpredictionoftransientstoragemodelparametersinnaturalrivers