SOM clustering of 21-year data of a small pristine boreal lake

In order to improve our understanding of the connections between the biological processes and abiotic factors, we clustered complex long-term ecological data with the self-organizing map (SOM) technique. The available 21-year long (1990–2010) data set from a small pristine humic lake, in southern Fi...

Full description

Bibliographic Details
Main Authors: Voutilainen Ari, Arvola Lauri
Format: Article
Language:English
Published: EDP Sciences 2017-01-01
Series:Knowledge and Management of Aquatic Ecosystems
Subjects:
Online Access:https://doi.org/10.1051/kmae/2017027
_version_ 1818152914629165056
author Voutilainen Ari
Arvola Lauri
author_facet Voutilainen Ari
Arvola Lauri
author_sort Voutilainen Ari
collection DOAJ
description In order to improve our understanding of the connections between the biological processes and abiotic factors, we clustered complex long-term ecological data with the self-organizing map (SOM) technique. The available 21-year long (1990–2010) data set from a small pristine humic lake, in southern Finland, consisted of 27 meteorological, physical, chemical, and biological variables. The SOM grouped the data into three categories of which the first one was the largest with 12 variables, including metabolic processes, dissolved oxygen, total nitrogen and phosphorus, chlorophyll a, and taxonomical groups of plankton known to exist in spring. The second cluster comprised of water temperature and precipitation together with cyanobacteria, algae, rotifers, and crustacean zooplankton, an association emphasized with summer. The third cluster was consisted of six physical and chemical variables linked to autumn, and to the effects of inflow and/or water column mixing. SOM is a useful method for grouping the variables of such a large multi-dimensional data set, especially, when the purpose is to draw comprehensive conclusions rather than to search for associations across sporadic variables. Sampling should minimize the number of missing values. Even flexible statistical techniques, such as SOM, are vulnerable to biased results due to incomplete data.
first_indexed 2024-12-11T14:02:17Z
format Article
id doaj.art-7bf56d7821744cb685b3c1483f0c60a7
institution Directory Open Access Journal
issn 1961-9502
language English
last_indexed 2024-12-11T14:02:17Z
publishDate 2017-01-01
publisher EDP Sciences
record_format Article
series Knowledge and Management of Aquatic Ecosystems
spelling doaj.art-7bf56d7821744cb685b3c1483f0c60a72022-12-22T01:03:49ZengEDP SciencesKnowledge and Management of Aquatic Ecosystems1961-95022017-01-0104183610.1051/kmae/2017027kmae170041SOM clustering of 21-year data of a small pristine boreal lakeVoutilainen AriArvola LauriIn order to improve our understanding of the connections between the biological processes and abiotic factors, we clustered complex long-term ecological data with the self-organizing map (SOM) technique. The available 21-year long (1990–2010) data set from a small pristine humic lake, in southern Finland, consisted of 27 meteorological, physical, chemical, and biological variables. The SOM grouped the data into three categories of which the first one was the largest with 12 variables, including metabolic processes, dissolved oxygen, total nitrogen and phosphorus, chlorophyll a, and taxonomical groups of plankton known to exist in spring. The second cluster comprised of water temperature and precipitation together with cyanobacteria, algae, rotifers, and crustacean zooplankton, an association emphasized with summer. The third cluster was consisted of six physical and chemical variables linked to autumn, and to the effects of inflow and/or water column mixing. SOM is a useful method for grouping the variables of such a large multi-dimensional data set, especially, when the purpose is to draw comprehensive conclusions rather than to search for associations across sporadic variables. Sampling should minimize the number of missing values. Even flexible statistical techniques, such as SOM, are vulnerable to biased results due to incomplete data.https://doi.org/10.1051/kmae/2017027boreal lakedata partitioningecological complexitylong-term dataself-organizing map
spellingShingle Voutilainen Ari
Arvola Lauri
SOM clustering of 21-year data of a small pristine boreal lake
Knowledge and Management of Aquatic Ecosystems
boreal lake
data partitioning
ecological complexity
long-term data
self-organizing map
title SOM clustering of 21-year data of a small pristine boreal lake
title_full SOM clustering of 21-year data of a small pristine boreal lake
title_fullStr SOM clustering of 21-year data of a small pristine boreal lake
title_full_unstemmed SOM clustering of 21-year data of a small pristine boreal lake
title_short SOM clustering of 21-year data of a small pristine boreal lake
title_sort som clustering of 21 year data of a small pristine boreal lake
topic boreal lake
data partitioning
ecological complexity
long-term data
self-organizing map
url https://doi.org/10.1051/kmae/2017027
work_keys_str_mv AT voutilainenari somclusteringof21yeardataofasmallpristineboreallake
AT arvolalauri somclusteringof21yeardataofasmallpristineboreallake