Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging

Mass spectrometry imaging (MSI) in sedimentary archives can produce records of molecular proxies at μm-scale resolution. For example, in annually varved sediments of the Santa Barbara Basin, such a fine resolution allows deciphering sub-annual distributions of archaeal tetraether lipids, haptophyte-...

Full description

Bibliographic Details
Main Authors: Weimin Liu, Susanne Alfken, Lars Wörmer, Julius S. Lipp, Kai-Uwe Hinrichs
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-08-01
Series:Frontiers in Earth Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/feart.2022.931157/full
_version_ 1811214529302888448
author Weimin Liu
Susanne Alfken
Lars Wörmer
Julius S. Lipp
Kai-Uwe Hinrichs
author_facet Weimin Liu
Susanne Alfken
Lars Wörmer
Julius S. Lipp
Kai-Uwe Hinrichs
author_sort Weimin Liu
collection DOAJ
description Mass spectrometry imaging (MSI) in sedimentary archives can produce records of molecular proxies at μm-scale resolution. For example, in annually varved sediments of the Santa Barbara Basin, such a fine resolution allows deciphering sub-annual distributions of archaeal tetraether lipids, haptophyte-derived alkenones, and sterols. Herein, we reported the establishment of an untargeted data processing workflow aimed at dissecting the MSI datasets and extracting information beyond that obtained by targeted analysis of known molecular proxies. The combination of MSI and the untargeted workflow not only increases the spatial resolution for molecular stratigraphy but also dramatically broadens the number and diversity of molecular signals evaluated, enabling us to discover unique molecular signatures imprinted by various biogeochemical processes. We applied the proposed workflow to two MSI datasets that were both measured on the uppermost ∼10 cm of the Santa Barbara Basin sediments while covering different mass ranges. Two matrices of 18625×293 and 18963×323 (number of spectra × number of peaks) were, respectively, extracted after peak alignment using bin-wise kernel density estimation and subsequent peak picking by peak prominence filtering combined with geochemical context-based filtering. Feature extraction by non-negative matrix factorization revealed in total 15 stable molecular clusters with distinct spatial distributions in the sediments. Each cluster typically comprised several to dozens of compounds, with the majority of compounds in each cluster likely belonging to similar chemical taxonomies. Some of these clusters can be linked to specific biogeochemical processes. For example, chlorin-like compounds are possibly related to diatom production, alkenones are related to coccolithophorid production, and steranes and long-chain fatty acids likely represent terrigenous input. Supervised learning from these data mining results further extracted molecular signatures with proxy potential that appear to be linked to specific environmental conditions inferred from historical oceanographic data. However, generalizability to other sedimentary settings will require further investigation.
first_indexed 2024-04-12T06:06:18Z
format Article
id doaj.art-9ac065d2866f4c8bbd7b5bdf6214d858
institution Directory Open Access Journal
issn 2296-6463
language English
last_indexed 2024-04-12T06:06:18Z
publishDate 2022-08-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Earth Science
spelling doaj.art-9ac065d2866f4c8bbd7b5bdf6214d8582022-12-22T03:44:51ZengFrontiers Media S.A.Frontiers in Earth Science2296-64632022-08-011010.3389/feart.2022.931157931157Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imagingWeimin LiuSusanne AlfkenLars WörmerJulius S. LippKai-Uwe HinrichsMass spectrometry imaging (MSI) in sedimentary archives can produce records of molecular proxies at μm-scale resolution. For example, in annually varved sediments of the Santa Barbara Basin, such a fine resolution allows deciphering sub-annual distributions of archaeal tetraether lipids, haptophyte-derived alkenones, and sterols. Herein, we reported the establishment of an untargeted data processing workflow aimed at dissecting the MSI datasets and extracting information beyond that obtained by targeted analysis of known molecular proxies. The combination of MSI and the untargeted workflow not only increases the spatial resolution for molecular stratigraphy but also dramatically broadens the number and diversity of molecular signals evaluated, enabling us to discover unique molecular signatures imprinted by various biogeochemical processes. We applied the proposed workflow to two MSI datasets that were both measured on the uppermost ∼10 cm of the Santa Barbara Basin sediments while covering different mass ranges. Two matrices of 18625×293 and 18963×323 (number of spectra × number of peaks) were, respectively, extracted after peak alignment using bin-wise kernel density estimation and subsequent peak picking by peak prominence filtering combined with geochemical context-based filtering. Feature extraction by non-negative matrix factorization revealed in total 15 stable molecular clusters with distinct spatial distributions in the sediments. Each cluster typically comprised several to dozens of compounds, with the majority of compounds in each cluster likely belonging to similar chemical taxonomies. Some of these clusters can be linked to specific biogeochemical processes. For example, chlorin-like compounds are possibly related to diatom production, alkenones are related to coccolithophorid production, and steranes and long-chain fatty acids likely represent terrigenous input. Supervised learning from these data mining results further extracted molecular signatures with proxy potential that appear to be linked to specific environmental conditions inferred from historical oceanographic data. However, generalizability to other sedimentary settings will require further investigation.https://www.frontiersin.org/articles/10.3389/feart.2022.931157/fullmarine sedimentssupervised learningmass spectrometry imaginguntargeted data miningmolecular signaturebiomarker discovery
spellingShingle Weimin Liu
Susanne Alfken
Lars Wörmer
Julius S. Lipp
Kai-Uwe Hinrichs
Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
Frontiers in Earth Science
marine sediments
supervised learning
mass spectrometry imaging
untargeted data mining
molecular signature
biomarker discovery
title Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
title_full Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
title_fullStr Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
title_full_unstemmed Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
title_short Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
title_sort hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging
topic marine sediments
supervised learning
mass spectrometry imaging
untargeted data mining
molecular signature
biomarker discovery
url https://www.frontiersin.org/articles/10.3389/feart.2022.931157/full
work_keys_str_mv AT weiminliu hiddenmolecularcluesinmarinesedimentsrevealedbyuntargetedmassspectrometryimaging
AT susannealfken hiddenmolecularcluesinmarinesedimentsrevealedbyuntargetedmassspectrometryimaging
AT larswormer hiddenmolecularcluesinmarinesedimentsrevealedbyuntargetedmassspectrometryimaging
AT juliusslipp hiddenmolecularcluesinmarinesedimentsrevealedbyuntargetedmassspectrometryimaging
AT kaiuwehinrichs hiddenmolecularcluesinmarinesedimentsrevealedbyuntargetedmassspectrometryimaging