Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.

In toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on mi...

Full description

Bibliographic Details
Main Authors: Franziska Kappenberg, Jörg Rahnenführer
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable
_version_ 1797637553591943168
author Franziska Kappenberg
Jörg Rahnenführer
author_facet Franziska Kappenberg
Jörg Rahnenführer
author_sort Franziska Kappenberg
collection DOAJ
description In toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on microarray or RNA-seq technology, concentration-response profiles can be measured for thousands of genes simultaneously. One approach for determining the alert concentration is given by fitting a parametric model to the data which allows interpolation between the tested concentrations. It is well known that the quality of a model fit improves with the number of measured data points. However, adding new replicates for existing concentrations or even several replicates for new concentrations is time-consuming and expensive. Here, we propose an empirical Bayes approach to information sharing across genes, where in essence a weighted mean of the individual estimate for one specific parameter of a fitted model and the mean of all estimates of the entire set of genes is calculated as a result. Results of a controlled plasmode simulation study show that for many genes a notable improvement in terms of the mean squared error (MSE) between estimate and true underlying value of the parameter can be observed. However, for some genes, the MSE increases, and this cannot be prevented by using a more sophisticated prior distribution in the Bayesian approach.
first_indexed 2024-03-11T12:51:02Z
format Article
id doaj.art-c6cb078f932c456b99b41455f5029951
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-03-11T12:51:02Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-c6cb078f932c456b99b41455f50299512023-11-04T05:32:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-011810e029318010.1371/journal.pone.0293180Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.Franziska KappenbergJörg RahnenführerIn toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on microarray or RNA-seq technology, concentration-response profiles can be measured for thousands of genes simultaneously. One approach for determining the alert concentration is given by fitting a parametric model to the data which allows interpolation between the tested concentrations. It is well known that the quality of a model fit improves with the number of measured data points. However, adding new replicates for existing concentrations or even several replicates for new concentrations is time-consuming and expensive. Here, we propose an empirical Bayes approach to information sharing across genes, where in essence a weighted mean of the individual estimate for one specific parameter of a fitted model and the mean of all estimates of the entire set of genes is calculated as a result. Results of a controlled plasmode simulation study show that for many genes a notable improvement in terms of the mean squared error (MSE) between estimate and true underlying value of the parameter can be observed. However, for some genes, the MSE increases, and this cannot be prevented by using a more sophisticated prior distribution in the Bayesian approach.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable
spellingShingle Franziska Kappenberg
Jörg Rahnenführer
Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
PLoS ONE
title Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
title_full Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
title_fullStr Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
title_full_unstemmed Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
title_short Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
title_sort information sharing in high dimensional gene expression data for improved parameter estimation in concentration response modelling
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable
work_keys_str_mv AT franziskakappenberg informationsharinginhighdimensionalgeneexpressiondataforimprovedparameterestimationinconcentrationresponsemodelling
AT jorgrahnenfuhrer informationsharinginhighdimensionalgeneexpressiondataforimprovedparameterestimationinconcentrationresponsemodelling