Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.
In toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on mi...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2023-01-01
|
Series: | PLoS ONE |
Online Access: | https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable |
_version_ | 1797637553591943168 |
---|---|
author | Franziska Kappenberg Jörg Rahnenführer |
author_facet | Franziska Kappenberg Jörg Rahnenführer |
author_sort | Franziska Kappenberg |
collection | DOAJ |
description | In toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on microarray or RNA-seq technology, concentration-response profiles can be measured for thousands of genes simultaneously. One approach for determining the alert concentration is given by fitting a parametric model to the data which allows interpolation between the tested concentrations. It is well known that the quality of a model fit improves with the number of measured data points. However, adding new replicates for existing concentrations or even several replicates for new concentrations is time-consuming and expensive. Here, we propose an empirical Bayes approach to information sharing across genes, where in essence a weighted mean of the individual estimate for one specific parameter of a fitted model and the mean of all estimates of the entire set of genes is calculated as a result. Results of a controlled plasmode simulation study show that for many genes a notable improvement in terms of the mean squared error (MSE) between estimate and true underlying value of the parameter can be observed. However, for some genes, the MSE increases, and this cannot be prevented by using a more sophisticated prior distribution in the Bayesian approach. |
first_indexed | 2024-03-11T12:51:02Z |
format | Article |
id | doaj.art-c6cb078f932c456b99b41455f5029951 |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-03-11T12:51:02Z |
publishDate | 2023-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-c6cb078f932c456b99b41455f50299512023-11-04T05:32:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-011810e029318010.1371/journal.pone.0293180Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling.Franziska KappenbergJörg RahnenführerIn toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on microarray or RNA-seq technology, concentration-response profiles can be measured for thousands of genes simultaneously. One approach for determining the alert concentration is given by fitting a parametric model to the data which allows interpolation between the tested concentrations. It is well known that the quality of a model fit improves with the number of measured data points. However, adding new replicates for existing concentrations or even several replicates for new concentrations is time-consuming and expensive. Here, we propose an empirical Bayes approach to information sharing across genes, where in essence a weighted mean of the individual estimate for one specific parameter of a fitted model and the mean of all estimates of the entire set of genes is calculated as a result. Results of a controlled plasmode simulation study show that for many genes a notable improvement in terms of the mean squared error (MSE) between estimate and true underlying value of the parameter can be observed. However, for some genes, the MSE increases, and this cannot be prevented by using a more sophisticated prior distribution in the Bayesian approach.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable |
spellingShingle | Franziska Kappenberg Jörg Rahnenführer Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. PLoS ONE |
title | Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. |
title_full | Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. |
title_fullStr | Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. |
title_full_unstemmed | Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. |
title_short | Information sharing in high-dimensional gene expression data for improved parameter estimation in concentration-response modelling. |
title_sort | information sharing in high dimensional gene expression data for improved parameter estimation in concentration response modelling |
url | https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293180&type=printable |
work_keys_str_mv | AT franziskakappenberg informationsharinginhighdimensionalgeneexpressiondataforimprovedparameterestimationinconcentrationresponsemodelling AT jorgrahnenfuhrer informationsharinginhighdimensionalgeneexpressiondataforimprovedparameterestimationinconcentrationresponsemodelling |