Representing high throughput expression profiles via perturbation barcodes reveals compound targets.

High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a...

Full description

Bibliographic Details
Main Authors: Tracey M Filzen, Peter S Kutchukian, Jeffrey D Hermes, Jing Li, Matthew Tudor
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-02-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5300121?pdf=render
_version_ 1818036302952529920
author Tracey M Filzen
Peter S Kutchukian
Jeffrey D Hermes
Jing Li
Matthew Tudor
author_facet Tracey M Filzen
Peter S Kutchukian
Jeffrey D Hermes
Jing Li
Matthew Tudor
author_sort Tracey M Filzen
collection DOAJ
description High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a manner that captures significant biological signals in spite of various noise sources such as batch effects and stochastic variation. We used the L1000 platform for large-scale profiling of 978 representative genes across thousands of compound treatments. Here, a method is described that uses deep learning techniques to convert the expression changes of the landmark genes into a perturbation barcode that reveals important features of the underlying data, performing better than the raw data in revealing important biological insights. The barcode captures compound structure and target information, and predicts a compound's high throughput screening promiscuity, to a higher degree than the original data measurements, indicating that the approach uncovers underlying factors of the expression data that are otherwise entangled or masked by noise. Furthermore, we demonstrate that visualizations derived from the perturbation barcode can be used to more sensitively assign functions to unknown compounds through a guilt-by-association approach, which we use to predict and experimentally validate the activity of compounds on the MAPK pathway. The demonstrated application of deep metric learning to large-scale chemical genetics projects highlights the utility of this and related approaches to the extraction of insights and testable hypotheses from big, sometimes noisy data.
first_indexed 2024-12-10T07:08:48Z
format Article
id doaj.art-9ef656e486684a0cb10a586c64ff5b5d
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-10T07:08:48Z
publishDate 2017-02-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-9ef656e486684a0cb10a586c64ff5b5d2022-12-22T01:58:07ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-02-01132e100533510.1371/journal.pcbi.1005335Representing high throughput expression profiles via perturbation barcodes reveals compound targets.Tracey M FilzenPeter S KutchukianJeffrey D HermesJing LiMatthew TudorHigh throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a manner that captures significant biological signals in spite of various noise sources such as batch effects and stochastic variation. We used the L1000 platform for large-scale profiling of 978 representative genes across thousands of compound treatments. Here, a method is described that uses deep learning techniques to convert the expression changes of the landmark genes into a perturbation barcode that reveals important features of the underlying data, performing better than the raw data in revealing important biological insights. The barcode captures compound structure and target information, and predicts a compound's high throughput screening promiscuity, to a higher degree than the original data measurements, indicating that the approach uncovers underlying factors of the expression data that are otherwise entangled or masked by noise. Furthermore, we demonstrate that visualizations derived from the perturbation barcode can be used to more sensitively assign functions to unknown compounds through a guilt-by-association approach, which we use to predict and experimentally validate the activity of compounds on the MAPK pathway. The demonstrated application of deep metric learning to large-scale chemical genetics projects highlights the utility of this and related approaches to the extraction of insights and testable hypotheses from big, sometimes noisy data.http://europepmc.org/articles/PMC5300121?pdf=render
spellingShingle Tracey M Filzen
Peter S Kutchukian
Jeffrey D Hermes
Jing Li
Matthew Tudor
Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
PLoS Computational Biology
title Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
title_full Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
title_fullStr Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
title_full_unstemmed Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
title_short Representing high throughput expression profiles via perturbation barcodes reveals compound targets.
title_sort representing high throughput expression profiles via perturbation barcodes reveals compound targets
url http://europepmc.org/articles/PMC5300121?pdf=render
work_keys_str_mv AT traceymfilzen representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT peterskutchukian representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT jeffreydhermes representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT jingli representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT matthewtudor representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets