Visualizing complex feature interactions and feature sharing in genomic deep neural networks

BACKGROUND: Visualization tools for deep learning models typically focus on discovering key input features without considering how such low level features are combined in intermediate layers to make decisions. Moreover, many of these methods examine a network’s response to specific input examples th...

Full description

Bibliographic Details
Main Authors: Liu, Ge, Zeng, Haoyang, Gifford, David K
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: BioMed Central 2020
Online Access:https://hdl.handle.net/1721.1/126253
_version_ 1811090306547843072
author Liu, Ge
Zeng, Haoyang
Gifford, David K
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Liu, Ge
Zeng, Haoyang
Gifford, David K
author_sort Liu, Ge
collection MIT
description BACKGROUND: Visualization tools for deep learning models typically focus on discovering key input features without considering how such low level features are combined in intermediate layers to make decisions. Moreover, many of these methods examine a network’s response to specific input examples that may be insufficient to reveal the complexity of model decision making. RESULTS: We present DeepResolve, an analysis framework for deep convolutional models of genome function that visualizes how input features contribute individually and combinatorially to network decisions. Unlike other methods, DeepResolve does not depend upon the analysis of a predefined set of inputs. Rather, it uses gradient ascent to stochastically explore intermediate feature maps to 1) discover important features, 2) visualize their contribution and interaction patterns, and 3) analyze feature sharing across tasks that suggests shared biological mechanism. We demonstrate the visualization of decision making using our proposed method on deep neural networks trained on both experimental and synthetic data. DeepResolve is competitive with existing visualization tools in discovering key sequence features, and identifies certain negative features and non-additive feature interactions that are not easily observed with existing tools. It also recovers similarities between poorly correlated classes which are not observed by traditional methods. DeepResolve reveals that DeepSEA’s learned decision structure is shared across genome annotations including histone marks, DNase hypersensitivity, and transcription factor binding. We identify groups of TFs that suggest known shared biological mechanism, and recover correlation between DNA hypersensitivities and TF/Chromatin marks. CONCLUSIONS: DeepResolve is capable of visualizing complex feature contribution patterns and feature interactions that contribute to decision making in genomic deep convolutional networks. It also recovers feature sharing and class similarities which suggest interesting biological mechanisms. DeepResolve is compatible with existing visualization tools and provides complementary insights.
first_indexed 2024-09-23T14:41:56Z
format Article
id mit-1721.1/126253
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T14:41:56Z
publishDate 2020
publisher BioMed Central
record_format dspace
spelling mit-1721.1/1262532022-10-01T22:05:37Z Visualizing complex feature interactions and feature sharing in genomic deep neural networks Liu, Ge Zeng, Haoyang Gifford, David K Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory BACKGROUND: Visualization tools for deep learning models typically focus on discovering key input features without considering how such low level features are combined in intermediate layers to make decisions. Moreover, many of these methods examine a network’s response to specific input examples that may be insufficient to reveal the complexity of model decision making. RESULTS: We present DeepResolve, an analysis framework for deep convolutional models of genome function that visualizes how input features contribute individually and combinatorially to network decisions. Unlike other methods, DeepResolve does not depend upon the analysis of a predefined set of inputs. Rather, it uses gradient ascent to stochastically explore intermediate feature maps to 1) discover important features, 2) visualize their contribution and interaction patterns, and 3) analyze feature sharing across tasks that suggests shared biological mechanism. We demonstrate the visualization of decision making using our proposed method on deep neural networks trained on both experimental and synthetic data. DeepResolve is competitive with existing visualization tools in discovering key sequence features, and identifies certain negative features and non-additive feature interactions that are not easily observed with existing tools. It also recovers similarities between poorly correlated classes which are not observed by traditional methods. DeepResolve reveals that DeepSEA’s learned decision structure is shared across genome annotations including histone marks, DNase hypersensitivity, and transcription factor binding. We identify groups of TFs that suggest known shared biological mechanism, and recover correlation between DNA hypersensitivities and TF/Chromatin marks. CONCLUSIONS: DeepResolve is capable of visualizing complex feature contribution patterns and feature interactions that contribute to decision making in genomic deep convolutional networks. It also recovers feature sharing and class similarities which suggest interesting biological mechanisms. DeepResolve is compatible with existing visualization tools and provides complementary insights. NIH (Grants U01HG007037 and R01CA218094) 2020-07-17T20:33:32Z 2020-07-17T20:33:32Z 2019-07-19 2020-06-26T11:02:07Z Article http://purl.org/eprint/type/JournalArticle 1471-2105 https://hdl.handle.net/1721.1/126253 Liu, Ge, Haoyang Zeng, and David K. Gifford. "Visualizing complex feature interactions and feature sharing in genomic deep neural networks." BMC Bioinformatics 20 (July 2019): no. 401 doi 10.1186/s12859-019-2957-4 ©2019 Author(s) en 10.1186/s12859-019-2957-4 BMC Bioinformatics Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/ The Author(s) application/pdf BioMed Central BioMed Central
spellingShingle Liu, Ge
Zeng, Haoyang
Gifford, David K
Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title_full Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title_fullStr Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title_full_unstemmed Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title_short Visualizing complex feature interactions and feature sharing in genomic deep neural networks
title_sort visualizing complex feature interactions and feature sharing in genomic deep neural networks
url https://hdl.handle.net/1721.1/126253
work_keys_str_mv AT liuge visualizingcomplexfeatureinteractionsandfeaturesharingingenomicdeepneuralnetworks
AT zenghaoyang visualizingcomplexfeatureinteractionsandfeaturesharingingenomicdeepneuralnetworks
AT gifforddavidk visualizingcomplexfeatureinteractionsandfeaturesharingingenomicdeepneuralnetworks