A pitfall for machine learning methods aiming to predict across cell types

A pitfall for machine learning methods aiming to predict across cell types

Abstract Machine learning models that predict genomic activity are most useful when they make accurate predictions across cell types. Here, we show that when the training and test sets contain the same genomic loci, the resulting model may falsely appear to perform well by effectively memorizing the...

Full description

Bibliographic Details
Main Authors:	Jacob Schreiber, Ritambhara Singh, Jeffrey Bilmes, William Stafford Noble
Format:	Article
Language:	English
Published:	BMC 2020-11-01
Series:	Genome Biology
Subjects:	Machine learning Epigenomics Genomics
Online Access:	http://link.springer.com/article/10.1186/s13059-020-02177-y

Similar Items

Epiphany: predicting Hi-C contact maps from 1D epigenomic signals
by: Rui Yang, et al.
Published: (2023-06-01)

Completing the ENCODE3 compendium yields accurate imputations across a variety of assays and human biosamples
by: Jacob Schreiber, et al.
Published: (2020-03-01)

Marginalizing the genomic architecture to identify crosstalk across cancer and neurodegeneration
by: Amit Sharma, et al.
Published: (2023-02-01)

Predicting liver cancer on epigenomics data using machine learning
by: Vishalkumar Vekariya, et al.
Published: (2022-09-01)

Editorial: Evolution of crop genomes and epigenomes
by: Hai Du, et al.
Published: (2022-09-01)

Manipulation of Epigenome: Opportunities and Pitfalls in Fighting Autoimmune Diseases
by: Hassan Higazi, et al.
Published: (2022-12-01)

Using recursive feature elimination in random forest to account for correlated variables in high dimensional data
by: Burcu F. Darst, et al.
Published: (2018-09-01)

Epigenetic Inheritance Across the Landscape
by: Amy Vaughn Whipple, et al.
Published: (2016-10-01)

Enhanced JBrowse plugins for epigenomics data visualization
by: Brigitte T. Hofmeister, et al.
Published: (2018-04-01)

Clonal Selection and Evolution of HTLV-1-Infected Cells Driven by Genetic and Epigenetic Alteration
by: Makoto Yamagishi, et al.
Published: (2022-03-01)

Crop improvement using life cycle datasets acquired under field conditions
by: Keiichi eMochida, et al.
Published: (2015-09-01)

Editorial: Epigenomic polymorphisms: The drivers of diversity and heterogeneity
by: Tanvir-Ul-Hassan Dar, et al.
Published: (2022-10-01)

Senomic view of the cell: Senome versus Genome
by: František Baluška, et al.
Published: (2018-05-01)

Comparative epigenomics by machine learning approach for neuroblastoma
by: Ryuichi P. Sugino, et al.
Published: (2022-12-01)

COOBoostR: An Extreme Gradient Boosting-Based Tool for Robust Tissue or Cell-of-Origin Prediction of Tumors
by: Sungmin Yang, et al.
Published: (2022-12-01)

Genome and Epigenome Disorders and Male Infertility: Feedback from 15 Years of Clinical and Research Experience
by: Debbie Montjean, et al.
Published: (2024-03-01)

Elucidating the heterogeneity of endometriosis using multi-omics
by: Cheuk, SKC
Published: (2020)

Genomics and Epigenomics in the Molecular Biology of Melanoma—A Prerequisite for Biomarkers Studies
by: Daniela Luminita Zob, et al.
Published: (2022-12-01)

Habit acquisition in the context of neuronal genomic and epigenomic mosaicism
by: Francisco Javier Novo
Published: (2014-04-01)

Radioactive contamination in Chernobyl and (epi)genetic stability of plants – A review
by: Veronika Lancíková, et al.
Published: (2020-09-01)

Editorial: Genomics and Epigenomics of Cancer Immunotherapy: Challenges and Clinical Implications
by: Malak Abedalthagafi
Published: (2021-07-01)

Editorial: Modern machine learning approaches for quantitative inference of gene regulation from genomic and epigenomic features
by: Michael Banf, et al.
Published: (2023-11-01)

Integrating whole genome sequencing, methylation, gene expression, topologically associated domain information in regulatory mutation prediction: A study of follicular lymphoma
by: Amna Farooq, et al.
Published: (2022-01-01)

Author Correction: Avocado: a multi-scale deep tensor factorization method learns a latent representation of the human epigenome
by: Jacob Schreiber, et al.
Published: (2021-09-01)

Avocado: a multi-scale deep tensor factorization method learns a latent representation of the human epigenome
by: Jacob Schreiber, et al.
Published: (2020-03-01)

Increased DNA methylation variability in type 1 diabetes across three immune effector cell types
by: Paul, Dirk S., et al.
Published: (2018)

Data mining and machine learning approaches for the integration of genome-wide association and methylation data: methodology and main conclusions from GAW20
by: Burcu Darst, et al.
Published: (2018-09-01)

Coordinates and intervals in graph-based reference genomes
by: Knut D. Rand, et al.
Published: (2017-05-01)

Mitochondrion at the Crossroad Between Nutrients and Epigenome
by: Giusi Taormina, et al.
Published: (2019-10-01)

Novel Approaches for Identifying the Molecular Background of Schizophrenia
by: Arkadiy K. Golov, et al.
Published: (2020-01-01)

Super-enhancers are transcriptionally more active and cell type-specific than stretch enhancers
by: Aziz Khan, et al.
Published: (2018-09-01)

Cell type-specific histone acetylation profiling of Alzheimer’s disease subjects and integration with genetics
by: Easwaran Ramamurthy, et al.
Published: (2023-01-01)

Omics Application in Animal Science—A Special Emphasis on Stress Response and Damaging Behaviour in Pigs
by: Claudia Kasper, et al.
Published: (2020-08-01)

Precision Medicine in Childhood Asthma: Omic Studies of Treatment Response
by: Javier Perez-Garcia, et al.
Published: (2020-04-01)

Leveraging Genomics, Transcriptomics, and Epigenomics to Understand the Biology and Chemoresistance of Ovarian Cancer
by: Sandra Muñoz-Galván, et al.
Published: (2021-08-01)

Crossing Bacterial Genomic Features and Methylation Patterns with MeStudio: An Epigenomic Analysis Tool
by: Christopher Riccardi, et al.
Published: (2022-12-01)

Abitudini ed ereditarietà: la rivincita di Lamarck?
by: Alessandro Capitanini, et al.
Published: (2021-07-01)

Multi-Omics Profiling Approach to Asthma: An Evolving Paradigm
by: Yadu Gautam, et al.
Published: (2022-01-01)

Mechanisms of Reactivation and Its Interaction with
by: Young Shin Song, et al.
Published: (2020-09-01)

Multi-Omics Analysis in Initiation and Progression of Meningiomas: From Pathogenesis to Diagnosis
by: Jiachen Liu, et al.
Published: (2020-08-01)