Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability

Abstract Background CD-1 is an outbred mouse stock that is frequently used in toxicology, pharmacology, and fundamental biomedical research. Although inbred strains are typically better suited for such studies due to minimal genetic variability, outbred stocks confer practical advantages over inbred...

Full description

Bibliographic Details
Main Authors: Yoon Hee Jung, Hsiao-Lin V. Wang, Samir Ali, Victor G. Corces, Isaac Kremsky
Format: Article
Language:English
Published: BMC 2023-08-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-023-09523-x
_version_ 1797453786891943936
author Yoon Hee Jung
Hsiao-Lin V. Wang
Samir Ali
Victor G. Corces
Isaac Kremsky
author_facet Yoon Hee Jung
Hsiao-Lin V. Wang
Samir Ali
Victor G. Corces
Isaac Kremsky
author_sort Yoon Hee Jung
collection DOAJ
description Abstract Background CD-1 is an outbred mouse stock that is frequently used in toxicology, pharmacology, and fundamental biomedical research. Although inbred strains are typically better suited for such studies due to minimal genetic variability, outbred stocks confer practical advantages over inbred strains, such as improved breeding performance and low cost. Knowledge of the full genetic variability of CD-1 would make it more useful in toxicology, pharmacology, and fundamental biomedical research. Results We performed deep genomic DNA sequencing of CD-1 mice and used the data to identify genome-wide SNPs, indels, and germline transposable elements relative to the mm10 reference genome. We used multiple genome-wide sequencing data types and previously published CD-1 SNPs to validate our called variants. We used the called variants to construct a strain-specific CD-1 reference genome, which we show can improve mappability and reduce experimental biases from genome-wide sequencing data derived from CD-1 mice. Based on previously published ChIP-seq and ATAC-seq data, we find evidence that genetic variation between CD-1 mice can lead to alterations in transcription factor binding. We also identified a number of variants in the coding region of genes which could have effects on translation of genes. Conclusions We have identified millions of previously unidentified CD-1 variants with the potential to confound studies involving CD-1. We used the identified variants to construct a CD-1-specific reference genome, which can improve accuracy and reduce bias when aligning genomics data derived from CD-1 mice.
first_indexed 2024-03-09T15:27:53Z
format Article
id doaj.art-09779d1438674ca8ba3c0692f49e57dc
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-03-09T15:27:53Z
publishDate 2023-08-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-09779d1438674ca8ba3c0692f49e57dc2023-11-26T12:25:38ZengBMCBMC Genomics1471-21642023-08-0124111810.1186/s12864-023-09523-xCharacterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variabilityYoon Hee Jung0Hsiao-Lin V. Wang1Samir Ali2Victor G. Corces3Isaac Kremsky4Department of Human Genetics, Emory University School of MedicineDepartment of Human Genetics, Emory University School of MedicineDepartment of Basic Sciences, Loma Linda University School of MedicineDepartment of Human Genetics, Emory University School of MedicineDepartment of Basic Sciences, Loma Linda University School of MedicineAbstract Background CD-1 is an outbred mouse stock that is frequently used in toxicology, pharmacology, and fundamental biomedical research. Although inbred strains are typically better suited for such studies due to minimal genetic variability, outbred stocks confer practical advantages over inbred strains, such as improved breeding performance and low cost. Knowledge of the full genetic variability of CD-1 would make it more useful in toxicology, pharmacology, and fundamental biomedical research. Results We performed deep genomic DNA sequencing of CD-1 mice and used the data to identify genome-wide SNPs, indels, and germline transposable elements relative to the mm10 reference genome. We used multiple genome-wide sequencing data types and previously published CD-1 SNPs to validate our called variants. We used the called variants to construct a strain-specific CD-1 reference genome, which we show can improve mappability and reduce experimental biases from genome-wide sequencing data derived from CD-1 mice. Based on previously published ChIP-seq and ATAC-seq data, we find evidence that genetic variation between CD-1 mice can lead to alterations in transcription factor binding. We also identified a number of variants in the coding region of genes which could have effects on translation of genes. Conclusions We have identified millions of previously unidentified CD-1 variants with the potential to confound studies involving CD-1. We used the identified variants to construct a CD-1-specific reference genome, which can improve accuracy and reduce bias when aligning genomics data derived from CD-1 mice.https://doi.org/10.1186/s12864-023-09523-xCD-1SNPsIndelsTransposable elementsReference genome
spellingShingle Yoon Hee Jung
Hsiao-Lin V. Wang
Samir Ali
Victor G. Corces
Isaac Kremsky
Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
BMC Genomics
CD-1
SNPs
Indels
Transposable elements
Reference genome
title Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
title_full Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
title_fullStr Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
title_full_unstemmed Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
title_short Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability
title_sort characterization of a strain specific cd 1 reference genome reveals potential inter and intra strain functional variability
topic CD-1
SNPs
Indels
Transposable elements
Reference genome
url https://doi.org/10.1186/s12864-023-09523-x
work_keys_str_mv AT yoonheejung characterizationofastrainspecificcd1referencegenomerevealspotentialinterandintrastrainfunctionalvariability
AT hsiaolinvwang characterizationofastrainspecificcd1referencegenomerevealspotentialinterandintrastrainfunctionalvariability
AT samirali characterizationofastrainspecificcd1referencegenomerevealspotentialinterandintrastrainfunctionalvariability
AT victorgcorces characterizationofastrainspecificcd1referencegenomerevealspotentialinterandintrastrainfunctionalvariability
AT isaackremsky characterizationofastrainspecificcd1referencegenomerevealspotentialinterandintrastrainfunctionalvariability