An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here,...
Main Authors: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | en_US |
Published: |
Public Library of Science
2014
|
Online Access: | http://hdl.handle.net/1721.1/86301 |
_version_ | 1811096101794611200 |
---|---|
author | Hoeppner, Marc P. Lundquist, Andrew Pirun, Mono Meadows, Jennifer R. S. Zamani, Neda Johnson, Jeremy Sundstrom, Gorel Cook, April FitzGerald, Michael G. Swofford, Ross Mauceli, Evan Moghadam, Behrooz Torabi Greka, Anna Alfoldi, Jessica Abouelleil, Amr Aftuck, Lynne Bessette, Daniel Berlin, Aaron M. Brown, Adam Gearin, Gary Lui, Annie Macdonald, J. Pendexter Priest, Margaret Shea, Terrance Turner-Maier, Jason Zimmer, Andrew di Palma, Federica Lindblad-Toh, Kerstin Grabherr, Manfred G. Lander, Eric Steven |
author2 | Massachusetts Institute of Technology. Department of Biology |
author_facet | Massachusetts Institute of Technology. Department of Biology Hoeppner, Marc P. Lundquist, Andrew Pirun, Mono Meadows, Jennifer R. S. Zamani, Neda Johnson, Jeremy Sundstrom, Gorel Cook, April FitzGerald, Michael G. Swofford, Ross Mauceli, Evan Moghadam, Behrooz Torabi Greka, Anna Alfoldi, Jessica Abouelleil, Amr Aftuck, Lynne Bessette, Daniel Berlin, Aaron M. Brown, Adam Gearin, Gary Lui, Annie Macdonald, J. Pendexter Priest, Margaret Shea, Terrance Turner-Maier, Jason Zimmer, Andrew di Palma, Federica Lindblad-Toh, Kerstin Grabherr, Manfred G. Lander, Eric Steven |
author_sort | Hoeppner, Marc P. |
collection | MIT |
description | The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ~175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ~3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ~20,700 high-confidence protein coding loci, we found ~4,600 antisense transcripts overlapping exons of protein coding genes, ~7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ~11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts. |
first_indexed | 2024-09-23T16:38:16Z |
format | Article |
id | mit-1721.1/86301 |
institution | Massachusetts Institute of Technology |
language | en_US |
last_indexed | 2024-09-23T16:38:16Z |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | dspace |
spelling | mit-1721.1/863012022-09-29T20:28:43Z An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts Hoeppner, Marc P. Lundquist, Andrew Pirun, Mono Meadows, Jennifer R. S. Zamani, Neda Johnson, Jeremy Sundstrom, Gorel Cook, April FitzGerald, Michael G. Swofford, Ross Mauceli, Evan Moghadam, Behrooz Torabi Greka, Anna Alfoldi, Jessica Abouelleil, Amr Aftuck, Lynne Bessette, Daniel Berlin, Aaron M. Brown, Adam Gearin, Gary Lui, Annie Macdonald, J. Pendexter Priest, Margaret Shea, Terrance Turner-Maier, Jason Zimmer, Andrew di Palma, Federica Lindblad-Toh, Kerstin Grabherr, Manfred G. Lander, Eric Steven Massachusetts Institute of Technology. Department of Biology Lander, Eric S. The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ~175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ~3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ~20,700 high-confidence protein coding loci, we found ~4,600 antisense transcripts overlapping exons of protein coding genes, ~7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ~11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts. European Science Foundation (EURYI award) National Human Genome Research Institute (U.S.) (NHGRI (U54 HG003067)) Uppsala University Swedish Medical Research Council Swedish Research Council FORMAS European Commission (FP7-LUPA, GA-201370) Science for Life Laboratory (Stockholm, Sweden) (Start-up grant) 2014-04-30T19:29:53Z 2014-04-30T19:29:53Z 2014-03 2013-11 Article http://purl.org/eprint/type/JournalArticle 1932-6203 http://hdl.handle.net/1721.1/86301 Hoeppner, Marc P., Andrew Lundquist, Mono Pirun, Jennifer R. S. Meadows, Neda Zamani, Jeremy Johnson, Görel Sundström, et al. “An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts.” Edited by Brian P. Chadwick. PLoS ONE 9, no. 3 (March 13, 2014): e91172. en_US http://dx.doi.org/10.1371/journal.pone.0091172 PLoS ONE Creative Commons Attribution http://creativecommons.org/licenses/by/4.0/ application/pdf Public Library of Science PLoS |
spellingShingle | Hoeppner, Marc P. Lundquist, Andrew Pirun, Mono Meadows, Jennifer R. S. Zamani, Neda Johnson, Jeremy Sundstrom, Gorel Cook, April FitzGerald, Michael G. Swofford, Ross Mauceli, Evan Moghadam, Behrooz Torabi Greka, Anna Alfoldi, Jessica Abouelleil, Amr Aftuck, Lynne Bessette, Daniel Berlin, Aaron M. Brown, Adam Gearin, Gary Lui, Annie Macdonald, J. Pendexter Priest, Margaret Shea, Terrance Turner-Maier, Jason Zimmer, Andrew di Palma, Federica Lindblad-Toh, Kerstin Grabherr, Manfred G. Lander, Eric Steven An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title | An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title_full | An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title_fullStr | An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title_full_unstemmed | An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title_short | An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts |
title_sort | improved canine genome and a comprehensive catalogue of coding genes and non coding transcripts |
url | http://hdl.handle.net/1721.1/86301 |
work_keys_str_mv | AT hoeppnermarcp animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT lundquistandrew animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT pirunmono animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT meadowsjenniferrs animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT zamanineda animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT johnsonjeremy animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT sundstromgorel animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT cookapril animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT fitzgeraldmichaelg animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT swoffordross animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT maucelievan animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT moghadambehrooztorabi animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT grekaanna animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT alfoldijessica animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT abouelleilamr animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT aftucklynne animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT bessettedaniel animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT berlinaaronm animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT brownadam animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT gearingary animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT luiannie animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT macdonaldjpendexter animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT priestmargaret animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT sheaterrance animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT turnermaierjason animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT zimmerandrew animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT dipalmafederica animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT lindbladtohkerstin animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT grabherrmanfredg animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT landerericsteven animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT hoeppnermarcp improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT lundquistandrew improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT pirunmono improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT meadowsjenniferrs improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT zamanineda improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT johnsonjeremy improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT sundstromgorel improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT cookapril improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT fitzgeraldmichaelg improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT swoffordross improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT maucelievan improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT moghadambehrooztorabi improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT grekaanna improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT alfoldijessica improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT abouelleilamr improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT aftucklynne improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT bessettedaniel improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT berlinaaronm improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT brownadam improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT gearingary improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT luiannie improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT macdonaldjpendexter improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT priestmargaret improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT sheaterrance improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT turnermaierjason improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT zimmerandrew improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT dipalmafederica improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT lindbladtohkerstin improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT grabherrmanfredg improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts AT landerericsteven improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts |