Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.

The international FANTOM consortium aims to produce a comprehensive picture of the mammalian transcriptome, based upon an extensive cDNA collection and functional annotation of full-length enriched cDNAs. The previous dataset, FANTOM2, comprised 60,770 full-length enriched cDNAs. Functional annotati...

Full description

Bibliographic Details
Main Authors: Norihiro Maeda, Takeya Kasukawa, Rieko Oyama, Julian Gough, Martin Frith, Pär G Engström, Boris Lenhard, Rajith N Aturaliya, Serge Batalov, Kirk W Beisel, Carol J Bult, Colin F Fletcher, Alistair R R Forrest, Masaaki Furuno, David Hill, Masayoshi Itoh, Mutsumi Kanamori-Katayama, Shintaro Katayama, Masaru Katoh, Tsugumi Kawashima, John Quackenbush, Timothy Ravasi, Brian Z Ring, Kazuhiro Shibata, Koji Sugiura, Yoichi Takenaka, Rohan D Teasdale, Christine A Wells, Yunxia Zhu, Chikatoshi Kai, Jun Kawai, David A Hume, Piero Carninci, Yoshihide Hayashizaki
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2006-04-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC1449903?pdf=render
_version_ 1828486118170951680
author Norihiro Maeda
Takeya Kasukawa
Rieko Oyama
Julian Gough
Martin Frith
Pär G Engström
Boris Lenhard
Rajith N Aturaliya
Serge Batalov
Kirk W Beisel
Carol J Bult
Colin F Fletcher
Alistair R R Forrest
Masaaki Furuno
David Hill
Masayoshi Itoh
Mutsumi Kanamori-Katayama
Shintaro Katayama
Masaru Katoh
Tsugumi Kawashima
John Quackenbush
Timothy Ravasi
Brian Z Ring
Kazuhiro Shibata
Koji Sugiura
Yoichi Takenaka
Rohan D Teasdale
Christine A Wells
Yunxia Zhu
Chikatoshi Kai
Jun Kawai
David A Hume
Piero Carninci
Yoshihide Hayashizaki
author_facet Norihiro Maeda
Takeya Kasukawa
Rieko Oyama
Julian Gough
Martin Frith
Pär G Engström
Boris Lenhard
Rajith N Aturaliya
Serge Batalov
Kirk W Beisel
Carol J Bult
Colin F Fletcher
Alistair R R Forrest
Masaaki Furuno
David Hill
Masayoshi Itoh
Mutsumi Kanamori-Katayama
Shintaro Katayama
Masaru Katoh
Tsugumi Kawashima
John Quackenbush
Timothy Ravasi
Brian Z Ring
Kazuhiro Shibata
Koji Sugiura
Yoichi Takenaka
Rohan D Teasdale
Christine A Wells
Yunxia Zhu
Chikatoshi Kai
Jun Kawai
David A Hume
Piero Carninci
Yoshihide Hayashizaki
author_sort Norihiro Maeda
collection DOAJ
description The international FANTOM consortium aims to produce a comprehensive picture of the mammalian transcriptome, based upon an extensive cDNA collection and functional annotation of full-length enriched cDNAs. The previous dataset, FANTOM2, comprised 60,770 full-length enriched cDNAs. Functional annotation revealed that this cDNA dataset contained only about half of the estimated number of mouse protein-coding genes, indicating that a number of cDNAs still remained to be collected and identified. To pursue the complete gene catalog that covers all predicted mouse genes, cloning and sequencing of full-length enriched cDNAs has been continued since FANTOM2. In FANTOM3, 42,031 newly isolated cDNAs were subjected to functional annotation, and the annotation of 4,347 FANTOM2 cDNAs was updated. To accomplish accurate functional annotation, we improved our automated annotation pipeline by introducing new coding sequence prediction programs and developed a Web-based annotation interface for simplifying the annotation procedures to reduce manual annotation errors. Automated coding sequence and function prediction was followed with manual curation and review by expert curators. A total of 102,801 full-length enriched mouse cDNAs were annotated. Out of 102,801 transcripts, 56,722 were functionally annotated as protein coding (including partial or truncated transcripts), providing to our knowledge the greatest current coverage of the mouse proteome by full-length cDNAs. The total number of distinct non-protein-coding transcripts increased to 34,030. The FANTOM3 annotation system, consisting of automated computational prediction, manual curation, and final expert curation, facilitated the comprehensive characterization of the mouse transcriptome, and could be applied to the transcriptomes of other species.
first_indexed 2024-12-11T09:27:31Z
format Article
id doaj.art-715f2767f50e469fbc189398f0eb0f8f
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-12-11T09:27:31Z
publishDate 2006-04-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-715f2767f50e469fbc189398f0eb0f8f2022-12-22T01:13:06ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042006-04-0124e6210.1371/journal.pgen.0020062Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.Norihiro MaedaTakeya KasukawaRieko OyamaJulian GoughMartin FrithPär G EngströmBoris LenhardRajith N AturaliyaSerge BatalovKirk W BeiselCarol J BultColin F FletcherAlistair R R ForrestMasaaki FurunoDavid HillMasayoshi ItohMutsumi Kanamori-KatayamaShintaro KatayamaMasaru KatohTsugumi KawashimaJohn QuackenbushTimothy RavasiBrian Z RingKazuhiro ShibataKoji SugiuraYoichi TakenakaRohan D TeasdaleChristine A WellsYunxia ZhuChikatoshi KaiJun KawaiDavid A HumePiero CarninciYoshihide HayashizakiThe international FANTOM consortium aims to produce a comprehensive picture of the mammalian transcriptome, based upon an extensive cDNA collection and functional annotation of full-length enriched cDNAs. The previous dataset, FANTOM2, comprised 60,770 full-length enriched cDNAs. Functional annotation revealed that this cDNA dataset contained only about half of the estimated number of mouse protein-coding genes, indicating that a number of cDNAs still remained to be collected and identified. To pursue the complete gene catalog that covers all predicted mouse genes, cloning and sequencing of full-length enriched cDNAs has been continued since FANTOM2. In FANTOM3, 42,031 newly isolated cDNAs were subjected to functional annotation, and the annotation of 4,347 FANTOM2 cDNAs was updated. To accomplish accurate functional annotation, we improved our automated annotation pipeline by introducing new coding sequence prediction programs and developed a Web-based annotation interface for simplifying the annotation procedures to reduce manual annotation errors. Automated coding sequence and function prediction was followed with manual curation and review by expert curators. A total of 102,801 full-length enriched mouse cDNAs were annotated. Out of 102,801 transcripts, 56,722 were functionally annotated as protein coding (including partial or truncated transcripts), providing to our knowledge the greatest current coverage of the mouse proteome by full-length cDNAs. The total number of distinct non-protein-coding transcripts increased to 34,030. The FANTOM3 annotation system, consisting of automated computational prediction, manual curation, and final expert curation, facilitated the comprehensive characterization of the mouse transcriptome, and could be applied to the transcriptomes of other species.http://europepmc.org/articles/PMC1449903?pdf=render
spellingShingle Norihiro Maeda
Takeya Kasukawa
Rieko Oyama
Julian Gough
Martin Frith
Pär G Engström
Boris Lenhard
Rajith N Aturaliya
Serge Batalov
Kirk W Beisel
Carol J Bult
Colin F Fletcher
Alistair R R Forrest
Masaaki Furuno
David Hill
Masayoshi Itoh
Mutsumi Kanamori-Katayama
Shintaro Katayama
Masaru Katoh
Tsugumi Kawashima
John Quackenbush
Timothy Ravasi
Brian Z Ring
Kazuhiro Shibata
Koji Sugiura
Yoichi Takenaka
Rohan D Teasdale
Christine A Wells
Yunxia Zhu
Chikatoshi Kai
Jun Kawai
David A Hume
Piero Carninci
Yoshihide Hayashizaki
Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
PLoS Genetics
title Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
title_full Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
title_fullStr Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
title_full_unstemmed Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
title_short Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.
title_sort transcript annotation in fantom3 mouse gene catalog based on physical cdnas
url http://europepmc.org/articles/PMC1449903?pdf=render
work_keys_str_mv AT norihiromaeda transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT takeyakasukawa transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT riekooyama transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT juliangough transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT martinfrith transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT pargengstrom transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT borislenhard transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT rajithnaturaliya transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT sergebatalov transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT kirkwbeisel transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT caroljbult transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT colinffletcher transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT alistairrrforrest transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT masaakifuruno transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT davidhill transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT masayoshiitoh transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT mutsumikanamorikatayama transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT shintarokatayama transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT masarukatoh transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT tsugumikawashima transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT johnquackenbush transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT timothyravasi transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT brianzring transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT kazuhiroshibata transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT kojisugiura transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT yoichitakenaka transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT rohandteasdale transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT christineawells transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT yunxiazhu transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT chikatoshikai transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT junkawai transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT davidahume transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT pierocarninci transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas
AT yoshihidehayashizaki transcriptannotationinfantom3mousegenecatalogbasedonphysicalcdnas