GSA: Genome Sequence Archive*

With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://big...

Full description

Bibliographic Details
Main Authors: Yanqing Wang, Fuhai Song, Junwei Zhu, Sisi Zhang, Yadong Yang, Tingting Chen, Bixia Tang, Lili Dong, Nan Ding, Qian Zhang, Zhouxian Bai, Xunong Dong, Huanxin Chen, Mingyuan Sun, Shuang Zhai, Yubin Sun, Lei Yu, Li Lan, Jingfa Xiao, Xiangdong Fang, Hongxing Lei, Zhang Zhang, Wenming Zhao
Format: Article
Language:English
Published: Elsevier 2017-02-01
Series:Genomics, Proteomics & Bioinformatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1672022917300025
_version_ 1797368734396973056
author Yanqing Wang
Fuhai Song
Junwei Zhu
Sisi Zhang
Yadong Yang
Tingting Chen
Bixia Tang
Lili Dong
Nan Ding
Qian Zhang
Zhouxian Bai
Xunong Dong
Huanxin Chen
Mingyuan Sun
Shuang Zhai
Yubin Sun
Lei Yu
Li Lan
Jingfa Xiao
Xiangdong Fang
Hongxing Lei
Zhang Zhang
Wenming Zhao
author_facet Yanqing Wang
Fuhai Song
Junwei Zhu
Sisi Zhang
Yadong Yang
Tingting Chen
Bixia Tang
Lili Dong
Nan Ding
Qian Zhang
Zhouxian Bai
Xunong Dong
Huanxin Chen
Mingyuan Sun
Shuang Zhai
Yubin Sun
Lei Yu
Li Lan
Jingfa Xiao
Xiangdong Fang
Hongxing Lei
Zhang Zhang
Wenming Zhao
author_sort Yanqing Wang
collection DOAJ
description With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.
first_indexed 2024-03-08T17:35:56Z
format Article
id doaj.art-7c47f4e8a1334b1fbd0d945c1b117d75
institution Directory Open Access Journal
issn 1672-0229
language English
last_indexed 2024-03-08T17:35:56Z
publishDate 2017-02-01
publisher Elsevier
record_format Article
series Genomics, Proteomics & Bioinformatics
spelling doaj.art-7c47f4e8a1334b1fbd0d945c1b117d752024-01-02T12:07:07ZengElsevierGenomics, Proteomics & Bioinformatics1672-02292017-02-01151141810.1016/j.gpb.2017.01.001GSA: Genome Sequence Archive*Yanqing Wang0Fuhai Song1Junwei Zhu2Sisi Zhang3Yadong Yang4Tingting Chen5Bixia Tang6Lili Dong7Nan Ding8Qian Zhang9Zhouxian Bai10Xunong Dong11Huanxin Chen12Mingyuan Sun13Shuang Zhai14Yubin Sun15Lei Yu16Li Lan17Jingfa Xiao18Xiangdong Fang19Hongxing Lei20Zhang Zhang21Wenming Zhao22BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaWith the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.http://www.sciencedirect.com/science/article/pii/S1672022917300025Genome Sequence ArchiveGSABig dataRaw sequence dataINSDC
spellingShingle Yanqing Wang
Fuhai Song
Junwei Zhu
Sisi Zhang
Yadong Yang
Tingting Chen
Bixia Tang
Lili Dong
Nan Ding
Qian Zhang
Zhouxian Bai
Xunong Dong
Huanxin Chen
Mingyuan Sun
Shuang Zhai
Yubin Sun
Lei Yu
Li Lan
Jingfa Xiao
Xiangdong Fang
Hongxing Lei
Zhang Zhang
Wenming Zhao
GSA: Genome Sequence Archive*
Genomics, Proteomics & Bioinformatics
Genome Sequence Archive
GSA
Big data
Raw sequence data
INSDC
title GSA: Genome Sequence Archive*
title_full GSA: Genome Sequence Archive*
title_fullStr GSA: Genome Sequence Archive*
title_full_unstemmed GSA: Genome Sequence Archive*
title_short GSA: Genome Sequence Archive*
title_sort gsa genome sequence archive
topic Genome Sequence Archive
GSA
Big data
Raw sequence data
INSDC
url http://www.sciencedirect.com/science/article/pii/S1672022917300025
work_keys_str_mv AT yanqingwang gsagenomesequencearchive
AT fuhaisong gsagenomesequencearchive
AT junweizhu gsagenomesequencearchive
AT sisizhang gsagenomesequencearchive
AT yadongyang gsagenomesequencearchive
AT tingtingchen gsagenomesequencearchive
AT bixiatang gsagenomesequencearchive
AT lilidong gsagenomesequencearchive
AT nanding gsagenomesequencearchive
AT qianzhang gsagenomesequencearchive
AT zhouxianbai gsagenomesequencearchive
AT xunongdong gsagenomesequencearchive
AT huanxinchen gsagenomesequencearchive
AT mingyuansun gsagenomesequencearchive
AT shuangzhai gsagenomesequencearchive
AT yubinsun gsagenomesequencearchive
AT leiyu gsagenomesequencearchive
AT lilan gsagenomesequencearchive
AT jingfaxiao gsagenomesequencearchive
AT xiangdongfang gsagenomesequencearchive
AT hongxinglei gsagenomesequencearchive
AT zhangzhang gsagenomesequencearchive
AT wenmingzhao gsagenomesequencearchive