GSA: Genome Sequence Archive*
With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://big...
Main Authors: | , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2017-02-01
|
Series: | Genomics, Proteomics & Bioinformatics |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1672022917300025 |
_version_ | 1797368734396973056 |
---|---|
author | Yanqing Wang Fuhai Song Junwei Zhu Sisi Zhang Yadong Yang Tingting Chen Bixia Tang Lili Dong Nan Ding Qian Zhang Zhouxian Bai Xunong Dong Huanxin Chen Mingyuan Sun Shuang Zhai Yubin Sun Lei Yu Li Lan Jingfa Xiao Xiangdong Fang Hongxing Lei Zhang Zhang Wenming Zhao |
author_facet | Yanqing Wang Fuhai Song Junwei Zhu Sisi Zhang Yadong Yang Tingting Chen Bixia Tang Lili Dong Nan Ding Qian Zhang Zhouxian Bai Xunong Dong Huanxin Chen Mingyuan Sun Shuang Zhai Yubin Sun Lei Yu Li Lan Jingfa Xiao Xiangdong Fang Hongxing Lei Zhang Zhang Wenming Zhao |
author_sort | Yanqing Wang |
collection | DOAJ |
description | With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world. |
first_indexed | 2024-03-08T17:35:56Z |
format | Article |
id | doaj.art-7c47f4e8a1334b1fbd0d945c1b117d75 |
institution | Directory Open Access Journal |
issn | 1672-0229 |
language | English |
last_indexed | 2024-03-08T17:35:56Z |
publishDate | 2017-02-01 |
publisher | Elsevier |
record_format | Article |
series | Genomics, Proteomics & Bioinformatics |
spelling | doaj.art-7c47f4e8a1334b1fbd0d945c1b117d752024-01-02T12:07:07ZengElsevierGenomics, Proteomics & Bioinformatics1672-02292017-02-01151141810.1016/j.gpb.2017.01.001GSA: Genome Sequence Archive*Yanqing Wang0Fuhai Song1Junwei Zhu2Sisi Zhang3Yadong Yang4Tingting Chen5Bixia Tang6Lili Dong7Nan Ding8Qian Zhang9Zhouxian Bai10Xunong Dong11Huanxin Chen12Mingyuan Sun13Shuang Zhai14Yubin Sun15Lei Yu16Li Lan17Jingfa Xiao18Xiangdong Fang19Hongxing Lei20Zhang Zhang21Wenming Zhao22BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaBIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, ChinaWith the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.http://www.sciencedirect.com/science/article/pii/S1672022917300025Genome Sequence ArchiveGSABig dataRaw sequence dataINSDC |
spellingShingle | Yanqing Wang Fuhai Song Junwei Zhu Sisi Zhang Yadong Yang Tingting Chen Bixia Tang Lili Dong Nan Ding Qian Zhang Zhouxian Bai Xunong Dong Huanxin Chen Mingyuan Sun Shuang Zhai Yubin Sun Lei Yu Li Lan Jingfa Xiao Xiangdong Fang Hongxing Lei Zhang Zhang Wenming Zhao GSA: Genome Sequence Archive* Genomics, Proteomics & Bioinformatics Genome Sequence Archive GSA Big data Raw sequence data INSDC |
title | GSA: Genome Sequence Archive* |
title_full | GSA: Genome Sequence Archive* |
title_fullStr | GSA: Genome Sequence Archive* |
title_full_unstemmed | GSA: Genome Sequence Archive* |
title_short | GSA: Genome Sequence Archive* |
title_sort | gsa genome sequence archive |
topic | Genome Sequence Archive GSA Big data Raw sequence data INSDC |
url | http://www.sciencedirect.com/science/article/pii/S1672022917300025 |
work_keys_str_mv | AT yanqingwang gsagenomesequencearchive AT fuhaisong gsagenomesequencearchive AT junweizhu gsagenomesequencearchive AT sisizhang gsagenomesequencearchive AT yadongyang gsagenomesequencearchive AT tingtingchen gsagenomesequencearchive AT bixiatang gsagenomesequencearchive AT lilidong gsagenomesequencearchive AT nanding gsagenomesequencearchive AT qianzhang gsagenomesequencearchive AT zhouxianbai gsagenomesequencearchive AT xunongdong gsagenomesequencearchive AT huanxinchen gsagenomesequencearchive AT mingyuansun gsagenomesequencearchive AT shuangzhai gsagenomesequencearchive AT yubinsun gsagenomesequencearchive AT leiyu gsagenomesequencearchive AT lilan gsagenomesequencearchive AT jingfaxiao gsagenomesequencearchive AT xiangdongfang gsagenomesequencearchive AT hongxinglei gsagenomesequencearchive AT zhangzhang gsagenomesequencearchive AT wenmingzhao gsagenomesequencearchive |