Computational analysis of next generation sequencing data and its applications in clinical oncology

Next generation sequencing (NGS) has made great strides in sequencing technology as it enables sequencing of genes in a high throughput manner with low cost. Various NGS platforms such as Illumina, Roche, ABI/SOLiD are used for wet-lab analysis of NGS data and computational tools such as BWA, Bowtie...

Full description

Bibliographic Details
Main Authors: Rucha M. Wadapurkar, Renu Vyas
Format: Article
Language:English
Published: Elsevier 2018-01-01
Series:Informatics in Medicine Unlocked
Online Access:http://www.sciencedirect.com/science/article/pii/S2352914818300790
_version_ 1811266889493512192
author Rucha M. Wadapurkar
Renu Vyas
author_facet Rucha M. Wadapurkar
Renu Vyas
author_sort Rucha M. Wadapurkar
collection DOAJ
description Next generation sequencing (NGS) has made great strides in sequencing technology as it enables sequencing of genes in a high throughput manner with low cost. Various NGS platforms such as Illumina, Roche, ABI/SOLiD are used for wet-lab analysis of NGS data and computational tools such as BWA, Bowtie, Galaxy, SanGeniX are used for dry-lab analysis of NGS data. One of the important aspects of NGS data is its usage in early disease diagnosis especially in cancer which was earlier not possible with conventional sequencing technologies such as Sanger sequencing, NGS can identify all those mutations which cannot be identified using conventional sequencing technologies as researchers can now sequence the whole genome, exome or transcriptome. Exome sequencing is preferred, as a higher number of mutations are found to exist in the exome part of genes. The present comprehensive review encompasses the complete NGS data analysis workflow that includes alignment of NGS reads, identification and annotation of mutations and visualization, discussion of software tools for variant identification and annotation, evaluation of structural variation in NGS data, and study of different DNA sequencing technologies. In the field of clinical oncology, NGS has already proven its usefulness, and the mortality rate has been reduced, as now doctors can suggest a proper treatment to their patients by checking the complete genomic profile. However, data storage and the complexity in interpreting enormous amounts of data obtained with NGS still remain a computational challenge to researchers, as for each sample, the number of different and very large analysis files are generated directly from the raw sequence read file to the final result file. NGS resultant data is very complex, and its interpretation requires expert bioinformatics assistance, as a large number of mutations are identified from samples, but to differentiate clinically significant mutations among them with appropriate use of validation methods is a challenging task. This review is intended to provide researchers with a complete overview of NGS along with knowledge of how the tools will be employed, and insight into identification and interpretation of cancer mutations for clinical diagnostics. Keywords: Next generation sequencing, Mutations, Cancer, Sanger sequencing, Variant identification and annotation, Data analysis
first_indexed 2024-04-12T20:51:37Z
format Article
id doaj.art-673423188c5f48579451cf89b75d4b91
institution Directory Open Access Journal
issn 2352-9148
language English
last_indexed 2024-04-12T20:51:37Z
publishDate 2018-01-01
publisher Elsevier
record_format Article
series Informatics in Medicine Unlocked
spelling doaj.art-673423188c5f48579451cf89b75d4b912022-12-22T03:17:07ZengElsevierInformatics in Medicine Unlocked2352-91482018-01-01117582Computational analysis of next generation sequencing data and its applications in clinical oncologyRucha M. Wadapurkar0Renu Vyas1MIT School of Bioengineering Sciences and Research, MIT ADT University, Raj Baugh Campus, Loni Kalbhor, Pune 412201, Maharashtra, IndiaCorresponding author.; MIT School of Bioengineering Sciences and Research, MIT ADT University, Raj Baugh Campus, Loni Kalbhor, Pune 412201, Maharashtra, IndiaNext generation sequencing (NGS) has made great strides in sequencing technology as it enables sequencing of genes in a high throughput manner with low cost. Various NGS platforms such as Illumina, Roche, ABI/SOLiD are used for wet-lab analysis of NGS data and computational tools such as BWA, Bowtie, Galaxy, SanGeniX are used for dry-lab analysis of NGS data. One of the important aspects of NGS data is its usage in early disease diagnosis especially in cancer which was earlier not possible with conventional sequencing technologies such as Sanger sequencing, NGS can identify all those mutations which cannot be identified using conventional sequencing technologies as researchers can now sequence the whole genome, exome or transcriptome. Exome sequencing is preferred, as a higher number of mutations are found to exist in the exome part of genes. The present comprehensive review encompasses the complete NGS data analysis workflow that includes alignment of NGS reads, identification and annotation of mutations and visualization, discussion of software tools for variant identification and annotation, evaluation of structural variation in NGS data, and study of different DNA sequencing technologies. In the field of clinical oncology, NGS has already proven its usefulness, and the mortality rate has been reduced, as now doctors can suggest a proper treatment to their patients by checking the complete genomic profile. However, data storage and the complexity in interpreting enormous amounts of data obtained with NGS still remain a computational challenge to researchers, as for each sample, the number of different and very large analysis files are generated directly from the raw sequence read file to the final result file. NGS resultant data is very complex, and its interpretation requires expert bioinformatics assistance, as a large number of mutations are identified from samples, but to differentiate clinically significant mutations among them with appropriate use of validation methods is a challenging task. This review is intended to provide researchers with a complete overview of NGS along with knowledge of how the tools will be employed, and insight into identification and interpretation of cancer mutations for clinical diagnostics. Keywords: Next generation sequencing, Mutations, Cancer, Sanger sequencing, Variant identification and annotation, Data analysishttp://www.sciencedirect.com/science/article/pii/S2352914818300790
spellingShingle Rucha M. Wadapurkar
Renu Vyas
Computational analysis of next generation sequencing data and its applications in clinical oncology
Informatics in Medicine Unlocked
title Computational analysis of next generation sequencing data and its applications in clinical oncology
title_full Computational analysis of next generation sequencing data and its applications in clinical oncology
title_fullStr Computational analysis of next generation sequencing data and its applications in clinical oncology
title_full_unstemmed Computational analysis of next generation sequencing data and its applications in clinical oncology
title_short Computational analysis of next generation sequencing data and its applications in clinical oncology
title_sort computational analysis of next generation sequencing data and its applications in clinical oncology
url http://www.sciencedirect.com/science/article/pii/S2352914818300790
work_keys_str_mv AT ruchamwadapurkar computationalanalysisofnextgenerationsequencingdataanditsapplicationsinclinicaloncology
AT renuvyas computationalanalysisofnextgenerationsequencingdataanditsapplicationsinclinicaloncology