Fast processing RNA-Seq on multicore processor

RNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigaby...

Full description

Bibliographic Details
Main Authors: Lee, Jia Bin, Abdul Hamid, Nor Asilah Wati, Ismail, Zurita, Laham, Mohamed Faris
Format: Article
Language:English
Published: College of Science for Women/ University of Baghdad 2021
Online Access:http://psasir.upm.edu.my/id/eprint/97311/1/ABSTRACT.pdf
_version_ 1825937848355782656
author Lee, Jia Bin
Abdul Hamid, Nor Asilah Wati
Ismail, Zurita
Laham, Mohamed Faris
author_facet Lee, Jia Bin
Abdul Hamid, Nor Asilah Wati
Ismail, Zurita
Laham, Mohamed Faris
author_sort Lee, Jia Bin
collection UPM
description RNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigabytes (GB). This massive data always makes the processing time become longer and take several days. A multicore processor can speed up a program by separating the tasks and running the tasks’ errands concurrently. Hence, a multicore processor will be a suitable choice to overcome this problem. Therefore, this study aims to use an Intel multicore processor to improve the RNA-Seq speed and analyze RNA-Seq analysis's performance with a multiprocessor. This study only processed RNA-Seq from quality control analysis until sorted the BAM (Binary Alignment/Map) file content. Three different sizes of RNA paired end has been used to make the comparison. The final experiment results showed that the implementation of RNA-Seq on an Intel multicore processor could achieve a higher speedup. The total processing time of RNA-Seq with the largest size of RNA raw sequence data (66.3 Megabytes) decreased from 317.638 seconds to 211.916 seconds. The reduced processing time was 105 seconds and near to 2 minutes. Furthermore, for the smallest RNA raw sequence data size, the total processing time decreased from 212.380 seconds to 163.961 seconds which reduced 48 seconds.
first_indexed 2024-03-06T11:05:41Z
format Article
id upm.eprints-97311
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T11:05:41Z
publishDate 2021
publisher College of Science for Women/ University of Baghdad
record_format dspace
spelling upm.eprints-973112022-09-05T08:47:13Z http://psasir.upm.edu.my/id/eprint/97311/ Fast processing RNA-Seq on multicore processor Lee, Jia Bin Abdul Hamid, Nor Asilah Wati Ismail, Zurita Laham, Mohamed Faris RNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigabytes (GB). This massive data always makes the processing time become longer and take several days. A multicore processor can speed up a program by separating the tasks and running the tasks’ errands concurrently. Hence, a multicore processor will be a suitable choice to overcome this problem. Therefore, this study aims to use an Intel multicore processor to improve the RNA-Seq speed and analyze RNA-Seq analysis's performance with a multiprocessor. This study only processed RNA-Seq from quality control analysis until sorted the BAM (Binary Alignment/Map) file content. Three different sizes of RNA paired end has been used to make the comparison. The final experiment results showed that the implementation of RNA-Seq on an Intel multicore processor could achieve a higher speedup. The total processing time of RNA-Seq with the largest size of RNA raw sequence data (66.3 Megabytes) decreased from 317.638 seconds to 211.916 seconds. The reduced processing time was 105 seconds and near to 2 minutes. Furthermore, for the smallest RNA raw sequence data size, the total processing time decreased from 212.380 seconds to 163.961 seconds which reduced 48 seconds. College of Science for Women/ University of Baghdad 2021 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/97311/1/ABSTRACT.pdf Lee, Jia Bin and Abdul Hamid, Nor Asilah Wati and Ismail, Zurita and Laham, Mohamed Faris (2021) Fast processing RNA-Seq on multicore processor. Baghdad Science Journal, 18 (suppl.4). pp. 1413-1422. ISSN 2078-8665; ESSN: 2411-7986 https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6642 10.21123/bsj.2021.18.4(Suppl.).1413
spellingShingle Lee, Jia Bin
Abdul Hamid, Nor Asilah Wati
Ismail, Zurita
Laham, Mohamed Faris
Fast processing RNA-Seq on multicore processor
title Fast processing RNA-Seq on multicore processor
title_full Fast processing RNA-Seq on multicore processor
title_fullStr Fast processing RNA-Seq on multicore processor
title_full_unstemmed Fast processing RNA-Seq on multicore processor
title_short Fast processing RNA-Seq on multicore processor
title_sort fast processing rna seq on multicore processor
url http://psasir.upm.edu.my/id/eprint/97311/1/ABSTRACT.pdf
work_keys_str_mv AT leejiabin fastprocessingrnaseqonmulticoreprocessor
AT abdulhamidnorasilahwati fastprocessingrnaseqonmulticoreprocessor
AT ismailzurita fastprocessingrnaseqonmulticoreprocessor
AT lahammohamedfaris fastprocessingrnaseqonmulticoreprocessor