Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad

In performing protein secondary structure prediction procedures, biologists need to use variety types of sequence data from multiple biological repositories which are available publicly in the Internet. A lot of researches have been done in minimizing the numbers of repositories needed for the predi...

Full description

Bibliographic Details
Main Authors: Mishan, Mohd Taufik, Idrus, Zanariah, Ahmad, Jasmin Ilyani
Format: Research Reports
Language:English
Published: Research Management Institute 2014
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/22434/1/LP_MOHD%20TAUFIK%20MISHAN%20IRMI%20K%2014_5.pdf
_version_ 1796901547772215296
author Mishan, Mohd Taufik
Idrus, Zanariah
Ahmad, Jasmin Ilyani
author_facet Mishan, Mohd Taufik
Idrus, Zanariah
Ahmad, Jasmin Ilyani
author_sort Mishan, Mohd Taufik
collection UITM
description In performing protein secondary structure prediction procedures, biologists need to use variety types of sequence data from multiple biological repositories which are available publicly in the Internet. A lot of researches have been done in minimizing the numbers of repositories needed for the prediction procedures. However, due to the size complexity and numbers of repositories used has created a major challenge in integrating all different data into one repository or database. This challenge is known as syntactic heterogeneity problem. The aim of this research is to overcome the problem by transforming all the different data form variety of databases such as Prosite, Blast, Print and PDB into flat file format and other format into relational form using XML and asp dot net. From studies that have been conducted, XML approach is considered as a better choice for biological data integration. And this research has reveals that query made from relational database incorporating XML schema gives better query performance after integrating the variety data into one repository or relational database using metadata framework. As a result, this research showed some tool can search different data and different sizes of protein secondary structure data stored in the relational database and the result can be retrieved faster and reliable.
first_indexed 2024-03-06T01:51:49Z
format Research Reports
id uitm.eprints-2434
institution Universiti Teknologi MARA
language English
last_indexed 2024-03-06T01:51:49Z
publishDate 2014
publisher Research Management Institute
record_format dspace
spelling uitm.eprints-24342018-12-13T02:05:06Z https://ir.uitm.edu.my/id/eprint/22434/ Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad Mishan, Mohd Taufik Idrus, Zanariah Ahmad, Jasmin Ilyani Electronic Computers. Computer Science Multimedia systems Database management In performing protein secondary structure prediction procedures, biologists need to use variety types of sequence data from multiple biological repositories which are available publicly in the Internet. A lot of researches have been done in minimizing the numbers of repositories needed for the prediction procedures. However, due to the size complexity and numbers of repositories used has created a major challenge in integrating all different data into one repository or database. This challenge is known as syntactic heterogeneity problem. The aim of this research is to overcome the problem by transforming all the different data form variety of databases such as Prosite, Blast, Print and PDB into flat file format and other format into relational form using XML and asp dot net. From studies that have been conducted, XML approach is considered as a better choice for biological data integration. And this research has reveals that query made from relational database incorporating XML schema gives better query performance after integrating the variety data into one repository or relational database using metadata framework. As a result, this research showed some tool can search different data and different sizes of protein secondary structure data stored in the relational database and the result can be retrieved faster and reliable. Research Management Institute 2014-04 Research Reports NonPeerReviewed text en https://ir.uitm.edu.my/id/eprint/22434/1/LP_MOHD%20TAUFIK%20MISHAN%20IRMI%20K%2014_5.pdf Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad. (2014) [Research Reports] (Unpublished)
spellingShingle Electronic Computers. Computer Science
Multimedia systems
Database management
Mishan, Mohd Taufik
Idrus, Zanariah
Ahmad, Jasmin Ilyani
Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title_full Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title_fullStr Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title_full_unstemmed Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title_short Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad
title_sort integrated the different data from the variety database for querying motif sequence mohd taufik mishan zanariah idrus and jasmin ilyani ahmad
topic Electronic Computers. Computer Science
Multimedia systems
Database management
url https://ir.uitm.edu.my/id/eprint/22434/1/LP_MOHD%20TAUFIK%20MISHAN%20IRMI%20K%2014_5.pdf
work_keys_str_mv AT mishanmohdtaufik integratedthedifferentdatafromthevarietydatabaseforqueryingmotifsequencemohdtaufikmishanzanariahidrusandjasminilyaniahmad
AT idruszanariah integratedthedifferentdatafromthevarietydatabaseforqueryingmotifsequencemohdtaufikmishanzanariahidrusandjasminilyaniahmad
AT ahmadjasminilyani integratedthedifferentdatafromthevarietydatabaseforqueryingmotifsequencemohdtaufikmishanzanariahidrusandjasminilyaniahmad