Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data

Clustering is a mathematical approach that allows one to find a group of data with similar attributes. This approach is also often used in the field of computer science to group a large amounts of data. Triclustering analysis is an analysis technique on 3D data (observation—attribute—context). Tricl...

Full description

Bibliographic Details
Main Authors:	Titin Siswantining, Noval Saputra, Devvi Sarwinda, Herley Shaori Al-Ash
Format:	Article
Language:	English
Published:	MDPI AG 2021-03-01
Series:	Symmetry
Subjects:	biomarker HiPSC HIV-1 mean square residual three dimensions
Online Access:	https://www.mdpi.com/2073-8994/13/3/437

_version_	1797412309645131776
author	Titin Siswantining Noval Saputra Devvi Sarwinda Herley Shaori Al-Ash
author_facet	Titin Siswantining Noval Saputra Devvi Sarwinda Herley Shaori Al-Ash
author_sort	Titin Siswantining
collection	DOAJ
description	Clustering is a mathematical approach that allows one to find a group of data with similar attributes. This approach is also often used in the field of computer science to group a large amounts of data. Triclustering analysis is an analysis technique on 3D data (observation—attribute—context). Triclustering analysis can group observations on several attributes and contexts simultaneously. Triclustering analysis has been frequently applied to analyze microarray gene expression data. We proposed the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method to perform triclustering analysis on microarray gene expression data. The <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method aims to find a tricluster that has a mean square residual smaller than <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula> and a maximum volume. Tricluster is obtained by deleting nodes from 3D data using multiple node deletion and single node deletion algorithms. The tricluster candidates that have been obtained are checked again by adding some previously deleted nodes using the node addition algorithm. In this research, the program improvement of the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method was carried out and also the calculation of the resulting tricluster evaluation result. The <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method is implemented in two microarray gene expression data. The first implementation was carried out on gene expression data from the differentiation process of human-induced pluripotent stem cells (HiPSCs) from patients with heart disease, resulting in the best simulation when <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>=</mo><mn>0.0068</mn></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>λ</mi><mo>=</mo><mn>1.2</mn></mrow></semantics></math></inline-formula>, and obtained five tricluster, which are considered as characteristics of heart disease. The second implementation was implemented on HIV-1 data, best simulation when <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>=</mo><mn>0.0046</mn></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>λ</mi><mo>=</mo><mn>1.25</mn></mrow></semantics></math></inline-formula> and produced three genes as biomarkers, with the gene names <i>AGFG1</i>, <i>EGR1</i> and <i>HLA-C</i>. This gene group can be used by medical experts in providing further treatment.
first_indexed	2024-03-09T05:00:18Z
format	Article
id	doaj.art-9de3bfef2f0a4c04ab381b3983fc1b8d
institution	Directory Open Access Journal
issn	2073-8994
language	English
last_indexed	2024-03-09T05:00:18Z
publishDate	2021-03-01
publisher	MDPI AG
record_format	Article
series	Symmetry
spelling	doaj.art-9de3bfef2f0a4c04ab381b3983fc1b8d2023-12-03T13:00:53ZengMDPI AGSymmetry2073-89942021-03-0113343710.3390/sym13030437Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression DataTitin Siswantining0Noval Saputra1Devvi Sarwinda2Herley Shaori Al-Ash3Department of Mathematics, Universitas Indonesia, Depok 16424, IndonesiaDepartment of Mathematics, Universitas Indonesia, Depok 16424, IndonesiaDepartment of Mathematics, Universitas Indonesia, Depok 16424, IndonesiaDepartment of Mathematics, Universitas Indonesia, Depok 16424, IndonesiaClustering is a mathematical approach that allows one to find a group of data with similar attributes. This approach is also often used in the field of computer science to group a large amounts of data. Triclustering analysis is an analysis technique on 3D data (observation—attribute—context). Triclustering analysis can group observations on several attributes and contexts simultaneously. Triclustering analysis has been frequently applied to analyze microarray gene expression data. We proposed the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method to perform triclustering analysis on microarray gene expression data. The <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method aims to find a tricluster that has a mean square residual smaller than <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula> and a maximum volume. Tricluster is obtained by deleting nodes from 3D data using multiple node deletion and single node deletion algorithms. The tricluster candidates that have been obtained are checked again by adding some previously deleted nodes using the node addition algorithm. In this research, the program improvement of the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method was carried out and also the calculation of the resulting tricluster evaluation result. The <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>δ</mi></semantics></math></inline-formula>-Trimax method is implemented in two microarray gene expression data. The first implementation was carried out on gene expression data from the differentiation process of human-induced pluripotent stem cells (HiPSCs) from patients with heart disease, resulting in the best simulation when <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>=</mo><mn>0.0068</mn></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>λ</mi><mo>=</mo><mn>1.2</mn></mrow></semantics></math></inline-formula>, and obtained five tricluster, which are considered as characteristics of heart disease. The second implementation was implemented on HIV-1 data, best simulation when <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>=</mo><mn>0.0046</mn></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>λ</mi><mo>=</mo><mn>1.25</mn></mrow></semantics></math></inline-formula> and produced three genes as biomarkers, with the gene names <i>AGFG1</i>, <i>EGR1</i> and <i>HLA-C</i>. This gene group can be used by medical experts in providing further treatment.https://www.mdpi.com/2073-8994/13/3/437biomarkerHiPSCHIV-1mean square residualthree dimensions
spellingShingle	Titin Siswantining Noval Saputra Devvi Sarwinda Herley Shaori Al-Ash Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data Symmetry biomarker HiPSC HIV-1 mean square residual three dimensions
title	Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data
title_full	Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data
title_fullStr	Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data
title_full_unstemmed	Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data
title_short	Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data
title_sort	triclustering discovery using the i δ i trimax method on microarray gene expression data
topic	biomarker HiPSC HIV-1 mean square residual three dimensions
url	https://www.mdpi.com/2073-8994/13/3/437
work_keys_str_mv	AT titinsiswantining triclusteringdiscoveryusingtheiditrimaxmethodonmicroarraygeneexpressiondata AT novalsaputra triclusteringdiscoveryusingtheiditrimaxmethodonmicroarraygeneexpressiondata AT devvisarwinda triclusteringdiscoveryusingtheiditrimaxmethodonmicroarraygeneexpressiondata AT herleyshaorialash triclusteringdiscoveryusingtheiditrimaxmethodonmicroarraygeneexpressiondata

Triclustering Discovery Using the <i>δ</i>-Trimax Method on Microarray Gene Expression Data

Similar Items