BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.

Machine learning (ML) algorithms are powerful tools to find complex patterns and biomarker signatures when conventional statistical methods fail to identify them. While the ML field made significant progress, state of the art methodologies to build efficient and non-overfitting models are not always...

Full description

Bibliographic Details
Main Authors: Sophiane Bouirdene, Mickael Leclercq, Léopold Quitté, Steve Bilodeau, Arnaud Droit
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0294750&type=printable
_version_ 1797392926052974592
author Sophiane Bouirdene
Mickael Leclercq
Léopold Quitté
Steve Bilodeau
Arnaud Droit
author_facet Sophiane Bouirdene
Mickael Leclercq
Léopold Quitté
Steve Bilodeau
Arnaud Droit
author_sort Sophiane Bouirdene
collection DOAJ
description Machine learning (ML) algorithms are powerful tools to find complex patterns and biomarker signatures when conventional statistical methods fail to identify them. While the ML field made significant progress, state of the art methodologies to build efficient and non-overfitting models are not always applied in the literature. To this purpose, automatic programs, such as BioDiscML, were designed to identify biomarker signatures and correlated features while escaping overfitting using multiple evaluation strategies, such as cross validation, bootstrapping and repeated holdout. To further improve BioDiscML and reach a broader audience, better visualization support and flexibility in choosing the best models and signatures are needed. Thus, to provide researchers with an easily accessible and usable tool for in depth investigation of the results from BioDiscML outputs, we developed a visual interaction tool called BioDiscViz. This tool provides summaries, tables and graphics, in the form of Principal Component Analysis (PCA) plots, UMAP, t-SNE, heatmaps and boxplots for the best model and the correlated features. Furthermore, this tool also provides visual support to extract a consensus signature from BioDiscML models using a combination of filters. BioDiscViz will be a great visual support for research using ML, hence new opportunities in this field by opening it to a broader community.
first_indexed 2024-03-08T23:54:39Z
format Article
id doaj.art-4fb625a368814489a93ecb37543115b3
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-03-08T23:54:39Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-4fb625a368814489a93ecb37543115b32023-12-13T05:32:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-011811e029475010.1371/journal.pone.0294750BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.Sophiane BouirdeneMickael LeclercqLéopold QuittéSteve BilodeauArnaud DroitMachine learning (ML) algorithms are powerful tools to find complex patterns and biomarker signatures when conventional statistical methods fail to identify them. While the ML field made significant progress, state of the art methodologies to build efficient and non-overfitting models are not always applied in the literature. To this purpose, automatic programs, such as BioDiscML, were designed to identify biomarker signatures and correlated features while escaping overfitting using multiple evaluation strategies, such as cross validation, bootstrapping and repeated holdout. To further improve BioDiscML and reach a broader audience, better visualization support and flexibility in choosing the best models and signatures are needed. Thus, to provide researchers with an easily accessible and usable tool for in depth investigation of the results from BioDiscML outputs, we developed a visual interaction tool called BioDiscViz. This tool provides summaries, tables and graphics, in the form of Principal Component Analysis (PCA) plots, UMAP, t-SNE, heatmaps and boxplots for the best model and the correlated features. Furthermore, this tool also provides visual support to extract a consensus signature from BioDiscML models using a combination of filters. BioDiscViz will be a great visual support for research using ML, hence new opportunities in this field by opening it to a broader community.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0294750&type=printable
spellingShingle Sophiane Bouirdene
Mickael Leclercq
Léopold Quitté
Steve Bilodeau
Arnaud Droit
BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
PLoS ONE
title BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
title_full BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
title_fullStr BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
title_full_unstemmed BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
title_short BioDiscViz: A visualization support and consensus signature selector for BioDiscML results.
title_sort biodiscviz a visualization support and consensus signature selector for biodiscml results
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0294750&type=printable
work_keys_str_mv AT sophianebouirdene biodiscvizavisualizationsupportandconsensussignatureselectorforbiodiscmlresults
AT mickaelleclercq biodiscvizavisualizationsupportandconsensussignatureselectorforbiodiscmlresults
AT leopoldquitte biodiscvizavisualizationsupportandconsensussignatureselectorforbiodiscmlresults
AT stevebilodeau biodiscvizavisualizationsupportandconsensussignatureselectorforbiodiscmlresults
AT arnauddroit biodiscvizavisualizationsupportandconsensussignatureselectorforbiodiscmlresults