A knowledge discovery methodology from EEG data for cyclic alternating pattern detection

Abstract Background Detection and quantification of cyclic alternating patterns (CAP) components has the potential to serve as a disease bio-marker. Few methods exist to discriminate all the different CAP components, they do not present appropriate sensitivities, and often they are evaluated based o...

Full description

Bibliographic Details
Main Authors: Fátima Machado, Francisco Sales, Clara Santos, António Dourado, C. A. Teixeira
Format: Article
Language:English
Published: BMC 2018-12-01
Series:BioMedical Engineering OnLine
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12938-018-0616-z
_version_ 1811338366059282432
author Fátima Machado
Francisco Sales
Clara Santos
António Dourado
C. A. Teixeira
author_facet Fátima Machado
Francisco Sales
Clara Santos
António Dourado
C. A. Teixeira
author_sort Fátima Machado
collection DOAJ
description Abstract Background Detection and quantification of cyclic alternating patterns (CAP) components has the potential to serve as a disease bio-marker. Few methods exist to discriminate all the different CAP components, they do not present appropriate sensitivities, and often they are evaluated based on accuracy (AC) that is not an appropriate measure for imbalanced datasets. Methods We describe a knowledge discovery methodology in data (KDD) aiming the development of automatic CAP scoring approaches. Automatic CAP scoring was faced from two perspectives: the binary distinction between A-phases and B-phases, and also for multi-class classification of the different CAP components. The most important KDD stages are: extraction of 55 features, feature ranking/transformation, and classification. Classification is performed by (i) support vector machine (SVM), (ii) k-nearest neighbors (k-NN), and (iii) discriminant analysis. We report the weighted accuracy (WAC) that accounts for class imbalance. Results The study includes 30 subjects from the CAP Sleep Database of Physionet. The best alternative for the discrimination of the different A-phase subtypes involved feature ranking by the minimum redundancy maximum relevance algorithm (mRMR) and classification by SVM, with a WAC of 51%. Concerning the binary discrimination between A-phases and B-phases, k-NN with mRMR ranking achieved the best WAC of 80%. Conclusions We describe a KDD that, to the best of our knowledge, was for the first time applied to CAP scoring. In particular, the fully discrimination of the three different A-phases subtypes is a new perspective, since past works tried multi-class approaches but based on grouping of different sub-types. We also considered the weighted accuracy, in addition to simple accuracy, resulting in a more trustworthy performance assessment. Globally, better subtype sensitivities than other published approaches were achieved.
first_indexed 2024-04-13T18:08:46Z
format Article
id doaj.art-e6a56e13d9c64c7b89427245b2e817c7
institution Directory Open Access Journal
issn 1475-925X
language English
last_indexed 2024-04-13T18:08:46Z
publishDate 2018-12-01
publisher BMC
record_format Article
series BioMedical Engineering OnLine
spelling doaj.art-e6a56e13d9c64c7b89427245b2e817c72022-12-22T02:35:58ZengBMCBioMedical Engineering OnLine1475-925X2018-12-0117112310.1186/s12938-018-0616-zA knowledge discovery methodology from EEG data for cyclic alternating pattern detectionFátima Machado0Francisco Sales1Clara Santos2António Dourado3C. A. Teixeira4CISUC-Centro de Informática e Sistemas da Universidade de Coimbra, Departamento de Engenharia Informática, Faculdade de Ciências e Tecnologia, Universidade de CoimbraCentro Integrado de Epilepsia, Centro Hospitalar e Universitário de CoimbraCentro de Medicina do Sono do Hospital Geral CoimbraCISUC-Centro de Informática e Sistemas da Universidade de Coimbra, Departamento de Engenharia Informática, Faculdade de Ciências e Tecnologia, Universidade de CoimbraCISUC-Centro de Informática e Sistemas da Universidade de Coimbra, Departamento de Engenharia Informática, Faculdade de Ciências e Tecnologia, Universidade de CoimbraAbstract Background Detection and quantification of cyclic alternating patterns (CAP) components has the potential to serve as a disease bio-marker. Few methods exist to discriminate all the different CAP components, they do not present appropriate sensitivities, and often they are evaluated based on accuracy (AC) that is not an appropriate measure for imbalanced datasets. Methods We describe a knowledge discovery methodology in data (KDD) aiming the development of automatic CAP scoring approaches. Automatic CAP scoring was faced from two perspectives: the binary distinction between A-phases and B-phases, and also for multi-class classification of the different CAP components. The most important KDD stages are: extraction of 55 features, feature ranking/transformation, and classification. Classification is performed by (i) support vector machine (SVM), (ii) k-nearest neighbors (k-NN), and (iii) discriminant analysis. We report the weighted accuracy (WAC) that accounts for class imbalance. Results The study includes 30 subjects from the CAP Sleep Database of Physionet. The best alternative for the discrimination of the different A-phase subtypes involved feature ranking by the minimum redundancy maximum relevance algorithm (mRMR) and classification by SVM, with a WAC of 51%. Concerning the binary discrimination between A-phases and B-phases, k-NN with mRMR ranking achieved the best WAC of 80%. Conclusions We describe a KDD that, to the best of our knowledge, was for the first time applied to CAP scoring. In particular, the fully discrimination of the three different A-phases subtypes is a new perspective, since past works tried multi-class approaches but based on grouping of different sub-types. We also considered the weighted accuracy, in addition to simple accuracy, resulting in a more trustworthy performance assessment. Globally, better subtype sensitivities than other published approaches were achieved.http://link.springer.com/article/10.1186/s12938-018-0616-zCyclic alternating patternA-phase detectionEEG processingKnowledge discovery in data
spellingShingle Fátima Machado
Francisco Sales
Clara Santos
António Dourado
C. A. Teixeira
A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
BioMedical Engineering OnLine
Cyclic alternating pattern
A-phase detection
EEG processing
Knowledge discovery in data
title A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
title_full A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
title_fullStr A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
title_full_unstemmed A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
title_short A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
title_sort knowledge discovery methodology from eeg data for cyclic alternating pattern detection
topic Cyclic alternating pattern
A-phase detection
EEG processing
Knowledge discovery in data
url http://link.springer.com/article/10.1186/s12938-018-0616-z
work_keys_str_mv AT fatimamachado aknowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT franciscosales aknowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT clarasantos aknowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT antoniodourado aknowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT cateixeira aknowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT fatimamachado knowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT franciscosales knowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT clarasantos knowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT antoniodourado knowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection
AT cateixeira knowledgediscoverymethodologyfromeegdataforcyclicalternatingpatterndetection