An automatic method using MFCC features for sleep stage classification

Abstract Sleep stage classification is a necessary step for diagnosing sleep disorders. Generally, experts use traditional methods based on every 30 seconds (s) of the biological signals, such as electrooculograms (EOGs), electrocardiograms (ECGs), electromyograms (EMGs), and electroencephalograms (...

Full description

Bibliographic Details
Main Authors: Wei Pei, Yan Li, Peng Wen, Fuwen Yang, Xiaopeng Ji
Format: Article
Language:English
Published: SpringerOpen 2024-02-01
Series:Brain Informatics
Subjects:
Online Access:https://doi.org/10.1186/s40708-024-00219-w
_version_ 1797272794264764416
author Wei Pei
Yan Li
Peng Wen
Fuwen Yang
Xiaopeng Ji
author_facet Wei Pei
Yan Li
Peng Wen
Fuwen Yang
Xiaopeng Ji
author_sort Wei Pei
collection DOAJ
description Abstract Sleep stage classification is a necessary step for diagnosing sleep disorders. Generally, experts use traditional methods based on every 30 seconds (s) of the biological signals, such as electrooculograms (EOGs), electrocardiograms (ECGs), electromyograms (EMGs), and electroencephalograms (EEGs), to classify sleep stages. Recently, various state-of-the-art approaches based on a deep learning model have been demonstrated to have efficient and accurate outcomes in sleep stage classification. In this paper, a novel deep convolutional neural network (CNN) combined with a long short-time memory (LSTM) model is proposed for sleep scoring tasks. A key frequency domain feature named Mel-frequency Cepstral Coefficient (MFCC) is extracted from EEG and EMG signals. The proposed method can learn features from frequency domains on different bio-signal channels. It firstly extracts the MFCC features from multi-channel signals, and then inputs them to several convolutional layers and an LSTM layer. Secondly, the learned representations are fed to a fully connected layer and a softmax classifier for sleep stage classification. The experiments are conducted on two widely used sleep datasets, Sleep Heart Health Study (SHHS) and Vincent’s University Hospital/University College Dublin Sleep Apnoea (UCDDB) to test the effectiveness of the method. The results of this study indicate that the model can perform well in the classification of sleep stages using the features of the 2-dimensional (2D) MFCC feature. The advantage of using the feature is that it can be used to input a two-dimensional data stream, which can be used to retain information about each sleep stage. Using 2D data streams can reduce the time it takes to retrieve the data from the one-dimensional stream. Another advantage of this method is that it eliminates the need for deep layers, which can help improve the performance of the model. For instance, by reducing the number of layers, our seven layers of the model structure takes around 400 s to train and test 100 subjects in the SHHS1 dataset. Its best accuracy and Cohen’s kappa are 82.35% and 0.75 for the SHHS dataset, and 73.07% and 0.63 for the UCDDB dataset, respectively.
first_indexed 2024-03-07T14:34:28Z
format Article
id doaj.art-f2b49e59c3074fb291277fbc26c80415
institution Directory Open Access Journal
issn 2198-4018
2198-4026
language English
last_indexed 2024-03-07T14:34:28Z
publishDate 2024-02-01
publisher SpringerOpen
record_format Article
series Brain Informatics
spelling doaj.art-f2b49e59c3074fb291277fbc26c804152024-03-05T20:46:18ZengSpringerOpenBrain Informatics2198-40182198-40262024-02-0111111310.1186/s40708-024-00219-wAn automatic method using MFCC features for sleep stage classificationWei Pei0Yan Li1Peng Wen2Fuwen Yang3Xiaopeng Ji4School of Mathematics, Physics and Computing, University of Southern QueenslandSchool of Mathematics, Physics and Computing, University of Southern QueenslandSchool of Engineering, University of Southern QueenslandSchool of Engineering and Built Environment, Griffith UniversitySchool of Mathematics, Physics and Computing, University of Southern QueenslandAbstract Sleep stage classification is a necessary step for diagnosing sleep disorders. Generally, experts use traditional methods based on every 30 seconds (s) of the biological signals, such as electrooculograms (EOGs), electrocardiograms (ECGs), electromyograms (EMGs), and electroencephalograms (EEGs), to classify sleep stages. Recently, various state-of-the-art approaches based on a deep learning model have been demonstrated to have efficient and accurate outcomes in sleep stage classification. In this paper, a novel deep convolutional neural network (CNN) combined with a long short-time memory (LSTM) model is proposed for sleep scoring tasks. A key frequency domain feature named Mel-frequency Cepstral Coefficient (MFCC) is extracted from EEG and EMG signals. The proposed method can learn features from frequency domains on different bio-signal channels. It firstly extracts the MFCC features from multi-channel signals, and then inputs them to several convolutional layers and an LSTM layer. Secondly, the learned representations are fed to a fully connected layer and a softmax classifier for sleep stage classification. The experiments are conducted on two widely used sleep datasets, Sleep Heart Health Study (SHHS) and Vincent’s University Hospital/University College Dublin Sleep Apnoea (UCDDB) to test the effectiveness of the method. The results of this study indicate that the model can perform well in the classification of sleep stages using the features of the 2-dimensional (2D) MFCC feature. The advantage of using the feature is that it can be used to input a two-dimensional data stream, which can be used to retain information about each sleep stage. Using 2D data streams can reduce the time it takes to retrieve the data from the one-dimensional stream. Another advantage of this method is that it eliminates the need for deep layers, which can help improve the performance of the model. For instance, by reducing the number of layers, our seven layers of the model structure takes around 400 s to train and test 100 subjects in the SHHS1 dataset. Its best accuracy and Cohen’s kappa are 82.35% and 0.75 for the SHHS dataset, and 73.07% and 0.63 for the UCDDB dataset, respectively.https://doi.org/10.1186/s40708-024-00219-wSleep stagesConvolutional neural networkMel-frequency cepstral coefficientsLong short-term memory
spellingShingle Wei Pei
Yan Li
Peng Wen
Fuwen Yang
Xiaopeng Ji
An automatic method using MFCC features for sleep stage classification
Brain Informatics
Sleep stages
Convolutional neural network
Mel-frequency cepstral coefficients
Long short-term memory
title An automatic method using MFCC features for sleep stage classification
title_full An automatic method using MFCC features for sleep stage classification
title_fullStr An automatic method using MFCC features for sleep stage classification
title_full_unstemmed An automatic method using MFCC features for sleep stage classification
title_short An automatic method using MFCC features for sleep stage classification
title_sort automatic method using mfcc features for sleep stage classification
topic Sleep stages
Convolutional neural network
Mel-frequency cepstral coefficients
Long short-term memory
url https://doi.org/10.1186/s40708-024-00219-w
work_keys_str_mv AT weipei anautomaticmethodusingmfccfeaturesforsleepstageclassification
AT yanli anautomaticmethodusingmfccfeaturesforsleepstageclassification
AT pengwen anautomaticmethodusingmfccfeaturesforsleepstageclassification
AT fuwenyang anautomaticmethodusingmfccfeaturesforsleepstageclassification
AT xiaopengji anautomaticmethodusingmfccfeaturesforsleepstageclassification
AT weipei automaticmethodusingmfccfeaturesforsleepstageclassification
AT yanli automaticmethodusingmfccfeaturesforsleepstageclassification
AT pengwen automaticmethodusingmfccfeaturesforsleepstageclassification
AT fuwenyang automaticmethodusingmfccfeaturesforsleepstageclassification
AT xiaopengji automaticmethodusingmfccfeaturesforsleepstageclassification