CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation

Video-based peripheral oxygen saturation (SpO2) estimation, utilizing solely RGB cameras, offers a non-contact approach to measuring blood oxygen levels. Previous studies set a stable and unchanging environment as the premise for non-contact blood oxygen estimation. Additionally, they utilized a sma...

Full description

Bibliographic Details
Main Authors: Jiahe Peng, Weihua Su, Haiyong Chen, Jingsheng Sun, Zandong Tian
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Bioengineering
Subjects:
Online Access:https://www.mdpi.com/2306-5354/11/2/113
_version_ 1797298868786823168
author Jiahe Peng
Weihua Su
Haiyong Chen
Jingsheng Sun
Zandong Tian
author_facet Jiahe Peng
Weihua Su
Haiyong Chen
Jingsheng Sun
Zandong Tian
author_sort Jiahe Peng
collection DOAJ
description Video-based peripheral oxygen saturation (SpO2) estimation, utilizing solely RGB cameras, offers a non-contact approach to measuring blood oxygen levels. Previous studies set a stable and unchanging environment as the premise for non-contact blood oxygen estimation. Additionally, they utilized a small amount of labeled data for system training and learning. However, it is challenging to train optimal model parameters with a small dataset. The accuracy of blood oxygen detection is easily affected by ambient light and subject movement. To address these issues, this paper proposes a contrastive learning spatiotemporal attention network (CL-SPO2Net), an innovative semi-supervised network for video-based SpO2 estimation. Spatiotemporal similarities in remote photoplethysmography (rPPG) signals were found in video segments containing facial or hand regions. Subsequently, integrating deep neural networks with machine learning expertise enabled the estimation of SpO2. The method had good feasibility in the case of small-scale labeled datasets, with the mean absolute error between the camera and the reference pulse oximeter of 0.85% in the stable environment, 1.13% with lighting fluctuations, and 1.20% in the facial rotation situation.
first_indexed 2024-03-07T22:41:21Z
format Article
id doaj.art-1ada87bdcbb64740ba35f84111eb1ed6
institution Directory Open Access Journal
issn 2306-5354
language English
last_indexed 2024-03-07T22:41:21Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Bioengineering
spelling doaj.art-1ada87bdcbb64740ba35f84111eb1ed62024-02-23T15:07:50ZengMDPI AGBioengineering2306-53542024-01-0111211310.3390/bioengineering11020113CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 EstimationJiahe Peng0Weihua Su1Haiyong Chen2Jingsheng Sun3Zandong Tian4School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, ChinaSchool of Mechanical Engineering, Hebei University of Technology, Tianjin 300401, ChinaSchool of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, ChinaSchool of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, ChinaSchool of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, ChinaVideo-based peripheral oxygen saturation (SpO2) estimation, utilizing solely RGB cameras, offers a non-contact approach to measuring blood oxygen levels. Previous studies set a stable and unchanging environment as the premise for non-contact blood oxygen estimation. Additionally, they utilized a small amount of labeled data for system training and learning. However, it is challenging to train optimal model parameters with a small dataset. The accuracy of blood oxygen detection is easily affected by ambient light and subject movement. To address these issues, this paper proposes a contrastive learning spatiotemporal attention network (CL-SPO2Net), an innovative semi-supervised network for video-based SpO2 estimation. Spatiotemporal similarities in remote photoplethysmography (rPPG) signals were found in video segments containing facial or hand regions. Subsequently, integrating deep neural networks with machine learning expertise enabled the estimation of SpO2. The method had good feasibility in the case of small-scale labeled datasets, with the mean absolute error between the camera and the reference pulse oximeter of 0.85% in the stable environment, 1.13% with lighting fluctuations, and 1.20% in the facial rotation situation.https://www.mdpi.com/2306-5354/11/2/113contrastive learningperipheral oxygen saturation (SpO2)computer visiondeep learningremote photoplethysmography (rPPG)attention mechanism
spellingShingle Jiahe Peng
Weihua Su
Haiyong Chen
Jingsheng Sun
Zandong Tian
CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
Bioengineering
contrastive learning
peripheral oxygen saturation (SpO2)
computer vision
deep learning
remote photoplethysmography (rPPG)
attention mechanism
title CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
title_full CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
title_fullStr CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
title_full_unstemmed CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
title_short CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation
title_sort cl spo2net contrastive learning spatiotemporal attention network for non contact video based spo2 estimation
topic contrastive learning
peripheral oxygen saturation (SpO2)
computer vision
deep learning
remote photoplethysmography (rPPG)
attention mechanism
url https://www.mdpi.com/2306-5354/11/2/113
work_keys_str_mv AT jiahepeng clspo2netcontrastivelearningspatiotemporalattentionnetworkfornoncontactvideobasedspo2estimation
AT weihuasu clspo2netcontrastivelearningspatiotemporalattentionnetworkfornoncontactvideobasedspo2estimation
AT haiyongchen clspo2netcontrastivelearningspatiotemporalattentionnetworkfornoncontactvideobasedspo2estimation
AT jingshengsun clspo2netcontrastivelearningspatiotemporalattentionnetworkfornoncontactvideobasedspo2estimation
AT zandongtian clspo2netcontrastivelearningspatiotemporalattentionnetworkfornoncontactvideobasedspo2estimation