Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention

Heart rate estimation from face videos is an emerging technology that offers numerous potential applications in healthcare and human–computer interaction. However, most of the existing approaches often overlook the importance of long-range spatiotemporal dependencies, which is essential for robust m...

Full description

Bibliographic Details
Main Authors: Smera Premkumar, J. Anitha, Daniela Danciulescu, D. Jude Hemanth
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Technologies
Subjects:
Online Access:https://www.mdpi.com/2227-7080/12/1/2
_version_ 1797342489751846912
author Smera Premkumar
J. Anitha
Daniela Danciulescu
D. Jude Hemanth
author_facet Smera Premkumar
J. Anitha
Daniela Danciulescu
D. Jude Hemanth
author_sort Smera Premkumar
collection DOAJ
description Heart rate estimation from face videos is an emerging technology that offers numerous potential applications in healthcare and human–computer interaction. However, most of the existing approaches often overlook the importance of long-range spatiotemporal dependencies, which is essential for robust measurement of heart rate prediction. Additionally, they involve extensive pre-processing steps to enhance the prediction accuracy, resulting in high computational complexity. In this paper, we propose an innovative solution called LGTransPPG. This end-to-end transformer-based framework eliminates the need for pre-processing steps while achieving improved efficiency and accuracy. LGTransPPG incorporates local and global aggregation techniques to capture fine-grained facial features and contextual information. By leveraging the power of transformers, our framework can effectively model long-range dependencies and temporal dynamics, enhancing the heart rate prediction process. The proposed approach is evaluated on three publicly available datasets, demonstrating its robustness and generalizability. Furthermore, we achieved a high Pearson correlation coefficient (PCC) value of 0.88, indicating its superior efficiency and accuracy between the predicted and actual heart rate values.
first_indexed 2024-03-08T10:34:02Z
format Article
id doaj.art-064ce49f1a384aa58392d09c08e5a814
institution Directory Open Access Journal
issn 2227-7080
language English
last_indexed 2024-03-08T10:34:02Z
publishDate 2023-12-01
publisher MDPI AG
record_format Article
series Technologies
spelling doaj.art-064ce49f1a384aa58392d09c08e5a8142024-01-26T18:40:03ZengMDPI AGTechnologies2227-70802023-12-01121210.3390/technologies12010002Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-AttentionSmera Premkumar0J. Anitha1Daniela Danciulescu2D. Jude Hemanth3Department of ECE, Karunya Institute of Technology and Sciences, Coimbatore 641114, IndiaDepartment of ECE, Karunya Institute of Technology and Sciences, Coimbatore 641114, IndiaDepartment of Computer Science, University of Craiova, 200585 Craiova, RomaniaDepartment of ECE, Karunya Institute of Technology and Sciences, Coimbatore 641114, IndiaHeart rate estimation from face videos is an emerging technology that offers numerous potential applications in healthcare and human–computer interaction. However, most of the existing approaches often overlook the importance of long-range spatiotemporal dependencies, which is essential for robust measurement of heart rate prediction. Additionally, they involve extensive pre-processing steps to enhance the prediction accuracy, resulting in high computational complexity. In this paper, we propose an innovative solution called LGTransPPG. This end-to-end transformer-based framework eliminates the need for pre-processing steps while achieving improved efficiency and accuracy. LGTransPPG incorporates local and global aggregation techniques to capture fine-grained facial features and contextual information. By leveraging the power of transformers, our framework can effectively model long-range dependencies and temporal dynamics, enhancing the heart rate prediction process. The proposed approach is evaluated on three publicly available datasets, demonstrating its robustness and generalizability. Furthermore, we achieved a high Pearson correlation coefficient (PCC) value of 0.88, indicating its superior efficiency and accuracy between the predicted and actual heart rate values.https://www.mdpi.com/2227-7080/12/1/2remote photoplethysmographytransformerheart rate prediction
spellingShingle Smera Premkumar
J. Anitha
Daniela Danciulescu
D. Jude Hemanth
Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
Technologies
remote photoplethysmography
transformer
heart rate prediction
title Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
title_full Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
title_fullStr Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
title_full_unstemmed Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
title_short Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention
title_sort transformative approach for heart rate prediction from face videos using local and global multi head self attention
topic remote photoplethysmography
transformer
heart rate prediction
url https://www.mdpi.com/2227-7080/12/1/2
work_keys_str_mv AT smerapremkumar transformativeapproachforheartratepredictionfromfacevideosusinglocalandglobalmultiheadselfattention
AT janitha transformativeapproachforheartratepredictionfromfacevideosusinglocalandglobalmultiheadselfattention
AT danieladanciulescu transformativeapproachforheartratepredictionfromfacevideosusinglocalandglobalmultiheadselfattention
AT djudehemanth transformativeapproachforheartratepredictionfromfacevideosusinglocalandglobalmultiheadselfattention