Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation

Abstract Leading a sedentary lifestyle may cause numerous health problems. Therefore, passive lifestyle changes should be given priority to avoid severe long-term damage. Automatic health coaching system may help people manage a healthy lifestyle with continuous health state monitoring and personali...

Full description

Bibliographic Details
Main Authors: Ayan Chatterjee, Nibedita Pahari, Andreas Prinz, Michael Riegler
Format: Article
Language:English
Published: Nature Portfolio 2022-11-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-022-24118-4
_version_ 1797865123772104704
author Ayan Chatterjee
Nibedita Pahari
Andreas Prinz
Michael Riegler
author_facet Ayan Chatterjee
Nibedita Pahari
Andreas Prinz
Michael Riegler
author_sort Ayan Chatterjee
collection DOAJ
description Abstract Leading a sedentary lifestyle may cause numerous health problems. Therefore, passive lifestyle changes should be given priority to avoid severe long-term damage. Automatic health coaching system may help people manage a healthy lifestyle with continuous health state monitoring and personalized recommendation generation with machine learning (ML). This study proposes a semantic ontology model to annotate the ML-prediction outcomes and personal preferences to conceptualize personalized recommendation generation with a hybrid approach. We use a transfer learning approach to improve ML model training and its performance, and an incremental learning approach to handle daily growing data and fit them into the ML models. Furthermore, we propose a personalized activity recommendation algorithm for a healthy lifestyle by combining transfer learning, incremental learning, the proposed semantic ontology model, and personal preference data. For the overall experiment, we use public and private activity datasets collected from healthy adults (n = 33 for public datasets; n = 16 for private datasets). The standard ML algorithms have been used to investigate the possibility of classifying daily physical activity levels into the following activity classes: sedentary (0), low active (1), active (2), highly active (3), and rigorous active (4). The daily step count, low physical activity, medium physical activity, and vigorous physical activity serve as input for the classification models. We first use publicly available Fitbit datasets to build the initial classification models. Subsequently, we re-use the pre-trained ML classifiers on the private MOX2-5 dataset using transfer learning. We test several standard algorithms and select the best-performing model with optimized configuration for our use case by empirical testing. We find that DecisionTreeClassifier with a criterion "entropy” outperforms other ML classifiers with a mean accuracy score of 97.50% (F1 = 97.00, precision = 97.00, recall = 98.00, MCC = 96.78) and 96.10% (F1 = 96.00, precision = 96.00, recall = 96.00, MCC = 96.10) in Fitbit and MOX2-5 datasets, respectively. Using transfer learning, the DecisionTreeClassifier with a criterion "entropy" outperforms other classifiers with a mean accuracy score of 97.99% (F1 = 98.00, precision = 98.00, recall = 98.00, MCC = 96.79). Therefore, the transfer learning approach improves the machine learning model performance by ≈ 1.98% for defined datasets and settings on MOX2-5 datasets. The Hermit reasoner outperforms other reasoners with an average reasoning time of 1.1–2.1 s, under defined settings in our proposed ontology model. Our proposed algorithm for personalized recommendations conceptualizes a direction to combine the classification results and personal preferences in an ontology for activity eCoaching. The proposed method of combining machine learning technology with semantic rules is an invaluable asset in personalized recommendation generation. Moreover, the semantic rules in the knowledge base and SPARQL (SPARQL Protocol and RDF Query Language) query processing in the query engine helps to understand the logic behind the personalized recommendation generation.
first_indexed 2024-04-09T23:02:52Z
format Article
id doaj.art-68a95d3cb64046a4aa9e33f48c75a920
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-09T23:02:52Z
publishDate 2022-11-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-68a95d3cb64046a4aa9e33f48c75a9202023-03-22T10:51:05ZengNature PortfolioScientific Reports2045-23222022-11-0112112610.1038/s41598-022-24118-4Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generationAyan Chatterjee0Nibedita Pahari1Andreas Prinz2Michael Riegler3Department of Information and Communication Technology, Centre for e-Health, University of AgderDepartment of Software Engineering, Tietoevry Norway ASDepartment of Information and Communication Technology, Centre for e-Health, University of AgderDepartment of Holistic Systems, Simula Metropolitan Center for Digital Engineering (SimulaMet)Abstract Leading a sedentary lifestyle may cause numerous health problems. Therefore, passive lifestyle changes should be given priority to avoid severe long-term damage. Automatic health coaching system may help people manage a healthy lifestyle with continuous health state monitoring and personalized recommendation generation with machine learning (ML). This study proposes a semantic ontology model to annotate the ML-prediction outcomes and personal preferences to conceptualize personalized recommendation generation with a hybrid approach. We use a transfer learning approach to improve ML model training and its performance, and an incremental learning approach to handle daily growing data and fit them into the ML models. Furthermore, we propose a personalized activity recommendation algorithm for a healthy lifestyle by combining transfer learning, incremental learning, the proposed semantic ontology model, and personal preference data. For the overall experiment, we use public and private activity datasets collected from healthy adults (n = 33 for public datasets; n = 16 for private datasets). The standard ML algorithms have been used to investigate the possibility of classifying daily physical activity levels into the following activity classes: sedentary (0), low active (1), active (2), highly active (3), and rigorous active (4). The daily step count, low physical activity, medium physical activity, and vigorous physical activity serve as input for the classification models. We first use publicly available Fitbit datasets to build the initial classification models. Subsequently, we re-use the pre-trained ML classifiers on the private MOX2-5 dataset using transfer learning. We test several standard algorithms and select the best-performing model with optimized configuration for our use case by empirical testing. We find that DecisionTreeClassifier with a criterion "entropy” outperforms other ML classifiers with a mean accuracy score of 97.50% (F1 = 97.00, precision = 97.00, recall = 98.00, MCC = 96.78) and 96.10% (F1 = 96.00, precision = 96.00, recall = 96.00, MCC = 96.10) in Fitbit and MOX2-5 datasets, respectively. Using transfer learning, the DecisionTreeClassifier with a criterion "entropy" outperforms other classifiers with a mean accuracy score of 97.99% (F1 = 98.00, precision = 98.00, recall = 98.00, MCC = 96.79). Therefore, the transfer learning approach improves the machine learning model performance by ≈ 1.98% for defined datasets and settings on MOX2-5 datasets. The Hermit reasoner outperforms other reasoners with an average reasoning time of 1.1–2.1 s, under defined settings in our proposed ontology model. Our proposed algorithm for personalized recommendations conceptualizes a direction to combine the classification results and personal preferences in an ontology for activity eCoaching. The proposed method of combining machine learning technology with semantic rules is an invaluable asset in personalized recommendation generation. Moreover, the semantic rules in the knowledge base and SPARQL (SPARQL Protocol and RDF Query Language) query processing in the query engine helps to understand the logic behind the personalized recommendation generation.https://doi.org/10.1038/s41598-022-24118-4
spellingShingle Ayan Chatterjee
Nibedita Pahari
Andreas Prinz
Michael Riegler
Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
Scientific Reports
title Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
title_full Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
title_fullStr Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
title_full_unstemmed Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
title_short Machine learning and ontology in eCoaching for personalized activity level monitoring and recommendation generation
title_sort machine learning and ontology in ecoaching for personalized activity level monitoring and recommendation generation
url https://doi.org/10.1038/s41598-022-24118-4
work_keys_str_mv AT ayanchatterjee machinelearningandontologyinecoachingforpersonalizedactivitylevelmonitoringandrecommendationgeneration
AT nibeditapahari machinelearningandontologyinecoachingforpersonalizedactivitylevelmonitoringandrecommendationgeneration
AT andreasprinz machinelearningandontologyinecoachingforpersonalizedactivitylevelmonitoringandrecommendationgeneration
AT michaelriegler machinelearningandontologyinecoachingforpersonalizedactivitylevelmonitoringandrecommendationgeneration