Correlates of physical activity behavior in adults: a data mining approach

Abstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collect...

Full description

Bibliographic Details
Main Authors: Vahid Farrahi, Maisa Niemelä, Mikko Kärmeniemi, Soile Puhakka, Maarit Kangas, Raija Korpelainen, Timo Jämsä
Format: Article
Language:English
Published: BMC 2020-07-01
Series:International Journal of Behavioral Nutrition and Physical Activity
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12966-020-00996-7
_version_ 1811327875961323520
author Vahid Farrahi
Maisa Niemelä
Mikko Kärmeniemi
Soile Puhakka
Maarit Kangas
Raija Korpelainen
Timo Jämsä
author_facet Vahid Farrahi
Maisa Niemelä
Mikko Kärmeniemi
Soile Puhakka
Maarit Kangas
Raija Korpelainen
Timo Jämsä
author_sort Vahid Farrahi
collection DOAJ
description Abstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collected in the most recent follow-up at age 46, were used to create a hierarchy using the chi-square automatic interaction detection (CHAID) decision tree technique for predicting PA behavior. PA behavior is defined as active or inactive based on machine-learned activity profiles, which were previously created through a multidimensional (clustering) approach on continuous accelerometer-measured activity intensities in one week. The input variables (predictors) used for decision tree fitting consisted of individual, demographical, psychological, behavioral, environmental, and physical factors. Using generalized linear mixed models, we also analyzed how factors emerging from the model were associated with three PA metrics, including daily time (minutes per day) in sedentary (SED), light PA (LPA), and moderate-to-vigorous PA (MVPA), to assure the relative importance of methodologically identified factors. Results Of the 4582 participants with valid accelerometer data at the latest follow-up, 2701 and 1881 had active and inactive profiles, respectively. We used a total of 168 factors as input variables to classify these two PA behaviors. Out of these 168 factors, the decision tree selected 36 factors of different domains from which 54 subgroups of participants were formed. The emerging factors from the model explained minutes per day in SED, LPA, and/or MVPA, including body fat percentage (SED: B = 26.5, LPA: B = − 16.1, and MVPA: B = − 11.7), normalized heart rate recovery 60 s after exercise (SED: B = -16.1, LPA: B = 9.9, and MVPA: B = 9.6), average weekday total sitting time (SED: B = 34.1, LPA: B = -25.3, and MVPA: B = -5.8), and extravagance score (SED: B = 6.3 and LPA: B = − 3.7). Conclusions Using data mining, we established a data-driven model composed of 36 different factors of relative importance from empirical data. This model may be used to identify subgroups for multilevel intervention allocation and design. Additionally, this study methodologically discovered an extensive set of factors that can be a basis for additional hypothesis testing in PA correlates research.
first_indexed 2024-04-13T15:15:37Z
format Article
id doaj.art-67b44416b3bf4011a2017a3b857db421
institution Directory Open Access Journal
issn 1479-5868
language English
last_indexed 2024-04-13T15:15:37Z
publishDate 2020-07-01
publisher BMC
record_format Article
series International Journal of Behavioral Nutrition and Physical Activity
spelling doaj.art-67b44416b3bf4011a2017a3b857db4212022-12-22T02:41:52ZengBMCInternational Journal of Behavioral Nutrition and Physical Activity1479-58682020-07-0117111410.1186/s12966-020-00996-7Correlates of physical activity behavior in adults: a data mining approachVahid Farrahi0Maisa Niemelä1Mikko Kärmeniemi2Soile Puhakka3Maarit Kangas4Raija Korpelainen5Timo Jämsä6Research Unit of Medical Imaging, Physics and Technology, University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluMedical Research Center, Oulu University Hospital and University of OuluCenter for Life Course Health Research, University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluMedical Research Center, Oulu University Hospital and University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluAbstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collected in the most recent follow-up at age 46, were used to create a hierarchy using the chi-square automatic interaction detection (CHAID) decision tree technique for predicting PA behavior. PA behavior is defined as active or inactive based on machine-learned activity profiles, which were previously created through a multidimensional (clustering) approach on continuous accelerometer-measured activity intensities in one week. The input variables (predictors) used for decision tree fitting consisted of individual, demographical, psychological, behavioral, environmental, and physical factors. Using generalized linear mixed models, we also analyzed how factors emerging from the model were associated with three PA metrics, including daily time (minutes per day) in sedentary (SED), light PA (LPA), and moderate-to-vigorous PA (MVPA), to assure the relative importance of methodologically identified factors. Results Of the 4582 participants with valid accelerometer data at the latest follow-up, 2701 and 1881 had active and inactive profiles, respectively. We used a total of 168 factors as input variables to classify these two PA behaviors. Out of these 168 factors, the decision tree selected 36 factors of different domains from which 54 subgroups of participants were formed. The emerging factors from the model explained minutes per day in SED, LPA, and/or MVPA, including body fat percentage (SED: B = 26.5, LPA: B = − 16.1, and MVPA: B = − 11.7), normalized heart rate recovery 60 s after exercise (SED: B = -16.1, LPA: B = 9.9, and MVPA: B = 9.6), average weekday total sitting time (SED: B = 34.1, LPA: B = -25.3, and MVPA: B = -5.8), and extravagance score (SED: B = 6.3 and LPA: B = − 3.7). Conclusions Using data mining, we established a data-driven model composed of 36 different factors of relative importance from empirical data. This model may be used to identify subgroups for multilevel intervention allocation and design. Additionally, this study methodologically discovered an extensive set of factors that can be a basis for additional hypothesis testing in PA correlates research.http://link.springer.com/article/10.1186/s12966-020-00996-7Decision treeCHAIDMultilevel modelPredictionClassification
spellingShingle Vahid Farrahi
Maisa Niemelä
Mikko Kärmeniemi
Soile Puhakka
Maarit Kangas
Raija Korpelainen
Timo Jämsä
Correlates of physical activity behavior in adults: a data mining approach
International Journal of Behavioral Nutrition and Physical Activity
Decision tree
CHAID
Multilevel model
Prediction
Classification
title Correlates of physical activity behavior in adults: a data mining approach
title_full Correlates of physical activity behavior in adults: a data mining approach
title_fullStr Correlates of physical activity behavior in adults: a data mining approach
title_full_unstemmed Correlates of physical activity behavior in adults: a data mining approach
title_short Correlates of physical activity behavior in adults: a data mining approach
title_sort correlates of physical activity behavior in adults a data mining approach
topic Decision tree
CHAID
Multilevel model
Prediction
Classification
url http://link.springer.com/article/10.1186/s12966-020-00996-7
work_keys_str_mv AT vahidfarrahi correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT maisaniemela correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT mikkokarmeniemi correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT soilepuhakka correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT maaritkangas correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT raijakorpelainen correlatesofphysicalactivitybehaviorinadultsadataminingapproach
AT timojamsa correlatesofphysicalactivitybehaviorinadultsadataminingapproach