Identifying groups of children's social mobility opportunity for public health applications using k-means clustering

Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood....

Full description

Bibliographic Details
Main Authors: Sarah Zelasky, Chantel L. Martin, Christopher Weaver, Lisa K. Baxter, Kristen M. Rappazzo
Format: Article
Language:English
Published: Elsevier 2023-09-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844023074583
_version_ 1827803034775715840
author Sarah Zelasky
Chantel L. Martin
Christopher Weaver
Lisa K. Baxter
Kristen M. Rappazzo
author_facet Sarah Zelasky
Chantel L. Martin
Christopher Weaver
Lisa K. Baxter
Kristen M. Rappazzo
author_sort Sarah Zelasky
collection DOAJ
description Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood. Clustering census tracts by Opportunity Atlas characteristics would allow for further exploration of variance in social mobility. Our objectives here are to identify and describe spatial clustering trends within Opportunity Atlas outcomes. Methods: We utilized a k-means clustering machine learning approach with four outcome variables (individual income, incarceration rate, employment, and percent of residents living in a neighborhood with low levels of poverty) each given at five parental income levels (1st, 25th, 50th, 75th, and 100th percentiles of the national distribution) to create clusters of census tracts across the contiguous United States (US) and within each Environmental Protection Agency region. Results: At the national level, the algorithm identified seven distinct clusters; the highest opportunity clusters occurred in the Northern Midwest and Northeast, and the lowest opportunity clusters occurred in rural areas of the Southwest and Southeast. For regional analyses, we identified between five to nine clusters within each region. PCA loadings fluctuate across parental income levels; income and low poverty neighborhood residence explain a substantial amount of variance across all variables, but there are differences in contributions across parental income levels for many components. Conclusions: Using data from the Opportunity Atlas, we have taken four social mobility opportunity outcome variables each stratified at five parental income levels and created nationwide and EPA region-specific clusters that group together census tracts with similar opportunity profiles. The development of clusters that can serve as a combined index of social mobility opportunity is an important contribution of this work, and this in turn can be employed in future investigations of factors associated with children's social mobility.
first_indexed 2024-03-11T20:48:37Z
format Article
id doaj.art-9ef21c90bbea488786652970158e3592
institution Directory Open Access Journal
issn 2405-8440
language English
last_indexed 2024-03-11T20:48:37Z
publishDate 2023-09-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj.art-9ef21c90bbea488786652970158e35922023-10-01T06:02:58ZengElsevierHeliyon2405-84402023-09-0199e20250Identifying groups of children's social mobility opportunity for public health applications using k-means clusteringSarah Zelasky0Chantel L. Martin1Christopher Weaver2Lisa K. Baxter3Kristen M. Rappazzo4Oak Ridge Associated Universities at the U.S. Environmental Protection Agency, Chapel Hill, NC, USADepartment of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USA; Corresponding author.Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood. Clustering census tracts by Opportunity Atlas characteristics would allow for further exploration of variance in social mobility. Our objectives here are to identify and describe spatial clustering trends within Opportunity Atlas outcomes. Methods: We utilized a k-means clustering machine learning approach with four outcome variables (individual income, incarceration rate, employment, and percent of residents living in a neighborhood with low levels of poverty) each given at five parental income levels (1st, 25th, 50th, 75th, and 100th percentiles of the national distribution) to create clusters of census tracts across the contiguous United States (US) and within each Environmental Protection Agency region. Results: At the national level, the algorithm identified seven distinct clusters; the highest opportunity clusters occurred in the Northern Midwest and Northeast, and the lowest opportunity clusters occurred in rural areas of the Southwest and Southeast. For regional analyses, we identified between five to nine clusters within each region. PCA loadings fluctuate across parental income levels; income and low poverty neighborhood residence explain a substantial amount of variance across all variables, but there are differences in contributions across parental income levels for many components. Conclusions: Using data from the Opportunity Atlas, we have taken four social mobility opportunity outcome variables each stratified at five parental income levels and created nationwide and EPA region-specific clusters that group together census tracts with similar opportunity profiles. The development of clusters that can serve as a combined index of social mobility opportunity is an important contribution of this work, and this in turn can be employed in future investigations of factors associated with children's social mobility.http://www.sciencedirect.com/science/article/pii/S2405844023074583ClusteringK-meansSocioeconomicsUpward social mobilityOpportunity
spellingShingle Sarah Zelasky
Chantel L. Martin
Christopher Weaver
Lisa K. Baxter
Kristen M. Rappazzo
Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
Heliyon
Clustering
K-means
Socioeconomics
Upward social mobility
Opportunity
title Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
title_full Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
title_fullStr Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
title_full_unstemmed Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
title_short Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
title_sort identifying groups of children s social mobility opportunity for public health applications using k means clustering
topic Clustering
K-means
Socioeconomics
Upward social mobility
Opportunity
url http://www.sciencedirect.com/science/article/pii/S2405844023074583
work_keys_str_mv AT sarahzelasky identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering
AT chantellmartin identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering
AT christopherweaver identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering
AT lisakbaxter identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering
AT kristenmrappazzo identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering