Identifying groups of children's social mobility opportunity for public health applications using k-means clustering
Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood....
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2023-09-01
|
| Series: | Heliyon |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2405844023074583 |
| _version_ | 1827803034775715840 |
|---|---|
| author | Sarah Zelasky Chantel L. Martin Christopher Weaver Lisa K. Baxter Kristen M. Rappazzo |
| author_facet | Sarah Zelasky Chantel L. Martin Christopher Weaver Lisa K. Baxter Kristen M. Rappazzo |
| author_sort | Sarah Zelasky |
| collection | DOAJ |
| description | Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood. Clustering census tracts by Opportunity Atlas characteristics would allow for further exploration of variance in social mobility. Our objectives here are to identify and describe spatial clustering trends within Opportunity Atlas outcomes. Methods: We utilized a k-means clustering machine learning approach with four outcome variables (individual income, incarceration rate, employment, and percent of residents living in a neighborhood with low levels of poverty) each given at five parental income levels (1st, 25th, 50th, 75th, and 100th percentiles of the national distribution) to create clusters of census tracts across the contiguous United States (US) and within each Environmental Protection Agency region. Results: At the national level, the algorithm identified seven distinct clusters; the highest opportunity clusters occurred in the Northern Midwest and Northeast, and the lowest opportunity clusters occurred in rural areas of the Southwest and Southeast. For regional analyses, we identified between five to nine clusters within each region. PCA loadings fluctuate across parental income levels; income and low poverty neighborhood residence explain a substantial amount of variance across all variables, but there are differences in contributions across parental income levels for many components. Conclusions: Using data from the Opportunity Atlas, we have taken four social mobility opportunity outcome variables each stratified at five parental income levels and created nationwide and EPA region-specific clusters that group together census tracts with similar opportunity profiles. The development of clusters that can serve as a combined index of social mobility opportunity is an important contribution of this work, and this in turn can be employed in future investigations of factors associated with children's social mobility. |
| first_indexed | 2024-03-11T20:48:37Z |
| format | Article |
| id | doaj.art-9ef21c90bbea488786652970158e3592 |
| institution | Directory Open Access Journal |
| issn | 2405-8440 |
| language | English |
| last_indexed | 2024-03-11T20:48:37Z |
| publishDate | 2023-09-01 |
| publisher | Elsevier |
| record_format | Article |
| series | Heliyon |
| spelling | doaj.art-9ef21c90bbea488786652970158e35922023-10-01T06:02:58ZengElsevierHeliyon2405-84402023-09-0199e20250Identifying groups of children's social mobility opportunity for public health applications using k-means clusteringSarah Zelasky0Chantel L. Martin1Christopher Weaver2Lisa K. Baxter3Kristen M. Rappazzo4Oak Ridge Associated Universities at the U.S. Environmental Protection Agency, Chapel Hill, NC, USADepartment of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USAU.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, Durham, NC, USA; Corresponding author.Background: The Opportunity Atlas project is a pioneering effort to trace social mobility and adulthood socioeconomic outcomes back to childhood residence. Half of the variation in adulthood socioeconomic outcomes was explainable by neighborhood-level socioeconomic characteristics during childhood. Clustering census tracts by Opportunity Atlas characteristics would allow for further exploration of variance in social mobility. Our objectives here are to identify and describe spatial clustering trends within Opportunity Atlas outcomes. Methods: We utilized a k-means clustering machine learning approach with four outcome variables (individual income, incarceration rate, employment, and percent of residents living in a neighborhood with low levels of poverty) each given at five parental income levels (1st, 25th, 50th, 75th, and 100th percentiles of the national distribution) to create clusters of census tracts across the contiguous United States (US) and within each Environmental Protection Agency region. Results: At the national level, the algorithm identified seven distinct clusters; the highest opportunity clusters occurred in the Northern Midwest and Northeast, and the lowest opportunity clusters occurred in rural areas of the Southwest and Southeast. For regional analyses, we identified between five to nine clusters within each region. PCA loadings fluctuate across parental income levels; income and low poverty neighborhood residence explain a substantial amount of variance across all variables, but there are differences in contributions across parental income levels for many components. Conclusions: Using data from the Opportunity Atlas, we have taken four social mobility opportunity outcome variables each stratified at five parental income levels and created nationwide and EPA region-specific clusters that group together census tracts with similar opportunity profiles. The development of clusters that can serve as a combined index of social mobility opportunity is an important contribution of this work, and this in turn can be employed in future investigations of factors associated with children's social mobility.http://www.sciencedirect.com/science/article/pii/S2405844023074583ClusteringK-meansSocioeconomicsUpward social mobilityOpportunity |
| spellingShingle | Sarah Zelasky Chantel L. Martin Christopher Weaver Lisa K. Baxter Kristen M. Rappazzo Identifying groups of children's social mobility opportunity for public health applications using k-means clustering Heliyon Clustering K-means Socioeconomics Upward social mobility Opportunity |
| title | Identifying groups of children's social mobility opportunity for public health applications using k-means clustering |
| title_full | Identifying groups of children's social mobility opportunity for public health applications using k-means clustering |
| title_fullStr | Identifying groups of children's social mobility opportunity for public health applications using k-means clustering |
| title_full_unstemmed | Identifying groups of children's social mobility opportunity for public health applications using k-means clustering |
| title_short | Identifying groups of children's social mobility opportunity for public health applications using k-means clustering |
| title_sort | identifying groups of children s social mobility opportunity for public health applications using k means clustering |
| topic | Clustering K-means Socioeconomics Upward social mobility Opportunity |
| url | http://www.sciencedirect.com/science/article/pii/S2405844023074583 |
| work_keys_str_mv | AT sarahzelasky identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering AT chantellmartin identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering AT christopherweaver identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering AT lisakbaxter identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering AT kristenmrappazzo identifyinggroupsofchildrenssocialmobilityopportunityforpublichealthapplicationsusingkmeansclustering |