Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
Introduction Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on stu...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2023-02-01
|
Series: | International Journal of Population Data Science |
Subjects: | |
Online Access: | https://ijpds.org/article/view/1843 |
_version_ | 1797430090760454144 |
---|---|
author | Jeanne Sinclair Scott Davies Magdalena Janus |
author_facet | Jeanne Sinclair Scott Davies Magdalena Janus |
author_sort | Jeanne Sinclair |
collection | DOAJ |
description |
Introduction
Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population.
Methods and analysis
Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables.
Results and implications
We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification.
|
first_indexed | 2024-03-09T09:23:34Z |
format | Article |
id | doaj.art-e701d30ccf014cad8faaab4de6222cc3 |
institution | Directory Open Access Journal |
issn | 2399-4908 |
language | English |
last_indexed | 2024-03-09T09:23:34Z |
publishDate | 2023-02-01 |
publisher | Swansea University |
record_format | Article |
series | International Journal of Population Data Science |
spelling | doaj.art-e701d30ccf014cad8faaab4de6222cc32023-12-02T06:57:22ZengSwansea UniversityInternational Journal of Population Data Science2399-49082023-02-018110.23889/ijpds.v8i1.1843Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal databaseJeanne Sinclair0Scott Davies1Magdalena Janus2Memorial University Faculty of Education, 323 Prince Philip Drive, St. John's, NL A1B 3X8, CanadaDepartment of Leadership, Higher and Adult Education, Ontario Institute for Studies in Education, 252 Bloor Street West, Toronto, ON M5S 1V6, CanadaOfford Centre for Child Studies, BAHT 132, 1280 Main Street West, Hamilton, ON L8S 4K1, Canada Introduction Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population. Methods and analysis Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables. Results and implications We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification. https://ijpds.org/article/view/1843longitudinal datadata linkagestudent achievement data |
spellingShingle | Jeanne Sinclair Scott Davies Magdalena Janus Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database International Journal of Population Data Science longitudinal data data linkage student achievement data |
title | Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database |
title_full | Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database |
title_fullStr | Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database |
title_full_unstemmed | Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database |
title_short | Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database |
title_sort | student achievement trajectories in ontario creating and validating a province wide multi cohort and longitudinal database |
topic | longitudinal data data linkage student achievement data |
url | https://ijpds.org/article/view/1843 |
work_keys_str_mv | AT jeannesinclair studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase AT scottdavies studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase AT magdalenajanus studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase |