Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database

Introduction Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on stu...

Full description

Bibliographic Details
Main Authors: Jeanne Sinclair, Scott Davies, Magdalena Janus
Format: Article
Language:English
Published: Swansea University 2023-02-01
Series:International Journal of Population Data Science
Subjects:
Online Access:https://ijpds.org/article/view/1843
_version_ 1797430090760454144
author Jeanne Sinclair
Scott Davies
Magdalena Janus
author_facet Jeanne Sinclair
Scott Davies
Magdalena Janus
author_sort Jeanne Sinclair
collection DOAJ
description Introduction Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population. Methods and analysis Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables. Results and implications We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification.
first_indexed 2024-03-09T09:23:34Z
format Article
id doaj.art-e701d30ccf014cad8faaab4de6222cc3
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T09:23:34Z
publishDate 2023-02-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-e701d30ccf014cad8faaab4de6222cc32023-12-02T06:57:22ZengSwansea UniversityInternational Journal of Population Data Science2399-49082023-02-018110.23889/ijpds.v8i1.1843Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal databaseJeanne Sinclair0Scott Davies1Magdalena Janus2Memorial University Faculty of Education, 323 Prince Philip Drive, St. John's, NL A1B 3X8, CanadaDepartment of Leadership, Higher and Adult Education, Ontario Institute for Studies in Education, 252 Bloor Street West, Toronto, ON M5S 1V6, CanadaOfford Centre for Child Studies, BAHT 132, 1280 Main Street West, Hamilton, ON L8S 4K1, Canada Introduction Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population. Methods and analysis Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables. Results and implications We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification. https://ijpds.org/article/view/1843longitudinal datadata linkagestudent achievement data
spellingShingle Jeanne Sinclair
Scott Davies
Magdalena Janus
Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
International Journal of Population Data Science
longitudinal data
data linkage
student achievement data
title Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
title_full Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
title_fullStr Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
title_full_unstemmed Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
title_short Student Achievement Trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database
title_sort student achievement trajectories in ontario creating and validating a province wide multi cohort and longitudinal database
topic longitudinal data
data linkage
student achievement data
url https://ijpds.org/article/view/1843
work_keys_str_mv AT jeannesinclair studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase
AT scottdavies studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase
AT magdalenajanus studentachievementtrajectoriesinontariocreatingandvalidatingaprovincewidemulticohortandlongitudinaldatabase