Design and Creation of Multi-Source Enduring Linked Assets

Introduction The Australian Institute of Health and Welfare is collaborating with a range of government and other institutions to build enduring data assets for improving analysis and informed policy outcomes. There were lessons learnt that can be shared, in addition to the architecture and linkage...

Full description

Bibliographic Details
Main Author: Elena Ougrinovsk
Format: Article
Language:English
Published: Swansea University 2020-12-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/1583
_version_ 1797428759754702848
author Elena Ougrinovsk
author_facet Elena Ougrinovsk
author_sort Elena Ougrinovsk
collection DOAJ
description Introduction The Australian Institute of Health and Welfare is collaborating with a range of government and other institutions to build enduring data assets for improving analysis and informed policy outcomes. There were lessons learnt that can be shared, in addition to the architecture and linkage techniques. The enduring assets were created by linking States’ and Territories’ health or welfare data to Commonwealth datasets such as Medicare Consumer Directory (MCD), Residential Aged Care (RAC) and National Death Index (NDI) data. The linkage was carried out by the Australian Institute of Health and Welfare (AIHW) Data Integration Services Centre (DISC). Objectives and Approach To create the integrated asset, the linkage spine was assembled by de-duplicating and linking MCD and NDI data. The states’ datasets and other commonwealth datasets involved in the project were linked to this spine. Each unique individual in the spine was assigned Personal Project Number (PPN) which was added to each record linked to the individual. The unlinked individuals from these datasets were de-duplicated and assigned different PPNs. Names, dates of birth and addresses were used in probabilistic linkage process. To enable investigators to interrogate the sequences of the events without releasing the exact dates, the central events file was created. It contains date differences for every event in the asset, calculated as the difference (in days) between event and not released “date zero”, different for each individual. Results Between 96% and 99% of records in the supplied datasets were linked to the spine with linkage accuracy at least 98.5%. The linkage rates depends on the data completeness and the nature of the datasets as not all individuals accessing states’ servicers are eligible for Medicare. Conclusion / Implications The person-focused de-identified analytical assets allow to study journeys of the individual through Australian health and welfare systems which transcends jurisdictional boundaries.
first_indexed 2024-03-09T09:03:31Z
format Article
id doaj.art-ceeded76af8f433cb6fdf2c60f659a30
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T09:03:31Z
publishDate 2020-12-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-ceeded76af8f433cb6fdf2c60f659a302023-12-02T11:00:36ZengSwansea UniversityInternational Journal of Population Data Science2399-49082020-12-015510.23889/ijpds.v5i5.1583Design and Creation of Multi-Source Enduring Linked AssetsElena Ougrinovsk0University Of MelbourneIntroduction The Australian Institute of Health and Welfare is collaborating with a range of government and other institutions to build enduring data assets for improving analysis and informed policy outcomes. There were lessons learnt that can be shared, in addition to the architecture and linkage techniques. The enduring assets were created by linking States’ and Territories’ health or welfare data to Commonwealth datasets such as Medicare Consumer Directory (MCD), Residential Aged Care (RAC) and National Death Index (NDI) data. The linkage was carried out by the Australian Institute of Health and Welfare (AIHW) Data Integration Services Centre (DISC). Objectives and Approach To create the integrated asset, the linkage spine was assembled by de-duplicating and linking MCD and NDI data. The states’ datasets and other commonwealth datasets involved in the project were linked to this spine. Each unique individual in the spine was assigned Personal Project Number (PPN) which was added to each record linked to the individual. The unlinked individuals from these datasets were de-duplicated and assigned different PPNs. Names, dates of birth and addresses were used in probabilistic linkage process. To enable investigators to interrogate the sequences of the events without releasing the exact dates, the central events file was created. It contains date differences for every event in the asset, calculated as the difference (in days) between event and not released “date zero”, different for each individual. Results Between 96% and 99% of records in the supplied datasets were linked to the spine with linkage accuracy at least 98.5%. The linkage rates depends on the data completeness and the nature of the datasets as not all individuals accessing states’ servicers are eligible for Medicare. Conclusion / Implications The person-focused de-identified analytical assets allow to study journeys of the individual through Australian health and welfare systems which transcends jurisdictional boundaries.https://ijpds.org/article/view/1583
spellingShingle Elena Ougrinovsk
Design and Creation of Multi-Source Enduring Linked Assets
International Journal of Population Data Science
title Design and Creation of Multi-Source Enduring Linked Assets
title_full Design and Creation of Multi-Source Enduring Linked Assets
title_fullStr Design and Creation of Multi-Source Enduring Linked Assets
title_full_unstemmed Design and Creation of Multi-Source Enduring Linked Assets
title_short Design and Creation of Multi-Source Enduring Linked Assets
title_sort design and creation of multi source enduring linked assets
url https://ijpds.org/article/view/1583
work_keys_str_mv AT elenaougrinovsk designandcreationofmultisourceenduringlinkedassets