Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File

IntroductionThere is increasing recognition of the value of linking food sales databases to national food composition tables for population nutrition research.ObjectivesExpanding upon automated and manual database mapping approaches in the literature, our aim was to match 1,179 food products in the...

Full description

Bibliographic Details
Main Authors: Sappho Z. Gilbert, Conor L. Morrison, Qiuyu J. Chen, Jesman Punian, Jodi T. Bernstein, Mahsa Jessri
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-02-01
Series:Frontiers in Nutrition
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fnut.2022.1013516/full
_version_ 1797905165168148480
author Sappho Z. Gilbert
Sappho Z. Gilbert
Conor L. Morrison
Qiuyu J. Chen
Jesman Punian
Jodi T. Bernstein
Mahsa Jessri
Mahsa Jessri
author_facet Sappho Z. Gilbert
Sappho Z. Gilbert
Conor L. Morrison
Qiuyu J. Chen
Jesman Punian
Jodi T. Bernstein
Mahsa Jessri
Mahsa Jessri
author_sort Sappho Z. Gilbert
collection DOAJ
description IntroductionThere is increasing recognition of the value of linking food sales databases to national food composition tables for population nutrition research.ObjectivesExpanding upon automated and manual database mapping approaches in the literature, our aim was to match 1,179 food products in the Canadian data subset of Euromonitor International’s Passport Nutrition to their closest respective equivalents in Health Canada’s Canadian Nutrient File (CNF).MethodsMatching took place in two major steps. First, an algorithm based on thresholds of maximal nutrient difference (between Euromonitor and CNF foods) and fuzzy matching was executed to offer match options. If a nutritionally appropriate match was available among the algorithm suggestions, it was selected. When the suggested set contained no nutritionally sound matches, the Euromonitor product was instead manually matched to a CNF food or deemed unmatchable, with the unique addition of expert validation to maximize meticulousness in matching. Both steps were independently performed by at least two team members with dietetics expertise.ResultsOf 1,111 Euromonitor products run through the algorithm, an accurate CNF match was offered for 65% of them; missing or zero-calorie data precluded 68 products from being run in the algorithm. Products with 2 or more algorithm-suggested CNF matches had higher match accuracy than those with one (71 vs. 50%, respectively). Overall, inter-rater agreement (reliability) rates were robust for matches chosen among algorithm options (51%) and even higher regarding whether manual selection would be required (71%); among manually selected CNF matches, reliability was 33%. Ultimately, 1,152 (98%) Euromonitor products were matched to a CNF equivalent.ConclusionOur reported matching process successfully bridged a food sales database’s products to their respective CNF matches for use in future nutritional epidemiological studies of branded foods sold in Canada. Our team’s novel utilization of dietetics expertise aided in match validation at both steps, ensuring rigor and quality of resulting match selections.
first_indexed 2024-04-10T10:01:53Z
format Article
id doaj.art-08eb6ffe7ee64e908f744bcdb3e066c8
institution Directory Open Access Journal
issn 2296-861X
language English
last_indexed 2024-04-10T10:01:53Z
publishDate 2023-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Nutrition
spelling doaj.art-08eb6ffe7ee64e908f744bcdb3e066c82023-02-16T07:05:02ZengFrontiers Media S.A.Frontiers in Nutrition2296-861X2023-02-01910.3389/fnut.2022.10135161013516Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient FileSappho Z. Gilbert0Sappho Z. Gilbert1Conor L. Morrison2Qiuyu J. Chen3Jesman Punian4Jodi T. Bernstein5Mahsa Jessri6Mahsa Jessri7Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, CT, United StatesFood, Nutrition, and Health Program, Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, CanadaDepartment of Statistics, Faculty of Science, The University of British Columbia, Vancouver, BC, CanadaFood, Nutrition, and Health Program, Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, CanadaFood, Nutrition, and Health Program, Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, CanadaFood, Nutrition, and Health Program, Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, CanadaFood, Nutrition, and Health Program, Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, CanadaCentre for Health Services and Policy Research (CHSPR) and Health Services and Policy (HSP), Faculty of Medicine, The University of British Columbia, Vancouver, BC, CanadaIntroductionThere is increasing recognition of the value of linking food sales databases to national food composition tables for population nutrition research.ObjectivesExpanding upon automated and manual database mapping approaches in the literature, our aim was to match 1,179 food products in the Canadian data subset of Euromonitor International’s Passport Nutrition to their closest respective equivalents in Health Canada’s Canadian Nutrient File (CNF).MethodsMatching took place in two major steps. First, an algorithm based on thresholds of maximal nutrient difference (between Euromonitor and CNF foods) and fuzzy matching was executed to offer match options. If a nutritionally appropriate match was available among the algorithm suggestions, it was selected. When the suggested set contained no nutritionally sound matches, the Euromonitor product was instead manually matched to a CNF food or deemed unmatchable, with the unique addition of expert validation to maximize meticulousness in matching. Both steps were independently performed by at least two team members with dietetics expertise.ResultsOf 1,111 Euromonitor products run through the algorithm, an accurate CNF match was offered for 65% of them; missing or zero-calorie data precluded 68 products from being run in the algorithm. Products with 2 or more algorithm-suggested CNF matches had higher match accuracy than those with one (71 vs. 50%, respectively). Overall, inter-rater agreement (reliability) rates were robust for matches chosen among algorithm options (51%) and even higher regarding whether manual selection would be required (71%); among manually selected CNF matches, reliability was 33%. Ultimately, 1,152 (98%) Euromonitor products were matched to a CNF equivalent.ConclusionOur reported matching process successfully bridged a food sales database’s products to their respective CNF matches for use in future nutritional epidemiological studies of branded foods sold in Canada. Our team’s novel utilization of dietetics expertise aided in match validation at both steps, ensuring rigor and quality of resulting match selections.https://www.frontiersin.org/articles/10.3389/fnut.2022.1013516/fulldatabase mappingnutritional surveillance and monitoringfood composition tables (FCTs)food supplypublic health nutritionfuzzy matching
spellingShingle Sappho Z. Gilbert
Sappho Z. Gilbert
Conor L. Morrison
Qiuyu J. Chen
Jesman Punian
Jodi T. Bernstein
Mahsa Jessri
Mahsa Jessri
Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
Frontiers in Nutrition
database mapping
nutritional surveillance and monitoring
food composition tables (FCTs)
food supply
public health nutrition
fuzzy matching
title Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
title_full Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
title_fullStr Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
title_full_unstemmed Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
title_short Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File
title_sort algorithm based mapping of products in a branded canadian food and beverage database to their equivalents in health canada s canadian nutrient file
topic database mapping
nutritional surveillance and monitoring
food composition tables (FCTs)
food supply
public health nutrition
fuzzy matching
url https://www.frontiersin.org/articles/10.3389/fnut.2022.1013516/full
work_keys_str_mv AT sapphozgilbert algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT sapphozgilbert algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT conorlmorrison algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT qiuyujchen algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT jesmanpunian algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT joditbernstein algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT mahsajessri algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile
AT mahsajessri algorithmbasedmappingofproductsinabrandedcanadianfoodandbeveragedatabasetotheirequivalentsinhealthcanadascanadiannutrientfile