Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection
An essential criterion for the proper implementation of case-control studies is selecting appropriate case and control groups. In this article, a new simulated annealing-based control group selection method is proposed, which solves the problem of selecting individuals in the control group as a dist...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-12-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/14/12/356 |
_version_ | 1827674463345311744 |
---|---|
author | Szabolcs Szekér Ágnes Vathy-Fogarassy |
author_facet | Szabolcs Szekér Ágnes Vathy-Fogarassy |
author_sort | Szabolcs Szekér |
collection | DOAJ |
description | An essential criterion for the proper implementation of case-control studies is selecting appropriate case and control groups. In this article, a new simulated annealing-based control group selection method is proposed, which solves the problem of selecting individuals in the control group as a distance optimization task. The proposed algorithm pairs the individuals in the <i>n</i>-dimensional feature space by minimizing the weighted distances between them. The weights of the dimensions are based on the odds ratios calculated from the logistic regression model fitted on the variables describing the probability of membership of the treated group. For finding the optimal pairing of the individuals, simulated annealing is utilized. The effectiveness of the newly proposed Weighted Nearest Neighbours Control Group Selection with Simulated Annealing (WNNSA) algorithm is presented by two Monte Carlo studies. Results show that the WNNSA method can outperform the widely applied greedy propensity score matching method in feature spaces where only a few covariates characterize individuals and the covariates can only take a few values. |
first_indexed | 2024-03-10T04:40:42Z |
format | Article |
id | doaj.art-98be8316a4354d6abf6dfeabb7217385 |
institution | Directory Open Access Journal |
issn | 1999-4893 |
language | English |
last_indexed | 2024-03-10T04:40:42Z |
publishDate | 2021-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Algorithms |
spelling | doaj.art-98be8316a4354d6abf6dfeabb72173852023-11-23T03:24:54ZengMDPI AGAlgorithms1999-48932021-12-01141235610.3390/a14120356Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group SelectionSzabolcs Szekér0Ágnes Vathy-Fogarassy1Department of Computer Science and Systems Technology, Faculty of Information Technology, University of Pannonia, 8200 Veszprém, HungaryDepartment of Computer Science and Systems Technology, Faculty of Information Technology, University of Pannonia, 8200 Veszprém, HungaryAn essential criterion for the proper implementation of case-control studies is selecting appropriate case and control groups. In this article, a new simulated annealing-based control group selection method is proposed, which solves the problem of selecting individuals in the control group as a distance optimization task. The proposed algorithm pairs the individuals in the <i>n</i>-dimensional feature space by minimizing the weighted distances between them. The weights of the dimensions are based on the odds ratios calculated from the logistic regression model fitted on the variables describing the probability of membership of the treated group. For finding the optimal pairing of the individuals, simulated annealing is utilized. The effectiveness of the newly proposed Weighted Nearest Neighbours Control Group Selection with Simulated Annealing (WNNSA) algorithm is presented by two Monte Carlo studies. Results show that the WNNSA method can outperform the widely applied greedy propensity score matching method in feature spaces where only a few covariates characterize individuals and the covariates can only take a few values.https://www.mdpi.com/1999-4893/14/12/356control group selectionweighted k-nearest neighboursimulated annealinglogistic regressionnegative covariates |
spellingShingle | Szabolcs Szekér Ágnes Vathy-Fogarassy Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection Algorithms control group selection weighted k-nearest neighbour simulated annealing logistic regression negative covariates |
title | Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection |
title_full | Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection |
title_fullStr | Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection |
title_full_unstemmed | Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection |
title_short | Optimized Weighted Nearest Neighbours Matching Algorithm for Control Group Selection |
title_sort | optimized weighted nearest neighbours matching algorithm for control group selection |
topic | control group selection weighted k-nearest neighbour simulated annealing logistic regression negative covariates |
url | https://www.mdpi.com/1999-4893/14/12/356 |
work_keys_str_mv | AT szabolcsszeker optimizedweightednearestneighboursmatchingalgorithmforcontrolgroupselection AT agnesvathyfogarassy optimizedweightednearestneighboursmatchingalgorithmforcontrolgroupselection |