Discriminative machine learning for maximal representative subsampling

Abstract Biased population samples pose a prevalent problem in the social sciences. Therefore, we present two novel methods that are based on positive-unlabeled learning to mitigate bias. Both methods leverage auxiliary information from a representative data set and train machine learning classifier...

Full description

Bibliographic Details
Main Authors: Tony Hauptmann, Sophie Fellenz, Laksan Nathan, Oliver Tüscher, Stefan Kramer
Format: Article
Language:English
Published: Nature Portfolio 2023-11-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-48177-3