On the Sampling Size for Inverse Sampling

In the Big Data era, sampling remains a central theme. This paper investigates the characteristics of inverse sampling on two different datasets (real and simulated) to determine when big data become too small for inverse sampling to be used and to examine the impact of the sampling rate of the subs...

Full description

Bibliographic Details
Main Authors: Daniele Cuntrera, Vincenzo Falco, Ornella Giambalvo
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Stats
Subjects:
Online Access:https://www.mdpi.com/2571-905X/5/4/67
Description
Summary:In the Big Data era, sampling remains a central theme. This paper investigates the characteristics of inverse sampling on two different datasets (real and simulated) to determine when big data become too small for inverse sampling to be used and to examine the impact of the sampling rate of the subsamples. We find that the method, using the appropriate subsample size for both the mean and proportion parameters, performs well with a smaller dataset than big data through the simulation study and real-data application. Different settings related to the selection bias severity are considered during the simulation study and real application.
ISSN:2571-905X