Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model
We investigate the association of a sensitive characteristic or latent variable with observed binary random variables by the randomized response (RR) technique of Warner in his publication (Warner, S.L. <i>J. Am. Stat. Assoc.</i><b>1965</b>, <i>60</i>, 63–69) and...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-01-01
|
Series: | Mathematics |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7390/11/2/345 |
_version_ | 1797438998452371456 |
---|---|
author | Shen-Ming Lee Phuoc-Loc Tran Truong-Nhat Le Chin-Shang Li |
author_facet | Shen-Ming Lee Phuoc-Loc Tran Truong-Nhat Le Chin-Shang Li |
author_sort | Shen-Ming Lee |
collection | DOAJ |
description | We investigate the association of a sensitive characteristic or latent variable with observed binary random variables by the randomized response (RR) technique of Warner in his publication (Warner, S.L. <i>J. Am. Stat. Assoc.</i><b>1965</b>, <i>60</i>, 63–69) and a latent class model. First, an expectation-maximization (EM) algorithm is provided to easily estimate the parameters of the null and alternative/full models for the association between a sensitive characteristic and an observed categorical random variable under the RR design of Warner’s paper above. The likelihood ratio test (LRT) is utilized to identify observed categorical random variables that are significantly related to the sensitive trait. Another EM algorithm is then presented to estimate the parameters of a latent class model constructed through the sensitive attribute and the observed binary random variables that are obtained from dichotomizing observed categorical random variables selected from the above LRT. Finally, two classification criteria are conducted to predict an individual in the sensitive or non-sensitive group. The practicality of the proposed methodology is illustrated with an actual data set from a survey study of the sexuality of first-year students, except international students, at Feng Chia University in Taiwan in 2016. |
first_indexed | 2024-03-09T11:46:28Z |
format | Article |
id | doaj.art-62fcd90b26684919a0565486910c609e |
institution | Directory Open Access Journal |
issn | 2227-7390 |
language | English |
last_indexed | 2024-03-09T11:46:28Z |
publishDate | 2023-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Mathematics |
spelling | doaj.art-62fcd90b26684919a0565486910c609e2023-11-30T23:20:54ZengMDPI AGMathematics2227-73902023-01-0111234510.3390/math11020345Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class ModelShen-Ming Lee0Phuoc-Loc Tran1Truong-Nhat Le2Chin-Shang Li3Department of Statistics, Feng Chia University, Taichung 40724, TaiwanDepartment of Mathematics, College of Natural Science, Can Tho University, Can Tho, VietnamFaculty of Mathematics and Statistics, Ton Duc Thang University, Ho Chi Minh City, VietnamSchool of Nursing, The State University of New York, University at Buffalo, Buffalo, NY 14214, USAWe investigate the association of a sensitive characteristic or latent variable with observed binary random variables by the randomized response (RR) technique of Warner in his publication (Warner, S.L. <i>J. Am. Stat. Assoc.</i><b>1965</b>, <i>60</i>, 63–69) and a latent class model. First, an expectation-maximization (EM) algorithm is provided to easily estimate the parameters of the null and alternative/full models for the association between a sensitive characteristic and an observed categorical random variable under the RR design of Warner’s paper above. The likelihood ratio test (LRT) is utilized to identify observed categorical random variables that are significantly related to the sensitive trait. Another EM algorithm is then presented to estimate the parameters of a latent class model constructed through the sensitive attribute and the observed binary random variables that are obtained from dichotomizing observed categorical random variables selected from the above LRT. Finally, two classification criteria are conducted to predict an individual in the sensitive or non-sensitive group. The practicality of the proposed methodology is illustrated with an actual data set from a survey study of the sexuality of first-year students, except international students, at Feng Chia University in Taiwan in 2016.https://www.mdpi.com/2227-7390/11/2/345bootstrapexpectation-maximization (EM) algorithmlatent classlikelihood ratio testmaximum likelihoodrandomized response |
spellingShingle | Shen-Ming Lee Phuoc-Loc Tran Truong-Nhat Le Chin-Shang Li Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model Mathematics bootstrap expectation-maximization (EM) algorithm latent class likelihood ratio test maximum likelihood randomized response |
title | Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model |
title_full | Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model |
title_fullStr | Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model |
title_full_unstemmed | Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model |
title_short | Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model |
title_sort | prediction of a sensitive feature under indirect questioning via warner s randomized response technique and latent class model |
topic | bootstrap expectation-maximization (EM) algorithm latent class likelihood ratio test maximum likelihood randomized response |
url | https://www.mdpi.com/2227-7390/11/2/345 |
work_keys_str_mv | AT shenminglee predictionofasensitivefeatureunderindirectquestioningviawarnersrandomizedresponsetechniqueandlatentclassmodel AT phuocloctran predictionofasensitivefeatureunderindirectquestioningviawarnersrandomizedresponsetechniqueandlatentclassmodel AT truongnhatle predictionofasensitivefeatureunderindirectquestioningviawarnersrandomizedresponsetechniqueandlatentclassmodel AT chinshangli predictionofasensitivefeatureunderindirectquestioningviawarnersrandomizedresponsetechniqueandlatentclassmodel |