Using machine learning to estimate the incidence rate of intimate partner violence
Abstract It is difficult to accurately estimate the incidence rate of intimate partner violence (IPV) using traditional social survey methods because IPV victims are often reluctant to disclose their experiences, leading to an underestimation of the incidence rate. To address this issue, we applied...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2023-04-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-023-31846-8 |
_version_ | 1797850120254914560 |
---|---|
author | Zhuo Chen Wen Ma Ying Li Wei Guo Senhu Wang Wansu Zhang Yunsong Chen |
author_facet | Zhuo Chen Wen Ma Ying Li Wei Guo Senhu Wang Wansu Zhang Yunsong Chen |
author_sort | Zhuo Chen |
collection | DOAJ |
description | Abstract It is difficult to accurately estimate the incidence rate of intimate partner violence (IPV) using traditional social survey methods because IPV victims are often reluctant to disclose their experiences, leading to an underestimation of the incidence rate. To address this issue, we applied machine learning algorithms to predict the incidence rate of IPV in China based on data from the Third Wave Survey on the Social Status of Women in China (TWSSSCW 2010). Specifically, we examined five unbalanced sample-processing methods and six machine learning algorithms, choosing the random under-sampling ensemble method and the random forest algorithm to impute the missing data. Analysis of the complete data showed that the incidence rates of physical violence, verbal violence, and cold violence were 7.10%, 13.74%, and 21.35%, respectively, which were higher than the incidence rates in the original dataset (4.05%, 11.21%, and 17.95%, respectively). The robustness of our findings was further confirmed by analysis using different training sets. Overall, this study demonstrates that better tools need to be developed to accurately estimate the incidence rates of IPV. It also serves as a useful guide for future research that imputes missing data using machine learning. |
first_indexed | 2024-04-09T18:56:18Z |
format | Article |
id | doaj.art-b2d5efa1c7ab4f998e6bc3542629e370 |
institution | Directory Open Access Journal |
issn | 2045-2322 |
language | English |
last_indexed | 2024-04-09T18:56:18Z |
publishDate | 2023-04-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj.art-b2d5efa1c7ab4f998e6bc3542629e3702023-04-09T11:12:43ZengNature PortfolioScientific Reports2045-23222023-04-011311910.1038/s41598-023-31846-8Using machine learning to estimate the incidence rate of intimate partner violenceZhuo Chen0Wen Ma1Ying Li2Wei Guo3Senhu Wang4Wansu Zhang5Yunsong Chen6School of Social and Behavioral Sciences, Nanjing UniversitySchool of Social and Behavioral Sciences, Nanjing UniversitySchool of Social and Behavioral Sciences, Nanjing UniversitySchool of Social and Behavioral Sciences, Nanjing UniversityDepartment of Sociology and Anthropology, National University of SingaporeSchool of Law, Nanjing UniversitySchool of Social and Behavioral Sciences, Nanjing UniversityAbstract It is difficult to accurately estimate the incidence rate of intimate partner violence (IPV) using traditional social survey methods because IPV victims are often reluctant to disclose their experiences, leading to an underestimation of the incidence rate. To address this issue, we applied machine learning algorithms to predict the incidence rate of IPV in China based on data from the Third Wave Survey on the Social Status of Women in China (TWSSSCW 2010). Specifically, we examined five unbalanced sample-processing methods and six machine learning algorithms, choosing the random under-sampling ensemble method and the random forest algorithm to impute the missing data. Analysis of the complete data showed that the incidence rates of physical violence, verbal violence, and cold violence were 7.10%, 13.74%, and 21.35%, respectively, which were higher than the incidence rates in the original dataset (4.05%, 11.21%, and 17.95%, respectively). The robustness of our findings was further confirmed by analysis using different training sets. Overall, this study demonstrates that better tools need to be developed to accurately estimate the incidence rates of IPV. It also serves as a useful guide for future research that imputes missing data using machine learning.https://doi.org/10.1038/s41598-023-31846-8 |
spellingShingle | Zhuo Chen Wen Ma Ying Li Wei Guo Senhu Wang Wansu Zhang Yunsong Chen Using machine learning to estimate the incidence rate of intimate partner violence Scientific Reports |
title | Using machine learning to estimate the incidence rate of intimate partner violence |
title_full | Using machine learning to estimate the incidence rate of intimate partner violence |
title_fullStr | Using machine learning to estimate the incidence rate of intimate partner violence |
title_full_unstemmed | Using machine learning to estimate the incidence rate of intimate partner violence |
title_short | Using machine learning to estimate the incidence rate of intimate partner violence |
title_sort | using machine learning to estimate the incidence rate of intimate partner violence |
url | https://doi.org/10.1038/s41598-023-31846-8 |
work_keys_str_mv | AT zhuochen usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT wenma usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT yingli usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT weiguo usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT senhuwang usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT wansuzhang usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence AT yunsongchen usingmachinelearningtoestimatetheincidencerateofintimatepartnerviolence |