Classification of the financial sustainability of health insurance beneficiaries through data mining techniques

Advances in information technologies have led to the storage of large amounts of data by organizations. An analysis of this data through data mining techniques is important support for decision-making. This article aims to apply techniques for the classification of the beneficiaries of an operator o...

Full description

Bibliographic Details
Main Authors: Sílvia Maria Dias Pedro Rebouças, Daniele Adelaide Brandão de Oliveira, Rômulo Alves Soares, Eugénia Maria Dores Maia Ferreira, Maria José Baltazar dos Reis de Pinto Gouveia
Format: Article
Language:English
Published: Research Centre for Tourism, Sustainability and Well-being - CinTurs; University of Algarve 2016-09-01
Series:Journal of Spatial and Organizational Dynamics
Subjects:
Online Access:http://www.cieo.pt/journal/J_3_2016/article4.pdf
_version_ 1818038494357880832
author Sílvia Maria Dias Pedro Rebouças
Daniele Adelaide Brandão de Oliveira
Rômulo Alves Soares
Eugénia Maria Dores Maia Ferreira
Maria José Baltazar dos Reis de Pinto Gouveia
author_facet Sílvia Maria Dias Pedro Rebouças
Daniele Adelaide Brandão de Oliveira
Rômulo Alves Soares
Eugénia Maria Dores Maia Ferreira
Maria José Baltazar dos Reis de Pinto Gouveia
author_sort Sílvia Maria Dias Pedro Rebouças
collection DOAJ
description Advances in information technologies have led to the storage of large amounts of data by organizations. An analysis of this data through data mining techniques is important support for decision-making. This article aims to apply techniques for the classification of the beneficiaries of an operator of health insurance in Brazil, according to their financial sustainability, via their sociodemographic characteristics and their healthcare cost history. Beneficiaries with a loss ratio greater than 0.75 are considered unsustainable. The sample consists of 38875 beneficiaries, active between the years 2011 and 2013. The techniques used were logistic regression and classification trees. The performance of the models was compared to accuracy rates and receiver operating Characteristic curves (ROC curves), by determining the area under the curves (AUC). The results showed that most of the sample is composed of sustainable beneficiaries. The logistic regression model had a 68.43% accuracy rate with AUC of 0.7501, and the classification tree obtained 67.76% accuracy and an AUC of 0.6855. Age and the type of plan were the most important variables related to the profile of the beneficiaries in the classification. The highlights with regard to healthcare costs were annual spending on consultation and on dental insurance.
first_indexed 2024-12-10T07:43:38Z
format Article
id doaj.art-a06546763f30435580025f5ad53bb3e7
institution Directory Open Access Journal
issn 2183-1912
2183-1912
language English
last_indexed 2024-12-10T07:43:38Z
publishDate 2016-09-01
publisher Research Centre for Tourism, Sustainability and Well-being - CinTurs; University of Algarve
record_format Article
series Journal of Spatial and Organizational Dynamics
spelling doaj.art-a06546763f30435580025f5ad53bb3e72022-12-22T01:57:14ZengResearch Centre for Tourism, Sustainability and Well-being - CinTurs; University of AlgarveJournal of Spatial and Organizational Dynamics2183-19122183-19122016-09-01IV3229242Classification of the financial sustainability of health insurance beneficiaries through data mining techniquesSílvia Maria Dias Pedro Rebouças0Daniele Adelaide Brandão de Oliveira1Rômulo Alves Soares2Eugénia Maria Dores Maia Ferreira3Maria José Baltazar dos Reis de Pinto Gouveia4Federal University of CearáFanor Devry BrazilFederal University of Ceará University of Algarve, Faculty of Economics, Research Centre for Spatial and Organizational Dynamics University of AlgarveAdvances in information technologies have led to the storage of large amounts of data by organizations. An analysis of this data through data mining techniques is important support for decision-making. This article aims to apply techniques for the classification of the beneficiaries of an operator of health insurance in Brazil, according to their financial sustainability, via their sociodemographic characteristics and their healthcare cost history. Beneficiaries with a loss ratio greater than 0.75 are considered unsustainable. The sample consists of 38875 beneficiaries, active between the years 2011 and 2013. The techniques used were logistic regression and classification trees. The performance of the models was compared to accuracy rates and receiver operating Characteristic curves (ROC curves), by determining the area under the curves (AUC). The results showed that most of the sample is composed of sustainable beneficiaries. The logistic regression model had a 68.43% accuracy rate with AUC of 0.7501, and the classification tree obtained 67.76% accuracy and an AUC of 0.6855. Age and the type of plan were the most important variables related to the profile of the beneficiaries in the classification. The highlights with regard to healthcare costs were annual spending on consultation and on dental insurance.http://www.cieo.pt/journal/J_3_2016/article4.pdfData MiningLogistic RegressionClassification TreesHealth Insurance
spellingShingle Sílvia Maria Dias Pedro Rebouças
Daniele Adelaide Brandão de Oliveira
Rômulo Alves Soares
Eugénia Maria Dores Maia Ferreira
Maria José Baltazar dos Reis de Pinto Gouveia
Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
Journal of Spatial and Organizational Dynamics
Data Mining
Logistic Regression
Classification Trees
Health Insurance
title Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
title_full Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
title_fullStr Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
title_full_unstemmed Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
title_short Classification of the financial sustainability of health insurance beneficiaries through data mining techniques
title_sort classification of the financial sustainability of health insurance beneficiaries through data mining techniques
topic Data Mining
Logistic Regression
Classification Trees
Health Insurance
url http://www.cieo.pt/journal/J_3_2016/article4.pdf
work_keys_str_mv AT silviamariadiaspedroreboucas classificationofthefinancialsustainabilityofhealthinsurancebeneficiariesthroughdataminingtechniques
AT danieleadelaidebrandaodeoliveira classificationofthefinancialsustainabilityofhealthinsurancebeneficiariesthroughdataminingtechniques
AT romuloalvessoares classificationofthefinancialsustainabilityofhealthinsurancebeneficiariesthroughdataminingtechniques
AT eugeniamariadoresmaiaferreira classificationofthefinancialsustainabilityofhealthinsurancebeneficiariesthroughdataminingtechniques
AT mariajosebaltazardosreisdepintogouveia classificationofthefinancialsustainabilityofhealthinsurancebeneficiariesthroughdataminingtechniques