Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance

For calculating non-life insurance premiums, actuaries traditionally rely on separate severity and frequency models using covariates to explain the claims loss exposure. In this paper, we focus on the claim severity. First, we build two reference models, a generalized linear model and a generalized...

Full description

Bibliographic Details
Main Authors: Yves Staudt, Joël Wagner
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Risks
Subjects:
Online Access:https://www.mdpi.com/2227-9091/9/3/53
_version_ 1797541173761409024
author Yves Staudt
Joël Wagner
author_facet Yves Staudt
Joël Wagner
author_sort Yves Staudt
collection DOAJ
description For calculating non-life insurance premiums, actuaries traditionally rely on separate severity and frequency models using covariates to explain the claims loss exposure. In this paper, we focus on the claim severity. First, we build two reference models, a generalized linear model and a generalized additive model, relying on a log-normal distribution of the severity and including the most significant factors. Thereby, we relate the continuous variables to the response in a nonlinear way. In the second step, we tune two random forest models, one for the claim severity and one for the log-transformed claim severity, where the latter requires a transformation of the predicted results. We compare the prediction performance of the different models using the relative error, the root mean squared error and the goodness-of-lift statistics in combination with goodness-of-fit statistics. In our application, we rely on a dataset of a Swiss collision insurance portfolio covering the loss exposure of the period from 2011 to 2015, and including observations from 81 309 settled claims with a total amount of CHF 184 mio. In the analysis, we use the data from 2011 to 2014 for training and from 2015 for testing. Our results indicate that the use of a log-normal transformation of the severity is not leading to performance gains with random forests. However, random forests with a log-normal transformation are the favorite choice for explaining right-skewed claims. Finally, when considering all indicators, we conclude that the generalized additive model has the best overall performance.
first_indexed 2024-03-10T13:11:30Z
format Article
id doaj.art-4725ccd500d94b1a89ada0eecda81b43
institution Directory Open Access Journal
issn 2227-9091
language English
last_indexed 2024-03-10T13:11:30Z
publishDate 2021-03-01
publisher MDPI AG
record_format Article
series Risks
spelling doaj.art-4725ccd500d94b1a89ada0eecda81b432023-11-21T10:42:08ZengMDPI AGRisks2227-90912021-03-01935310.3390/risks9030053Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car InsuranceYves Staudt0Joël Wagner1Department Alpine Region Development, Institute for Tourism and Leisure, University of Applied Sciences of the Grisons, Comercialstrasse 19, 7000 Chur, SwitzerlandDepartment of Actuarial Science, Faculty of Business and Economics (HEC Lausanne), University of Lausanne, Extranef, 1015 Lausanne, SwitzerlandFor calculating non-life insurance premiums, actuaries traditionally rely on separate severity and frequency models using covariates to explain the claims loss exposure. In this paper, we focus on the claim severity. First, we build two reference models, a generalized linear model and a generalized additive model, relying on a log-normal distribution of the severity and including the most significant factors. Thereby, we relate the continuous variables to the response in a nonlinear way. In the second step, we tune two random forest models, one for the claim severity and one for the log-transformed claim severity, where the latter requires a transformation of the predicted results. We compare the prediction performance of the different models using the relative error, the root mean squared error and the goodness-of-lift statistics in combination with goodness-of-fit statistics. In our application, we rely on a dataset of a Swiss collision insurance portfolio covering the loss exposure of the period from 2011 to 2015, and including observations from 81 309 settled claims with a total amount of CHF 184 mio. In the analysis, we use the data from 2011 to 2014 for training and from 2015 for testing. Our results indicate that the use of a log-normal transformation of the severity is not leading to performance gains with random forests. However, random forests with a log-normal transformation are the favorite choice for explaining right-skewed claims. Finally, when considering all indicators, we conclude that the generalized additive model has the best overall performance.https://www.mdpi.com/2227-9091/9/3/53regression modeldata-driven binningrandom forestperformance analysisseverity modeling
spellingShingle Yves Staudt
Joël Wagner
Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
Risks
regression model
data-driven binning
random forest
performance analysis
severity modeling
title Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
title_full Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
title_fullStr Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
title_full_unstemmed Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
title_short Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance
title_sort assessing the performance of random forests for modeling claim severity in collision car insurance
topic regression model
data-driven binning
random forest
performance analysis
severity modeling
url https://www.mdpi.com/2227-9091/9/3/53
work_keys_str_mv AT yvesstaudt assessingtheperformanceofrandomforestsformodelingclaimseverityincollisioncarinsurance
AT joelwagner assessingtheperformanceofrandomforestsformodelingclaimseverityincollisioncarinsurance