Identification of multiple outliers in a generalized linear model with continuous variables

In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident...

Full description

Bibliographic Details
Main Authors: Loo, Yee Peng, Midi, Habshah, Rana, Md. Sohel, Fitrianto, Anwar
Format: Article
Language:English
Published: Hindawi Publishing Corporation 2016
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf
_version_ 1825931046554697728
author Loo, Yee Peng
Midi, Habshah
Rana, Md. Sohel
Fitrianto, Anwar
author_facet Loo, Yee Peng
Midi, Habshah
Rana, Md. Sohel
Fitrianto, Anwar
author_sort Loo, Yee Peng
collection UPM
description In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident that most of the existing diagnostic methods have failed in identifying multiple outliers. Therefore, this paper proposed a diagnostic method for the identification of multiple outliers in GLM, where traditionally used outlier detection methods are effortless as they undergo masking or swamping dilemma. Hence, an investigation was carried out to determine the capability of the proposed GSCPR method. The findings obtained from the numerical examples indicated that the performance of the proposed method was satisfactory for the identification of multiple outliers. Meanwhile, in the simulation study, two scenarios were considered to assess the validity of the proposed method. The proposed method consistently displayed higher percentage of correct detection, as well as lower rates of swamping and masking, regardless of the sample size and the contamination levels.
first_indexed 2024-03-06T09:20:48Z
format Article
id upm.eprints-54484
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T09:20:48Z
publishDate 2016
publisher Hindawi Publishing Corporation
record_format dspace
spelling upm.eprints-544842018-03-21T01:27:12Z http://psasir.upm.edu.my/id/eprint/54484/ Identification of multiple outliers in a generalized linear model with continuous variables Loo, Yee Peng Midi, Habshah Rana, Md. Sohel Fitrianto, Anwar In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident that most of the existing diagnostic methods have failed in identifying multiple outliers. Therefore, this paper proposed a diagnostic method for the identification of multiple outliers in GLM, where traditionally used outlier detection methods are effortless as they undergo masking or swamping dilemma. Hence, an investigation was carried out to determine the capability of the proposed GSCPR method. The findings obtained from the numerical examples indicated that the performance of the proposed method was satisfactory for the identification of multiple outliers. Meanwhile, in the simulation study, two scenarios were considered to assess the validity of the proposed method. The proposed method consistently displayed higher percentage of correct detection, as well as lower rates of swamping and masking, regardless of the sample size and the contamination levels. Hindawi Publishing Corporation 2016 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf Loo, Yee Peng and Midi, Habshah and Rana, Md. Sohel and Fitrianto, Anwar (2016) Identification of multiple outliers in a generalized linear model with continuous variables. Mathematical Problems in Engineering, 2016. art. no. 5840523. pp. 1-9. ISSN 1024-123X; ESSN: 1563-5147 https://www.hindawi.com/journals/mpe/2016/5840523/abs/ Multiple outliers; Generalized linear model; Continuous variables 10.1155/2016/5840523
spellingShingle Multiple outliers; Generalized linear model; Continuous variables
Loo, Yee Peng
Midi, Habshah
Rana, Md. Sohel
Fitrianto, Anwar
Identification of multiple outliers in a generalized linear model with continuous variables
title Identification of multiple outliers in a generalized linear model with continuous variables
title_full Identification of multiple outliers in a generalized linear model with continuous variables
title_fullStr Identification of multiple outliers in a generalized linear model with continuous variables
title_full_unstemmed Identification of multiple outliers in a generalized linear model with continuous variables
title_short Identification of multiple outliers in a generalized linear model with continuous variables
title_sort identification of multiple outliers in a generalized linear model with continuous variables
topic Multiple outliers; Generalized linear model; Continuous variables
url http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf
work_keys_str_mv AT looyeepeng identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables
AT midihabshah identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables
AT ranamdsohel identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables
AT fitriantoanwar identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables