Identification of multiple outliers in a generalized linear model with continuous variables
In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Publishing Corporation
2016
|
Subjects: | |
Online Access: | http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf |
_version_ | 1825931046554697728 |
---|---|
author | Loo, Yee Peng Midi, Habshah Rana, Md. Sohel Fitrianto, Anwar |
author_facet | Loo, Yee Peng Midi, Habshah Rana, Md. Sohel Fitrianto, Anwar |
author_sort | Loo, Yee Peng |
collection | UPM |
description | In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident that most of the existing diagnostic methods have failed in identifying multiple outliers. Therefore, this paper proposed a diagnostic method for the identification of multiple outliers in GLM, where traditionally used outlier detection methods are effortless as they undergo masking or swamping dilemma. Hence, an investigation was carried out to determine the capability of the proposed GSCPR method. The findings obtained from the numerical examples indicated that the performance of the proposed method was satisfactory for the identification of multiple outliers. Meanwhile, in the simulation study, two scenarios were considered to assess the validity of the proposed method. The proposed method consistently displayed higher percentage of correct detection, as well as lower rates of swamping and masking, regardless of the sample size and the contamination levels. |
first_indexed | 2024-03-06T09:20:48Z |
format | Article |
id | upm.eprints-54484 |
institution | Universiti Putra Malaysia |
language | English |
last_indexed | 2024-03-06T09:20:48Z |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | dspace |
spelling | upm.eprints-544842018-03-21T01:27:12Z http://psasir.upm.edu.my/id/eprint/54484/ Identification of multiple outliers in a generalized linear model with continuous variables Loo, Yee Peng Midi, Habshah Rana, Md. Sohel Fitrianto, Anwar In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident that most of the existing diagnostic methods have failed in identifying multiple outliers. Therefore, this paper proposed a diagnostic method for the identification of multiple outliers in GLM, where traditionally used outlier detection methods are effortless as they undergo masking or swamping dilemma. Hence, an investigation was carried out to determine the capability of the proposed GSCPR method. The findings obtained from the numerical examples indicated that the performance of the proposed method was satisfactory for the identification of multiple outliers. Meanwhile, in the simulation study, two scenarios were considered to assess the validity of the proposed method. The proposed method consistently displayed higher percentage of correct detection, as well as lower rates of swamping and masking, regardless of the sample size and the contamination levels. Hindawi Publishing Corporation 2016 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf Loo, Yee Peng and Midi, Habshah and Rana, Md. Sohel and Fitrianto, Anwar (2016) Identification of multiple outliers in a generalized linear model with continuous variables. Mathematical Problems in Engineering, 2016. art. no. 5840523. pp. 1-9. ISSN 1024-123X; ESSN: 1563-5147 https://www.hindawi.com/journals/mpe/2016/5840523/abs/ Multiple outliers; Generalized linear model; Continuous variables 10.1155/2016/5840523 |
spellingShingle | Multiple outliers; Generalized linear model; Continuous variables Loo, Yee Peng Midi, Habshah Rana, Md. Sohel Fitrianto, Anwar Identification of multiple outliers in a generalized linear model with continuous variables |
title | Identification of multiple outliers in a generalized linear model with continuous variables |
title_full | Identification of multiple outliers in a generalized linear model with continuous variables |
title_fullStr | Identification of multiple outliers in a generalized linear model with continuous variables |
title_full_unstemmed | Identification of multiple outliers in a generalized linear model with continuous variables |
title_short | Identification of multiple outliers in a generalized linear model with continuous variables |
title_sort | identification of multiple outliers in a generalized linear model with continuous variables |
topic | Multiple outliers; Generalized linear model; Continuous variables |
url | http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf |
work_keys_str_mv | AT looyeepeng identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables AT midihabshah identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables AT ranamdsohel identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables AT fitriantoanwar identificationofmultipleoutliersinageneralizedlinearmodelwithcontinuousvariables |