An efficient method of identification of influential observaions in multiple linear regression and its application to real data
Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that in...
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
Universiti Kebangsaan Malaysia
2024
|
_version_ | 1811137868923404288 |
---|---|
author | Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied |
author_facet | Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied |
author_sort | Midi, Habshah |
collection | UPM |
description | Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study. |
first_indexed | 2024-09-25T03:41:09Z |
format | Article |
id | upm.eprints-108928 |
institution | Universiti Putra Malaysia |
last_indexed | 2024-09-25T03:41:09Z |
publishDate | 2024 |
publisher | Universiti Kebangsaan Malaysia |
record_format | dspace |
spelling | upm.eprints-1089282024-05-16T14:04:39Z http://psasir.upm.edu.my/id/eprint/108928/ An efficient method of identification of influential observaions in multiple linear regression and its application to real data Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study. Universiti Kebangsaan Malaysia 2024-06-20 Article PeerReviewed Midi, Habshah and Hendi, Hasan Talib and Uraibi, Hassan and Arasan, Jayanthi and Ismaeel, Shelan Saied (2024) An efficient method of identification of influential observaions in multiple linear regression and its application to real data. Sains Malaysiana, 52 (12). pp. 3589-3602. ISSN 0126-6039; ESSN: 2735-0118 https://www.ukm.my/jsm/pdf_files/SM-PDF-52-12-2023/19.pdf 10.17576/jsm-2023-5212-19 |
spellingShingle | Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title | An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title_full | An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title_fullStr | An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title_full_unstemmed | An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title_short | An efficient method of identification of influential observaions in multiple linear regression and its application to real data |
title_sort | efficient method of identification of influential observaions in multiple linear regression and its application to real data |
work_keys_str_mv | AT midihabshah anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT hendihasantalib anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT uraibihassan anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT arasanjayanthi anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT ismaeelshelansaied anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT midihabshah efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT hendihasantalib efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT uraibihassan efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT arasanjayanthi efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata AT ismaeelshelansaied efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata |