An efficient method of identification of influential observaions in multiple linear regression and its application to real data

Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that in...

Full description

Bibliographic Details
Main Authors: Midi, Habshah, Hendi, Hasan Talib, Uraibi, Hassan, Arasan, Jayanthi, Ismaeel, Shelan Saied
Format: Article
Published: Universiti Kebangsaan Malaysia 2024
_version_ 1811137868923404288
author Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
author_facet Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
author_sort Midi, Habshah
collection UPM
description Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study.
first_indexed 2024-09-25T03:41:09Z
format Article
id upm.eprints-108928
institution Universiti Putra Malaysia
last_indexed 2024-09-25T03:41:09Z
publishDate 2024
publisher Universiti Kebangsaan Malaysia
record_format dspace
spelling upm.eprints-1089282024-05-16T14:04:39Z http://psasir.upm.edu.my/id/eprint/108928/ An efficient method of identification of influential observaions in multiple linear regression and its application to real data Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study. Universiti Kebangsaan Malaysia 2024-06-20 Article PeerReviewed Midi, Habshah and Hendi, Hasan Talib and Uraibi, Hassan and Arasan, Jayanthi and Ismaeel, Shelan Saied (2024) An efficient method of identification of influential observaions in multiple linear regression and its application to real data. Sains Malaysiana, 52 (12). pp. 3589-3602. ISSN 0126-6039; ESSN: 2735-0118 https://www.ukm.my/jsm/pdf_files/SM-PDF-52-12-2023/19.pdf 10.17576/jsm-2023-5212-19
spellingShingle Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_full An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_fullStr An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_full_unstemmed An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_short An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_sort efficient method of identification of influential observaions in multiple linear regression and its application to real data
work_keys_str_mv AT midihabshah anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT hendihasantalib anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT uraibihassan anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT arasanjayanthi anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT ismaeelshelansaied anefficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT midihabshah efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT hendihasantalib efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT uraibihassan efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT arasanjayanthi efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata
AT ismaeelshelansaied efficientmethodofidentificationofinfluentialobservaionsinmultiplelinearregressionanditsapplicationtorealdata