Fast improvised influential distance for the identification of influential observations in multiple linear regression

Influential observations (IO) are those observations that are responsible for misleading conclusions about the fitting of a multiple linear regression model. The existing IO identification methods such as influential distance (ID) is not very successful in detecting IO. It is suspected that the ID e...

Full description

Bibliographic Details
Main Authors: Habshah Midi, Muhammad Sani, Shelan Saied Ismaeel, Jayanthi Arasan
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2021
Online Access:http://journalarticle.ukm.my/17568/1/22.pdf
_version_ 1796932677254774784
author Habshah Midi,
Muhammad Sani,
Shelan Saied Ismaeel,
Jayanthi Arasan,
author_facet Habshah Midi,
Muhammad Sani,
Shelan Saied Ismaeel,
Jayanthi Arasan,
author_sort Habshah Midi,
collection UKM
description Influential observations (IO) are those observations that are responsible for misleading conclusions about the fitting of a multiple linear regression model. The existing IO identification methods such as influential distance (ID) is not very successful in detecting IO. It is suspected that the ID employed inefficient method with long computational running time for the identification of the suspected IO at the initial step. Moreover, this method declares good leverage observations as IO, resulting in misleading conclusion. In this paper, we proposed fast improvised influential distance (FIID) that can successfully identify IO, good leverage observations, and regular observations with shorter computational running time. Monte Carlo simulation study and real data examples show that the FIID correctly identify genuine IO in multiple linear regression model with no masking and a negligible swamping rate.
first_indexed 2024-03-06T04:36:06Z
format Article
id ukm.eprints-17568
institution Universiti Kebangsaan Malaysia
language English
last_indexed 2024-03-06T04:36:06Z
publishDate 2021
publisher Penerbit Universiti Kebangsaan Malaysia
record_format dspace
spelling ukm.eprints-175682021-11-15T03:49:46Z http://journalarticle.ukm.my/17568/ Fast improvised influential distance for the identification of influential observations in multiple linear regression Habshah Midi, Muhammad Sani, Shelan Saied Ismaeel, Jayanthi Arasan, Influential observations (IO) are those observations that are responsible for misleading conclusions about the fitting of a multiple linear regression model. The existing IO identification methods such as influential distance (ID) is not very successful in detecting IO. It is suspected that the ID employed inefficient method with long computational running time for the identification of the suspected IO at the initial step. Moreover, this method declares good leverage observations as IO, resulting in misleading conclusion. In this paper, we proposed fast improvised influential distance (FIID) that can successfully identify IO, good leverage observations, and regular observations with shorter computational running time. Monte Carlo simulation study and real data examples show that the FIID correctly identify genuine IO in multiple linear regression model with no masking and a negligible swamping rate. Penerbit Universiti Kebangsaan Malaysia 2021-07 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/17568/1/22.pdf Habshah Midi, and Muhammad Sani, and Shelan Saied Ismaeel, and Jayanthi Arasan, (2021) Fast improvised influential distance for the identification of influential observations in multiple linear regression. Sains Malaysiana, 50 (7). pp. 2085-2094. ISSN 0126-6039 https://www.ukm.my/jsm/malay_journals/jilid50bil7_2021/KandunganJilid50Bil7_2021.html
spellingShingle Habshah Midi,
Muhammad Sani,
Shelan Saied Ismaeel,
Jayanthi Arasan,
Fast improvised influential distance for the identification of influential observations in multiple linear regression
title Fast improvised influential distance for the identification of influential observations in multiple linear regression
title_full Fast improvised influential distance for the identification of influential observations in multiple linear regression
title_fullStr Fast improvised influential distance for the identification of influential observations in multiple linear regression
title_full_unstemmed Fast improvised influential distance for the identification of influential observations in multiple linear regression
title_short Fast improvised influential distance for the identification of influential observations in multiple linear regression
title_sort fast improvised influential distance for the identification of influential observations in multiple linear regression
url http://journalarticle.ukm.my/17568/1/22.pdf
work_keys_str_mv AT habshahmidi fastimprovisedinfluentialdistancefortheidentificationofinfluentialobservationsinmultiplelinearregression
AT muhammadsani fastimprovisedinfluentialdistancefortheidentificationofinfluentialobservationsinmultiplelinearregression
AT shelansaiedismaeel fastimprovisedinfluentialdistancefortheidentificationofinfluentialobservationsinmultiplelinearregression
AT jayanthiarasan fastimprovisedinfluentialdistancefortheidentificationofinfluentialobservationsinmultiplelinearregression