Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]

Outliers with respect to the predictor variables are called high leverage points. The observations that are slightly different from all others can drive to a large difference in the results of regression analysis. In regression analysis, the detection of high leverage points is compulsory, as they w...

Full description

Bibliographic Details
Main Authors: Ismail, Nor Azima, Midi, Prof Dr. Habshah, Mohamad Sobri, Norafefah Mohamad Sobri, Zulkifli, Siti Nurani Zulkifli
Format: Article
Language:English
Published: Unit Penerbitan UiTM Kelantan 2016
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/24068/1/4
_version_ 1796901845836234752
author Ismail, Nor Azima
Midi, Prof Dr. Habshah
Mohamad Sobri, Norafefah Mohamad Sobri
Zulkifli, Siti Nurani Zulkifli
author_facet Ismail, Nor Azima
Midi, Prof Dr. Habshah
Mohamad Sobri, Norafefah Mohamad Sobri
Zulkifli, Siti Nurani Zulkifli
author_sort Ismail, Nor Azima
collection UITM
description Outliers with respect to the predictor variables are called high leverage points. The observations that are slightly different from all others can drive to a large difference in the results of regression analysis. In regression analysis, the detection of high leverage points is compulsory, as they will give large impact on the estimation values as well as lead to multicollinearity problems. In this situation, robust regression procedure can be very useful to deal with problems arise due to the existence of high leverage points. The aim of this study is to compare the performance of three methods in detecting high leverage points. At first stage, the two well-known data sets are considered. The first data used is artificial data set generated by Hawkins, Bradu and Kass in 1984 and the second data used is stack loss data by Brownlee in 1965. The second stage of this study is to conduct simulation study whereby the data were generated based on clean and contaminated data. The three sets of measures being considered in this study are Leverage methods Ttwice-the-mean-rule), Generalized Potentials and Diagnostic Robust Generalized Approach (DRGP). The result indicates that DRGP successfully proved its ability as a powerful method of detecting high leverage points as compared to the other two methods using both artificial data sets and simulated data.
first_indexed 2024-03-06T01:56:08Z
format Article
id uitm.eprints-4068
institution Universiti Teknologi MARA
language English
last_indexed 2024-03-06T01:56:08Z
publishDate 2016
publisher Unit Penerbitan UiTM Kelantan
record_format dspace
spelling uitm.eprints-40682020-04-02T05:37:42Z https://ir.uitm.edu.my/id/eprint/24068/ Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.] jmcs Ismail, Nor Azima Midi, Prof Dr. Habshah Mohamad Sobri, Norafefah Mohamad Sobri Zulkifli, Siti Nurani Zulkifli Electronic digital computers Computer software Operating systems (Computers) Outliers with respect to the predictor variables are called high leverage points. The observations that are slightly different from all others can drive to a large difference in the results of regression analysis. In regression analysis, the detection of high leverage points is compulsory, as they will give large impact on the estimation values as well as lead to multicollinearity problems. In this situation, robust regression procedure can be very useful to deal with problems arise due to the existence of high leverage points. The aim of this study is to compare the performance of three methods in detecting high leverage points. At first stage, the two well-known data sets are considered. The first data used is artificial data set generated by Hawkins, Bradu and Kass in 1984 and the second data used is stack loss data by Brownlee in 1965. The second stage of this study is to conduct simulation study whereby the data were generated based on clean and contaminated data. The three sets of measures being considered in this study are Leverage methods Ttwice-the-mean-rule), Generalized Potentials and Diagnostic Robust Generalized Approach (DRGP). The result indicates that DRGP successfully proved its ability as a powerful method of detecting high leverage points as compared to the other two methods using both artificial data sets and simulated data. Unit Penerbitan UiTM Kelantan 2016-06 Article PeerReviewed text en https://ir.uitm.edu.my/id/eprint/24068/1/4 Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]. (2016) Journal of Mathematics and Computing Science (JMCS) <https://ir.uitm.edu.my/view/publication/Journal_of_Mathematics_and_Computing_Science_=28JMCS=29/>, 1 (1). ISSN 0128-0767 https://jmcs.com.my/
spellingShingle Electronic digital computers
Computer software
Operating systems (Computers)
Ismail, Nor Azima
Midi, Prof Dr. Habshah
Mohamad Sobri, Norafefah Mohamad Sobri
Zulkifli, Siti Nurani Zulkifli
Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title_full Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title_fullStr Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title_full_unstemmed Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title_short Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]
title_sort identification of high leverage points in multiple linear regression noor azima ismail et al
topic Electronic digital computers
Computer software
Operating systems (Computers)
url https://ir.uitm.edu.my/id/eprint/24068/1/4
work_keys_str_mv AT ismailnorazima identificationofhighleveragepointsinmultiplelinearregressionnoorazimaismailetal
AT midiprofdrhabshah identificationofhighleveragepointsinmultiplelinearregressionnoorazimaismailetal
AT mohamadsobrinorafefahmohamadsobri identificationofhighleveragepointsinmultiplelinearregressionnoorazimaismailetal
AT zulkiflisitinuranizulkifli identificationofhighleveragepointsinmultiplelinearregressionnoorazimaismailetal