The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors

The ordinary least squares (OLS) technique is often used in practice to estimate the parameters of a multiple linear regression model with both continuous and categorical variables. It has been the most popular technique due to its optimal properties and ease of computation. Nevertheless, in the pr...

Full description

Bibliographic Details
Main Authors: Midi, Habshah, A. Talib, Bashar
Format: Article
Language:English
Published: Universiti Putra Malaysia Press 2008
Online Access:http://psasir.upm.edu.my/id/eprint/12578/1/2.__habshah.pdf
_version_ 1825945075083902976
author Midi, Habshah
A. Talib, Bashar
author_facet Midi, Habshah
A. Talib, Bashar
author_sort Midi, Habshah
collection UPM
description The ordinary least squares (OLS) technique is often used in practice to estimate the parameters of a multiple linear regression model with both continuous and categorical variables. It has been the most popular technique due to its optimal properties and ease of computation. Nevertheless, in the presence of outliers, the OLS can result in very poor estimates. Outliers which arise from bad data points may have unduly effect on the OLS estimates. The problem is further complicated when both outliers and heteroscedasticity or non-constant error variances are present in the data. The influence of outliers and heteroscedasticity cannot be removed or reduced by simply transforming the data using known transformation such as logarithmic transformation. In this paper, we proposed a robust technique to deal with these two problems simultaneously. A robust estimate of scales for each level of categorical variables are first estimated by using robust distance S and M (RDSM) estimates. Then we determine the weighting scheme for each level of the categorical variables and transform the model. The reweighted least squares based on RDSM (RLSRDSM) is then applied to the transformed model. The empirical evidence shows that the proposed method has reduced the heteroscedastic effect to a greater extent.
first_indexed 2024-03-06T07:25:51Z
format Article
id upm.eprints-12578
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T07:25:51Z
publishDate 2008
publisher Universiti Putra Malaysia Press
record_format dspace
spelling upm.eprints-125782015-05-27T03:02:01Z http://psasir.upm.edu.my/id/eprint/12578/ The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors Midi, Habshah A. Talib, Bashar The ordinary least squares (OLS) technique is often used in practice to estimate the parameters of a multiple linear regression model with both continuous and categorical variables. It has been the most popular technique due to its optimal properties and ease of computation. Nevertheless, in the presence of outliers, the OLS can result in very poor estimates. Outliers which arise from bad data points may have unduly effect on the OLS estimates. The problem is further complicated when both outliers and heteroscedasticity or non-constant error variances are present in the data. The influence of outliers and heteroscedasticity cannot be removed or reduced by simply transforming the data using known transformation such as logarithmic transformation. In this paper, we proposed a robust technique to deal with these two problems simultaneously. A robust estimate of scales for each level of categorical variables are first estimated by using robust distance S and M (RDSM) estimates. Then we determine the weighting scheme for each level of the categorical variables and transform the model. The reweighted least squares based on RDSM (RLSRDSM) is then applied to the transformed model. The empirical evidence shows that the proposed method has reduced the heteroscedastic effect to a greater extent. Universiti Putra Malaysia Press 2008 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/12578/1/2.__habshah.pdf Midi, Habshah and A. Talib, Bashar (2008) The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors. Malaysian Journal of Mathematical Sciences, 2 (1). pp. 25-48. ISSN 1823-8343 http://einspem.upm.edu.my/journal/volume2.1.php
spellingShingle Midi, Habshah
A. Talib, Bashar
The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title_full The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title_fullStr The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title_full_unstemmed The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title_short The performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
title_sort performance of robust estimator on linear regression model having both continuous and categorical variables with heteroscedastic errors
url http://psasir.upm.edu.my/id/eprint/12578/1/2.__habshah.pdf
work_keys_str_mv AT midihabshah theperformanceofrobustestimatoronlinearregressionmodelhavingbothcontinuousandcategoricalvariableswithheteroscedasticerrors
AT atalibbashar theperformanceofrobustestimatoronlinearregressionmodelhavingbothcontinuousandcategoricalvariableswithheteroscedasticerrors
AT midihabshah performanceofrobustestimatoronlinearregressionmodelhavingbothcontinuousandcategoricalvariableswithheteroscedasticerrors
AT atalibbashar performanceofrobustestimatoronlinearregressionmodelhavingbothcontinuousandcategoricalvariableswithheteroscedasticerrors