Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model

The location model is a familiar basis for discrimination dealing with mixed binary and continuous variables simultaneously. The binary variables create cells while the continuous variables are information that measures the difference between groups in each cell. But, if some of the created cells ar...

Full description

Bibliographic Details
Main Author: Hamid, Hashibah
Format: Article
Language:English
Published: Journliimcms 2019
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/30805/1/JMCMS%2004%202019%2090-108.pdf
_version_ 1825806248923103232
author Hamid, Hashibah
author_facet Hamid, Hashibah
author_sort Hamid, Hashibah
collection UUM
description The location model is a familiar basis for discrimination dealing with mixed binary and continuous variables simultaneously. The binary variables create cells while the continuous variables are information that measures the difference between groups in each cell. But, if some of the created cells are empty, the classical location model rule is biased and sometimes infeasible. Interestingly, the analyses of previous studies have revealed that non-parametric smoothing approach succeeded in reducing the effects of some empty cells immensely. However, one practical drawback to the use of discrimination methods based on the location model is that the smoothing approach employed, its performance is severe when there are outliers in the data sample. The purpose of this paper is to extend these limitations of the location model with the presence of outliers and empty cells. Accordingly, a new location model rule called Winsorized smoothed location model is developed through the combination of Winsorization and non-parametric smoothing approach to address both issues of outliers and empty cells at once. Results from simulation manifests the improvement of the new rule as the rates of misclassification are dramatically declined even the data contains outliers for all 36 different simulation data settings. Findings from real dataset, full breast cancer, also clearly show that the newly developed Winsorized smoothed location model achieves the best performance compared to over than 10 existing discrimination methods. These revealed that the newly derived rule further enhanced the applicability range of the location model, as previously it was limited to the non-contaminated datasets to achieve tolerable performance. The overall investigation verifying the new rule developed offers practitioners another potential good methodology for discrimination tasks, as the rule very favourably compared to all its competitors except only one
first_indexed 2024-07-04T06:46:30Z
format Article
id uum-30805
institution Universiti Utara Malaysia
language English
last_indexed 2024-07-04T06:46:30Z
publishDate 2019
publisher Journliimcms
record_format eprints
spelling uum-308052024-05-19T09:12:31Z https://repo.uum.edu.my/id/eprint/30805/ Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model Hamid, Hashibah QA Mathematics The location model is a familiar basis for discrimination dealing with mixed binary and continuous variables simultaneously. The binary variables create cells while the continuous variables are information that measures the difference between groups in each cell. But, if some of the created cells are empty, the classical location model rule is biased and sometimes infeasible. Interestingly, the analyses of previous studies have revealed that non-parametric smoothing approach succeeded in reducing the effects of some empty cells immensely. However, one practical drawback to the use of discrimination methods based on the location model is that the smoothing approach employed, its performance is severe when there are outliers in the data sample. The purpose of this paper is to extend these limitations of the location model with the presence of outliers and empty cells. Accordingly, a new location model rule called Winsorized smoothed location model is developed through the combination of Winsorization and non-parametric smoothing approach to address both issues of outliers and empty cells at once. Results from simulation manifests the improvement of the new rule as the rates of misclassification are dramatically declined even the data contains outliers for all 36 different simulation data settings. Findings from real dataset, full breast cancer, also clearly show that the newly developed Winsorized smoothed location model achieves the best performance compared to over than 10 existing discrimination methods. These revealed that the newly derived rule further enhanced the applicability range of the location model, as previously it was limited to the non-contaminated datasets to achieve tolerable performance. The overall investigation verifying the new rule developed offers practitioners another potential good methodology for discrimination tasks, as the rule very favourably compared to all its competitors except only one Journliimcms 2019 Article PeerReviewed application/pdf en cc4_by https://repo.uum.edu.my/id/eprint/30805/1/JMCMS%2004%202019%2090-108.pdf Hamid, Hashibah (2019) Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model. Journal of Mechanics of Continua and Mathematical Sciences (04). pp. 90-108. ISSN 0973-8975 https://www.journalimcms.org/special_issue/alternative-methodology-of-location-model-for-handling-outliers-and-empty-cells-problems-winsorized-smoothed-location-model/
spellingShingle QA Mathematics
Hamid, Hashibah
Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title_full Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title_fullStr Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title_full_unstemmed Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title_short Alternative Methodology of Location Model for Handling Outliers and Empty Cells Problems: Winsorized Smoothed Location Model
title_sort alternative methodology of location model for handling outliers and empty cells problems winsorized smoothed location model
topic QA Mathematics
url https://repo.uum.edu.my/id/eprint/30805/1/JMCMS%2004%202019%2090-108.pdf
work_keys_str_mv AT hamidhashibah alternativemethodologyoflocationmodelforhandlingoutliersandemptycellsproblemswinsorizedsmoothedlocationmodel