Variables extraction on large binary variables in discriminant analysis based on mixed variables location model

The natural performance of the location model is a potential tool for allocating an object into one of the two observed groups involving mixtures of continuous and binary variables. In constructing location model, continuous variable is used to estimate parameters while binary variable is utilized t...

Full description

Bibliographic Details
Main Authors: Long, Mei Mei, Hamid, Hashibah, Aziz, Nazrina
Format: Article
Published: IP Publishing LLC 2015
Subjects:
_version_ 1825804586041999360
author Long, Mei Mei
Hamid, Hashibah
Aziz, Nazrina
author_facet Long, Mei Mei
Hamid, Hashibah
Aziz, Nazrina
author_sort Long, Mei Mei
collection UUM
description The natural performance of the location model is a potential tool for allocating an object into one of the two observed groups involving mixtures of continuous and binary variables. In constructing location model, continuous variable is used to estimate parameters while binary variable is utilized to create segmentation in each group. Such segmentation is called as multinomial cells. Basically, the multinomial cells will grow exponentially according to the number of the binary variable.These multinomial cells will become empty when there is no object can be assigned into some of them. Then the occurring of empty cells will lead to unreliable parameter estimation.Consequently, the construction of the discriminant rule based on location model is impossible.Therefore, this paper attempts to discuss how the location model based on maximum likelihood estimation can be constructed even dealing with many measured binary variables. In other word, how is location model able to deal with the issue of many empty cells for classifying an object into correct group? For remedy this problem, this paper adapts nonlinear principal component analysis in order to reduce large binary variables considered in the study.This new strategy can be expected as an alternative discriminant tool practically when large number of binary variables are considered in a classification tasks.
first_indexed 2024-07-04T06:17:57Z
format Article
id uum-21578
institution Universiti Utara Malaysia
last_indexed 2024-07-04T06:17:57Z
publishDate 2015
publisher IP Publishing LLC
record_format eprints
spelling uum-215782017-04-16T06:04:02Z https://repo.uum.edu.my/id/eprint/21578/ Variables extraction on large binary variables in discriminant analysis based on mixed variables location model Long, Mei Mei Hamid, Hashibah Aziz, Nazrina QA Mathematics The natural performance of the location model is a potential tool for allocating an object into one of the two observed groups involving mixtures of continuous and binary variables. In constructing location model, continuous variable is used to estimate parameters while binary variable is utilized to create segmentation in each group. Such segmentation is called as multinomial cells. Basically, the multinomial cells will grow exponentially according to the number of the binary variable.These multinomial cells will become empty when there is no object can be assigned into some of them. Then the occurring of empty cells will lead to unreliable parameter estimation.Consequently, the construction of the discriminant rule based on location model is impossible.Therefore, this paper attempts to discuss how the location model based on maximum likelihood estimation can be constructed even dealing with many measured binary variables. In other word, how is location model able to deal with the issue of many empty cells for classifying an object into correct group? For remedy this problem, this paper adapts nonlinear principal component analysis in order to reduce large binary variables considered in the study.This new strategy can be expected as an alternative discriminant tool practically when large number of binary variables are considered in a classification tasks. IP Publishing LLC 2015 Article PeerReviewed Long, Mei Mei and Hamid, Hashibah and Aziz, Nazrina (2015) Variables extraction on large binary variables in discriminant analysis based on mixed variables location model. AIP Conference Proceedings, 1691. 050014. ISSN 0094-243X http://doi.org/10.1063/1.4937096 doi:10.1063/1.4937096 doi:10.1063/1.4937096
spellingShingle QA Mathematics
Long, Mei Mei
Hamid, Hashibah
Aziz, Nazrina
Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title_full Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title_fullStr Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title_full_unstemmed Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title_short Variables extraction on large binary variables in discriminant analysis based on mixed variables location model
title_sort variables extraction on large binary variables in discriminant analysis based on mixed variables location model
topic QA Mathematics
work_keys_str_mv AT longmeimei variablesextractiononlargebinaryvariablesindiscriminantanalysisbasedonmixedvariableslocationmodel
AT hamidhashibah variablesextractiononlargebinaryvariablesindiscriminantanalysisbasedonmixedvariableslocationmodel
AT aziznazrina variablesextractiononlargebinaryvariablesindiscriminantanalysisbasedonmixedvariableslocationmodel