Sensitivity of missing values in classification tree for large sample

Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data...

Full description

Bibliographic Details
Main Authors: Hasan, Norsida, Adam, Mohd Bakri, Mustapha, Norwati, Abu Bakar, Mohd Rizam
Format: Conference or Workshop Item
Language:English
Published: American Institute of Physics 2011
Online Access:http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf
_version_ 1825931591986184192
author Hasan, Norsida
Adam, Mohd Bakri
Mustapha, Norwati
Abu Bakar, Mohd Rizam
author_facet Hasan, Norsida
Adam, Mohd Bakri
Mustapha, Norwati
Abu Bakar, Mohd Rizam
author_sort Hasan, Norsida
collection UPM
description Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data in classification tree model for large sample. Data were obtained from one of the high level educational institutions in Malaysia. Students' background data were randomly eliminated and classification tree was used to predict students degree classification. The results showed that for large sample, the structure of the classification tree was sensitive to missing values especially for sample contains more than ten percent missing values.
first_indexed 2024-03-06T09:29:04Z
format Conference or Workshop Item
id upm.eprints-57334
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T09:29:04Z
publishDate 2011
publisher American Institute of Physics
record_format dspace
spelling upm.eprints-573342017-09-26T04:07:08Z http://psasir.upm.edu.my/id/eprint/57334/ Sensitivity of missing values in classification tree for large sample Hasan, Norsida Adam, Mohd Bakri Mustapha, Norwati Abu Bakar, Mohd Rizam Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data in classification tree model for large sample. Data were obtained from one of the high level educational institutions in Malaysia. Students' background data were randomly eliminated and classification tree was used to predict students degree classification. The results showed that for large sample, the structure of the classification tree was sensitive to missing values especially for sample contains more than ten percent missing values. American Institute of Physics 2011 Conference or Workshop Item PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf Hasan, Norsida and Adam, Mohd Bakri and Mustapha, Norwati and Abu Bakar, Mohd Rizam (2011) Sensitivity of missing values in classification tree for large sample. In: 5th International Conference on Research and Education in Mathematics (ICREM5), 22-24 Oct. 2011, Bandung, Indonesia. (pp. 374-379). 10.1063/1.4724171
spellingShingle Hasan, Norsida
Adam, Mohd Bakri
Mustapha, Norwati
Abu Bakar, Mohd Rizam
Sensitivity of missing values in classification tree for large sample
title Sensitivity of missing values in classification tree for large sample
title_full Sensitivity of missing values in classification tree for large sample
title_fullStr Sensitivity of missing values in classification tree for large sample
title_full_unstemmed Sensitivity of missing values in classification tree for large sample
title_short Sensitivity of missing values in classification tree for large sample
title_sort sensitivity of missing values in classification tree for large sample
url http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf
work_keys_str_mv AT hasannorsida sensitivityofmissingvaluesinclassificationtreeforlargesample
AT adammohdbakri sensitivityofmissingvaluesinclassificationtreeforlargesample
AT mustaphanorwati sensitivityofmissingvaluesinclassificationtreeforlargesample
AT abubakarmohdrizam sensitivityofmissingvaluesinclassificationtreeforlargesample