Sensitivity of missing values in classification tree for large sample
Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data...
Main Authors: | , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
American Institute of Physics
2011
|
Online Access: | http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf |
_version_ | 1825931591986184192 |
---|---|
author | Hasan, Norsida Adam, Mohd Bakri Mustapha, Norwati Abu Bakar, Mohd Rizam |
author_facet | Hasan, Norsida Adam, Mohd Bakri Mustapha, Norwati Abu Bakar, Mohd Rizam |
author_sort | Hasan, Norsida |
collection | UPM |
description | Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data in classification tree model for large sample. Data were obtained from one of the high level educational institutions in Malaysia. Students' background data were randomly eliminated and classification tree was used to predict students degree classification. The results showed that for large sample, the structure of the classification tree was sensitive to missing values especially for sample contains more than ten percent missing values. |
first_indexed | 2024-03-06T09:29:04Z |
format | Conference or Workshop Item |
id | upm.eprints-57334 |
institution | Universiti Putra Malaysia |
language | English |
last_indexed | 2024-03-06T09:29:04Z |
publishDate | 2011 |
publisher | American Institute of Physics |
record_format | dspace |
spelling | upm.eprints-573342017-09-26T04:07:08Z http://psasir.upm.edu.my/id/eprint/57334/ Sensitivity of missing values in classification tree for large sample Hasan, Norsida Adam, Mohd Bakri Mustapha, Norwati Abu Bakar, Mohd Rizam Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data in classification tree model for large sample. Data were obtained from one of the high level educational institutions in Malaysia. Students' background data were randomly eliminated and classification tree was used to predict students degree classification. The results showed that for large sample, the structure of the classification tree was sensitive to missing values especially for sample contains more than ten percent missing values. American Institute of Physics 2011 Conference or Workshop Item PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf Hasan, Norsida and Adam, Mohd Bakri and Mustapha, Norwati and Abu Bakar, Mohd Rizam (2011) Sensitivity of missing values in classification tree for large sample. In: 5th International Conference on Research and Education in Mathematics (ICREM5), 22-24 Oct. 2011, Bandung, Indonesia. (pp. 374-379). 10.1063/1.4724171 |
spellingShingle | Hasan, Norsida Adam, Mohd Bakri Mustapha, Norwati Abu Bakar, Mohd Rizam Sensitivity of missing values in classification tree for large sample |
title | Sensitivity of missing values in classification tree for large sample |
title_full | Sensitivity of missing values in classification tree for large sample |
title_fullStr | Sensitivity of missing values in classification tree for large sample |
title_full_unstemmed | Sensitivity of missing values in classification tree for large sample |
title_short | Sensitivity of missing values in classification tree for large sample |
title_sort | sensitivity of missing values in classification tree for large sample |
url | http://psasir.upm.edu.my/id/eprint/57334/1/Sensitivity%20of%20missing%20values%20in%20classification%20tree%20for%20large%20sample.pdf |
work_keys_str_mv | AT hasannorsida sensitivityofmissingvaluesinclassificationtreeforlargesample AT adammohdbakri sensitivityofmissingvaluesinclassificationtreeforlargesample AT mustaphanorwati sensitivityofmissingvaluesinclassificationtreeforlargesample AT abubakarmohdrizam sensitivityofmissingvaluesinclassificationtreeforlargesample |