Classification with degree of importance of attributes for stock market data mining

With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount...

Full description

Bibliographic Details
Main Authors: Khokhar, Rashid Hafeez, Md. Sap, Mohd. Noor
Format: Article
Language:English
Published: Penerbit UTM Press 2004
Subjects:
Online Access:http://eprints.utm.my/3425/1/scan0006.pdf
_version_ 1825909388448104448
author Khokhar, Rashid Hafeez
Md. Sap, Mohd. Noor
author_facet Khokhar, Rashid Hafeez
Md. Sap, Mohd. Noor
author_sort Khokhar, Rashid Hafeez
collection ePrints
description With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount of financial data to support companies and individuals in strategic planning and investment for decisionmaking. Many statistical and data mining techniques have been used to predict time series stock market. However, most statistical and data mining methods suffer from serious drawback due to requiring long training times, results are often hard to understand, and producing inaccurate predictions. We present another modification of fuzzy decision tree (FDT) classification techniques that aims to combine symbolic decision trees in data classification with approximate reasoning offered by fuzzy representation. The intent is to exploit complementary advantages of both: ability to learn from examples, high knowledge comprehensibility of decision trees, and the ability to deal with uncertain information of fuzzy representation. In particular, the proposed predictive fuzzy decision tree is based on the concept of degree of importance of attribute contributing to the classification. We extend this idea with the expressive power of fuzzy reasoning method. After constructing predictive FDT, weighted fuzzy production rules (WFPRs) can be extracted from predictive FDT. The predictive FDT has been tested using three data sets including KLSE, NYSE and LSE. The experimental results show that predictive FDT algorithm can generate a relatively optimal tree without much computation effort (comprehensibility), and WFPRs have a better predictive accuracy of stock market time series data. Many attempts have been made for meaningful prediction from real time stock market data by using data mining and statistical techniques such as Support Vector Machine [1,2], and Linear and Non- Linear Statistical Models [3,4], Neural Networks [5, 6]. Alan Fan et aI., [2] use Support Vector Machine (SVM) to stock market prediction. The SVM is a training algorithm for learning classification and regression rules from data [7]. However the predictive accuracy of SVM achieved by [2] in stock market is relatively lower than other classification applications [8, 9]. Also the existing relationship between the future stock returns and its accounting information, one would expect it to be a weak relationship. Support Vector Regression (SVR) is the extended form of SVM that can be applied in financial time series prediction [8, 9]. In financial data, due to the embedded noise, one must set a suitable margin in order to obtain a good prediction [9]. Haiqin et at, [9] has extended the standard
first_indexed 2024-03-05T18:01:25Z
format Article
id utm.eprints-3425
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T18:01:25Z
publishDate 2004
publisher Penerbit UTM Press
record_format dspace
spelling utm.eprints-34252017-11-01T04:17:36Z http://eprints.utm.my/3425/ Classification with degree of importance of attributes for stock market data mining Khokhar, Rashid Hafeez Md. Sap, Mohd. Noor QA75 Electronic computers. Computer science With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount of financial data to support companies and individuals in strategic planning and investment for decisionmaking. Many statistical and data mining techniques have been used to predict time series stock market. However, most statistical and data mining methods suffer from serious drawback due to requiring long training times, results are often hard to understand, and producing inaccurate predictions. We present another modification of fuzzy decision tree (FDT) classification techniques that aims to combine symbolic decision trees in data classification with approximate reasoning offered by fuzzy representation. The intent is to exploit complementary advantages of both: ability to learn from examples, high knowledge comprehensibility of decision trees, and the ability to deal with uncertain information of fuzzy representation. In particular, the proposed predictive fuzzy decision tree is based on the concept of degree of importance of attribute contributing to the classification. We extend this idea with the expressive power of fuzzy reasoning method. After constructing predictive FDT, weighted fuzzy production rules (WFPRs) can be extracted from predictive FDT. The predictive FDT has been tested using three data sets including KLSE, NYSE and LSE. The experimental results show that predictive FDT algorithm can generate a relatively optimal tree without much computation effort (comprehensibility), and WFPRs have a better predictive accuracy of stock market time series data. Many attempts have been made for meaningful prediction from real time stock market data by using data mining and statistical techniques such as Support Vector Machine [1,2], and Linear and Non- Linear Statistical Models [3,4], Neural Networks [5, 6]. Alan Fan et aI., [2] use Support Vector Machine (SVM) to stock market prediction. The SVM is a training algorithm for learning classification and regression rules from data [7]. However the predictive accuracy of SVM achieved by [2] in stock market is relatively lower than other classification applications [8, 9]. Also the existing relationship between the future stock returns and its accounting information, one would expect it to be a weak relationship. Support Vector Regression (SVR) is the extended form of SVM that can be applied in financial time series prediction [8, 9]. In financial data, due to the embedded noise, one must set a suitable margin in order to obtain a good prediction [9]. Haiqin et at, [9] has extended the standard Penerbit UTM Press 2004-12 Article PeerReviewed application/pdf en http://eprints.utm.my/3425/1/scan0006.pdf Khokhar, Rashid Hafeez and Md. Sap, Mohd. Noor (2004) Classification with degree of importance of attributes for stock market data mining. Jurnal Teknologi Maklumat, 16 (2). pp. 21-43. ISSN 0128-3790
spellingShingle QA75 Electronic computers. Computer science
Khokhar, Rashid Hafeez
Md. Sap, Mohd. Noor
Classification with degree of importance of attributes for stock market data mining
title Classification with degree of importance of attributes for stock market data mining
title_full Classification with degree of importance of attributes for stock market data mining
title_fullStr Classification with degree of importance of attributes for stock market data mining
title_full_unstemmed Classification with degree of importance of attributes for stock market data mining
title_short Classification with degree of importance of attributes for stock market data mining
title_sort classification with degree of importance of attributes for stock market data mining
topic QA75 Electronic computers. Computer science
url http://eprints.utm.my/3425/1/scan0006.pdf
work_keys_str_mv AT khokharrashidhafeez classificationwithdegreeofimportanceofattributesforstockmarketdatamining
AT mdsapmohdnoor classificationwithdegreeofimportanceofattributesforstockmarketdatamining