Classification with degree of importance of attributes for stock market data mining

With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount...

Full description

Bibliographic Details
Main Authors:	Khokhar, Rashid Hafeez, Md. Sap, Mohd. Noor
Format:	Article
Language:	English
Published:	Penerbit UTM Press 2004
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.utm.my/3425/1/scan0006.pdf

_version_	1825909388448104448
author	Khokhar, Rashid Hafeez Md. Sap, Mohd. Noor
author_facet	Khokhar, Rashid Hafeez Md. Sap, Mohd. Noor
author_sort	Khokhar, Rashid Hafeez
collection	ePrints
description	With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount of financial data to support companies and individuals in strategic planning and investment for decisionmaking. Many statistical and data mining techniques have been used to predict time series stock market. However, most statistical and data mining methods suffer from serious drawback due to requiring long training times, results are often hard to understand, and producing inaccurate predictions. We present another modification of fuzzy decision tree (FDT) classification techniques that aims to combine symbolic decision trees in data classification with approximate reasoning offered by fuzzy representation. The intent is to exploit complementary advantages of both: ability to learn from examples, high knowledge comprehensibility of decision trees, and the ability to deal with uncertain information of fuzzy representation. In particular, the proposed predictive fuzzy decision tree is based on the concept of degree of importance of attribute contributing to the classification. We extend this idea with the expressive power of fuzzy reasoning method. After constructing predictive FDT, weighted fuzzy production rules (WFPRs) can be extracted from predictive FDT. The predictive FDT has been tested using three data sets including KLSE, NYSE and LSE. The experimental results show that predictive FDT algorithm can generate a relatively optimal tree without much computation effort (comprehensibility), and WFPRs have a better predictive accuracy of stock market time series data. Many attempts have been made for meaningful prediction from real time stock market data by using data mining and statistical techniques such as Support Vector Machine [1,2], and Linear and Non- Linear Statistical Models [3,4], Neural Networks [5, 6]. Alan Fan et aI., [2] use Support Vector Machine (SVM) to stock market prediction. The SVM is a training algorithm for learning classification and regression rules from data [7]. However the predictive accuracy of SVM achieved by [2] in stock market is relatively lower than other classification applications [8, 9]. Also the existing relationship between the future stock returns and its accounting information, one would expect it to be a weak relationship. Support Vector Regression (SVR) is the extended form of SVM that can be applied in financial time series prediction [8, 9]. In financial data, due to the embedded noise, one must set a suitable margin in order to obtain a good prediction [9]. Haiqin et at, [9] has extended the standard
first_indexed	2024-03-05T18:01:25Z
format	Article
id	utm.eprints-3425
institution	Universiti Teknologi Malaysia - ePrints
language	English
last_indexed	2024-03-05T18:01:25Z
publishDate	2004
publisher	Penerbit UTM Press
record_format	dspace
spelling	utm.eprints-34252017-11-01T04:17:36Z http://eprints.utm.my/3425/ Classification with degree of importance of attributes for stock market data mining Khokhar, Rashid Hafeez Md. Sap, Mohd. Noor QA75 Electronic computers. Computer science With the increase of economic globalization and evolution of information technology, financial time series data are being generated and accumulated at an unprecedented pace. As a result, there has been a critical need for automated approaches to effective and efficient utilization of massive amount of financial data to support companies and individuals in strategic planning and investment for decisionmaking. Many statistical and data mining techniques have been used to predict time series stock market. However, most statistical and data mining methods suffer from serious drawback due to requiring long training times, results are often hard to understand, and producing inaccurate predictions. We present another modification of fuzzy decision tree (FDT) classification techniques that aims to combine symbolic decision trees in data classification with approximate reasoning offered by fuzzy representation. The intent is to exploit complementary advantages of both: ability to learn from examples, high knowledge comprehensibility of decision trees, and the ability to deal with uncertain information of fuzzy representation. In particular, the proposed predictive fuzzy decision tree is based on the concept of degree of importance of attribute contributing to the classification. We extend this idea with the expressive power of fuzzy reasoning method. After constructing predictive FDT, weighted fuzzy production rules (WFPRs) can be extracted from predictive FDT. The predictive FDT has been tested using three data sets including KLSE, NYSE and LSE. The experimental results show that predictive FDT algorithm can generate a relatively optimal tree without much computation effort (comprehensibility), and WFPRs have a better predictive accuracy of stock market time series data. Many attempts have been made for meaningful prediction from real time stock market data by using data mining and statistical techniques such as Support Vector Machine [1,2], and Linear and Non- Linear Statistical Models [3,4], Neural Networks [5, 6]. Alan Fan et aI., [2] use Support Vector Machine (SVM) to stock market prediction. The SVM is a training algorithm for learning classification and regression rules from data [7]. However the predictive accuracy of SVM achieved by [2] in stock market is relatively lower than other classification applications [8, 9]. Also the existing relationship between the future stock returns and its accounting information, one would expect it to be a weak relationship. Support Vector Regression (SVR) is the extended form of SVM that can be applied in financial time series prediction [8, 9]. In financial data, due to the embedded noise, one must set a suitable margin in order to obtain a good prediction [9]. Haiqin et at, [9] has extended the standard Penerbit UTM Press 2004-12 Article PeerReviewed application/pdf en http://eprints.utm.my/3425/1/scan0006.pdf Khokhar, Rashid Hafeez and Md. Sap, Mohd. Noor (2004) Classification with degree of importance of attributes for stock market data mining. Jurnal Teknologi Maklumat, 16 (2). pp. 21-43. ISSN 0128-3790
spellingShingle	QA75 Electronic computers. Computer science Khokhar, Rashid Hafeez Md. Sap, Mohd. Noor Classification with degree of importance of attributes for stock market data mining
title	Classification with degree of importance of attributes for stock market data mining
title_full	Classification with degree of importance of attributes for stock market data mining
title_fullStr	Classification with degree of importance of attributes for stock market data mining
title_full_unstemmed	Classification with degree of importance of attributes for stock market data mining
title_short	Classification with degree of importance of attributes for stock market data mining
title_sort	classification with degree of importance of attributes for stock market data mining
topic	QA75 Electronic computers. Computer science
url	http://eprints.utm.my/3425/1/scan0006.pdf
work_keys_str_mv	AT khokharrashidhafeez classificationwithdegreeofimportanceofattributesforstockmarketdatamining AT mdsapmohdnoor classificationwithdegreeofimportanceofattributesforstockmarketdatamining

Classification with degree of importance of attributes for stock market data mining

Similar Items