A text mining system for deviation detection in financial documents

Attempts to mine text documents to discover deviations or anomalies have increased in recent years due to the elevated amount of textual data in today's data repositories. Text mining assists in uncovering hidden information contents across multiple documents.Although various text mining tools...

Full description

Bibliographic Details
Main Authors: Kamaruddin, Siti Sakira, Abu Bakar, Azuraliza, Hamdan, Abdul Razak, Mat Nor, Fauzias, Ahmad Nazri, Mohd Zakree, Ali Othman, Zulaiha, Hussein, Ghassan Saleh
Format: Article
Published: IOS Press 2015
Subjects:
_version_ 1825803709088530432
author Kamaruddin, Siti Sakira
Abu Bakar, Azuraliza
Hamdan, Abdul Razak
Mat Nor, Fauzias
Ahmad Nazri, Mohd Zakree
Ali Othman, Zulaiha
Hussein, Ghassan Saleh
author_facet Kamaruddin, Siti Sakira
Abu Bakar, Azuraliza
Hamdan, Abdul Razak
Mat Nor, Fauzias
Ahmad Nazri, Mohd Zakree
Ali Othman, Zulaiha
Hussein, Ghassan Saleh
author_sort Kamaruddin, Siti Sakira
collection UUM
description Attempts to mine text documents to discover deviations or anomalies have increased in recent years due to the elevated amount of textual data in today's data repositories. Text mining assists in uncovering hidden information contents across multiple documents.Although various text mining tools are available, their focus is mainly to assist in data summarization or document classification. These tasks proved to be helpful, however; they do not provide semantic analysis and rigorous textual comparison to detect abnormal sentences that exist in the documents. In this paper, we describe a text mining system that is able to detect sentence deviations from a collection of financial documents.The system implements a dissimilarity function to compare sentences represented as graphs. Our evaluation on the proposed system revolves around experiments using financial statements of a bank. The findings provide valid evidence that the proposed system is able to identify deviating sentences occurring in the documents. The detected deviations can be beneficial for the authorities in order to improve their business decisions.
first_indexed 2024-07-04T06:02:06Z
format Article
id uum-16453
institution Universiti Utara Malaysia
last_indexed 2024-07-04T06:02:06Z
publishDate 2015
publisher IOS Press
record_format eprints
spelling uum-164532016-04-27T04:34:00Z https://repo.uum.edu.my/id/eprint/16453/ A text mining system for deviation detection in financial documents Kamaruddin, Siti Sakira Abu Bakar, Azuraliza Hamdan, Abdul Razak Mat Nor, Fauzias Ahmad Nazri, Mohd Zakree Ali Othman, Zulaiha Hussein, Ghassan Saleh QA76 Computer software Attempts to mine text documents to discover deviations or anomalies have increased in recent years due to the elevated amount of textual data in today's data repositories. Text mining assists in uncovering hidden information contents across multiple documents.Although various text mining tools are available, their focus is mainly to assist in data summarization or document classification. These tasks proved to be helpful, however; they do not provide semantic analysis and rigorous textual comparison to detect abnormal sentences that exist in the documents. In this paper, we describe a text mining system that is able to detect sentence deviations from a collection of financial documents.The system implements a dissimilarity function to compare sentences represented as graphs. Our evaluation on the proposed system revolves around experiments using financial statements of a bank. The findings provide valid evidence that the proposed system is able to identify deviating sentences occurring in the documents. The detected deviations can be beneficial for the authorities in order to improve their business decisions. IOS Press 2015 Article PeerReviewed Kamaruddin, Siti Sakira and Abu Bakar, Azuraliza and Hamdan, Abdul Razak and Mat Nor, Fauzias and Ahmad Nazri, Mohd Zakree and Ali Othman, Zulaiha and Hussein, Ghassan Saleh (2015) A text mining system for deviation detection in financial documents. Intelligent Data Analysis, 19 (s1). S19-S44. ISSN 1088467X http://doi.org/10.3233/IDA-150768 doi:10.3233/IDA-150768 doi:10.3233/IDA-150768
spellingShingle QA76 Computer software
Kamaruddin, Siti Sakira
Abu Bakar, Azuraliza
Hamdan, Abdul Razak
Mat Nor, Fauzias
Ahmad Nazri, Mohd Zakree
Ali Othman, Zulaiha
Hussein, Ghassan Saleh
A text mining system for deviation detection in financial documents
title A text mining system for deviation detection in financial documents
title_full A text mining system for deviation detection in financial documents
title_fullStr A text mining system for deviation detection in financial documents
title_full_unstemmed A text mining system for deviation detection in financial documents
title_short A text mining system for deviation detection in financial documents
title_sort text mining system for deviation detection in financial documents
topic QA76 Computer software
work_keys_str_mv AT kamaruddinsitisakira atextminingsystemfordeviationdetectioninfinancialdocuments
AT abubakarazuraliza atextminingsystemfordeviationdetectioninfinancialdocuments
AT hamdanabdulrazak atextminingsystemfordeviationdetectioninfinancialdocuments
AT matnorfauzias atextminingsystemfordeviationdetectioninfinancialdocuments
AT ahmadnazrimohdzakree atextminingsystemfordeviationdetectioninfinancialdocuments
AT aliothmanzulaiha atextminingsystemfordeviationdetectioninfinancialdocuments
AT husseinghassansaleh atextminingsystemfordeviationdetectioninfinancialdocuments
AT kamaruddinsitisakira textminingsystemfordeviationdetectioninfinancialdocuments
AT abubakarazuraliza textminingsystemfordeviationdetectioninfinancialdocuments
AT hamdanabdulrazak textminingsystemfordeviationdetectioninfinancialdocuments
AT matnorfauzias textminingsystemfordeviationdetectioninfinancialdocuments
AT ahmadnazrimohdzakree textminingsystemfordeviationdetectioninfinancialdocuments
AT aliothmanzulaiha textminingsystemfordeviationdetectioninfinancialdocuments
AT husseinghassansaleh textminingsystemfordeviationdetectioninfinancialdocuments