Conceptually related lexicon clustering based on word context association mining.

Automatic lexicon generation is a useful task in learning text fragment patterns. In our previous work we have focused on text fragment pattern learning through the fuzzy grammar method which inputs include a predefined lexicon and text fragments that represents the expression of the grammar class t...

Full description

Bibliographic Details
Main Authors: Mohd Sharef, Nurfadhlina, Martin, Trevor, Azmi Murad, Masrah Azrifah
Format: Article
Language:English
English
Published: Advanced Institute of Convergence Information Technology 2013
Online Access:http://psasir.upm.edu.my/id/eprint/30613/1/Conceptually%20related%20lexicon%20clustering%20based%20on%20word%20context%20association%20mining.pdf
_version_ 1825947741270835200
author Mohd Sharef, Nurfadhlina
Martin, Trevor
Azmi Murad, Masrah Azrifah
author_facet Mohd Sharef, Nurfadhlina
Martin, Trevor
Azmi Murad, Masrah Azrifah
author_sort Mohd Sharef, Nurfadhlina
collection UPM
description Automatic lexicon generation is a useful task in learning text fragment patterns. In our previous work we have focused on text fragment pattern learning through the fuzzy grammar method which inputs include a predefined lexicon and text fragments that represents the expression of the grammar class to be learned. However, the bottleneck of the success of the fuzzy grammar creation and in common with other text learner often lies in the knowledge acquisition phase; due to the labour intensive text annotation which also demands skills and background knowledge of the text. For this reason, a semi-automated technique called automatic Terminal Grammar Recommender (TGR) is devised to identify conceptually related lexicons in the texts and their related to create terminal grammars by mining associations of words contexts. The approach recognizes that there is a degree of local structure within such text and the technique exploits the local structure without the large computational overhead of deeper analysis. Result from the comparison of the associative words detected by TGR with the definition of a content category tool called General Inquirer on the data from European Central Bank data is reported. Our findings show that our proposed method has managed to reduce the manual effort of identifying conceptually similar lexicons to form terminal grammars. The average of matched generated terminal grammar clusters compared to General Inquirer is 54.85% which indicates that at least half the expensive effort to construct conceptually related lexicon is saved. This hint the potential of word context association mining in automated conceptual lexicon generation.
first_indexed 2024-03-06T08:18:05Z
format Article
id upm.eprints-30613
institution Universiti Putra Malaysia
language English
English
last_indexed 2024-03-06T08:18:05Z
publishDate 2013
publisher Advanced Institute of Convergence Information Technology
record_format dspace
spelling upm.eprints-306132015-10-08T00:26:32Z http://psasir.upm.edu.my/id/eprint/30613/ Conceptually related lexicon clustering based on word context association mining. Mohd Sharef, Nurfadhlina Martin, Trevor Azmi Murad, Masrah Azrifah Automatic lexicon generation is a useful task in learning text fragment patterns. In our previous work we have focused on text fragment pattern learning through the fuzzy grammar method which inputs include a predefined lexicon and text fragments that represents the expression of the grammar class to be learned. However, the bottleneck of the success of the fuzzy grammar creation and in common with other text learner often lies in the knowledge acquisition phase; due to the labour intensive text annotation which also demands skills and background knowledge of the text. For this reason, a semi-automated technique called automatic Terminal Grammar Recommender (TGR) is devised to identify conceptually related lexicons in the texts and their related to create terminal grammars by mining associations of words contexts. The approach recognizes that there is a degree of local structure within such text and the technique exploits the local structure without the large computational overhead of deeper analysis. Result from the comparison of the associative words detected by TGR with the definition of a content category tool called General Inquirer on the data from European Central Bank data is reported. Our findings show that our proposed method has managed to reduce the manual effort of identifying conceptually similar lexicons to form terminal grammars. The average of matched generated terminal grammar clusters compared to General Inquirer is 54.85% which indicates that at least half the expensive effort to construct conceptually related lexicon is saved. This hint the potential of word context association mining in automated conceptual lexicon generation. Advanced Institute of Convergence Information Technology 2013 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/30613/1/Conceptually%20related%20lexicon%20clustering%20based%20on%20word%20context%20association%20mining.pdf Mohd Sharef, Nurfadhlina and Martin, Trevor and Azmi Murad, Masrah Azrifah (2013) Conceptually related lexicon clustering based on word context association mining. International Journal of Information Processing and Management, 4 (3). pp. 40-50. ISSN 2093-4009; ESSN: 2233-940X 10.4156/ijipm.vol4.issue3.4 English
spellingShingle Mohd Sharef, Nurfadhlina
Martin, Trevor
Azmi Murad, Masrah Azrifah
Conceptually related lexicon clustering based on word context association mining.
title Conceptually related lexicon clustering based on word context association mining.
title_full Conceptually related lexicon clustering based on word context association mining.
title_fullStr Conceptually related lexicon clustering based on word context association mining.
title_full_unstemmed Conceptually related lexicon clustering based on word context association mining.
title_short Conceptually related lexicon clustering based on word context association mining.
title_sort conceptually related lexicon clustering based on word context association mining
url http://psasir.upm.edu.my/id/eprint/30613/1/Conceptually%20related%20lexicon%20clustering%20based%20on%20word%20context%20association%20mining.pdf
work_keys_str_mv AT mohdsharefnurfadhlina conceptuallyrelatedlexiconclusteringbasedonwordcontextassociationmining
AT martintrevor conceptuallyrelatedlexiconclusteringbasedonwordcontextassociationmining
AT azmimuradmasrahazrifah conceptuallyrelatedlexiconclusteringbasedonwordcontextassociationmining