Using Text Mining for Unsupervised Knowledge Extraction and Organization

The progress in digitally generated data aquisition and storage has allowed for a huge growth in information generated in organizations. Around 80% ofthose data are created in non structured format and a significant part of those are texts. Intelligent organization of those textual collection is a m...

Full description

Bibliographic Details
Main Authors: REZENDE, S. O., MARCACINI, R. M., MOURA, M. F.
Format: Article
Language:English
Published: Faculdade Salesiana Maria Auxiliadora 2011-06-01
Series:Sistemas de Informação
Subjects:
Online Access:http://www.fsma.edu.br/si/edicao7/FSMA_SI_2011_1_Principal_3.pdf
_version_ 1818379334558154752
author REZENDE, S. O.
MARCACINI, R. M.
MOURA, M. F.
author_facet REZENDE, S. O.
MARCACINI, R. M.
MOURA, M. F.
author_sort REZENDE, S. O.
collection DOAJ
description The progress in digitally generated data aquisition and storage has allowed for a huge growth in information generated in organizations. Around 80% ofthose data are created in non structured format and a significant part of those are texts. Intelligent organization of those textual collection is a matter of interest for most organizations, for it speed up information search and retrieval. In this context, Text Mining can transform this great amount non structure text data un useful knowledge, that can even be innovative for those organizations. Using unsupervised methods for knowledge extraction and organization has received great attention in literature, because it does not require previous knowledge on the textual collections that are going to be explored. In this article we describe the main techniques and algorithms used for unsupervised knowledege extraction and organization from textual data. The most relevant works in literature are presented and discussed in each phase of the Text Mining process and some existing computational tools are suggested for each task at hand. At last, some examples and applications are present to show the use of Text Mining on real problems.
first_indexed 2024-12-14T02:01:08Z
format Article
id doaj.art-cd4395ef5c3640f8ae2c3098684514dc
institution Directory Open Access Journal
issn 1983-5604
language English
last_indexed 2024-12-14T02:01:08Z
publishDate 2011-06-01
publisher Faculdade Salesiana Maria Auxiliadora
record_format Article
series Sistemas de Informação
spelling doaj.art-cd4395ef5c3640f8ae2c3098684514dc2022-12-21T23:21:00ZengFaculdade Salesiana Maria AuxiliadoraSistemas de Informação1983-56042011-06-017721Using Text Mining for Unsupervised Knowledge Extraction and OrganizationREZENDE, S. O.MARCACINI, R. M.MOURA, M. F.The progress in digitally generated data aquisition and storage has allowed for a huge growth in information generated in organizations. Around 80% ofthose data are created in non structured format and a significant part of those are texts. Intelligent organization of those textual collection is a matter of interest for most organizations, for it speed up information search and retrieval. In this context, Text Mining can transform this great amount non structure text data un useful knowledge, that can even be innovative for those organizations. Using unsupervised methods for knowledge extraction and organization has received great attention in literature, because it does not require previous knowledge on the textual collections that are going to be explored. In this article we describe the main techniques and algorithms used for unsupervised knowledege extraction and organization from textual data. The most relevant works in literature are presented and discussed in each phase of the Text Mining process and some existing computational tools are suggested for each task at hand. At last, some examples and applications are present to show the use of Text Mining on real problems.http://www.fsma.edu.br/si/edicao7/FSMA_SI_2011_1_Principal_3.pdfText MiningDocument ClusteringUnsupervised LearningMetadata ExtractionTopic hierarchy
spellingShingle REZENDE, S. O.
MARCACINI, R. M.
MOURA, M. F.
Using Text Mining for Unsupervised Knowledge Extraction and Organization
Sistemas de Informação
Text Mining
Document Clustering
Unsupervised Learning
Metadata Extraction
Topic hierarchy
title Using Text Mining for Unsupervised Knowledge Extraction and Organization
title_full Using Text Mining for Unsupervised Knowledge Extraction and Organization
title_fullStr Using Text Mining for Unsupervised Knowledge Extraction and Organization
title_full_unstemmed Using Text Mining for Unsupervised Knowledge Extraction and Organization
title_short Using Text Mining for Unsupervised Knowledge Extraction and Organization
title_sort using text mining for unsupervised knowledge extraction and organization
topic Text Mining
Document Clustering
Unsupervised Learning
Metadata Extraction
Topic hierarchy
url http://www.fsma.edu.br/si/edicao7/FSMA_SI_2011_1_Principal_3.pdf
work_keys_str_mv AT rezendeso usingtextminingforunsupervisedknowledgeextractionandorganization
AT marcacinirm usingtextminingforunsupervisedknowledgeextractionandorganization
AT mouramf usingtextminingforunsupervisedknowledgeextractionandorganization