Eigenvector space model to capture features of documents
Eigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents da...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Editura Fundatiei Romania de Maine
2011-09-01
|
Series: | Annals of Spiru Haret University Economic Series |
Subjects: | |
Online Access: | http://anale.spiruharet.ro/index.php/economics/article/view/559 |
_version_ | 1819077418638376960 |
---|---|
author | Choi DONGJIN Kim PANKOO |
author_facet | Choi DONGJIN Kim PANKOO |
author_sort | Choi DONGJIN |
collection | DOAJ |
description | Eigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents data corpus. To capture properties of given documents, this paper conducted simple experiments to prove the eigenvector is also possible to use in document analysis. For the experiment, we use short abstract document of Wikipedia provided by DBpedia as a document corpus. To build an original square matrix, the most popular method named tf-idf measurement will be used. After calculating the eigenvectors of original matrix, each vector will be plotted into 3D graph to find what the eigenvector means in document processing. |
first_indexed | 2024-12-21T18:56:53Z |
format | Article |
id | doaj.art-2d4e07a773be498c9f5c439a309ffcf8 |
institution | Directory Open Access Journal |
issn | 2393-1795 |
language | English |
last_indexed | 2024-12-21T18:56:53Z |
publishDate | 2011-09-01 |
publisher | Editura Fundatiei Romania de Maine |
record_format | Article |
series | Annals of Spiru Haret University Economic Series |
spelling | doaj.art-2d4e07a773be498c9f5c439a309ffcf82022-12-21T18:53:35ZengEditura Fundatiei Romania de MaineAnnals of Spiru Haret University Economic Series2393-17952011-09-011137380559Eigenvector space model to capture features of documentsChoi DONGJIN0Kim PANKOO1Department of Computer Engineering Chosun University Gwangju, South KoreaDepartment of Computer Engineering Chosun University Gwangju, South KoreaEigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents data corpus. To capture properties of given documents, this paper conducted simple experiments to prove the eigenvector is also possible to use in document analysis. For the experiment, we use short abstract document of Wikipedia provided by DBpedia as a document corpus. To build an original square matrix, the most popular method named tf-idf measurement will be used. After calculating the eigenvectors of original matrix, each vector will be plotted into 3D graph to find what the eigenvector means in document processing.http://anale.spiruharet.ro/index.php/economics/article/view/559eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining |
spellingShingle | Choi DONGJIN Kim PANKOO Eigenvector space model to capture features of documents Annals of Spiru Haret University Economic Series eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining |
title | Eigenvector space model to capture features of documents |
title_full | Eigenvector space model to capture features of documents |
title_fullStr | Eigenvector space model to capture features of documents |
title_full_unstemmed | Eigenvector space model to capture features of documents |
title_short | Eigenvector space model to capture features of documents |
title_sort | eigenvector space model to capture features of documents |
topic | eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining |
url | http://anale.spiruharet.ro/index.php/economics/article/view/559 |
work_keys_str_mv | AT choidongjin eigenvectorspacemodeltocapturefeaturesofdocuments AT kimpankoo eigenvectorspacemodeltocapturefeaturesofdocuments |