Eigenvector space model to capture features of documents

Eigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents da...

Full description

Bibliographic Details
Main Authors: Choi DONGJIN, Kim PANKOO
Format: Article
Language:English
Published: Editura Fundatiei Romania de Maine 2011-09-01
Series:Annals of Spiru Haret University Economic Series
Subjects:
Online Access:http://anale.spiruharet.ro/index.php/economics/article/view/559
_version_ 1819077418638376960
author Choi DONGJIN
Kim PANKOO
author_facet Choi DONGJIN
Kim PANKOO
author_sort Choi DONGJIN
collection DOAJ
description Eigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents data corpus. To capture properties of given documents, this paper conducted simple experiments to prove the eigenvector is also possible to use in document analysis. For the experiment, we use short abstract document of Wikipedia provided by DBpedia as a document corpus. To build an original square matrix, the most popular method named tf-idf measurement will be used. After calculating the eigenvectors of original matrix, each vector will be plotted into 3D graph to find what the eigenvector means in document processing.
first_indexed 2024-12-21T18:56:53Z
format Article
id doaj.art-2d4e07a773be498c9f5c439a309ffcf8
institution Directory Open Access Journal
issn 2393-1795
language English
last_indexed 2024-12-21T18:56:53Z
publishDate 2011-09-01
publisher Editura Fundatiei Romania de Maine
record_format Article
series Annals of Spiru Haret University Economic Series
spelling doaj.art-2d4e07a773be498c9f5c439a309ffcf82022-12-21T18:53:35ZengEditura Fundatiei Romania de MaineAnnals of Spiru Haret University Economic Series2393-17952011-09-011137380559Eigenvector space model to capture features of documentsChoi DONGJIN0Kim PANKOO1Department of Computer Engineering Chosun University Gwangju, South KoreaDepartment of Computer Engineering Chosun University Gwangju, South KoreaEigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents data corpus. To capture properties of given documents, this paper conducted simple experiments to prove the eigenvector is also possible to use in document analysis. For the experiment, we use short abstract document of Wikipedia provided by DBpedia as a document corpus. To build an original square matrix, the most popular method named tf-idf measurement will be used. After calculating the eigenvectors of original matrix, each vector will be plotted into 3D graph to find what the eigenvector means in document processing.http://anale.spiruharet.ro/index.php/economics/article/view/559eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining
spellingShingle Choi DONGJIN
Kim PANKOO
Eigenvector space model to capture features of documents
Annals of Spiru Haret University Economic Series
eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining
title Eigenvector space model to capture features of documents
title_full Eigenvector space model to capture features of documents
title_fullStr Eigenvector space model to capture features of documents
title_full_unstemmed Eigenvector space model to capture features of documents
title_short Eigenvector space model to capture features of documents
title_sort eigenvector space model to capture features of documents
topic eigenvector, Vector Space Model, Natural Language Processing, document analyzing, Information Retrieval, text mining
url http://anale.spiruharet.ro/index.php/economics/article/view/559
work_keys_str_mv AT choidongjin eigenvectorspacemodeltocapturefeaturesofdocuments
AT kimpankoo eigenvectorspacemodeltocapturefeaturesofdocuments