Toolkit development for high-dimensional data pre-processing, clustering and analysis

In this report, the author documents the software project that designs and implements a high dimensional data processing toolkit. The developed toolkit is called WordTagger, that automatically labels a vocabulary of computer science words to provide the categorical information of the word-space by u...

Full description

Bibliographic Details
Main Author: Hu, Yao.
Other Authors: Chen Lihui
Format: Thesis
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/55243