Patent semantics : analysis, search and visualization of large text corpora

Thesis (M. Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004.

Bibliographic Details
Main Author: Lucas, Christopher G
Other Authors: Deb K. Roy.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2006
Subjects:
Online Access:http://hdl.handle.net/1721.1/33146
_version_ 1826212676707024896
author Lucas, Christopher G
author2 Deb K. Roy.
author_facet Deb K. Roy.
Lucas, Christopher G
author_sort Lucas, Christopher G
collection MIT
description Thesis (M. Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004.
first_indexed 2024-09-23T15:32:54Z
format Thesis
id mit-1721.1/33146
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T15:32:54Z
publishDate 2006
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/331462019-04-12T07:16:36Z Patent semantics : analysis, search and visualization of large text corpora Lucas, Christopher G Deb K. Roy. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004. Includes bibliographical references (leaves 47-48). Patent Semantics is system for processing text documents by extracting features capturing their semantic content, and searching, clustering, and relating them by those same features. It is set apart from existing methodologies by combining a visualization scheme that integrates retrieval and clustering, providing a variety of ways to find and relate documents depending on their goals. In addition, the system provides an explanatory mechanism that makes the retrieval an understandable process rather than a black box. The domain in which the system currently works is biochemistry and molecular biology patents but it is not intrinsically constrained to any document set. by Christopher G. Lucas. M.Eng.and S.B. 2006-06-19T17:45:12Z 2006-06-19T17:45:12Z 2004 2004 Thesis http://hdl.handle.net/1721.1/33146 62256204 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 48 leaves 2507435 bytes 2507882 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Lucas, Christopher G
Patent semantics : analysis, search and visualization of large text corpora
title Patent semantics : analysis, search and visualization of large text corpora
title_full Patent semantics : analysis, search and visualization of large text corpora
title_fullStr Patent semantics : analysis, search and visualization of large text corpora
title_full_unstemmed Patent semantics : analysis, search and visualization of large text corpora
title_short Patent semantics : analysis, search and visualization of large text corpora
title_sort patent semantics analysis search and visualization of large text corpora
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/33146
work_keys_str_mv AT lucaschristopherg patentsemanticsanalysissearchandvisualizationoflargetextcorpora