Annotation persistence over dynamic documents

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Civil and Environmental Engineering, 2005.

Bibliographic Details
Main Author: Wang, Shaomin, 1969-
Other Authors: Steven R. Lerman.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2006
Subjects:
Online Access:http://hdl.handle.net/1721.1/30191
_version_ 1811088636463022080
author Wang, Shaomin, 1969-
author2 Steven R. Lerman.
author_facet Steven R. Lerman.
Wang, Shaomin, 1969-
author_sort Wang, Shaomin, 1969-
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Civil and Environmental Engineering, 2005.
first_indexed 2024-09-23T14:05:08Z
format Thesis
id mit-1721.1/30191
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T14:05:08Z
publishDate 2006
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/301912019-04-11T10:57:38Z Annotation persistence over dynamic documents Wang, Shaomin, 1969- Steven R. Lerman. Massachusetts Institute of Technology. Dept. of Civil and Environmental Engineering. Massachusetts Institute of Technology. Dept. of Civil and Environmental Engineering. Civil and Environmental Engineering. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Civil and Environmental Engineering, 2005. Includes bibliographical references (p. 212-216). Annotations, as a routine practice of actively engaging with reading materials, are heavily used in the paper world to augment the usefulness of documents. By annotation, we include a large variety of creative manipulations by which the otherwise passive reader becomes actively involved in a document. Annotations in digital form possess many benefits paper annotations do not enjoy, such as annotation searching, annotation multi- referencing, and annotation sharing. The digital form also introduces challenges to the process of annotation. This study looks at one of them, annotation persistence over dynamic documents. With the development of annotation software, users now have the opportunity to annotate documents which they don't own, or to which they don't have write permission. In annotation software, annotations are normally created and saved independently of the document. The owners of the documents being annotated may have no knowledge of the fact that third parties are annotating their documents' contents. When document contents are modified, annotation software faces a difficult situation where annotations need to be reattached. Reattaching annotations in a revised version of a document is a crucial component in annotation system design. Annotation persistence over document versions is a complicated and challenging problem, as documents can go through various changes between versions. In this thesis, we treat annotation persistence over dynamic documents as a specialized information retrieval problem. We then design a scheme to reposition annotations between versions by three mechanisms: the meta-structure information match, the keywords match, and content semantics match. (cont.) Content semantics matching is the determining factor in our annotation persistence scheme design. Latent Semantic Analysis, an innovative information retrieval model, is used to extract and compare document semantics. Two editions of an introductory computer science textbook are used to evaluate the annotation persistence scheme proposed in this study. The evaluation provides substantial evidence that the annotation persistence scheme proposed in this thesis is able to make the right decisions on repositioning annotations based on their degree of modifications, i.e. to reattach annotations if modifications are light, and to orphan annotations if modifications are heavy. by Shaomin Wang. Ph.D. 2006-03-24T18:28:48Z 2006-03-24T18:28:48Z 2005 2005 Thesis http://hdl.handle.net/1721.1/30191 60686450 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 216 p. 12022602 bytes 12050532 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
spellingShingle Civil and Environmental Engineering.
Wang, Shaomin, 1969-
Annotation persistence over dynamic documents
title Annotation persistence over dynamic documents
title_full Annotation persistence over dynamic documents
title_fullStr Annotation persistence over dynamic documents
title_full_unstemmed Annotation persistence over dynamic documents
title_short Annotation persistence over dynamic documents
title_sort annotation persistence over dynamic documents
topic Civil and Environmental Engineering.
url http://hdl.handle.net/1721.1/30191
work_keys_str_mv AT wangshaomin1969 annotationpersistenceoverdynamicdocuments