CourseDiff : a system for identifying and reporting changes to course websites

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010.

Bibliographic Details
Main Author: Kopylov, Igor, M. Eng. Massachusetts Institute of Technology
Other Authors: Robert C. Miller.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2011
Subjects:
Online Access:http://hdl.handle.net/1721.1/61165
_version_ 1826199357184016384
author Kopylov, Igor, M. Eng. Massachusetts Institute of Technology
author2 Robert C. Miller.
author_facet Robert C. Miller.
Kopylov, Igor, M. Eng. Massachusetts Institute of Technology
author_sort Kopylov, Igor, M. Eng. Massachusetts Institute of Technology
collection MIT
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010.
first_indexed 2024-09-23T11:18:35Z
format Thesis
id mit-1721.1/61165
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T11:18:35Z
publishDate 2011
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/611652019-04-12T16:02:02Z CourseDiff : a system for identifying and reporting changes to course websites Course Diff : a system for identifying and reporting changes to course websites System for identifying and reporting changes to course websites Kopylov, Igor, M. Eng. Massachusetts Institute of Technology Robert C. Miller. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010. Cataloged from PDF version of thesis. Includes bibliographical references (p. 63-64). CourseDiff is a prototype system that periodically samples course websites and notifies users via email when it identifies changes to those sites. The system was developed after conducting a study of 120 web pages from 50 MIT course websites sampled for two months during the spring semester of 2009. The study found that only 18% of changes to the HTML content of course website data are actually important to the content of the page. A closer examination of the corpus identified two major sources of trivial changes. The first is automatically generated content that changes on every visit to the page. The second is formatting and whitespace changes that do not affect the page's textual content. Together, these two sources produce over 99% of the trivial changes. CourseDiff implements an algorithm to filter out these trivial changes from the webpages it samples and a change reporting format for the changes that are identified as important. A small user test on part of the CourseDiff interface indicated that the system could feasibly be used by students to track changes to course websites. by Igor Kopylov. M.Eng. 2011-02-23T14:23:00Z 2011-02-23T14:23:00Z 2010 2010 Thesis http://hdl.handle.net/1721.1/61165 698482936 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 64 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Kopylov, Igor, M. Eng. Massachusetts Institute of Technology
CourseDiff : a system for identifying and reporting changes to course websites
title CourseDiff : a system for identifying and reporting changes to course websites
title_full CourseDiff : a system for identifying and reporting changes to course websites
title_fullStr CourseDiff : a system for identifying and reporting changes to course websites
title_full_unstemmed CourseDiff : a system for identifying and reporting changes to course websites
title_short CourseDiff : a system for identifying and reporting changes to course websites
title_sort coursediff a system for identifying and reporting changes to course websites
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/61165
work_keys_str_mv AT kopylovigormengmassachusettsinstituteoftechnology coursediffasystemforidentifyingandreportingchangestocoursewebsites
AT kopylovigormengmassachusettsinstituteoftechnology systemforidentifyingandreportingchangestocoursewebsites