EdVidParse : detecting people and content in educational videos

Thesis: M. Eng. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015.

Bibliographic Details
Main Author: Pratusevich, Michele
Other Authors: Robert C. Miller and Antonio Torralba.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2016
Subjects:
Online Access:http://hdl.handle.net/1721.1/100647
_version_ 1826213223248953344
author Pratusevich, Michele
author2 Robert C. Miller and Antonio Torralba.
author_facet Robert C. Miller and Antonio Torralba.
Pratusevich, Michele
author_sort Pratusevich, Michele
collection MIT
description Thesis: M. Eng. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015.
first_indexed 2024-09-23T15:45:43Z
format Thesis
id mit-1721.1/100647
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T15:45:43Z
publishDate 2016
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1006472019-04-11T11:06:29Z EdVidParse : detecting people and content in educational videos Detecting people and content in educational videos Pratusevich, Michele Robert C. Miller and Antonio Torralba. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: M. Eng. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 61-65). There are thousands of hours of educational content on the Internet, with services like edX, Coursera, Berkeley WebCasts, and others offering hundreds of courses to hundreds of thousands of learners. Consequently, researchers are interested in the effectiveness of video learning. While educational videos vary, they share two common attributes: people and textual content. People are presenting content to learners in the form of text, graphs, charts, tables, and diagrams. With an annotation of people and textual content in an educational video, researchers can study the relationship between video learning and retention. This thesis presents EdVidParse, an automatic tool that takes an educational video and annotates it with bounding boxes around the people and textual content. EdVidParse uses internal features from deep convolutional neural networks to estimate the bounding boxes, achieving a 0.43 AP score on a test set. Three applications of EdVidParse, including identifying the video type, identifying people and textual content for interface design, and removing a person from a picture-in-picture video are presented. EdVidParse provides an easy interface for identifying people and textual content inside educational videos for use in video annotation, interface design, and video reconfiguration. by Michele Pratusevich. M. Eng. in Computer Science and Engineering 2016-01-04T20:01:57Z 2016-01-04T20:01:57Z 2015 2015 Thesis http://hdl.handle.net/1721.1/100647 933247843 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 65 pages application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Pratusevich, Michele
EdVidParse : detecting people and content in educational videos
title EdVidParse : detecting people and content in educational videos
title_full EdVidParse : detecting people and content in educational videos
title_fullStr EdVidParse : detecting people and content in educational videos
title_full_unstemmed EdVidParse : detecting people and content in educational videos
title_short EdVidParse : detecting people and content in educational videos
title_sort edvidparse detecting people and content in educational videos
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/100647
work_keys_str_mv AT pratusevichmichele edvidparsedetectingpeopleandcontentineducationalvideos
AT pratusevichmichele detectingpeopleandcontentineducationalvideos