Deep video-to-video transformations for accessibility applications
This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/1721.1/121622 |
_version_ | 1826194958067957760 |
---|---|
author | Banda, Dalitso Hansini. |
author2 | Boris Katz. |
author_facet | Boris Katz. Banda, Dalitso Hansini. |
author_sort | Banda, Dalitso Hansini. |
collection | MIT |
description | This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. |
first_indexed | 2024-09-23T10:04:30Z |
format | Thesis |
id | mit-1721.1/121622 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T10:04:30Z |
publishDate | 2019 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1216222019-08-07T03:04:15Z Deep video-to-video transformations for accessibility applications Banda, Dalitso Hansini. Boris Katz. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Electrical Engineering and Computer Science. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018 Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 73-79). We develop a class of visual assistive technologies that can learn visual transforms to improve accessibility as an alternative to traditional methods that mostly rely on extracted symbolic information. In this thesis, we mainly focus on how we can apply this class of systems to address photosensitivity. People with photosensitivity may have seizures, migraines or other adverse reactions to certain visual stimuli such as flashing images and alternating patterns. We develop deep learning models that learn to identify and transform video sequences containing such stimuli whilst preserving video quality and content. Using descriptions of the adverse visual stimuli, we train models to learn transforms to remove such stimuli. We show that these deep learning models are able to generalize to real-world examples of images with these problematic stimuli. From our experimental trials, human subjects rated video sequences transformed by our models as having significantly less problematic stimuli than their input. We extend these ideas; we show how these deep transformation networks can be applied in other visual assistive domains through demonstration of an application addressing the problem of emotion recognition in those with the Autism Spectrum Disorder. by Dalitso Hansini Banda. M. Eng. M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science 2019-07-15T20:28:44Z 2019-07-15T20:28:44Z 2018 2018 Thesis https://hdl.handle.net/1721.1/121622 1098049248 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 79 pages application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Banda, Dalitso Hansini. Deep video-to-video transformations for accessibility applications |
title | Deep video-to-video transformations for accessibility applications |
title_full | Deep video-to-video transformations for accessibility applications |
title_fullStr | Deep video-to-video transformations for accessibility applications |
title_full_unstemmed | Deep video-to-video transformations for accessibility applications |
title_short | Deep video-to-video transformations for accessibility applications |
title_sort | deep video to video transformations for accessibility applications |
topic | Electrical Engineering and Computer Science. |
url | https://hdl.handle.net/1721.1/121622 |
work_keys_str_mv | AT bandadalitsohansini deepvideotovideotransformationsforaccessibilityapplications |