Beyond text analysis : image-based evaluation of health-related text readability using style features

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.

Bibliographic Details
Main Author: Bafuka, Freddy Nole
Other Authors: William J. Long.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2010
Subjects:
Online Access:http://hdl.handle.net/1721.1/53121
_version_ 1826217169817436160
author Bafuka, Freddy Nole
author2 William J. Long.
author_facet William J. Long.
Bafuka, Freddy Nole
author_sort Bafuka, Freddy Nole
collection MIT
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.
first_indexed 2024-09-23T16:59:08Z
format Thesis
id mit-1721.1/53121
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T16:59:08Z
publishDate 2010
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/531212019-04-11T02:46:37Z Beyond text analysis : image-based evaluation of health-related text readability using style features Image-based evaluation of health-related text readability using style features Image-based evaluation of readability of health-related documents Bafuka, Freddy Nole William J. Long. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. Includes bibliographical references (p. 70-71). Many studies have shown that the readability of health documents presented to consumers does not match their reading levels. An accurate assessment of the readability of health-related texts is an important step in providing material that match readers' literacy. Current readability measurements depend heavily on text analysis (NLP), but neglect style (text layout). In this study, we show that style properties are important predictors of documents' readability. In particular, we build an automated computer program that uses documents' style to predict their readability score. The style features are extracted by analyzing only one page of the document as an image. The scores produced by our system were tested against scores given by human experts. Our tool shows stronger correlation to experts' scores than the Flesch-Kincaid readability grading method. We provide an end-user program, VisualGrader, which provides a Graphical User Interface to the scoring model. by Freddy Nole Bafuka. M.Eng. 2010-03-25T15:03:53Z 2010-03-25T15:03:53Z 2009 2009 Thesis http://hdl.handle.net/1721.1/53121 503453538 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 71 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Bafuka, Freddy Nole
Beyond text analysis : image-based evaluation of health-related text readability using style features
title Beyond text analysis : image-based evaluation of health-related text readability using style features
title_full Beyond text analysis : image-based evaluation of health-related text readability using style features
title_fullStr Beyond text analysis : image-based evaluation of health-related text readability using style features
title_full_unstemmed Beyond text analysis : image-based evaluation of health-related text readability using style features
title_short Beyond text analysis : image-based evaluation of health-related text readability using style features
title_sort beyond text analysis image based evaluation of health related text readability using style features
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/53121
work_keys_str_mv AT bafukafreddynole beyondtextanalysisimagebasedevaluationofhealthrelatedtextreadabilityusingstylefeatures
AT bafukafreddynole imagebasedevaluationofhealthrelatedtextreadabilityusingstylefeatures
AT bafukafreddynole imagebasedevaluationofreadabilityofhealthrelateddocuments