Flexible, wide-area storage for distributed systems using semantic cues

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.

Bibliographic Details
Main Author: Stribling, Jeremy
Other Authors: M. Frans Kaashoek, Robert Morris and Jinyang Li.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2010
Subjects:
Online Access:http://hdl.handle.net/1721.1/54231
_version_ 1826213354154229760
author Stribling, Jeremy
author2 M. Frans Kaashoek, Robert Morris and Jinyang Li.
author_facet M. Frans Kaashoek, Robert Morris and Jinyang Li.
Stribling, Jeremy
author_sort Stribling, Jeremy
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.
first_indexed 2024-09-23T15:47:44Z
format Thesis
id mit-1721.1/54231
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T15:47:44Z
publishDate 2010
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/542312019-04-12T15:53:34Z Flexible, wide-area storage for distributed systems using semantic cues Stribling, Jeremy M. Frans Kaashoek, Robert Morris and Jinyang Li. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student submitted PDF version of thesis. Includes bibliographical references (p. 81-87). There is a growing set of Internet-based services that are too big, or too important, to run at a single site. Examples include Web services for e-mail, video and image hosting, and social networking. Splitting such services over multiple sites can increase capacity, improve fault tolerance, and reduce network delays to clients. These services often need storage infrastructure to share data among the sites. This dissertation explores the use of a new file system (WheelFS) specifically designed to be the storage infrastructure for wide-area distributed services. WheelFS allows applications to adjust the semantics of their data via semantic cues, which provide application control over consistency, failure handling, and file and replica placement. This dissertation describes a particular set of semantic cues that reflect the specific challenges that storing data over the wide-area network entails: high-latency and low-bandwidth links, coupled with increased node and link failures, when compared to local-area networks. By augmenting a familiar POSIX interface with support for semantic cues, WheelFS provides a wide-area distributed storage system intended to help multi-site applications share data and gain fault tolerance, in the form of a distributed file system. Its design allows applications to adjust the tradeoff between prompt visibility of updates from other sites and the ability for sites to operate independently despite failures and long delays. WheelFS is implemented as a user-level file system and is deployed on PlanetLab and Emu-lab. (cont.) Six applications (an all-pairs-pings script, a distributed Web cache, an email service, large file distribution, distributed compilation, and protein sequence alignment software) demonstrate that WheelFS's file system interface simplifies construction of distributed applications by allowing reuse of existing software. These applications would perform poorly with the strict semantics implied by a traditional file system interface, but by providing cues to WheelFS they are able to achieve good performance. Measurements show that applications built on WheelFS deliver comparable performance to services such as CoralCDN and BitTorrent that use specialized wide-area storage systems. by Jeremy Andrew Stribling. Ph.D. 2010-04-26T19:41:05Z 2010-04-26T19:41:05Z 2009 2009 Thesis http://hdl.handle.net/1721.1/54231 606588059 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 87 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Stribling, Jeremy
Flexible, wide-area storage for distributed systems using semantic cues
title Flexible, wide-area storage for distributed systems using semantic cues
title_full Flexible, wide-area storage for distributed systems using semantic cues
title_fullStr Flexible, wide-area storage for distributed systems using semantic cues
title_full_unstemmed Flexible, wide-area storage for distributed systems using semantic cues
title_short Flexible, wide-area storage for distributed systems using semantic cues
title_sort flexible wide area storage for distributed systems using semantic cues
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/54231
work_keys_str_mv AT striblingjeremy flexiblewideareastoragefordistributedsystemsusingsemanticcues