Statistical learning for decision making : interpretability, uncertainty, and inference

Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Research Center, 2015.

Bibliographic Details
Main Author:	Letham, Benjamin
Other Authors:	Cynthia Rudin.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2015
Subjects:	Operations Research Center.
Online Access:	http://hdl.handle.net/1721.1/98569

_version_	1811087536052764672
author	Letham, Benjamin
author2	Cynthia Rudin.
author_facet	Cynthia Rudin. Letham, Benjamin
author_sort	Letham, Benjamin
collection	MIT
description	Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Research Center, 2015.
first_indexed	2024-09-23T13:47:39Z
format	Thesis
id	mit-1721.1/98569
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T13:47:39Z
publishDate	2015
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/985692019-04-10T22:19:32Z Statistical learning for decision making : interpretability, uncertainty, and inference Letham, Benjamin Cynthia Rudin. Massachusetts Institute of Technology. Operations Research Center. Massachusetts Institute of Technology. Operations Research Center. Operations Research Center. Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Research Center, 2015. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 183-196). Data and predictive modeling are an increasingly important part of decision making. Here we present advances in several areas of statistical learning that are important for gaining insight from large amounts of data, and ultimately using predictive models to make better decisions. The first part of the thesis develops methods and theory for constructing interpretable models from association rules. Interpretability is important for decision makers to understand why a prediction is made. First we show how linear mixtures of rules can be used to make sequential predictions. Then we develop Bayesian Rule Lists, a method for learning small, ordered lists of rules. We apply Bayesian Rule Lists to a large database of patient medical histories and produce a simple, interpretable model that solves an important problem in healthcare, with little sacrifice to accuracy. Finally, we prove a uniform generalization bound for decision lists. In the second part of the thesis we focus on decision making from sales transaction data. We develop models and inference procedures for using transaction data to estimate quantities such as willingness-to-pay and lost sales due to stock unavailability. We develop a copula estimation procedure for making optimal bundle pricing decisions. We then develop a Bayesian hierarchical model for inferring demand and substitution behaviors from transaction data with stockouts. We show how posterior sampling can be used to directly incorporate model uncertainty into the decisions that will be made using the model. In the third part of the thesis we propose a method for aggregating relevant information from across the Internet to facilitate informed decision making. Our contributions here include an important theoretical result for Bayesian Sets, a popular method for identifying data that are similar to seed examples. We provide a generalization bound that holds for any data distribution, and moreover is independent of the dimensionality of the feature space. This result justifies the use of Bayesian Sets on high-dimensional problems, and also explains its good performance in settings where its underlying independence assumption does not hold. by Benjamin Letham. Ph. D. 2015-09-17T17:43:07Z 2015-09-17T17:43:07Z 2015 2015 Thesis http://hdl.handle.net/1721.1/98569 920866974 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 196 pages application/pdf Massachusetts Institute of Technology
spellingShingle	Operations Research Center. Letham, Benjamin Statistical learning for decision making : interpretability, uncertainty, and inference
title	Statistical learning for decision making : interpretability, uncertainty, and inference
title_full	Statistical learning for decision making : interpretability, uncertainty, and inference
title_fullStr	Statistical learning for decision making : interpretability, uncertainty, and inference
title_full_unstemmed	Statistical learning for decision making : interpretability, uncertainty, and inference
title_short	Statistical learning for decision making : interpretability, uncertainty, and inference
title_sort	statistical learning for decision making interpretability uncertainty and inference
topic	Operations Research Center.
url	http://hdl.handle.net/1721.1/98569
work_keys_str_mv	AT lethambenjamin statisticallearningfordecisionmakinginterpretabilityuncertaintyandinference

Statistical learning for decision making : interpretability, uncertainty, and inference

Similar Items