Networked interactions, graphical models and econometrics perspectives in data analysis

Thesis: S.M. in Technology and Policy, Massachusetts Institute of Technology, School of Engineering, Institute for Data, Systems, and Society, Technology and Policy Program, September, 2020

Bibliographic Details
Main Author: Seby, Jean-Baptiste.
Other Authors: Chintan Vaishnav and John Tsitsiklis.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2021
Subjects:
Online Access:https://hdl.handle.net/1721.1/129081
_version_ 1826197597647273984
author Seby, Jean-Baptiste.
author2 Chintan Vaishnav and John Tsitsiklis.
author_facet Chintan Vaishnav and John Tsitsiklis.
Seby, Jean-Baptiste.
author_sort Seby, Jean-Baptiste.
collection MIT
description Thesis: S.M. in Technology and Policy, Massachusetts Institute of Technology, School of Engineering, Institute for Data, Systems, and Society, Technology and Policy Program, September, 2020
first_indexed 2024-09-23T10:50:04Z
format Thesis
id mit-1721.1/129081
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T10:50:04Z
publishDate 2021
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1290812022-01-31T18:58:42Z Networked interactions, graphical models and econometrics perspectives in data analysis Seby, Jean-Baptiste. Chintan Vaishnav and John Tsitsiklis. Massachusetts Institute of Technology. Institute for Data, Systems, and Society. Technology and Policy Program. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Institute for Data, Systems, and Society Technology and Policy Program Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Engineering Systems Division Institute for Data, Systems, and Society. Technology and Policy Program. Electrical Engineering and Computer Science. Thesis: S.M. in Technology and Policy, Massachusetts Institute of Technology, School of Engineering, Institute for Data, Systems, and Society, Technology and Policy Program, September, 2020 Thesis: S.M. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, September, 2020 Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 231-243). This thesis is composed of two independent parts. In Part I, we study higher-order interactions in both graphical models and networks, i.e., interactions between more than two nodes. In the graphical model setting, we do not assume that interactions are known and our goal is to recover the structure of the graph. Our main contribution is an algebraic criterion that enables us to determine whether a set of observed variables have a single cause or multiple causes. We also prove that this criterion holds in the presence of confounders, i.e., when the causes are hidden. In the network setting, we assume that the structure of the graph is known. Our objective is then to identify what kind of information about data can be learned from the analysis of higher-order interactions. More precisely, using the generalization of the normalized Laplacian and random walks on graphs to simplicial complexes, we study a simplicial notion of PageRank centrality as defined in [Schaub et al., 2018]. Conducting numerical experiments on both synthetic and true data, we find evidence that the so-called edge PageRank is related to the concepts of local and global bridges in networks. In Part II, we analyze the determinants of yield gaps in Semi-Arid Tropics (SAT) regions in India. Analyzing a panel data of households within 30 villages over 6 years in India, we apply a fixed effects estimation method and a quantile regression with fixed effects to identify the most significant explanatory variables of yield gaps for 5 different crops. Using a correlated random effects estimator for unbalanced panel data, we can also estimate coefficients for time-invariant variables. We find that yield gaps determinants are crop specific. In addition to that, soil characteristics show the most significant effects on output rate. When statistically significant, correlations with the type of soil are negative. This result might suggest that the choice of cropping pattern is not necessarily appropriate. Finally, results suggest that unobservable heterogeneity of households is critical in explaining farm productivity. Time-invariant variables hardly explain this heterogeneity for which more research is needed. by Jean-Baptiste Seby. S.M. in Technology and Policy S.M. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science S.M.inTechnologyandPolicy Massachusetts Institute of Technology, School of Engineering, Institute for Data, Systems, and Society, Technology and Policy Program S.M.MassachusettsInstituteofTechnology,DepartmentofElectricalEngineeringandComputerScience 2021-01-06T17:38:47Z 2021-01-06T17:38:47Z 2020 2020 Thesis https://hdl.handle.net/1721.1/129081 1227221368 eng MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582 243 pages application/pdf Massachusetts Institute of Technology
spellingShingle Institute for Data, Systems, and Society.
Technology and Policy Program.
Electrical Engineering and Computer Science.
Seby, Jean-Baptiste.
Networked interactions, graphical models and econometrics perspectives in data analysis
title Networked interactions, graphical models and econometrics perspectives in data analysis
title_full Networked interactions, graphical models and econometrics perspectives in data analysis
title_fullStr Networked interactions, graphical models and econometrics perspectives in data analysis
title_full_unstemmed Networked interactions, graphical models and econometrics perspectives in data analysis
title_short Networked interactions, graphical models and econometrics perspectives in data analysis
title_sort networked interactions graphical models and econometrics perspectives in data analysis
topic Institute for Data, Systems, and Society.
Technology and Policy Program.
Electrical Engineering and Computer Science.
url https://hdl.handle.net/1721.1/129081
work_keys_str_mv AT sebyjeanbaptiste networkedinteractionsgraphicalmodelsandeconometricsperspectivesindataanalysis