Improving Causal Inference and Attribute Prediction Through Visual Information

Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other va...

Full description

Bibliographic Details
Main Author: Chau, Eileen
Other Authors: Cafarella, Michael
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156837
_version_ 1811090069129265152
author Chau, Eileen
author2 Cafarella, Michael
author_facet Cafarella, Michael
Chau, Eileen
author_sort Chau, Eileen
collection MIT
description Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs.
first_indexed 2024-09-23T14:32:13Z
format Thesis
id mit-1721.1/156837
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T14:32:13Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1568372024-09-17T03:18:55Z Improving Causal Inference and Attribute Prediction Through Visual Information Chau, Eileen Cafarella, Michael Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs. M.Eng. 2024-09-16T13:52:09Z 2024-09-16T13:52:09Z 2024-05 2024-07-11T14:37:06.668Z Thesis https://hdl.handle.net/1721.1/156837 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Chau, Eileen
Improving Causal Inference and Attribute Prediction Through Visual Information
title Improving Causal Inference and Attribute Prediction Through Visual Information
title_full Improving Causal Inference and Attribute Prediction Through Visual Information
title_fullStr Improving Causal Inference and Attribute Prediction Through Visual Information
title_full_unstemmed Improving Causal Inference and Attribute Prediction Through Visual Information
title_short Improving Causal Inference and Attribute Prediction Through Visual Information
title_sort improving causal inference and attribute prediction through visual information
url https://hdl.handle.net/1721.1/156837
work_keys_str_mv AT chaueileen improvingcausalinferenceandattributepredictionthroughvisualinformation