Improving Causal Inference and Attribute Prediction Through Visual Information
Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other va...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2024
|
Online Access: | https://hdl.handle.net/1721.1/156837 |
_version_ | 1811090069129265152 |
---|---|
author | Chau, Eileen |
author2 | Cafarella, Michael |
author_facet | Cafarella, Michael Chau, Eileen |
author_sort | Chau, Eileen |
collection | MIT |
description | Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs. |
first_indexed | 2024-09-23T14:32:13Z |
format | Thesis |
id | mit-1721.1/156837 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T14:32:13Z |
publishDate | 2024 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1568372024-09-17T03:18:55Z Improving Causal Inference and Attribute Prediction Through Visual Information Chau, Eileen Cafarella, Michael Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs. M.Eng. 2024-09-16T13:52:09Z 2024-09-16T13:52:09Z 2024-05 2024-07-11T14:37:06.668Z Thesis https://hdl.handle.net/1721.1/156837 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Chau, Eileen Improving Causal Inference and Attribute Prediction Through Visual Information |
title | Improving Causal Inference and Attribute Prediction Through Visual Information |
title_full | Improving Causal Inference and Attribute Prediction Through Visual Information |
title_fullStr | Improving Causal Inference and Attribute Prediction Through Visual Information |
title_full_unstemmed | Improving Causal Inference and Attribute Prediction Through Visual Information |
title_short | Improving Causal Inference and Attribute Prediction Through Visual Information |
title_sort | improving causal inference and attribute prediction through visual information |
url | https://hdl.handle.net/1721.1/156837 |
work_keys_str_mv | AT chaueileen improvingcausalinferenceandattributepredictionthroughvisualinformation |