Improving Causal Inference and Attribute Prediction Through Visual Information

Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other va...

Szczegółowa specyfikacja

Opis bibliograficzny
1. autor:	Chau, Eileen
Kolejni autorzy:	Cafarella, Michael
Format:	Praca dyplomowa
Wydane:	Massachusetts Institute of Technology 2024
Dostęp online:	https://hdl.handle.net/1721.1/156837

_version_	1826209838798995456
author	Chau, Eileen
author2	Cafarella, Michael
author_facet	Cafarella, Michael Chau, Eileen
author_sort	Chau, Eileen
collection	MIT
description	Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs.
first_indexed	2024-09-23T14:32:13Z
format	Thesis
id	mit-1721.1/156837
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T14:32:13Z
publishDate	2024
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1568372024-09-17T03:18:55Z Improving Causal Inference and Attribute Prediction Through Visual Information Chau, Eileen Cafarella, Michael Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Causal inference is an active area of research in computer science and statistics as it is used to understand casual conclusions that traditional statistics cannot. A naive way to conclude the cause of an outcome is by using correlations, but this is not always accurate because there may be other variables that indirectly affect an outcome. Causal inference aims to find the root cause by considering those variables called confounders. Frequently, confounding variables are attributes in existing data, but sometimes they can be missing from the existing data. In those cases, data analysts have to look for confounders from outside sources such as tables, knowledge graphs, and text. Our focus is to look for confounding variables from visual data such as videos and images. Discovering confounders from visual data is a challenge because videos and images are unstructured unlike tables and graphs. Thus, it is difficult to identify features and also extract them from visual data. Additionally, the identified and extracted features must be relevant to the casual question being studied. With the recent advancement in visual language models (VLMs) such as GPT-4V(ision), VLMs can provide a versatile solution to the confounder discovery and feature extraction problem when using visual data. This thesis proposal investigates confounder discovery, feature extraction, and casual inference from visual data by utilizing the power of VLMs. M.Eng. 2024-09-16T13:52:09Z 2024-09-16T13:52:09Z 2024-05 2024-07-11T14:37:06.668Z Thesis https://hdl.handle.net/1721.1/156837 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Chau, Eileen Improving Causal Inference and Attribute Prediction Through Visual Information
title	Improving Causal Inference and Attribute Prediction Through Visual Information
title_full	Improving Causal Inference and Attribute Prediction Through Visual Information
title_fullStr	Improving Causal Inference and Attribute Prediction Through Visual Information
title_full_unstemmed	Improving Causal Inference and Attribute Prediction Through Visual Information
title_short	Improving Causal Inference and Attribute Prediction Through Visual Information
title_sort	improving causal inference and attribute prediction through visual information
url	https://hdl.handle.net/1721.1/156837
work_keys_str_mv	AT chaueileen improvingcausalinferenceandattributepredictionthroughvisualinformation

Improving Causal Inference and Attribute Prediction Through Visual Information

Podobne zapisy