Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

AbstractA fundamental goal of scientific research is to learn about causal relationships. However, despite its critical role in the life and social sciences, causality has not had the same importance in Natural Language Processing (NLP), which has traditionally placed more emphasis o...

Full description

Bibliographic Details
Main Authors: Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, Diyi Yang
Format: Article
Language:English
Published: The MIT Press 2022-01-01
Series:Transactions of the Association for Computational Linguistics
Online Access:https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00511/113490/Causal-Inference-in-Natural-Language-Processing
_version_ 1797990199324573696
author Amir Feder
Katherine A. Keith
Emaad Manzoor
Reid Pryzant
Dhanya Sridhar
Zach Wood-Doughty
Jacob Eisenstein
Justin Grimmer
Roi Reichart
Margaret E. Roberts
Brandon M. Stewart
Victor Veitch
Diyi Yang
author_facet Amir Feder
Katherine A. Keith
Emaad Manzoor
Reid Pryzant
Dhanya Sridhar
Zach Wood-Doughty
Jacob Eisenstein
Justin Grimmer
Roi Reichart
Margaret E. Roberts
Brandon M. Stewart
Victor Veitch
Diyi Yang
author_sort Amir Feder
collection DOAJ
description AbstractA fundamental goal of scientific research is to learn about causal relationships. However, despite its critical role in the life and social sciences, causality has not had the same importance in Natural Language Processing (NLP), which has traditionally placed more emphasis on predictive tasks. This distinction is beginning to fade, with an emerging area of interdisciplinary research at the convergence of causal inference and language processing. Still, research on causality in NLP remains scattered across domains without unified definitions, benchmark datasets and clear articulations of the challenges and opportunities in the application of causal inference to the textual domain, with its unique properties. In this survey, we consolidate research across academic areas and situate it in the broader NLP landscape. We introduce the statistical challenge of estimating causal effects with text, encompassing settings where text is used as an outcome, treatment, or to address confounding. In addition, we explore potential uses of causal inference to improve the robustness, fairness, and interpretability of NLP models. We thus provide a unified overview of causal inference for the NLP community.1
first_indexed 2024-04-11T08:32:44Z
format Article
id doaj.art-ce20dd42414b496493ce8c27322a7d13
institution Directory Open Access Journal
issn 2307-387X
language English
last_indexed 2024-04-11T08:32:44Z
publishDate 2022-01-01
publisher The MIT Press
record_format Article
series Transactions of the Association for Computational Linguistics
spelling doaj.art-ce20dd42414b496493ce8c27322a7d132022-12-22T04:34:27ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2022-01-01101138115810.1162/tacl_a_00511Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and BeyondAmir Feder0Katherine A. Keith1Emaad Manzoor2Reid Pryzant3Dhanya Sridhar4Zach Wood-Doughty5Jacob Eisenstein6Justin Grimmer7Roi Reichart8Margaret E. Roberts9Brandon M. Stewart10Victor Veitch11Diyi Yang12Technion - Israel Institute of Technology, IsraelWilliams College, USAUniversity of Wisconsin - Madison, USAMicrosoft, USAColumbia University, CanadaNorthwestern University, USAGoogle Research, USAStanford University, USATechnion - Israel Institute of Technology, IsraelUniversity of California San Diego, USAPrinceton University, USAGoogle Research, USAGeorgia Tech, USA AbstractA fundamental goal of scientific research is to learn about causal relationships. However, despite its critical role in the life and social sciences, causality has not had the same importance in Natural Language Processing (NLP), which has traditionally placed more emphasis on predictive tasks. This distinction is beginning to fade, with an emerging area of interdisciplinary research at the convergence of causal inference and language processing. Still, research on causality in NLP remains scattered across domains without unified definitions, benchmark datasets and clear articulations of the challenges and opportunities in the application of causal inference to the textual domain, with its unique properties. In this survey, we consolidate research across academic areas and situate it in the broader NLP landscape. We introduce the statistical challenge of estimating causal effects with text, encompassing settings where text is used as an outcome, treatment, or to address confounding. In addition, we explore potential uses of causal inference to improve the robustness, fairness, and interpretability of NLP models. We thus provide a unified overview of causal inference for the NLP community.1https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00511/113490/Causal-Inference-in-Natural-Language-Processing
spellingShingle Amir Feder
Katherine A. Keith
Emaad Manzoor
Reid Pryzant
Dhanya Sridhar
Zach Wood-Doughty
Jacob Eisenstein
Justin Grimmer
Roi Reichart
Margaret E. Roberts
Brandon M. Stewart
Victor Veitch
Diyi Yang
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
Transactions of the Association for Computational Linguistics
title Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
title_full Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
title_fullStr Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
title_full_unstemmed Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
title_short Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
title_sort causal inference in natural language processing estimation prediction interpretation and beyond
url https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00511/113490/Causal-Inference-in-Natural-Language-Processing
work_keys_str_mv AT amirfeder causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT katherineakeith causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT emaadmanzoor causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT reidpryzant causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT dhanyasridhar causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT zachwooddoughty causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT jacobeisenstein causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT justingrimmer causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT roireichart causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT margareteroberts causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT brandonmstewart causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT victorveitch causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond
AT diyiyang causalinferenceinnaturallanguageprocessingestimationpredictioninterpretationandbeyond