Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor

Analogies such as man is to king as woman is to X are often used to illustrate the amazing power of word embeddings. Concurrently, they have also been used to expose how strongly human biases are encoded in vector spaces trained on natural language, with examples like man is to computer programmer a...

Full description

Bibliographic Details
Main Authors: Nissim, Malvina, van Noord, Rik, van der Goot, Rob
Format: Article
Language:English
Published: The MIT Press 2020-06-01
Series:Computational Linguistics
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00379
_version_ 1818492305393319936
author Nissim, Malvina
van Noord, Rik
van der Goot, Rob
author_facet Nissim, Malvina
van Noord, Rik
van der Goot, Rob
author_sort Nissim, Malvina
collection DOAJ
description Analogies such as man is to king as woman is to X are often used to illustrate the amazing power of word embeddings. Concurrently, they have also been used to expose how strongly human biases are encoded in vector spaces trained on natural language, with examples like man is to computer programmer as woman is to homemaker. Recent work has shown that analogies are in fact not an accurate diagnostic for bias, but this does not mean that they are not used anymore, or that their legacy is fading. Instead of focusing on the intrinsic problems of the analogy task as a bias detection tool, we discuss a series of issues involving implementation as well as subjective choices that might have yielded a distorted picture of bias in word embeddings. We stand by the truth that human biases are present in word embeddings, and, of course, the need to address them. But analogies are not an accurate tool to do so, and the way they have been most often used has exacerbated some possibly non-existing biases and perhaps hidden others. Because they are still widely popular, and some of them have become classics within and outside the NLP community, we deem it important to provide a series of clarifications that should put well-known, and potentially new analogies, into the right perspective.
first_indexed 2024-12-10T17:41:17Z
format Article
id doaj.art-4b710c571bc84a388f0d878371a3b7f2
institution Directory Open Access Journal
issn 0891-2017
1530-9312
language English
last_indexed 2024-12-10T17:41:17Z
publishDate 2020-06-01
publisher The MIT Press
record_format Article
series Computational Linguistics
spelling doaj.art-4b710c571bc84a388f0d878371a3b7f22022-12-22T01:39:22ZengThe MIT PressComputational Linguistics0891-20171530-93122020-06-0146248749710.1162/coli_a_00379Fair Is Better than Sensational: Man Is to Doctor as Woman Is to DoctorNissim, Malvinavan Noord, Rikvan der Goot, RobAnalogies such as man is to king as woman is to X are often used to illustrate the amazing power of word embeddings. Concurrently, they have also been used to expose how strongly human biases are encoded in vector spaces trained on natural language, with examples like man is to computer programmer as woman is to homemaker. Recent work has shown that analogies are in fact not an accurate diagnostic for bias, but this does not mean that they are not used anymore, or that their legacy is fading. Instead of focusing on the intrinsic problems of the analogy task as a bias detection tool, we discuss a series of issues involving implementation as well as subjective choices that might have yielded a distorted picture of bias in word embeddings. We stand by the truth that human biases are present in word embeddings, and, of course, the need to address them. But analogies are not an accurate tool to do so, and the way they have been most often used has exacerbated some possibly non-existing biases and perhaps hidden others. Because they are still widely popular, and some of them have become classics within and outside the NLP community, we deem it important to provide a series of clarifications that should put well-known, and potentially new analogies, into the right perspective.https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00379
spellingShingle Nissim, Malvina
van Noord, Rik
van der Goot, Rob
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
Computational Linguistics
title Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
title_full Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
title_fullStr Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
title_full_unstemmed Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
title_short Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
title_sort fair is better than sensational man is to doctor as woman is to doctor
url https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00379
work_keys_str_mv AT nissimmalvina fairisbetterthansensationalmanistodoctoraswomanistodoctor
AT vannoordrik fairisbetterthansensationalmanistodoctoraswomanistodoctor
AT vandergootrob fairisbetterthansensationalmanistodoctoraswomanistodoctor