Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data

As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a...

Full description

Bibliographic Details
Main Authors: Balagopalan, Aparna, Madras, David, Yang, David H., Hadfield-Menell, Dylan, Hadfield, Gillian K., Ghassemi, Marzyeh
Format: Article
Language:en_US
Published: American Association for the Advancement of Science (AAAS) 2024
Subjects:
Online Access:https://hdl.handle.net/1721.1/153492