Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data

As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a...

Full description

Bibliographic Details
Main Authors:	Balagopalan, Aparna, Madras, David, Yang, David H., Hadfield-Menell, Dylan, Hadfield, Gillian K., Ghassemi, Marzyeh
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	American Association for the Advancement of Science (AAAS) 2024
Subjects:	Multidisciplinary
Online Access:	https://hdl.handle.net/1721.1/153492

Internet

https://hdl.handle.net/1721.1/153492

Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data

Internet

Similar Items