Transparent Value Alignment: Foundations for Human-Centered Explainable AI in Alignment

Alignment of autonomous agents' values and objectives with those of humans can greatly enhance these agents' ability to act flexibly to safely and reliably meet humans' goals across diverse contexts from space exploration to robotic manufacturing. However, it is often difficult or imp...

Full description

Bibliographic Details
Main Author:	Sanneman, Lindsay
Other Authors:	Shah, Julie A.
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/151499

Internet

https://hdl.handle.net/1721.1/151499

Transparent Value Alignment: Foundations for Human-Centered Explainable AI in Alignment

Internet

Similar Items