Transparent Value Alignment: Foundations for Human-Centered Explainable AI in Alignment
Alignment of autonomous agents' values and objectives with those of humans can greatly enhance these agents' ability to act flexibly to safely and reliably meet humans' goals across diverse contexts from space exploration to robotic manufacturing. However, it is often difficult or imp...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/151499 |