Reversing the logic of generative AI alignment: a pragmatic approach for public interest

The alignment of artificial intelligence (AI) systems with societal values and the public interest is a critical challenge in the field of AI ethics and governance. Traditional approaches, such as Reinforcement Learning with Human Feedback (RLHF) and Constitutional AI, often rely on pre-defined high...

Full description

Bibliographic Details
Main Author:	Gleb Papyshev
Format:	Article
Language:	English
Published:	Cambridge University Press 2025-01-01
Series:	Data & Policy
Subjects:	AI alignment constitutional AI pragmatism public interest reinforcement learning with human feedback
Online Access:	https://www.cambridge.org/core/product/identifier/S2632324925000094/type/journal_article

Internet

https://www.cambridge.org/core/product/identifier/S2632324925000094/type/journal_article

Reversing the logic of generative AI alignment: a pragmatic approach for public interest

Internet

Similar Items