Reversing the logic of generative AI alignment: a pragmatic approach for public interest

The alignment of artificial intelligence (AI) systems with societal values and the public interest is a critical challenge in the field of AI ethics and governance. Traditional approaches, such as Reinforcement Learning with Human Feedback (RLHF) and Constitutional AI, often rely on pre-defined high...

Full description

Bibliographic Details
Main Author: Gleb Papyshev
Format: Article
Language:English
Published: Cambridge University Press 2025-01-01
Series:Data & Policy
Subjects:
Online Access:https://www.cambridge.org/core/product/identifier/S2632324925000094/type/journal_article