Inverse Constitutional AI

Inverse Constitutional AI

The alignment of large language models (LLMs) to human values becomes more and more pressing as their scale and capabilities have grown. One important feature of alignment is understanding the preference datasets that are used to finetune LLMs. Inverse Constitutional AI (ICAI) is presented as a nove...

Bibliografski detalji
Glavni autor:	Kostolansky, Timothy H.
Daljnji autori:	Hadfield-Menell, Dylan
Format:	Disertacija
Izdano:	Massachusetts Institute of Technology 2024
Online pristup:	https://hdl.handle.net/1721.1/156804

Slični predmeti

Image inversion and uncertainty quantification for constitutive laws of pattern formation
od: Zhao, Hongbo, i dr.
Izdano: (2022)

Image inversion and uncertainty quantification for constitutive laws of pattern formation
od: Zhao, Hongbo, i dr.
Izdano: (2021)

A model for an inverse power constitutive law for cerebral compliance
od: Wirth, B, i dr.
Izdano: (2008)

AI-assisted reaction impurity prediction and inverse structure elucidation
od: Mohapatra, Somesh
Izdano: (2024)

Constitution and constitutionalism
od: Moten, Abdul Rashid
Izdano: (2008)

Constitutions, constitutionalism, and the European Union
od: Craig, P
Izdano: (2001)

The European Economic Constitution and the Constitutional Dimension of Private Law
od: Collins, H
Izdano: (2009)

Popular sovereignty, constitutionalism and the Indian constitution
od: Dolcetti, A
Izdano: (2019)

The paradox of constitutionalism or the potential of constitutional theory?
od: Galligan, D
Izdano: (2008)

Constitutions
od: Chou, S
Izdano: (2022)

Constitutional court and constitutional economy: A study on decisions of Indonesian constitutional court
od: Salman, Radian
Izdano: (2009)

On the rate-dependent constitutive response of cortical and trabecular bone
od: Johnson, Timothy Paul Mahal
Izdano: (2011)

Between the people and the constitution: the constitutional role(s) of the legislature
od: Sathanapally, A
Izdano: (2009)

Constitutional directives: morally-committed political constitutionalism
od: Khaitan, T
Izdano: (2019)

Optical resonators constitute a universal spin simulator
od: Verstraelen, Wouter, i dr.
Izdano: (2025)

Inverse Inverse Graphics
od: Chandra, Kartik
Izdano: (2023)

Implied constitutional principles
od: Zhou, H
Izdano: (2012)

Constitutionalism, quasi-constitutionalism, and representative democracy: the case of Bulgaria
Izdano: (2010)

Constitutionalism, counterterrorism, and the courts: Changes in the British constitutional landscape
od: Kavanagh, A
Izdano: (2011)

IBRG constitution
od: Irish in Britain Representation Group, IBRG
Izdano: (1983)

The Constitution of Peoples
od: Skach, C
Izdano: (1970)

Constitutional logic
od: Endicott, T
Izdano: (2003)

Constitutional statutes
od: Ahmed, F, i dr.
Izdano: (2016)

Constitutional Reform
od: McLean, I
Izdano: (2009)

Milestone constitution
od: Silm, Bouchaib.
Izdano: (2008)

Epistocracy and constitutions
od: Bhatia, U
Izdano: (2019)

The balance of the constitution
od: Eakins, R
Izdano: (2022)

Discursive constitutionalism
od: Bui, NS
Izdano: (2023)

Constitutional interpretation
od: Endicott, TAO
Izdano: (2021)

Religion and the Constitution
od: Skach, C
Izdano: (2013)

Legal mosaics: the post-Mubarak Egyptian constitutions, their legal legacies and constitutional heritages
od: McRobie, H
Izdano: (2015)

Acting as Inverse Inverse Planning
od: Chandra, Kartik, i dr.
Izdano: (2023)

Oral tradition and the adat constitution of Negeri Sembilan: A study of traditional Malay constitutional government and constitutional monarchy
od: Haron, Nadzan
Izdano: (2014)

AI Meets Database: AI4DB and DB4AI
od: Li, Guoliang, i dr.
Izdano: (2022)

Bayesian inverse problems and seismic inversion
od: Lim, S
Izdano: (2016)

Kilkenny Association Constitution
od: Kilkenny Association, KILK

Draft constitution and rules
od: Anti-Partition League, APL

Constitutional issues in Australia
od: Anti-Partition League, APL
Izdano: (1937)

Proposed amendments to constitution
od: Irish in Britain Representation Group, IBRG

Understanding Selangor's constitution
od: Bari, Abdul Aziz
Izdano: (2010)