Managing extreme AI risks amid rapid progress

Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to developing generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social h...

Повний опис

Бібліографічні деталі
Автори:	Bengio, Y, Hinton, G, Yao, A, Song, D, Abbeel, P, Darrell, T, Harari, YN, Zhang, Y-Q, Xue, L, Shalev-Shwartz, S, Hadfield, G, Clune, J, Maharaj, T, Hutter, F, Baydin, AG, McIlraith, S, Gao, Q, Acharya, A, Krueger, D, Dragan, A, Torr, P, Russell, S, Kahneman, D, Brauner, J, Mindermann, S
Формат:	Internet publication
Мова:	English
Опубліковано:	2023

_version_	1826317763077996544
author	Bengio, Y Hinton, G Yao, A Song, D Abbeel, P Darrell, T Harari, YN Zhang, Y-Q Xue, L Shalev-Shwartz, S Hadfield, G Clune, J Maharaj, T Hutter, F Baydin, AG McIlraith, S Gao, Q Acharya, A Krueger, D Dragan, A Torr, P Russell, S Kahneman, D Brauner, J Mindermann, S
author_facet	Bengio, Y Hinton, G Yao, A Song, D Abbeel, P Darrell, T Harari, YN Zhang, Y-Q Xue, L Shalev-Shwartz, S Hadfield, G Clune, J Maharaj, T Hutter, F Baydin, AG McIlraith, S Gao, Q Acharya, A Krueger, D Dragan, A Torr, P Russell, S Kahneman, D Brauner, J Mindermann, S
author_sort	Bengio, Y
collection	OXFORD
description	Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to developing generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social harms, malicious uses, and an irreversible loss of human control over autonomous AI systems. Although researchers have warned of extreme risks from AI, there is a lack of consensus about how exactly such risks arise, and how to manage them. Society's response, despite promising first steps, is incommensurate with the possibility of rapid, transformative progress that is expected by many experts. AI safety research is lagging. Present governance initiatives lack the mechanisms and institutions to prevent misuse and recklessness, and barely address autonomous systems. In this short consensus paper, we describe extreme risks from upcoming, advanced AI systems. Drawing on lessons learned from other safety-critical technologies, we then outline a comprehensive plan combining technical research and development with proactive, adaptive governance mechanisms for a more commensurate preparation.
first_indexed	2025-03-11T16:59:04Z
format	Internet publication
id	oxford-uuid:79b0f23a-9dbc-43d1-88b6-f90b1318e361
institution	University of Oxford
language	English
last_indexed	2025-03-11T16:59:04Z
publishDate	2023
record_format	dspace
spelling	oxford-uuid:79b0f23a-9dbc-43d1-88b6-f90b1318e3612025-03-06T16:21:24ZManaging extreme AI risks amid rapid progressInternet publicationhttp://purl.org/coar/resource_type/c_7ad9uuid:79b0f23a-9dbc-43d1-88b6-f90b1318e361EnglishSymplectic Elements2023Bengio, YHinton, GYao, ASong, DAbbeel, PDarrell, THarari, YNZhang, Y-QXue, LShalev-Shwartz, SHadfield, GClune, JMaharaj, THutter, FBaydin, AGMcIlraith, SGao, QAcharya, AKrueger, DDragan, ATorr, PRussell, SKahneman, DBrauner, JMindermann, SArtificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to developing generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social harms, malicious uses, and an irreversible loss of human control over autonomous AI systems. Although researchers have warned of extreme risks from AI, there is a lack of consensus about how exactly such risks arise, and how to manage them. Society's response, despite promising first steps, is incommensurate with the possibility of rapid, transformative progress that is expected by many experts. AI safety research is lagging. Present governance initiatives lack the mechanisms and institutions to prevent misuse and recklessness, and barely address autonomous systems. In this short consensus paper, we describe extreme risks from upcoming, advanced AI systems. Drawing on lessons learned from other safety-critical technologies, we then outline a comprehensive plan combining technical research and development with proactive, adaptive governance mechanisms for a more commensurate preparation.
spellingShingle	Bengio, Y Hinton, G Yao, A Song, D Abbeel, P Darrell, T Harari, YN Zhang, Y-Q Xue, L Shalev-Shwartz, S Hadfield, G Clune, J Maharaj, T Hutter, F Baydin, AG McIlraith, S Gao, Q Acharya, A Krueger, D Dragan, A Torr, P Russell, S Kahneman, D Brauner, J Mindermann, S Managing extreme AI risks amid rapid progress
title	Managing extreme AI risks amid rapid progress
title_full	Managing extreme AI risks amid rapid progress
title_fullStr	Managing extreme AI risks amid rapid progress
title_full_unstemmed	Managing extreme AI risks amid rapid progress
title_short	Managing extreme AI risks amid rapid progress
title_sort	managing extreme ai risks amid rapid progress
work_keys_str_mv	AT bengioy managingextremeairisksamidrapidprogress AT hintong managingextremeairisksamidrapidprogress AT yaoa managingextremeairisksamidrapidprogress AT songd managingextremeairisksamidrapidprogress AT abbeelp managingextremeairisksamidrapidprogress AT darrellt managingextremeairisksamidrapidprogress AT harariyn managingextremeairisksamidrapidprogress AT zhangyq managingextremeairisksamidrapidprogress AT xuel managingextremeairisksamidrapidprogress AT shalevshwartzs managingextremeairisksamidrapidprogress AT hadfieldg managingextremeairisksamidrapidprogress AT clunej managingextremeairisksamidrapidprogress AT maharajt managingextremeairisksamidrapidprogress AT hutterf managingextremeairisksamidrapidprogress AT baydinag managingextremeairisksamidrapidprogress AT mcilraiths managingextremeairisksamidrapidprogress AT gaoq managingextremeairisksamidrapidprogress AT acharyaa managingextremeairisksamidrapidprogress AT kruegerd managingextremeairisksamidrapidprogress AT dragana managingextremeairisksamidrapidprogress AT torrp managingextremeairisksamidrapidprogress AT russells managingextremeairisksamidrapidprogress AT kahnemand managingextremeairisksamidrapidprogress AT braunerj managingextremeairisksamidrapidprogress AT mindermanns managingextremeairisksamidrapidprogress

Managing extreme AI risks amid rapid progress

Схожі ресурси