Will Superintelligent AI End the World? | Eliezer Yudkowsky | TED

309,707 views ・ 2023-07-11

TED

Please double-click on the English subtitles below to play the video.

00:04

Since 2001, I have been working on what we would now call

4334

3503

00:07

the problem of aligning artificial general intelligence:

7879

3629

00:11

how to shape the preferences and behavior

11550

2085

00:13

of a powerful artificial mind such that it does not kill everyone.

13677

4838

00:19

I more or less founded the field two decades ago,

19224

3128

00:22

when nobody else considered it rewarding enough to work on.

22352

2794

00:25

I tried to get this very important project started early

25146

2711

00:27

so we'd be in less of a drastic rush later.

27899

2544

00:31

I consider myself to have failed.

31069

2044

00:33

(Laughter)

33154

1001

00:34

Nobody understands how modern AI systems do what they do.

34197

3337

00:37

They are giant, inscrutable matrices of floating point numbers

37576

2961

00:40

that we nudge in the direction of better performance

40579

2460

00:43

until they inexplicably start working.

43039

2044

00:45

At some point, the companies rushing headlong to scale AI

45083

3378

00:48

will cough out something that's smarter than humanity.

48461

2670

00:51

Nobody knows how to calculate when that will happen.

51131

2544

00:53

My wild guess is that it will happen after zero to two more breakthroughs

53675

3712

00:57

the size of transformers.

57429

1710

00:59

What happens if we build something smarter than us

59139

2335

01:01

that we understand that poorly?

61516

2377

01:03

Some people find it obvious that building something smarter than us

63935

3170

01:07

that we don't understand might go badly.

67105

2294

01:09

Others come in with a very wide range of hopeful thoughts

69441

3879

01:13

about how it might possibly go well.

73361

2044

01:16

Even if I had 20 minutes for this talk and months to prepare it,

76281

3337

01:19

I would not be able to refute all the ways people find to imagine

79659

3087

01:22

that things might go well.

82746

1710

01:24

But I will say that there is no standard scientific consensus

84456

4546

01:29

for how things will go well.

89044

1668

01:30

There is no hope that has been widely persuasive

90712

2252

01:33

and stood up to skeptical examination.

93006

1877

01:35

There is nothing resembling a real engineering plan for us surviving

95592

4421

01:40

that I could critique.

100013

1710

01:41

This is not a good place in which to find ourselves.

101765

2752

01:44

If I had more time,

104517

1210

01:45

I'd try to tell you about the predictable reasons

105727

2294

01:48

why the current paradigm will not work

108063

1960

01:50

to build a superintelligence that likes you

110065

2460

01:52

or is friends with you, or that just follows orders.

112567

3378

01:56

Why, if you press "thumbs up" when humans think that things went right

116613

4379

02:01

or "thumbs down" when another AI system thinks that they went wrong,

121034

3712

02:04

you do not get a mind that wants nice things

124788

3503

02:08

in a way that generalizes well outside the training distribution

128291

4004

02:12

to where the AI is smarter than the trainers.

132337

2461

02:15

You can search for "Yudkowsky list of lethalities" for more.

135674

5130

02:20

(Laughter)

140804

1793

02:22

But to worry, you do not need to believe me

142597

2169

02:24

about exact predictions of exact disasters.

144808

2878

02:27

You just need to expect that things are not going to work great

147727

2962

02:30

on the first really serious, really critical try

150730

3045

02:33

because an AI system smart enough to be truly dangerous

153817

3503

02:37

was meaningfully different from AI systems stupider than that.

157362

3503

02:40

My prediction is that this ends up with us facing down something smarter than us

160907

5005

02:45

that does not want what we want,

165954

1752

02:47

that does not want anything we recognize as valuable or meaningful.

167706

4170

02:52

I cannot predict exactly how a conflict between humanity and a smarter AI would go

172252

4588

02:56

for the same reason I can't predict exactly how you would lose a chess game

176881

3838

03:00

to one of the current top AI chess programs, let's say Stockfish.

180760

3671

03:04

If I could predict exactly where Stockfish could move,

184973

3545

03:08

I could play chess that well myself.

188560

2544

03:11

I can't predict exactly how you'll lose to Stockfish,

191146

2502

03:13

but I can predict who wins the game.

193648

2044

03:16

I do not expect something actually smart to attack us with marching robot armies

196401

4421

03:20

with glowing red eyes

200864

1543

03:22

where there could be a fun movie about us fighting them.

202407

3086

03:25

I expect an actually smarter and uncaring entity

205535

2502

03:28

will figure out strategies and technologies

208079

2002

03:30

that can kill us quickly and reliably and then kill us.

210123

3545

03:34

I am not saying that the problem of aligning superintelligence

214461

2919

03:37

is unsolvable in principle.

217422

1710

03:39

I expect we could figure it out with unlimited time and unlimited retries,

219174

4879

03:44

which the usual process of science assumes that we have.

224095

3796

03:48

The problem here is the part where we don't get to say,

228308

2836

03:51

“Ha ha, whoops, that sure didn’t work.

231186

2294

03:53

That clever idea that used to work on earlier systems

233521

3837

03:57

sure broke down when the AI got smarter, smarter than us.”

237358

3504

04:01

We do not get to learn from our mistakes and try again

241571

2544

04:04

because everyone is already dead.

244157

1960

04:07

It is a large ask

247076

2586

04:09

to get an unprecedented scientific and engineering challenge

249704

3003

04:12

correct on the first critical try.

252707

2002

04:15

Humanity is not approaching this issue with remotely

255084

3045

04:18

the level of seriousness that would be required.

258171

2711

04:20

Some of the people leading these efforts

260924

1918

04:22

have spent the last decade not denying

262884

2502

04:25

that creating a superintelligence might kill everyone,

265386

2920

04:28

but joking about it.

268306

1376

04:30

We are very far behind.

270183

1877

04:32

This is not a gap we can overcome in six months,

272101

2294

04:34

given a six-month moratorium.

274437

1919

04:36

If we actually try to do this in real life,

276689

2503

04:39

we are all going to die.

279234

1668

04:41

People say to me at this point, what's your ask?

281402

2586

04:44

I do not have any realistic plan,

284697

1627

04:46

which is why I spent the last two decades

286324

1960

04:48

trying and failing to end up anywhere but here.

288326

2586

04:51

My best bad take is that we need an international coalition

291913

3670

04:55

banning large AI training runs,

295625

2252

04:57

including extreme and extraordinary measures

297877

3379

05:01

to have that ban be actually and universally effective,

100

301256

3253

05:04

like tracking all GPU sales,

101

304509

2377

05:06

monitoring all the data centers,

102

306928

2127

05:09

being willing to risk a shooting conflict between nations

103

309055

2711

05:11

in order to destroy an unmonitored data center

104

311808

2711

05:14

in a non-signatory country.

105

314561

1793

05:17

I say this, not expecting that to actually happen.

106

317480

3170

05:21

I say this expecting that we all just die.

107

321109

2919

05:24

But it is not my place to just decide on my own

108

324779

3295

05:28

that humanity will choose to die,

109

328074

2211

05:30

to the point of not bothering to warn anyone.

110

330326

2461

05:33

I have heard that people outside the tech industry

111

333204

2336

05:35

are getting this point faster than people inside it.

112

335582

2627

05:38

Maybe humanity wakes up one morning and decides to live.

113

338251

3712

05:43

Thank you for coming to my brief TED talk.

114

343006

2043

05:45

(Laughter)

115

345049

1585

05:46

(Applause and cheers)

116

346676

6965

05:56

Chris Anderson: So, Eliezer, thank you for coming and giving that.

117

356102

4421

06:00

It seems like what you're raising the alarm about is that like,

118

360523

3962

06:04

for this to happen, for an AI to basically destroy humanity,

119

364527

3962

06:08

it has to break out, escape controls of the internet and, you know,

120

368489

5172

06:13

start commanding actual real-world resources.

121

373661

2920

06:16

You say you can't predict how that will happen,

122

376581

2210

06:18

but just paint one or two possibilities.

123

378833

3462

06:22

Eliezer Yudkowsky: OK, so why is this hard?

124

382337

2794

06:25

First, because you can't predict exactly where a smarter chess program will move.

125

385131

3837

06:28

Maybe even more importantly than that,

126

388968

1877

06:30

imagine sending the design for an air conditioner

127

390887

2794

06:33

back to the 11th century.

128

393723

1877

06:35

Even if they -- if it’s enough detail for them to build it,

129

395642

3128

06:38

they will be surprised when cold air comes out

130

398811

2753

06:41

because the air conditioner will use the temperature-pressure relation

131

401564

3670

06:45

and they don't know about that law of nature.

132

405276

2461

06:47

So if you want me to sketch what a superintelligence might do,

133

407779

4629

06:52

I can go deeper and deeper into places

134

412450

2336

06:54

where we think there are predictable technological advancements

135

414827

2962

06:57

that we haven't figured out yet.

136

417789

1626

06:59

And as I go deeper, it will get harder and harder to follow.

137

419415

2836

07:02

It could be super persuasive.

138

422251

1961

07:04

That's relatively easy to understand.

139

424253

1877

07:06

We do not understand exactly how the brain works,

140

426130

2545

07:08

so it's a great place to exploit laws of nature that we do not know about.

141

428716

3504

07:12

Rules of the environment,

142

432261

1502

07:13

invent new technologies beyond that.

143

433805

2669

07:16

Can you build a synthetic virus that gives humans a cold

144

436808

3879

07:20

and then a bit of neurological change and they're easier to persuade?

145

440728

4213

07:24

Can you build your own synthetic biology,

146

444941

3336

07:28

synthetic cyborgs?

147

448319

1585

07:29

Can you blow straight past that

148

449946

2044

07:31

to covalently bonded equivalents of biology,

149

451990

4004

07:36

where instead of proteins that fold up and are held together by static cling,

150

456035

3754

07:39

you've got things that go down much sharper potential energy gradients

151

459789

3378

07:43

and are bonded together?

152

463209

1376

07:44

People have done advanced design work about this sort of thing

153

464585

3546

07:48

for artificial red blood cells that could hold 100 times as much oxygen

154

468172

3879

07:52

if they were using tiny sapphire vessels to store the oxygen.

155

472093

3754

07:55

There's lots and lots of room above biology,

156

475888

2586

07:58

but it gets harder and harder to understand.

157

478474

2211

08:01

CA: So what I hear you saying

158

481519

1543

08:03

is that these terrifying possibilities there

159

483104

2294

08:05

but your real guess is that AIs will work out something more devious than that.

160

485440

5130

08:10

Is that really a likely pathway in your mind?

161

490570

3837

08:14

EY: Which part?

162

494407

1168

08:15

That they're smarter than I am? Absolutely.

163

495616

2002

08:17

CA: Not that they're smarter,

164

497660

1418

08:19

but why would they want to go in that direction?

165

499078

3045

08:22

Like, AIs don't have our feelings of sort of envy and jealousy and anger

166

502123

5422

08:27

and so forth.

167

507587

1168

08:28

So why might they go in that direction?

168

508755

2460

08:31

EY: Because it's convergently implied by almost any of the strange,

169

511215

4296

08:35

inscrutable things that they might end up wanting

170

515511

3379

08:38

as a result of gradient descent

171

518931

1710

08:40

on these "thumbs up" and "thumbs down" things internally.

172

520683

2711

08:44

If all you want is to make tiny little molecular squiggles

173

524604

3920

08:48

or that's like, one component of what you want,

174

528566

2461

08:51

but it's a component that never saturates, you just want more and more of it,

175

531069

3628

08:54

the same way that we would want more and more galaxies filled with life

176

534697

3337

08:58

and people living happily ever after.

177

538034

1793

08:59

Anything that just keeps going,

178

539869

1627

09:01

you just want to use more and more material for that,

179

541537

3045

09:04

that could kill everyone on Earth as a side effect.

180

544624

2669

09:07

It could kill us because it doesn't want us making other superintelligences

181

547293

3545

09:10

to compete with it.

182

550880

1168

09:12

It could kill us because it's using up all the chemical energy on earth

183

552048

4379

09:16

and we contain some chemical potential energy.

184

556469

2586

09:19

CA: So some people in the AI world worry that your views are strong enough

185

559055

6256

09:25

and they would say extreme enough

186

565311

1585

09:26

that you're willing to advocate extreme responses to it.

187

566896

3170

09:30

And therefore, they worry that you could be, you know,

188

570066

3462

09:33

in one sense, a very destructive figure.

189

573528

1918

09:35

Do you draw the line yourself in terms of the measures

190

575488

3170

09:38

that we should take to stop this happening?

191

578699

2753

09:41

Or is actually anything justifiable to stop

192

581494

3253

09:44

the scenarios you're talking about happening?

193

584747

2211

09:47

EY: I don't think that "anything" works.

194

587834

3253

09:51

I think that this takes state actors

195

591087

4379

09:55

and international agreements

196

595508

2753

09:58

and all international agreements by their nature,

197

598302

3254

10:01

tend to ultimately be backed by force

198

601556

1876

10:03

on the signatory countries and on the non-signatory countries,

199

603474

3379

10:06

which is a more extreme measure.

200

606894

1794

10:09

I have not proposed that individuals run out and use violence,

201

609856

2961

10:12

and I think that the killer argument for that is that it would not work.

202

612859

4337

10:18

CA: Well, you are definitely not the only person to propose

203

618489

2961

10:21

that what we need is some kind of international reckoning here

204

621450

4004

10:25

on how to manage this going forward.

205

625496

2044

10:27

Thank you so much for coming here to TED, Eliezer.

206

627540

2419

10:30

(Applause)

207

630001

2085

New videos

06:27

How do drugs make you hallucinate? - Anees Bahji

06:51

The Rise of China's Homegrown Brands — and Why ...

06:16

How important is politeness? ⏲️ 6 Minute English

07:44

North Korea’s secrets revealed by phone: Study:...

17:30

Advanced English Learning: Speaking Practice

03:48

What can you do? Easy English Conversations 💬 ...

08:33

Can AI Help with the Chaos of Family Life? | Av...

12:13

Speak English Confidently: Daily Tricks & Tips 🧠

Original video on YouTube.com

Will Superintelligent AI End the World? | Eliezer Yudkowsky | TED - YouTube

About this website

This site will introduce you to YouTube videos that are useful for learning English. You will see English lessons taught by top-notch teachers from around the world. Double-click on the English subtitles displayed on each video page to play the video from there. The subtitles scroll in sync with the video playback. If you have any comments or requests, please contact us using this contact form.

https://forms.gle/WvT1wiN1qDtmnspy7

Playback speed

Subtitle font size

Will Superintelligent AI End the World? | Eliezer Yudkowsky | TED

New videos

Will Superintelligent AI End the World? | Eliezer Yudkowsky | TED

New videos

Original video on YouTube.com