How to Keep AI Under Control | Max Tegmark | TED

172,007 views ・ 2023-11-02

TED

请双击下面的英文字幕来播放视频。

翻译人员: Yip Yan Yeung 校对人员: Yanyan Hong

00:03

Five years ago,

3833

2461

五年前，

00:06

I stood on the TED stage

6294

1752

我站在 TED 舞台上，

00:08

and warned about the dangers of superintelligence.

8046

4379

预警了超级智能的危险。

00:13

I was wrong.

13051

1710

我错了。

00:16

It went even worse than I thought.

16513

1752

情况比我想象的还要糟糕。

00:18

(Laughter)

18306

1752

（笑声）

00:20

I never thought governments would let AI companies get this far

20058

4379

我从没想过政府会在没有任何有效的监管下

00:24

without any meaningful regulation.

24479

2294

让人工智能（AI）公司发展成这样，

00:27

And the progress of AI went even faster than I predicted.

27732

4880

而且 AI 的进步比我预期的还要快。

00:32

Look, I showed this abstract landscape of tasks

32654

3587

这是一幅任务的抽象图，

00:36

where the elevation represented how hard it was for AI

36241

3128

其中海拔代表了 AI 要以人类水平完成每项任务的难度。

00:39

to do each task at human level.

39369

1919

00:41

And the sea level represented what AI could be back then.

41288

3753

而海平面代表了 AI 曾经的水平。

00:45

And boy or boy, has the sea been rising fast ever since.

45875

2962

从那以后，海平面一直在快速提高。

00:48

But a lot of these tasks have already gone blub blub blub blub blub blub.

48878

3587

有很多任务已经被水淹没。

00:52

And the water is on track to submerge all land,

52882

3921

海水势必淹没所有的土地，

00:56

matching human intelligence at all cognitive tasks.

56803

3420

在认知任务中与人类智慧媲美。

01:00

This is a definition of artificial general intelligence, AGI,

60849

5756

这就是通用人工智能（AGI）的定义，

01:06

which is the stated goal of companies like OpenAI,

66605

3837

也是 OpenAI、谷歌 DeepMind 和 Anthropic 等公司的既定目标。

01:10

Google DeepMind and Anthropic.

70442

2002

01:12

And these companies are also trying to build superintelligence,

72819

3587

这些公司也在努力打造超级智能，

01:16

leaving human intelligence far behind.

76448

2919

远超人类智能。

01:19

And many think it'll only be a few years, maybe, from AGI to superintelligence.

79826

4379

许多人认为，从 AGI 到超级智能，可能只需要几年的时间。

01:24

So when are we going to get AGI?

84539

2795

我们什么时候能做出 AGI 呢？

01:27

Well, until recently, most AI researchers thought it was at least decades away.

87375

5464

直到最近，大多数人工智能研究人员还认为至少还有几十年的时间。

01:33

And now Microsoft is saying, "Oh, it's almost here."

93214

3504

微软说：“哦，差不多了。”

01:36

We're seeing sparks of AGI in ChatGPT-4,

96760

3753

我们在 ChatGPT-4 中看到了 AGI 的苗头，

01:40

and the Metaculus betting site is showing the time left to AGI

100555

3796

而 Metaculus 博彩网站显示，在过去的 18 个月中，

01:44

plummeting from 20 years away to three years away

104351

4129

距离 AGI 时间从 20 年后骤降至 3 年后。

01:48

in the last 18 months.

108521

1544

01:50

And leading industry people are now predicting

110106

4922

行业领袖现在预测

01:55

that we have maybe two or three years left until we get outsmarted.

115028

5047

我们距离被人工智能超越也许还有两三年的时间。

02:00

So you better stop talking about AGI as a long-term risk,

120116

4421

所以你还是不要再说 AGI 是一种长期风险了，

02:04

or someone might call you a dinosaur stuck in the past.

124579

3170

不然有人会认为你是头被困在过去的恐龙。

02:08

It's really remarkable how AI has progressed recently.

128416

3837

AI 最近的进步真是太了不起了。

02:12

Not long ago, robots moved like this.

132921

2419

不久前，机器人是这样移动的。

02:15

(Music)

135382

2085

（音乐）

02:18

Now they can dance.

138551

1418

现在它们可以跳舞了。

02:20

(Music)

140804

2711

（音乐）

02:29

Just last year, Midjourney produced this image.

149979

3295

就在去年，Midjourney 生成了这张照片。

02:34

This year, the exact same prompt produces this.

154401

3253

今年，相同的指示生成了这样的结果。

02:39

Deepfakes are getting really convincing.

159656

2544

深度伪造以假乱真。

02:43

(Video) Deepfake Tom Cruise: I’m going to show you some magic.

163201

2920

（视频）深度伪造的汤姆·克鲁斯：我要变个魔术。

02:46

It's the real thing.

166913

1335

这是真的。

02:48

(Laughs)

168289

2086

（笑）

02:50

I mean ...

170375

1293

我的意思是……

02:53

It's all ...

173920

1835

全部……

02:55

the real ...

175797

1710

都是……

02:57

thing.

177549

1168

真的。

02:58

Max Tegmark: Or is it?

178717

1251

迈克斯·泰格马克（Max Tegmark）：是吗？

03:02

And Yoshua Bengio now argues

182387

2669

约书亚·本吉奥（Yoshua Bengio）

现在认为，大语言模型已经掌握了语言

03:05

that large language models have mastered language

185056

3837

03:08

and knowledge to the point that they pass the Turing test.

188893

2795

和知识，水平高到它们可以通过图灵测试。

03:12

I know some skeptics are saying,

192355

1585

我知道一些质疑者会说：

03:13

"Nah, they're just overhyped stochastic parrots

193940

2711

“不，它们只是过度炒作的随机鹦鹉，

03:16

that lack a model of the world,"

196693

1877

没有这个世界的模型，”

03:18

but they clearly have a representation of the world.

198611

2878

但它们显然有着世界的表征。

03:21

In fact, we recently found that Llama-2 even has a literal map of the world in it.

201531

5839

我们最近发现 Llama-2 甚至有一张真实的世界地图。

03:28

And AI also builds

208121

3045

AI 还构建了

03:31

geometric representations of more abstract concepts

211207

3796

更抽象概念的几何表现形式，

03:35

like what it thinks is true and false.

215003

3754

比如它对是非的认知。

03:40

So what's going to happen if we get AGI and superintelligence?

220467

5046

如果我们有了 AGI 和超级智能，会发生什么？

03:46

If you only remember one thing from my talk, let it be this.

226514

3879

如果你只会记得演讲中的一点，那就记住这一点吧。

03:51

AI godfather, Alan Turing predicted

231311

3462

人工智能教父艾伦·图灵（Alan Turing）预测

03:54

that the default outcome is the machines take control.

234814

4797

默认的结果就是计算机掌控一切。

04:00

The machines take control.

240528

2211

计算机掌控一切。

04:04

I know this sounds like science fiction,

244240

2336

我知道这听起来像科幻小说，

04:06

but, you know, having AI as smart as GPT-4

246618

3503

但是，像 GPT-4 这样聪明的人工智能在不久前

04:10

also sounded like science fiction not long ago.

250163

2919

听起来也像科幻小说。

04:13

And if you think of AI,

253124

2378

如果你将 AI，

04:15

if you think of superintelligence in particular, as just another technology,

255502

6047

尤其是超级智能，视为一项司空见惯的技术，

04:21

like electricity,

261591

2419

比如电力，

04:24

you're probably not very worried.

264052

2002

你可能并不会十分担心。

04:26

But you see,

266095

1168

但是你看，

04:27

Turing thinks of superintelligence more like a new species.

267263

3837

图灵认为超级智能更像是一个新物种。

04:31

Think of it,

271142

1168

想一想，

04:32

we are building creepy, super capable,

272310

3879

我们正在培养可怕的、能力逆天的、

04:36

amoral psychopaths

276231

1585

不道德的心理变态，

04:37

that don't sleep and think much faster than us,

277857

2711

它们不需要睡觉，脑子转得比我们快得多，

04:40

can make copies of themselves

280610

1418

可以复制，

04:42

and have nothing human about them at all.

282070

2002

没有丝毫人性。

04:44

So what could possibly go wrong?

284072

1835

可能会出什么问题呢？

04:45

(Laughter)

285949

1543

（笑声）

04:47

And it's not just Turing.

287951

1585

不只是图灵。

04:49

OpenAI CEO Sam Altman, who gave us ChatGPT,

289536

2919

为我们带来 ChatGPT 的 OpenAI CEO 山姆·阿尔特曼（Sam Altman）

04:52

recently warned that it could be "lights out for all of us."

292497

3837

最近警告说，这可能会是 “全人类的黑暗”。

04:57

Anthropic CEO, Dario Amodei, even put a number on this risk:

297126

3754

Anthropic CEO 达里奥·阿莫迪（Dario Amodei）

甚至给这种风险定了一个数字：

05:02

10-25 percent.

302090

2210

10-25%。

05:04

And it's not just them.

304300

1335

不只是他们。

05:05

Human extinction from AI went mainstream in May

305677

3086

AI 造成的人类灭绝在五月份甚嚣尘上，

05:08

when all the AGI CEOs and who's who of AI researchers

308763

4922

当时，所有 AGI 首席执行官和 AI 领域有头有脸的研究人员

05:13

came on and warned about it.

313685

1376

站出来，发出了警告。

05:15

And last month, even the number one of the European Union

315061

2920

上个月，连欧盟委员会主席

05:18

warned about human extinction by AI.

318022

3295

都警告了人工智能会导致人类灭绝。

05:21

So let me summarize everything I've said so far

321359

2211

让我用一页猫咪表情包总结一下我刚说的内容。

05:23

in just one slide of cat memes.

323611

2544

05:27

Three years ago,

327282

1793

三年前，

05:29

people were saying it's inevitable, superintelligence,

329117

4129

人们说这是不可避免的，超级智能，

05:33

it'll be fine,

333288

1501

没事儿。

05:34

it's decades away.

334789

1210

还有几十年的时间。

05:35

Last year it was more like,

100

335999

1835

去年我们这么说，

05:37

It's inevitable, it'll be fine.

101

337876

2043

这是不可避免的，没事儿。

05:40

Now it's more like,

102

340295

2460

现在我们说，

05:42

It's inevitable.

103

342797

1251

这是不可避免的。

05:44

(Laughter)

104

344090

1126

（笑声）

05:47

But let's take a deep breath and try to raise our spirits

105

347260

3962

来深呼吸，打起精神，

05:51

and cheer ourselves up,

106

351264

1168

振作起来，

05:52

because the rest of my talk is going to be about the good news,

107

352473

3045

因为我接下来要讲的都是好消息，

05:55

that it's not inevitable, and we can absolutely do better,

108

355560

2919

这不是不可避免的，我们绝对还有努力的空间，

05:58

alright?

109

358521

1168

好吗？

06:00

(Applause)

110

360315

2002

（掌声）

06:02

So ...

111

362317

1209

所以……

06:04

The real problem is that we lack a convincing plan for AI safety.

112

364903

5296

真正的问题是我们没有针对 AI 安全的明确计划。

06:10

People are working hard on evals

113

370700

3337

人们正在努力进行评估，

06:14

looking for risky AI behavior, and that's good,

114

374037

4087

寻找危险的 AI 行为，这很好，

06:18

but clearly not good enough.

115

378124

2044

但显然还不够好。

06:20

They're basically training AI to not say bad things

116

380209

4797

他们都是在训练人工智能不要“说”不好的内容，

06:25

rather than not do bad things.

117

385006

2502

而不是不去“做”坏事。

06:28

Moreover, evals and debugging are really just necessary,

118

388176

4212

此外，评估和调试其实只是 AI 安全的必要

06:32

not sufficient, conditions for safety.

119

392430

2002

而不是充分条件。

06:34

In other words,

120

394474

1751

换句话说，

06:36

they can prove the presence of risk,

121

396225

3671

它们可以证明风险的存在，

06:39

not the absence of risk.

122

399938

2168

而非无风险。

06:42

So let's up our game, alright?

123

402148

2544

我们来玩一玩吧，好吗？

06:44

Try to see how we can make provably safe AI that we can control.

124

404692

5631

看看我们如何才能制造出可控的、“可证明安全”的 AI。

06:50

Guardrails try to physically limit harm.

125

410323

5047

护栏可以试着从物理上控制伤害。

06:55

But if your adversary is superintelligence

126

415828

2211

但是，如果你的对手是超级智能，

06:58

or a human using superintelligence against you, right,

127

418081

2544

或者是使用超级智能对付你的人类，

07:00

trying is just not enough.

128

420625

1960

“试着获胜”是不够的。

07:02

You need to succeed.

129

422585

1877

你必须得成功。

07:04

Harm needs to be impossible.

130

424504

2169

伤害必须是不存在的。

07:06

So we need provably safe systems.

131

426714

2544

我们需要可证明安全的系统。

07:09

Provable, not in the weak sense of convincing some judge,

132

429258

3838

“可证明”，不是局限于说服法官的单薄含义，

07:13

but in the strong sense of there being something that's impossible

133

433137

3128

而是彻彻底底说明根据物理定律，有些事情是不可能的。

07:16

according to the laws of physics.

134

436265

1585

07:17

Because no matter how smart an AI is,

135

437892

2002

因为无论 AI 有多么聪明，

07:19

it can't violate the laws of physics and do what's provably impossible.

136

439936

4046

它都无法违反物理定律，做“可证明”不可能的事情。

07:24

Steve Omohundro and I wrote a paper about this,

137

444440

2836

我和史蒂夫·奥莫洪德罗（Steve Omohundro）

关于这点写了一篇论文，

07:27

and we're optimistic that this vision can really work.

138

447318

5005

我们乐观地认为这个愿景能够真正奏效。

07:32

So let me tell you a little bit about how.

139

452323

2169

我简单说一说要如何做到。

07:34

There's a venerable field called formal verification,

140

454993

4421

有一个神圣的领域，叫做“形式验证”，

07:39

which proves stuff about code.

141

459455

2127

可以证明有关代码的东西。

07:41

And I'm optimistic that AI will revolutionize automatic proving business

142

461624

6548

我乐观地认为，AI 将彻底改变自动证明任务，

07:48

and also revolutionize program synthesis,

143

468214

3337

还将彻底改变程序合成，

07:51

the ability to automatically write really good code.

144

471592

3254

即自动编写非常好的代码的能力。

07:54

So here is how our vision works.

145

474887

1585

因此，我们的愿景是这样的。

07:56

You, the human, write a specification

146

476472

4213

作为人类，你要写一份

08:00

that your AI tool must obey,

147

480685

2711

你的 AI 工具必须遵守的规范，

08:03

that it's impossible to log in to your laptop

148

483438

2127

比如，如果没有正确的密码，它就不可能登录你的电脑，

08:05

without the correct password,

149

485565

1793

08:07

or that a DNA printer cannot synthesize dangerous viruses.

150

487400

5714

或者 DNA 打印机无法合成危险病毒。

08:13

Then a very powerful AI creates both your AI tool

151

493156

5213

然后，非常强大的 AI 既要创建你的 AI 工具，

08:18

and a proof that your tool meets your spec.

152

498369

3837

又要创建可以证明你的工具遵守你的规范的证据。

08:22

Machine learning is uniquely good at learning algorithms,

153

502540

4254

机器学习尤其擅长学习算法，

08:26

but once the algorithm has been learned,

154

506836

2169

一旦它学习了算法，

08:29

you can re-implement it in a different computational architecture

155

509047

3169

你就可以在另一种更易于验证的计算架构中重新实现它。

08:32

that's easier to verify.

156

512216

1627

08:35

Now you might worry,

157

515344

1210

你可能会担心，

08:36

how on earth am I going to understand this powerful AI

158

516554

3921

我到底该如何理解这个强大的 AI、

08:40

and the powerful AI tool it built

159

520475

1626

它构建的强大 AI 工具和证据，

08:42

and the proof,

160

522143

1126

08:43

if they're all too complicated for any human to grasp?

161

523311

2794

如果它们对于所有人类都过于复杂，难以理解呢？

08:46

Here is the really great news.

162

526147

2127

以下就是真正的好消息。

08:48

You don't have to understand any of that stuff,

163

528316

2461

你不必了解任何东西，

08:50

because it's much easier to verify a proof than to discover it.

164

530818

5297

因为验证证据比找证据要容易得多。

08:56

So you only have to understand or trust your proof-checking code,

165

536115

5089

因此，你只需要理解或信任你的校验代码，

09:01

which could be just a few hundred lines long.

166

541245

2211

它可能只有几百行长。

09:03

And Steve and I envision

167

543498

2252

史蒂夫和我设想

09:05

that such proof checkers get built into all our compute hardware,

168

545750

4463

这样的校验器会内置于我们所有的计算机硬件中，

09:10

so it just becomes impossible to run very unsafe code.

169

550254

4213

因此绝无可能运行非常不安全的代码。

09:14

What if the AI, though, isn't able to write that AI tool for you?

170

554509

5505

但是，如果 AI 无法为你编写那个 AI 工具呢？

09:20

Then there's another possibility.

171

560056

3795

那就还有另一种可能性。

09:23

You train an AI to first just learn to do what you want

172

563851

3587

你训练 AI 先学会做你想做的事，

09:27

and then you use a different AI

173

567480

3337

然后再换一个 AI

09:30

to extract out the learned algorithm and knowledge for you,

174

570858

3963

为你提取出所学的算法和知识，

09:34

like an AI neuroscientist.

175

574862

2086

就像一个 AI 神经科学家。

09:37

This is in the spirit of the field of mechanistic interpretability,

176

577281

3879

这就是机械可解释性领域的精髓，

09:41

which is making really impressive rapid progress.

177

581160

3253

该领域正在取得惊艳的快速进步。

09:44

Provably safe systems are clearly not impossible.

178

584455

3170

可证明安全的系统显然不是不可能的。

09:47

Let's look at a simple example

179

587625

1502

我们来看一个简单的例子，

09:49

of where we first machine-learn an algorithm from data

180

589168

4630

首先，基于数据，用计算机学习一个算法，

09:53

and then distill it out in the form of code

181

593840

4254

然后以可证明符合规范的代码形式提炼出算法。

09:58

that provably meets spec, OK?

182

598136

2252

10:00

Let’s do it with an algorithm that you probably learned in first grade,

183

600888

4922

我们就拿你一年级或许就学会的算法为例，

10:05

addition,

184

605810

1168

加法，

10:07

where you loop over the digits from right to left,

185

607019

2503

你从右到左遍历数位，

10:09

and sometimes you do a carry.

186

609564

1793

有时还会进位。

10:11

We'll do it in binary,

187

611357

1752

我们用二进制来做，

10:13

as if you were counting on two fingers instead of ten.

188

613151

2752

如同用两根手指，而不是十个手指数数。

10:16

And we first train a recurrent neural network,

189

616279

3378

我们训练了一个循环神经网络，

10:19

never mind the details,

190

619657

2211

不用管细节，来完成这个任务。

10:21

to nail the task.

191

621909

1418

10:23

So now you have this algorithm that you don't understand how it works

192

623828

3253

你现在有了这个算法，你也不知道它在黑箱中是怎么运作的，

10:27

in a black box

193

627123

2753

10:29

defined by a bunch of tables of numbers that we, in nerd speak,

194

629917

4463

黑箱由一大堆数字定义，用“技术宅”的话来说，

10:34

call parameters.

195

634380

1502

就是“参数”。

10:35

Then we use an AI tool we built to automatically distill out from this

196

635882

5297

然后，我们用构建的 AI 工具

以 Python 程序的形式自动从中提炼出所学算法。

10:41

the learned algorithm in the form of a Python program.

197

641179

3420

10:44

And then we use the formal verification tool known as Daphne

198

644640

4838

然后，我们使用名为 Daphne 的形式验证工具

10:49

to prove that this program correctly adds up any numbers,

199

649520

5422

证明该程序可以正确地将任意数字相加，

10:54

not just the numbers that were in your training data.

200

654942

2503

而不仅仅是训练数据中的数字。

10:57

So in summary,

201

657778

1377

总而言之，

10:59

provably safe AI, I'm convinced is possible,

202

659155

3962

可证明安全的 AI，我坚信这是可能的，

11:03

but it's going to take time and work.

203

663117

3003

但这需要时间和努力。

11:06

And in the meantime,

204

666162

1209

同时，

11:07

let's remember that all the AI benefits

205

667413

4254

请记住，各种 AI 的益处，

11:11

that most people are excited about

206

671709

3795

很多人为之兴奋的益处，

11:15

actually don't require superintelligence.

207

675504

2628

其实并不需要超级智能。

11:18

We can have a long and amazing future with AI.

208

678758

5005

我们可以和 AI 共同拥有漫长且奇妙的未来。

11:25

So let's not pause AI.

209

685014

2169

所以请不要暂停 AI。

11:28

Let's just pause the reckless race to superintelligence.

210

688476

4129

而是暂停追求超级智能的无脑竞争。

11:32

Let's stop obsessively training ever-larger models

211

692980

4255

停止执着于训练越来越大

11:37

that we don't understand.

212

697276

1669

又无法理解的模型。

11:39

Let's heed the warning from ancient Greece

213

699737

3212

让我们听从古希腊的警告，

11:42

and not get hubris, like in the story of Icarus.

214

702949

3712

不要像伊卡洛斯的故事那样自负。

11:46

Because artificial intelligence

215

706702

2711

AI 为我们

11:49

is giving us incredible intellectual wings

216

709455

4463

插上了神奇的智慧之翼，

11:53

with which we can do things beyond our wildest dreams

217

713960

4045

如果我们不再执迷于飞向太阳，

11:58

if we stop obsessively trying to fly to the sun.

218

718047

4379

我们就能用它天马行空地飞翔。

12:02

Thank you.

219

722802

1168

谢谢。

12:03

(Applause)

220

723970

5880

（掌声）

New videos

06:51

The Rise of China's Homegrown Brands — and Why ...

08:33

Can AI Help with the Chaos of Family Life? | Av...

09:26

You Are the Bridge to the Next Generation | Ndi...

08:29

Are We Still Human If Robots Help Raise Our Bab...

06:45

Parkour! How the Sport Keeps Your Body and Mind...

09:53

The Power of Gaming Together in a Lonely World ...

05:46

The myth of Medusa - Laura Aitken-Burt

05:02

How reliable is fingerprint evidence? - Theodor...

Original video on YouTube.com

How to Keep AI Under Control | Max Tegmark | TED - YouTube

关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕，即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求，请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7

Playback speed

Subtitle font size

How to Keep AI Under Control | Max Tegmark | TED

New videos

How to Keep AI Under Control | Max Tegmark | TED

New videos

Original video on YouTube.com