The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED
1,798,027 views ・ 2023-04-20
请双击下面的英文字幕来播放视频。
翻译人员: Yip Yan Yeung
校对人员: Grace Man
00:03
We started OpenAI seven years ago
0
3875
2503
七年前,我们成立了 OpenAI,
00:06
because we felt like something
really interesting was happening in AI
1
6378
3712
因为我们认为,AI 领域中
正在上演一些很有趣的事,
00:10
and we wanted to help steer it
in a positive direction.
2
10131
3170
我们想助其走上正轨。
00:15
It's honestly just really amazing to see
3
15220
2085
这个领域自此取得的进步
00:17
how far this whole field
has come since then.
4
17347
3086
着实令人咋舌。
00:20
And it's really gratifying to hear
from people like Raymond
5
20433
3629
我也十分欣慰看到像网友雷蒙德
(Raymond)这样的人们
00:24
who are using the technology
we are building, and others,
6
24104
2836
使用我们正在打造的技术,
还有很多人
00:26
for so many wonderful things.
7
26982
2127
将其用于许多美妙的用途。
00:29
We hear from people who are excited,
8
29150
2503
我们看到有人很兴奋,
00:31
we hear from people who are concerned,
9
31653
1835
我们看到有人很忧虑,
00:33
we hear from people who feel
both those emotions at once.
10
33530
2961
我们看到有人两者兼有。
00:36
And honestly, that's how we feel.
11
36533
2252
说实话,这也是我们的心情。
00:40
Above all, it feels like we're entering
an historic period right now
12
40245
4087
关键是感觉我们
即将进入一个历史性时期,
00:44
where we as a world
are going to define a technology
13
44374
4421
全球各地的人们将要
共同努力定义一项技术,
00:48
that will be so important
for our society going forward.
14
48795
3086
而它对于我们社会
未来的走向至关重要。
00:52
And I believe that we can
manage this for good.
15
52924
2628
我相信我们有能力做到。
00:56
So today, I want to show you
the current state of that technology
16
56845
4171
今天,我想为大家介绍
这项技术的现况
01:01
and some of the underlying
design principles that we hold dear.
17
61016
3086
和其背后我们所珍视的
一些设计原则。
01:09
So the first thing I'm going to show you
18
69983
1918
我要介绍的第一点是
01:11
is what it's like to build
a tool for an AI
19
71943
2086
为 AI (人工智能),
而不是人类打造一个工具,
01:14
rather than building it for a human.
20
74029
1876
是什么样的。
01:17
So we have a new DALL-E model,
which generates images,
21
77574
3545
我们现在有了一个新的 DALL-E
模型,可以生成图片,
01:21
and we are exposing it as an app
for ChatGPT to use on your behalf.
22
81161
4045
它的呈现方式是 ChatGPT
直接替你调用的应用程序。
01:25
And you can do things like ask, you know,
23
85248
2461
你可以执行这样的操作,
比如问一问
01:27
suggest a nice post-TED meal
and draw a picture of it.
24
87751
6631
这场 TED 结束了以后
吃点什么好呢?再画张图吧。
01:35
(Laughter)
25
95216
1419
(笑声)
01:38
Now you get all of the, sort of,
ideation and creative back-and-forth
26
98303
4671
你能看到 ChatGPT
为你提供的这种
构思和发挥创意的往复过程
以及为你处理好所有细节的能力。
01:43
and taking care of the details for you
that you get out of ChatGPT.
27
103016
4004
01:47
And here we go, it's not just
the idea for the meal,
28
107062
2669
看这些回答,已经不仅仅是
关于一顿饭的想法了,
01:49
but a very, very detailed spread.
29
109773
3587
而是非常、
非常详细的大餐细节。
01:54
So let's see what we're going to get.
30
114110
2044
我们来看看能得出什么结果。
01:56
But ChatGPT doesn't just generate
images in this case --
31
116154
3795
但在这种情况下,ChatGPT
不止生成了图片,
01:59
sorry, it doesn't generate text,
it also generates an image.
32
119991
2836
它不仅生成了文本,
还生成了图片。
02:02
And that is something
that really expands the power
33
122827
2419
这增强了它为了执行你的意图
而替你完成操作的能力。
02:05
of what it can do on your behalf
in terms of carrying out your intent.
34
125246
3504
02:08
And I'll point out,
this is all a live demo.
35
128750
2085
我想提一点,
这都是现场演示。
02:10
This is all generated
by the AI as we speak.
36
130835
2169
这都是在我们的言语之间
由 AI 生成的。
02:13
So I actually don't even know
what we're going to see.
37
133046
2544
其实我都不知道
我们会看到什么样的结果。
02:16
This looks wonderful.
38
136216
2294
看起来非常棒。
02:18
(Applause)
39
138510
3712
(掌声)
02:22
I'm getting hungry just looking at it.
40
142514
1877
看着它我都饿了。
02:24
Now we've extended ChatGPT
with other tools too,
41
144724
2753
我们也把其他工具
接入了 ChatGPT,
02:27
for example, memory.
42
147519
1168
比如记忆。
02:28
You can say "save this for later."
43
148728
2795
你可以输入
“保存,稍后再用”。
02:33
And the interesting thing
about these tools
44
153233
2043
这种工具的有趣之处在于
02:35
is they're very inspectable.
45
155318
1377
检查它们是非常容易的。
02:36
So you get this little pop up here
that says "use the DALL-E app."
46
156695
3128
你可以看到这个弹出插件
写着“调用 DALL-E 应用”。
02:39
And by the way, this is coming to you, all
ChatGPT users, over upcoming months.
47
159823
3712
顺带提一句,
你马上就能用上这个功能了,
每位 ChatGPT 用户,
在几个月内就能用上。
02:43
And you can look under the hood
and see that what it actually did
48
163535
3086
如果你看一下它背后的技术,
看看它到底进行了什么操作,
02:46
was write a prompt
just like a human could.
49
166621
2169
其实就是像人那样
写了个提示(prompt)。
02:48
And so you sort of have
this ability to inspect
50
168790
2628
你可以检查
02:51
how the machine is using these tools,
51
171459
2086
机器是如何使用这些工具的,
02:53
which allows us to provide
feedback to them.
52
173586
2086
这样我们就可以为其提供反馈。
02:55
Now it's saved for later,
53
175714
1209
我们把它存了下来,之后再用,
02:56
and let me show you
what it's like to use that information
54
176965
2878
我来给各位展示一下
如何利用这些信息,
02:59
and to integrate
with other applications too.
55
179884
2503
调用其他应用。
03:02
You can say,
56
182387
2210
你可以这么说:
03:04
“Now make a shopping list
for the tasty thing
57
184639
5506
“请你针对我之前提到的美食,
制作一份购物清单。”
03:10
I was suggesting earlier.”
58
190186
1835
03:12
And make it a little tricky for the AI.
59
192021
2128
我们来给 AI 增加一点难度。
03:16
"And tweet it out for all
the TED viewers out there."
60
196276
4337
“再发条推特
推给所有 TED 观众。”
03:20
(Laughter)
61
200655
2252
(笑声)
03:22
So if you do make this wonderful,
wonderful meal,
62
202949
2461
如果你真的做了这顿美餐,
03:25
I definitely want to know how it tastes.
63
205410
2044
我很想知道味道如何。
03:28
But you can see that ChatGPT
is selecting all these different tools
64
208496
3504
你可以看到,
ChatGPT 选用了各种工具,
03:32
without me having to tell it explicitly
which ones to use in any situation.
65
212000
4379
而我并不需要明确告诉它
什么情况下该用什么工具。
03:37
And this, I think, shows a new way
of thinking about the user interface.
66
217088
3879
我认为这体现了一个
看待用户界面的新方式。
03:40
Like, we are so used to thinking of,
well, we have these apps,
67
220967
3796
我们很习惯这么思考:
我们有了一些应用,
03:44
we click between them,
we copy/paste between them,
68
224763
2335
在应用之间点击,
在应用之间复制、粘贴,
03:47
and usually it's a great
experience within an app
69
227098
2294
通常在单个应用内体验良好,
03:49
as long as you kind of know
the menus and know all the options.
70
229434
2961
只要你大概了解菜单
和所有选项就行。
03:52
Yes, I would like you to.
71
232395
1293
是的,就这么做。
(键入提示)
03:53
Yes, please.
72
233730
1126
是的,麻烦了。(键入提示)
03:54
Always good to be polite.
73
234898
1251
礼貌一点总是好的。
03:56
(Laughter)
74
236149
2628
(笑声)
04:00
And by having this unified
language interface on top of tools,
75
240361
5464
有了统一的语言界面,
再加上工具,
04:05
the AI is able to sort of take away
all those details from you.
76
245867
4630
AI 就能从你手中提取各种细节。
04:10
So you don't have to be the one
77
250538
1543
你不需要一字一句地说出
04:12
who spells out every single
sort of little piece
78
252123
2294
04:14
of what's supposed to happen.
79
254459
1543
该产生什么结果。
04:16
And as I said, this is a live demo,
80
256419
1710
如我所说,这是个现场演示,
04:18
so sometimes the unexpected
will happen to us.
81
258129
3379
所以,有时会发生意外。
04:21
But let's take a look at the Instacart
shopping list while we're at it.
82
261549
3420
我们来看一看
这份 Instracart 购物清单。
04:25
And you can see we sent a list
of ingredients to Instacart.
83
265386
3254
你可以看到,我们向
Instacart 发送了一系列商品。
04:29
Here's everything you need.
84
269349
1543
就是你所需的所有商品。
04:30
And the thing that's really interesting
85
270892
1877
很有趣的一点是
04:32
is that the traditional UI
is still very valuable, right?
86
272811
2919
传统 UI (用户界面)
依旧是很有价值的,对吧?
04:35
If you look at this,
87
275772
1877
看一下这个界面,
04:37
you still can click through it
and sort of modify the actual quantities.
88
277690
4296
你还是可以点上去,
修改实际数量。
04:41
And that's something that I think shows
89
281986
1877
我认为这就体现了
04:43
that they're not going away,
traditional UIs.
90
283863
3253
传统 UI 并没有消失。
04:47
It's just we have a new,
augmented way to build them.
91
287158
2795
只是我们有了一个全新、
增强的方式来创建它。
04:49
And now we have a tweet
that's been drafted for our review,
92
289994
2920
我们有了一条草拟好的推特,
等待我们的审阅,
04:52
which is also a very important thing.
93
292956
1793
这也是一件很重要的事。
04:54
We can click “run,” and there we are,
we’re the manager, we’re able to inspect,
94
294749
3712
我们可以点击“运行”,
然后就成了这样,
我们是管理者,可以检查,
04:58
we're able to change the work
of the AI if we want to.
95
298461
2836
要是愿意,我们
也可以修改 AI 的成果。
05:02
And so after this talk,
you will be able to access this yourself.
96
302924
5964
在这场演讲之后,
你自己也可以试一下。
05:17
And there we go.
97
317647
1710
有了。
05:19
Cool.
98
319816
1126
酷。
05:22
Thank you, everyone.
99
322485
1168
谢谢大家。
05:23
(Applause)
100
323653
3003
(掌声)
05:29
So we’ll cut back to the slides.
101
329367
1627
我们再回到片子。
05:32
Now, the important thing
about how we build this,
102
332954
3587
我们是怎么把它做出来的,
其中的关键之处
05:36
it's not just about building these tools.
103
336583
2210
不仅仅是
怎么做出这些工具的问题。
05:38
It's about teaching
the AI how to use them.
104
338793
2252
还要教会 AI 使用它们。
05:41
Like, what do we even want it to do
105
341087
1710
我们问了这些高层级的问题,
05:42
when we ask these very
high-level questions?
106
342839
2419
到底想让它做些什么呢?
05:45
And to do this, we use an old idea.
107
345258
2669
要回答这个问题,
我们用了一个老方法。
05:48
If you go back to Alan Turing's 1950 paper
on the Turing test, he says,
108
348261
3337
你可以翻阅
艾伦·图灵(Alan Turing)
1950 年关于图灵测试的
论文,其中写道,
05:51
you'll never program an answer to this.
109
351598
2043
你是编不出一个程序
来回答这个问题的,
05:53
Instead, you can learn it.
110
353683
1627
而是通过学习。
05:55
You could build a machine,
like a human child,
111
355351
2169
你可以做一台机器,
就像人类儿童那样,
05:57
and then teach it through feedback.
112
357520
2127
通过不断反馈指导它。
05:59
Have a human teacher who provides
rewards and punishments
113
359689
2711
在它尝试、做出对的事
和不对的事的时候,
06:02
as it tries things out and does things
that are either good or bad.
114
362400
3212
有那么一位人类老师
给它奖励和惩罚。
06:06
And this is exactly how we train ChatGPT.
115
366237
2002
这就是我们训练
ChatGPT 的方式。
06:08
It's a two-step process.
116
368239
1168
这个过程有两步。
06:09
First, we produce what Turing
would have called a child machine
117
369449
3086
首先,我们做了一台
图灵所称的“子计算机”,
采用了无监督学习过程。
06:12
through an unsupervised learning process.
118
372535
1960
06:14
We just show it the whole world,
the whole internet
119
374495
2461
我们给它看了整个世界,
整个互联网,
06:16
and say, “Predict what comes next
in text you’ve never seen before.”
120
376956
3212
然后要求:“这是你之前没见过的内容,
请预测接下来该输出什么文本。”
06:20
And this process imbues it
with all sorts of wonderful skills.
121
380168
3044
这个过程让它掌握了
各种神奇的技能。
06:23
For example, if you're shown
a math problem,
122
383212
2086
比如,你给它一个数学问题,
06:25
the only way to actually
complete that math problem,
123
385298
2544
要做出这道数学题的唯一方式,
06:27
to say what comes next,
124
387884
1334
输出接下来的答案,
06:29
that green nine up there,
125
389218
1293
也就是这个绿色的 9,
06:30
is to actually solve the math problem.
126
390511
2294
只有通过真正解出这个数学问题。
06:34
But we actually have to do
a second step, too,
127
394432
2169
但其实我们也得进行第二步,
06:36
which is to teach the AI
what to do with those skills.
128
396601
2544
即指导 AI 使用这些技能。
06:39
And for this, we provide feedback.
129
399187
1668
为了达到这个目的,
我们会给予反馈。
06:40
We have the AI try out multiple things,
give us multiple suggestions,
130
400855
3253
我们让 AI 尝试做几件事,
给我们提几个建议,
06:44
and then a human rates them, says
“This one’s better than that one.”
131
404150
3212
然后人类会为其评分,
表示:“这个比那个好。”
06:47
And this reinforces not just the specific
thing that the AI said,
132
407362
3086
这不仅强化了
AI 刚说的那个答案,
06:50
but very importantly, the whole process
that the AI used to produce that answer.
133
410448
3795
更重要的是,还强化了
AI 生成这个答案的整个过程。
06:54
And this allows it to generalize.
134
414243
1585
这样 AI 就可以泛化。
06:55
It allows it to teach,
to sort of infer your intent
135
415828
2419
它可以指导、推测你的意图,
06:58
and apply it in scenarios
that it hasn't seen before,
136
418247
2503
在没有出现过的情境下应用,
07:00
that it hasn't received feedback.
137
420750
1585
且是在没有接受过
反馈的情况下。
07:02
Now, sometimes the things
we have to teach the AI
138
422669
2460
有时我们需要教给 AI 的东西
07:05
are not what you'd expect.
139
425171
1543
不是你意料之中的那些。
07:06
For example, when we first showed
GPT-4 to Khan Academy,
140
426756
3086
比如,我们第一次把 GPT-4
给可汗学院看的时候,
07:09
they said, "Wow, this is so great,
141
429884
1627
他们说:“哇,太棒了,
07:11
We're going to be able to teach
students wonderful things.
142
431552
2753
我们可以给学生教点好东西了。
07:14
Only one problem, it doesn't
double-check students' math.
143
434347
3462
只是有一个问题,它不会
检查学生的数学做得对不对。
07:17
If there's some bad math in there,
144
437809
1626
如果数学做得不对,
07:19
it will happily pretend that one plus one
equals three and run with it."
145
439477
3462
它会愉快地默认一加一等于三,
然后就这么运行下去。”
07:23
So we had to collect some feedback data.
146
443523
2294
于是我们就得收集一些反馈数据。
07:25
Sal Khan himself was very kind
147
445858
1544
萨尔·可汗(Sal Khan)
本人很不错,
07:27
and offered 20 hours of his own time
to provide feedback to the machine
148
447443
3337
他贡献了 20 个小时的私人时间
和我们的团队一起为计算机
07:30
alongside our team.
149
450780
1501
提供反馈。
07:32
And over the course of a couple of months
we were able to teach the AI that,
150
452323
3587
在几个月的时间里,
我们已经可以这么教会 AI:
07:35
"Hey, you really should
push back on humans
151
455910
2044
“嘿,你真的得
在这种特定的场合下,
07:37
in this specific kind of scenario."
152
457954
2044
反驳人类。”
07:41
And we've actually made lots and lots
of improvements to the models this way.
153
461416
4921
我们已经以这种方式
对模型进行了大量改良。
07:46
And when you push
that thumbs down in ChatGPT,
154
466379
2544
你在 ChatGPT 里
点击“踩”的时候,
07:48
that actually is kind of like sending up
a bat signal to our team to say,
155
468965
3462
其实是给我们的团队
发送了一个信号:
07:52
“Here’s an area of weakness
where you should gather feedback.”
156
472427
2919
“这块儿有缺陷,
你们得收集一些反馈。”
07:55
And so when you do that,
157
475388
1168
你这么点击的时候,
07:56
that's one way that we really
listen to our users
158
476597
2294
这就会是我们真正倾听用户、
07:58
and make sure we're building something
that's more useful for everyone.
159
478933
3378
确保我们创造的东西
会对每个人更有用的方式。
08:02
Now, providing high-quality
feedback is a hard thing.
160
482895
3754
提供高质量的反馈是件难事。
08:07
If you think about asking a kid
to clean their room,
161
487025
2460
想象一下让孩子打扫房间,
08:09
if all you're doing
is inspecting the floor,
162
489485
2711
如果你做的唯一一件事
只是检查地面,
08:12
you don't know if you're just teaching
them to stuff all the toys in the closet.
163
492196
3796
就不会知道你是不是只教会了他们
把所有玩具塞进衣柜里。
08:15
This is a nice DALL-E-generated
image, by the way.
164
495992
2627
顺带说一句,这是一张
由 DALL-E 生成的美图。
08:19
And the same sort
of reasoning applies to AI.
165
499912
4713
同样的道理也适用于 AI。
08:24
As we move to harder tasks,
166
504667
1794
如果要执行更困难的任务,
08:26
we will have to scale our ability
to provide high-quality feedback.
167
506502
3796
我们就得提高
提供高质量反馈的能力。
08:30
But for this, the AI itself
is happy to help.
168
510882
3879
但就这一点,
AI 本身就乐意为之。
08:34
It's happy to help us provide
even better feedback
169
514761
2335
它很乐意持续帮助我们
提供更好的反馈,
08:37
and to scale our ability to supervise
the machine as time goes on.
170
517138
3587
提高我们监督计算机的能力。
08:40
And let me show you what I mean.
171
520767
1543
我来给你看一下
我说的是什么意思。
08:42
For example, you can ask GPT-4
a question like this,
172
522810
4546
比如,你可以问 GPT-4
这么一个问题:
08:47
of how much time passed
between these two foundational blogs
173
527356
3295
这两篇关于无监督学习
和基于人类反馈学习的
08:50
on unsupervised learning
174
530693
1668
基础性博客之间
08:52
and learning from human feedback.
175
532403
1794
相差了多长时间,
08:54
And the model says two months passed.
176
534197
2460
模型说两个月。
08:57
But is it true?
177
537075
1167
但说的是真的吗?
08:58
Like, these models
are not 100-percent reliable,
178
538284
2252
这些模型不是 100% 可靠的,
09:00
although they’re getting better
every time we provide some feedback.
179
540536
3921
虽然每次我们提供了反馈以后,
它们都会优化。
09:04
But we can actually use
the AI to fact-check.
180
544457
3086
但其实我们可以利用 AI
进行事实核查。
09:07
And it can actually check its own work.
181
547543
1877
它都可以核查自己的结果。
09:09
You can say, fact-check this for me.
182
549462
1877
你可以要求它,
帮我核对一下是否属实。
09:12
Now, in this case, I've actually
given the AI a new tool.
183
552757
3670
在这个例子中,我给了
AI 一个新工具。
09:16
This one is a browsing tool
184
556427
1710
这是个浏览工具,
09:18
where the model can issue search queries
and click into web pages.
185
558137
3879
模型可以搜索查询,点进网页。
09:22
And it actually writes out
its whole chain of thought as it does it.
186
562016
3253
在搜索的过程中,
它会写出完整的思考过程。
09:25
It says, I’m just going to search for this
and it actually does the search.
187
565269
3587
它会写出,我要搜这个,
然后它真的就去搜这个了。
09:28
It then it finds the publication date
and the search results.
188
568856
3128
然后它会找到
发表时间和搜索结果。
09:32
It then is issuing another search query.
189
572026
1919
然后它会再进行一次搜索。
09:33
It's going to click into the blog post.
190
573945
1877
点进博客。
09:35
And all of this you could do,
but it’s a very tedious task.
191
575822
2877
你也可以做这些事,
但是这很冗长乏味。
09:38
It's not a thing
that humans really want to do.
192
578741
2211
这是人类不太想做的事。
09:40
It's much more fun
to be in the driver's seat,
193
580952
2169
掌控一切要有趣得多,
09:43
to be in this manager's position
where you can, if you want,
194
583162
2836
当管理者要有趣得多,
只要你想,你也可以
09:45
triple-check the work.
195
585998
1210
再查一遍结果。
09:47
And out come citations
196
587208
1501
你可以看到这些引用,
09:48
so you can actually go
197
588709
1168
这样你就可以
09:49
and very easily verify any piece
of this whole chain of reasoning.
198
589919
3754
非常轻松地验证
整条逻辑中的任何环节。
09:53
And it actually turns out
two months was wrong.
199
593673
2210
结果“两个月”是错误的。
09:55
Two months and one week,
200
595883
2169
两个月加一周,
09:58
that was correct.
201
598094
1251
这才是对的。
10:00
(Applause)
202
600888
3837
(掌声)
10:07
And we'll cut back to the side.
203
607645
1502
我们再回到片子。
10:09
And so thing that's so interesting to me
about this whole process
204
609147
3920
我觉得整个过程中很有趣的是
10:13
is that it’s this many-step collaboration
between a human and an AI.
205
613067
3962
这个过程是人类和 AI 之间
进行的多步合作。
10:17
Because a human, using
this fact-checking tool
206
617029
2461
因为人类
利用这个事实核查工具,
10:19
is doing it in order to produce data
207
619532
2210
为的是产生数据,
10:21
for another AI to become
more useful to a human.
208
621742
3170
让 AI 对人类更加有用。
10:25
And I think this really shows
the shape of something
209
625454
2545
我认为这充分展示了
10:28
that we should expect to be
much more common in the future,
210
628040
3087
未来会越来越常见的一种情况:
10:31
where we have humans
and machines kind of very carefully
211
631127
2711
在一个具体的问题和
我们设想的解决方法中,
10:33
and delicately designed
in how they fit into a problem
212
633880
3503
人类和计算机的定位分工
都会被非常谨慎、精心地安排好。
10:37
and how we want
to solve that problem.
213
637425
1918
10:39
We make sure that the humans are providing
the management, the oversight,
214
639385
3462
我们得保证人类
提供管理、全局视角、
10:42
the feedback,
215
642847
1168
反馈,
10:44
and the machines are operating
in a way that's inspectable
216
644015
2752
计算机以一种可被检查、
10:46
and trustworthy.
217
646809
1126
值得信任的方式运行。
10:47
And together we're able to actually create
even more trustworthy machines.
218
647977
3503
通过合作,我们可以
创造出更值得信任的计算机。
10:51
And I think that over time,
if we get this process right,
219
651522
2711
我认为随着时间的推移,
如果我们让这个过程走上正轨,
10:54
we will be able to solve
impossible problems.
220
654233
2127
我们就能解决
不可能被解决的问题。
10:56
And to give you a sense
of just how impossible I'm talking,
221
656360
3963
为了让大家理解我说的
“不可能被解决”是什么意思,
11:00
I think we're going to be able
to rethink almost every aspect
222
660323
2878
我认为,我们可以重新思考
11:03
of how we interact with computers.
223
663242
2378
我们与计算机互动的
几乎每一个方面。
11:05
For example, think about spreadsheets.
224
665620
2502
比如,表格。
11:08
They've been around in some form since,
we'll say, 40 years ago with VisiCalc.
225
668122
4379
它们从大约 40 年前
VisiCalc 时期就长这样了。
11:12
I don't think they've really
changed that much in that time.
226
672543
2878
我觉得过了这么多年
也没发生什么变化。
11:16
And here is a specific spreadsheet
of all the AI papers on the arXiv
227
676214
5922
这张表格记录了
arXiv 上过去 30 年里
11:22
for the past 30 years.
228
682178
1168
所有与 AI 有关的论文。
11:23
There's about 167,000 of them.
229
683346
1960
大约有 167000 篇。
11:25
And you can see there the data right here.
230
685348
2878
你可以看到这些数据。
11:28
But let me show you the ChatGPT take
on how to analyze a data set like this.
231
688267
3837
但我想给你展示一下 ChatGPT
是如何分析这样的数据集的。
11:37
So we can give ChatGPT
access to yet another tool,
232
697318
3837
我们再给 ChatGPT
加一个工具,
11:41
this one a Python interpreter,
233
701197
1460
这是个 Python 解释器,
11:42
so it’s able to run code,
just like a data scientist would.
234
702657
4004
它可以像数据科学家那样运行代码。
11:46
And so you can just
literally upload a file
235
706661
2335
你只需要上传一个文件,
11:48
and ask questions about it.
236
708996
1335
问相关的问题。
11:50
And very helpfully, you know, it knows
the name of the file and it's like,
237
710373
3545
它很有用,可以读取
文件名然后这样运行:
11:53
"Oh, this is CSV,"
comma-separated value file,
238
713960
2419
“哦,这是个 CSV 文件,
逗号分隔值文件(CSV),
11:56
"I'll parse it for you."
239
716420
1335
我来给你解析一下。”
11:57
The only information here
is the name of the file,
240
717755
2794
然后留下的信息只有文件名、
12:00
the column names like you saw
and then the actual data.
241
720591
3671
你刚看到的列名和实际数据。
12:04
And from that it's able to infer
what these columns actually mean.
242
724262
4504
从这些信息中,它可以推测出
这些列都代表着什么意思。
12:08
Like, that semantic information
wasn't in there.
243
728766
2294
这里并没有语义信息。
12:11
It has to sort of, put together
its world knowledge of knowing that,
244
731102
3211
所以它就得搜罗
它知道的所有信息,
12:14
“Oh yeah, arXiv is a site
that people submit papers
245
734355
2502
“哦,arXiv 是
人们提交论文的网站,
12:16
and therefore that's what these things are
and that these are integer values
246
736857
3587
所以这些条目指的是论文,
而且这些数都是整数,
12:20
and so therefore it's a number
of authors in the paper,"
247
740486
2628
所以这个数值就是
论文作者的数量。”
12:23
like all of that, that’s work
for a human to do,
248
743114
2252
等等操作,
这些都是人类要做的事,
12:25
and the AI is happy to help with it.
249
745408
1751
而 AI 乐意效劳。
12:27
Now I don't even know what I want to ask.
250
747159
2002
我都不知道我该问什么。
12:29
So fortunately, you can ask the machine,
251
749203
3003
好就好在你可以问计算机:
12:32
"Can you make some exploratory graphs?"
252
752248
1877
“你可以帮我做一些
探索性图表吗?”
12:37
And once again, this is a super high-level
instruction with lots of intent behind it.
253
757461
4004
同样地,这是个非常高层级的指示,
背后还有很多深层的涵义。
12:41
But I don't even know what I want.
254
761507
1668
但我都不知道自己想要什么。
12:43
And the AI kind of has to infer
what I might be interested in.
255
763175
2920
AI 就得推测我可能想要什么。
12:46
And so it comes up
with some good ideas, I think.
256
766137
2294
于是,它就给出了几个
我认为还不错的点子。
12:48
So a histogram of the number
of authors per paper,
257
768472
2336
每篇论文作者数量的直方图、
12:50
time series of papers per year,
word cloud of the paper titles.
258
770850
2961
每年论文的时间序列、
论文标题的词云。
12:53
All of that, I think,
will be pretty interesting to see.
259
773853
2627
我觉得这些都还挺有趣的。
12:56
And the great thing is,
it can actually do it.
260
776522
2169
妙就妙在它真的可以做出来。
12:58
Here we go, a nice bell curve.
261
778691
1460
来了,精美的正态分布曲线。
13:00
You see that three
is kind of the most common.
262
780151
2169
你可以看到,3 是最常见的数量。
13:02
It's going to then make this nice plot
of the papers per year.
263
782320
5797
让它给每年的论文
都做这么一张漂亮的图。
13:08
Something crazy
is happening in 2023, though.
264
788117
2294
2023 年有点奇葩。
13:10
Looks like we were on an exponential
and it dropped off the cliff.
265
790411
3128
看起来我们经历了一个
指数增长然后跳水。
13:13
What could be going on there?
266
793539
1460
发生了什么事?
13:14
By the way, all this
is Python code, you can inspect.
267
794999
2753
顺带提一句,这些都是
Python 代码,你是可以检查的。
13:17
And then we'll see word cloud.
268
797752
1459
我们来看词云。
13:19
So you can see all these wonderful things
that appear in these titles.
269
799253
3378
你可以看到这些标题中的奇妙内容。
13:23
But I'm pretty unhappy
about this 2023 thing.
270
803090
2127
我对这 2023 年的现象
不太满意。
13:25
It makes this year look really bad.
271
805634
1711
这一年看起来太差了。
13:27
Of course, the problem is
that the year is not over.
272
807345
2877
当然,今年还没过完。
13:30
So I'm going to push back on the machine.
273
810222
2878
所以我要把问题抛回给计算机。
13:33
[Waitttt that's not fair!!!
274
813142
1585
[等等!这不公平!
13:34
2023 isn't over.
275
814727
1293
2023 年还没过完呢。
13:38
What percentage of papers in 2022
were even posted by April 13?]
276
818481
5088
2022 年有多大比例的论文是在
4 月 13 日以前发表的?]
13:44
So April 13 was the cut-off
date I believe.
277
824695
2294
我就把 4 月 13 日
当作截止日期。
13:47
Can you use that to make
a fair projection?
278
827656
4922
[你可以用这个结论
生成一个公平的图表吗?]
13:54
So we'll see, this is
the kind of ambitious one.
279
834747
2294
我们来看看,这题有点难度。
13:57
(Laughter)
280
837083
1126
(笑声)
13:59
So you know,
281
839877
1251
同样地,我这么做是因为
我对计算机有更大的企图。
14:01
again, I feel like there was more I wanted
out of the machine here.
282
841128
3921
14:05
I really wanted it to notice this thing,
283
845049
2502
我想让它注意到这一点,
14:07
maybe it's a little bit
of an overreach for it
284
847593
3128
虽然对它来说,机智地猜出
14:10
to have sort of, inferred magically
that this is what I wanted.
285
850763
3378
我这样的意图可能有点太难了。
14:14
But I inject my intent,
286
854183
1627
但我表明了我的意图,
14:15
I provide this additional piece
of, you know, guidance.
287
855810
4754
又为它提供了一些指导。
14:20
And under the hood,
288
860564
1168
如果看它底层的运作,
14:21
the AI is just writing code again,
so if you want to inspect what it's doing,
289
861774
3629
AI 只是又写了一遍代码,
所以如果你想检查一下它在干什么,
14:25
it's very possible.
290
865403
1251
是很容易做到的。
14:26
And now, it does the correct projection.
291
866654
3628
这次,它生成了正确的图表。
14:30
(Applause)
292
870282
5005
(掌声)
14:35
If you noticed, it even updates the title.
293
875287
2169
你仔细看的话,
它还更新了标题。
14:37
I didn't ask for that,
but it know what I want.
294
877498
2336
我可没要求它这么做,
但它知道我想要什么。
14:41
Now we'll cut back to the slide again.
295
881794
2544
我们再回到片子。
14:45
This slide shows a parable
of how I think we ...
296
885714
4880
这张片子讲了一个故事,
表示我认为我们……
14:51
A vision of how we may end up
using this technology in the future.
297
891220
3212
我们未来使用这项技术的愿景。
14:54
A person brought
his very sick dog to the vet,
298
894849
3712
一个人带着他病重的狗狗去看兽医,
14:58
and the veterinarian made a bad call
to say, “Let’s just wait and see.”
299
898561
3336
兽医做了个愚蠢的决定,说:
“我们就等着观望吧。”
15:01
And the dog would not
be here today had he listened.
300
901897
2795
如果他当时就这么信了兽医,
狗狗现在已经走了。
15:05
In the meanwhile,
he provided the blood test,
301
905401
2252
与此同时,他把验血结果、
15:07
like, the full medical records, to GPT-4,
302
907653
2586
完整病历提交给了 GPT-4,
15:10
which said, "I am not a vet,
you need to talk to a professional,
303
910281
3170
它说:“我不是个兽医,
你得去找专业人士咨询,
15:13
here are some hypotheses."
304
913492
1710
以下是一些假设。”
15:15
He brought that information
to a second vet
305
915786
2127
他带着这些信息
去找了第二位兽医,
15:17
who used it to save the dog's life.
306
917913
1835
兽医利用这些信息
拯救了狗狗的生命。
15:21
Now, these systems, they're not perfect.
307
921292
2252
这些系统不是完美的。
15:23
You cannot overly rely on them.
308
923544
2336
你不能过度依赖它们。
15:25
But this story, I think, shows
309
925880
3712
但我认为这个故事体现了
15:29
that a human with a medical professional
310
929592
3044
人类和医疗专家,
15:32
and with ChatGPT
as a brainstorming partner
311
932678
2461
再加上 ChatGPT
作为头脑风暴的搭档,
15:35
was able to achieve an outcome
that would not have happened otherwise.
312
935181
3295
可以创造出一些前所未有的成果。
15:38
I think this is something
we should all reflect on,
313
938476
2419
我认为,这是我们所有人
在考虑将这些系统融入世界之时
15:40
think about as we consider
how to integrate these systems
314
940895
2711
该反思、考虑的一点。
15:43
into our world.
315
943606
1167
15:44
And one thing I believe really deeply,
316
944815
1835
我坚信的一点是,
15:46
is that getting AI right is going
to require participation from everyone.
317
946650
4338
让 AI 走上正道
需要每个人的参与。
15:50
And that's for deciding
how we want it to slot in,
318
950988
2377
参与决定该如何
让 AI 加入这个世界,
15:53
that's for setting the rules of the road,
319
953365
1961
参与规划路该怎么走,
15:55
for what an AI will and won't do.
320
955367
2044
参与规定 AI
该做什么,不该做什么。
15:57
And if there's one thing
to take away from this talk,
321
957453
2502
这场演讲能给你带来的
一个收获就是
15:59
it's that this technology
just looks different.
322
959997
2211
这项技术就是与众不同。
16:02
Just different from anything
people had anticipated.
323
962208
2502
与人们意料中的任何事物都不一样。
16:04
And so we all have to become literate.
324
964710
1835
所以,我们都得对它有所了解。
16:06
And that's, honestly, one
of the reasons we released ChatGPT.
325
966587
2961
说实话,这也是我们
发布 ChatGPT 的原因之一。
16:09
Together, I believe that we can
achieve the OpenAI mission
326
969548
3128
我相信,我们能一起努力
达成 OpenAI 的使命,
16:12
of ensuring that artificial
general intelligence
327
972718
2252
让通用人工智能
16:14
benefits all of humanity.
328
974970
1877
造福全人类。
16:16
Thank you.
329
976847
1168
谢谢。
16:18
(Applause)
330
978057
6965
(掌声)
16:33
(Applause ends)
331
993322
1168
(掌声结束)
16:34
Chris Anderson: Greg.
332
994532
1334
克里斯·安德森(Chris Anderson):
格雷格(Greg)。
16:36
Wow.
333
996242
1167
哇。
16:37
I mean ...
334
997868
1126
我……
16:39
I suspect that within every mind out here
335
999662
3753
我怀疑在场的每个人
16:43
there's a feeling of reeling.
336
1003457
2503
都有天旋地转的感觉。
16:46
Like, I suspect that a very large
number of people viewing this,
337
1006001
3379
我怀疑观看这场演讲的很多观众,
16:49
you look at that and you think,
“Oh my goodness,
338
1009421
2419
看到这些内容,都会心想:“天哪,
16:51
pretty much every single thing
about the way I work, I need to rethink."
339
1011882
3462
我得重新考虑一下
我日常操作中的每一件事。”
16:55
Like, there's just
new possibilities there.
340
1015386
2002
这是新的可能啊。
16:57
Am I right?
341
1017388
1168
对吧?
16:58
Who thinks that they're having to rethink
the way that we do things?
342
1018597
3337
谁认为得重新思考一下
自己行事的方式?
17:01
Yeah, I mean, it's amazing,
343
1021976
1543
太神奇了,
17:03
but it's also really scary.
344
1023561
2002
但也太可怕了。
17:05
So let's talk, Greg, let's talk.
345
1025604
1585
我们来谈谈吧,格雷格。
17:08
I mean, I guess
my first question actually is just
346
1028524
2377
我想问的第一个问题就是
17:10
how the hell have you done this?
347
1030901
1585
你们到底是怎么走到这一步的?
17:12
(Laughter)
348
1032486
1251
(笑声)
17:13
OpenAI has a few hundred employees.
349
1033737
2962
OpenAI 有几百名员工。
17:16
Google has thousands of employees
working on artificial intelligence.
350
1036740
4755
谷歌有几千名员工
专攻人工智能。
17:21
Why is it you who's come up
with this technology
351
1041996
3503
为什么是你们想出了这项
17:25
that shocked the world?
352
1045541
1168
震惊世界的技术?
17:26
Greg Brockman: I mean, the truth is,
353
1046709
1751
格雷格·布罗克曼
(Greg Brockman):其实,
17:28
we're all building on shoulders
of giants, right, there's no question.
354
1048502
3295
我们是站在巨人的肩膀上前进,
毋庸置疑。
如果你看看计算的进步、
17:31
If you look at the compute progress,
355
1051797
1752
算法的进步、数据的进步,
17:33
the algorithmic progress,
the data progress,
356
1053549
2085
这些都是覆盖整个行业的。
17:35
all of those are really industry-wide.
357
1055634
1835
但我认为,在 OpenAI 内部,
我们汲取过往的经验
做出了很多慎重的决定。
17:37
But I think within OpenAI,
358
1057469
1252
17:38
we made a lot of very deliberate
choices from the early days.
359
1058762
2878
首先就是直面现实。
17:41
And the first one was just
to confront reality as it lays.
360
1061640
2711
我们对此苦思冥想:
17:44
And that we just thought
really hard about like:
361
1064393
2294
17:46
What is it going to take
to make progress here?
362
1066687
2210
在这个领域前进,
要付出什么代价?
17:48
We tried a lot of things that didn't work,
so you only see the things that did.
363
1068939
3754
我们尝试了很多无效的东西,
而你们只会看到有效的东西。
我认为最重要的一点就是
召集一个团队,
17:52
And I think that the most important thing
has been to get teams of people
364
1072693
3462
17:56
who are very different from each other
to work together harmoniously.
365
1076196
3254
互不相同却要和谐地合作。
17:59
CA: Can we have the water,
by the way, just brought here?
366
1079450
2711
CA: 我们准备了水,拿过来吧。
18:02
I think we're going to need it,
it's a dry-mouth topic.
367
1082202
3170
我觉得我们得喝点水,
这是个令人津津乐道的话题。
18:06
But isn't there something also
just about the fact
368
1086665
2795
但是,在这些语言模型中,
18:09
that you saw something
in these language models
369
1089501
4755
你有没有发现
18:14
that meant that if you continue
to invest in them and grow them,
370
1094256
3921
如果你继续对其投入、研发,
18:18
that something
at some point might emerge?
371
1098218
3129
到了某个节点,
会出现一些情况?
18:21
GB: Yes.
372
1101847
1126
GB: 有。
18:23
And I think that, I mean, honestly,
373
1103015
2836
老实说,我认为
18:25
I think the story there
is pretty illustrative, right?
374
1105893
2544
我刚说的故事已经
很直观了,对吧?
18:28
I think that high level, deep learning,
375
1108437
2002
我认为从高层级来看,
深度学习,
18:30
like we always knew that was
what we wanted to be,
376
1110481
2335
我们一直知道
这就是我们想要的结果,
就得有个深度学习实验室,
那我们该怎么把它做出来呢?
18:32
was a deep learning lab,
and exactly how to do it?
377
1112858
2419
我觉得早期我们并不知道答案。
18:35
I think that in the early days,
we didn't know.
378
1115277
2211
我们尝试了很多,
18:37
We tried a lot of things,
379
1117529
1210
一位同事训练了一个模型,
18:38
and one person was working
on training a model
380
1118739
2336
用于预测
亚马逊评论中的下一个字母,
18:41
to predict the next character
in Amazon reviews,
381
1121075
2877
18:43
and he got a result where --
this is a syntactic process,
382
1123994
4755
他得出了这样一个结果——
这是个处理句法的过程,
18:48
you expect, you know, the model
will predict where the commas go,
383
1128749
3086
你想要的结果是
模型预测出逗号在什么位置,
18:51
where the nouns and verbs are.
384
1131835
1627
名词和动词在什么位置。
18:53
But he actually got a state-of-the-art
sentiment analysis classifier out of it.
385
1133504
4337
但他其实由此做出了一个
最先进的情感分析分类器。
18:57
This model could tell you
if a review was positive or negative.
386
1137883
2961
这个模型可以判定
这条评论是正面还是负面的。
19:00
I mean, today we are just like,
come on, anyone can do that.
387
1140886
3378
放到现在,我们会觉得,
人人都能做到啊。
19:04
But this was the first time
that you saw this emergence,
388
1144306
3087
但这是你第一次见到这样的东西,
19:07
this sort of semantics that emerged
from this underlying syntactic process.
389
1147434
5005
由底层的句法处理进程
产生的语义。
19:12
And there we knew,
you've got to scale this thing,
390
1152481
2336
由此,我们知道,
必须得扩大这项技术的规模,
19:14
you've got to see where it goes.
391
1154858
1544
必须得见证它未来的发展。
CA: 我认为这有助于解开
19:16
CA: So I think this helps explain
392
1156402
1626
让观众困惑的谜题,
19:18
the riddle that baffles
everyone looking at this,
393
1158028
2544
因为这些模型被称为预测机。
19:20
because these things are described
as prediction machines.
394
1160572
2753
但我们看到它们就感觉……
19:23
And yet, what we're seeing
out of them feels ...
395
1163367
2669
19:26
it just feels impossible that that
could come from a prediction machine.
396
1166036
3420
就感觉预测机怎么可能
可以产生这样的结果。
19:29
Just the stuff you showed us just now.
397
1169456
2378
就是你刚才给我们看的东西。
19:31
And the key idea of emergence
is that when you get more of a thing,
398
1171875
3838
新兴事物的关键就在于
一件东西出现得越多,
19:35
suddenly different things emerge.
399
1175754
1585
突然间就会出现很多别的东西。
19:37
It happens all the time, ant colonies,
single ants run around,
400
1177339
3045
这是个普遍现象,蚁群,
一只蚂蚁跑来跑去,
19:40
when you bring enough of them together,
401
1180384
1877
如果你把足够多的蚂蚁放在一起,
19:42
you get these ant colonies that show
completely emergent, different behavior.
402
1182302
3629
就会出现蚁群,呈现出完全
没有见过的、异于寻常的行为。
19:45
Or a city where a few houses together,
it's just houses together.
403
1185973
3086
一座城市,几栋房子建在一起,
就只是几栋建在一起的房子而已。
19:49
But as you grow the number of houses,
404
1189059
1794
但如果你增加房子的数量,
19:50
things emerge, like suburbs
and cultural centers and traffic jams.
405
1190894
4588
就会出现别的东西,
比如郊区、文化中心、堵车。
19:57
Give me one moment for you
when you saw just something pop
406
1197276
3211
你有没有经历过这么一个瞬间,
眼前突然出现了什么东西,
20:00
that just blew your mind
407
1200529
1668
让你震撼,
20:02
that you just did not see coming.
408
1202197
1627
出乎意料?
20:03
GB: Yeah, well,
409
1203824
1209
GB: 嗯,
20:05
so you can try this in ChatGPT,
if you add 40-digit numbers --
410
1205075
3462
你可以在 ChatGPT 里试一试,
输入 40 位数字……
20:08
CA: 40-digit?
411
1208537
1168
CA: 40 位?
20:09
GB: 40-digit numbers,
the model will do it,
412
1209705
2169
GB: 40 位数,
模型是可以运算的,
20:11
which means it's really learned
an internal circuit for how to do it.
413
1211915
3254
也就是说它已经学会了
运算这些数字的内部逻辑。
20:15
And the really interesting
thing is actually,
414
1215210
2127
很有趣的一点是
20:17
if you have it add like a 40-digit number
plus a 35-digit number,
415
1217337
3212
如果你试试将 40 位数
和 35 位数相加,
20:20
it'll often get it wrong.
416
1220591
1710
它通常会算错。
20:22
And so you can see that it's really
learning the process,
417
1222676
2795
你可以看出,
它在学习这个过程,
20:25
but it hasn't fully generalized, right?
418
1225471
1876
但还没有完全泛化,对吧?
20:27
It's like you can't memorize
the 40-digit addition table,
419
1227389
2711
你不可能记住 40 位加法表,
20:30
that's more atoms
than there are in the universe.
420
1230100
2294
这比宇宙中的原子数都多。
所以它得学习一些通用的东西,
20:32
So it had to have learned
something general,
421
1232394
2086
但它还没有完全学会,
20:34
but that it hasn't really
fully yet learned that,
422
1234480
2377
20:36
Oh, I can sort of generalize this
to adding arbitrary numbers
423
1236899
2961
把它泛化到可以算出任意长度的
20:39
of arbitrary lengths.
424
1239902
1167
任意数字之和。
20:41
CA: So what's happened here
425
1241111
1335
CA: 所以眼下的情况是
20:42
is that you've allowed it to scale up
426
1242488
1793
你拓展了它的能力,
20:44
and look at an incredible
number of pieces of text.
427
1244281
2419
能处理很大量的文本。
20:46
And it is learning things
428
1246742
1209
它在学习
20:47
that you didn't know that it was
going to be capable of learning.
429
1247951
3379
你都不知道它能学会的东西。
20:51
GB Well, yeah, and it’s more nuanced, too.
430
1251371
2002
GB: 对,也有点微妙。
20:53
So one science that we’re starting
to really get good at
431
1253415
2878
我们逐渐掌握的一项技能是
20:56
is predicting some of these
emergent capabilities.
432
1256335
2586
预测这些新兴的性能。
20:58
And to do that actually,
433
1258962
1335
要做到这一点,
21:00
one of the things I think
is very undersung in this field
434
1260339
2711
我认为这个领域中
有件不太受到重视的事,
21:03
is sort of engineering quality.
435
1263050
1501
那就是工程质量。
21:04
Like, we had to rebuild our entire stack.
436
1264551
2044
我们要重建整个技术栈。
21:06
When you think about building a rocket,
437
1266637
1877
你可以想象一下造火箭,
21:08
every tolerance has to be incredibly tiny.
438
1268555
2211
每一项的容错率都得非常小。
21:10
Same is true in machine learning.
439
1270766
1626
机器学习也是一样的。
21:12
You have to get every single piece
of the stack engineered properly,
440
1272434
3212
堆栈里的每一个元素
都得正确地编写,
21:15
and then you can start
doing these predictions.
441
1275646
2210
之后才可以进行预测。
21:17
There are all these incredibly
smooth scaling curves.
442
1277856
2503
这些都是非常平滑的规模曲线。
21:20
They tell you something deeply
fundamental about intelligence.
443
1280359
2919
它们体现了关于智能
非常基础的特质。
如果你去读一读我们的
GPT-4 博文,
21:23
If you look at our GPT-4 blog post,
444
1283320
1710
你就会看到这些曲线。
21:25
you can see all of these curves in there.
445
1285030
1960
我们现在可以开始预测了。
21:26
And now we're starting
to be able to predict.
446
1286990
2127
比如,我们可以预测
代码问题的性能。
21:29
So we were able to predict, for example,
the performance on coding problems.
447
1289117
3713
21:32
We basically look at some models
448
1292871
1585
我们会去看一些
21:34
that are 10,000 times
or 1,000 times smaller.
449
1294456
2461
小一万倍或一千倍的模型。
21:36
And so there's something about this
that is actually smooth scaling,
450
1296959
3211
虽然还处于早期,
21:40
even though it's still early days.
451
1300170
2044
它们的增长还是平滑的。
21:42
CA: So here is, one of the big fears then,
452
1302756
2544
CA: 最大的恐惧
21:45
that arises from this.
453
1305300
1252
来源于此。
21:46
If it’s fundamental
to what’s happening here,
454
1306593
2127
如果一定会发生这种情况:
21:48
that as you scale up,
455
1308720
1210
随着规模的扩大,
21:49
things emerge that
456
1309930
2419
会出现你可能有一定信心预测,
21:52
you can maybe predict
in some level of confidence,
457
1312349
4171
21:56
but it's capable of surprising you.
458
1316562
2544
但依然会吓你一跳的事。
22:00
Why isn't there just a huge risk
of something truly terrible emerging?
459
1320816
4463
那不是会有很大风险
会发生糟糕透顶的事吗?
22:05
GB: Well, I think all of these
are questions of degree
460
1325320
2545
GB: 我认为这些问题
都是有关程度、
22:07
and scale and timing.
461
1327865
1209
规模和时机的。
22:09
And I think one thing people miss, too,
462
1329116
1877
我认为大家还忽略了一点,
22:10
is sort of the integration with the world
is also this incredibly emergent,
463
1330993
3587
那就是与世界接轨
也是一件新兴的、
22:14
sort of, very powerful thing too.
464
1334621
1585
很强大的事。
这也就是我们认为逐步部署
如此重要的原因之一。
22:16
And so that's one of the reasons
that we think it's so important
465
1336248
3045
22:19
to deploy incrementally.
466
1339293
1167
你可以通过这场演讲看出
22:20
And so I think that what we kind of see
right now, if you look at this talk,
467
1340502
3629
我关注的一大重点就是
提供非常高质量的反馈。
22:24
a lot of what I focus on is providing
really high-quality feedback.
468
1344131
3170
我们如今进行的任务,
你都可以进行检查,对吧?
22:27
Today, the tasks that we do,
you can inspect them, right?
469
1347301
2711
你可以很轻松地看看那道数学题,
然后发现不对, 不对,不对,
22:30
It's very easy to look at that math
problem and be like, no, no, no,
470
1350012
3211
机器,7 才是正确答案。
22:33
machine, seven was the correct answer.
471
1353265
1835
但就算是总结一本书
都是难以监管的事。
22:35
But even summarizing a book,
like, that's a hard thing to supervise.
472
1355100
3212
你如何得知这篇摘要
是一篇好摘要呢?
22:38
Like, how do you know
if this book summary is any good?
473
1358312
2586
你得看完整本书。
22:40
You have to read the whole book.
474
1360939
1543
没人想干这事儿。
22:42
No one wants to do that.
475
1362482
1168
(笑声)
22:43
(Laughter)
476
1363692
1293
22:44
And so I think that the important thing
will be that we take this step by step.
477
1364985
4296
所以,我认为重要的是
我们要一步一步来。
22:49
And that we say, OK,
as we move on to book summaries,
478
1369323
2544
我们在处理书籍摘要时,
22:51
we have to supervise this task properly.
479
1371867
1960
我们得适当地
对这个任务进行监管。
22:53
We have to build up
a track record with these machines
480
1373827
2586
我们得建立起
这些计算机的历史记录,
22:56
that they're able to actually
carry out our intent.
481
1376413
2586
确认它们是否能
真正执行我们的意图。
22:59
And I think we're going to have to produce
even better, more efficient,
482
1379041
3336
我觉得我们得创造出
更好、更高效、
23:02
more reliable ways of scaling this,
483
1382419
1710
更可靠的拓展方式,
23:04
sort of like making the machine
be aligned with you.
484
1384129
2878
让计算机与你保持一致。
23:07
CA: So we're going to hear
later in this session,
485
1387049
2294
CA: 这场演讲之后会提到,
23:09
there are critics who say that,
486
1389343
1543
有人评论说
23:10
you know, there's no real
understanding inside,
487
1390928
4587
其内部根本没有理解,
23:15
the system is going to always --
488
1395557
1627
系统一直会……
23:17
we're never going to know
that it's not generating errors,
489
1397225
3212
我们看不到
它不会产生任何差错、
23:20
that it doesn't have
common sense and so forth.
490
1400479
2210
它没有常识等等。
23:22
Is it your belief, Greg,
that it is true at any one moment,
491
1402689
4088
格雷格,你会不会
有那么一瞬间认为这说的是对的,
23:26
but that the expansion of the scale
and the human feedback
492
1406818
3629
但又十分自信你刚谈到的
规模的扩大和人类的反馈
23:30
that you talked about is basically
going to take it on that journey
493
1410489
4963
会将其带上
23:35
of actually getting to things
like truth and wisdom and so forth,
494
1415494
3837
真正触及真相、
智慧等等的道路?
23:39
with a high degree of confidence.
495
1419331
1627
23:40
Can you be sure of that?
496
1420999
1335
你确定吗?
23:42
GB: Yeah, well, I think that the OpenAI,
I mean, the short answer is yes,
497
1422334
3462
GB: 我觉得 OpenAI,
简而言之答案是肯定的,
23:45
I believe that is where we're headed.
498
1425796
1793
我相信这就是我们前进的方向。
23:47
And I think that the OpenAI approach
here has always been just like,
499
1427631
3211
OpenAI 的方式一直就是
23:50
let reality hit you in the face, right?
500
1430842
1877
被现实打脸,对吧?
23:52
It's like this field is the field
of broken promises,
501
1432719
2503
这个领域就像是个
“不讲信用”的领域,
23:55
of all these experts saying
X is going to happen, Y is how it works.
502
1435263
3212
专家们都说 X 要发生了,
就是按 Y 这么做的。
人们一直在说神经网络
运行不了 70 年。
23:58
People have been saying neural nets
aren't going to work for 70 years.
503
1438475
3337
他们说的还不一定对呢。
24:01
They haven't been right yet.
504
1441812
1376
也许说得对,
也许 71 年是对的,
24:03
They might be right
maybe 70 years plus one
505
1443188
2044
或者你想说多少年就多少年。
24:05
or something like that is what you need.
506
1445232
1918
但我认为我们的方式一直是
24:07
But I think that our approach
has always been,
507
1447192
2169
你得把这项技术推向极致,
24:09
you've got to push to the limits
of this technology
508
1449361
2419
见证这种情况的真正发生。
因为这才会告诉你,哦,
这就是我们走进新时代的方式。
24:11
to really see it in action,
509
1451822
1293
24:13
because that tells you then, oh, here's
how we can move on to a new paradigm.
510
1453115
3670
我们还没有物尽其用呢。
24:16
And we just haven't exhausted
the fruit here.
511
1456785
2127
CA: 你们所处的立场
还挺有争议的,
24:18
CA: I mean, it's quite
a controversial stance you've taken,
512
1458954
2794
正确的方式就是把它公之于众,
24:21
that the right way to do this
is to put it out there in public
513
1461748
2920
再乘势而上,
24:24
and then harness all this, you know,
514
1464710
1751
不仅仅是你们团队提供反馈,
24:26
instead of just your team giving feedback,
515
1466461
2002
24:28
the world is now giving feedback.
516
1468463
2461
整个世界都在提供反馈。
24:30
But ...
517
1470924
1168
但是……
24:33
If, you know, bad things
are going to emerge,
518
1473135
3753
如果发生了坏事,
24:36
it is out there.
519
1476930
1168
就这么发生了。
24:38
So, you know, the original story
that I heard on OpenAI
520
1478140
2919
我听说 OpenAI 的起源故事是
24:41
when you were founded as a nonprofit,
521
1481101
1793
你们刚开始是作为
非营利性机构成立的,
24:42
well you were there as the great
sort of check on the big companies
522
1482894
4463
你们是查大企业
24:47
doing their unknown,
possibly evil thing with AI.
523
1487399
3837
利用 AI 行不可告人的
不轨之事的利器。
24:51
And you were going to build models
that sort of, you know,
524
1491278
4755
你们做出了一些模型,
24:56
somehow held them accountable
525
1496033
1418
让大企业负起责任,
24:57
and was capable of slowing
the field down, if need be.
526
1497492
4380
也在必要的时候
降低这个领域的发展速度。
25:01
Or at least that's kind of what I heard.
527
1501872
1960
至少我是这么听说的。
25:03
And yet, what's happened,
arguably, is the opposite.
528
1503832
2461
但不得不说,
真正发生的是相反的情况。
25:06
That your release of GPT,
especially ChatGPT,
529
1506334
5673
你们发布了 GPT,
尤其是 ChatGPT,
25:12
sent such shockwaves
through the tech world
530
1512049
2002
掀起了科技界的巨浪,
25:14
that now Google and Meta and so forth
are all scrambling to catch up.
531
1514051
3795
让谷歌、Meta 等等公司
现在手忙脚乱地追赶。
25:17
And some of their criticisms have been,
532
1517888
2085
有些批评是这么说的,
25:20
you are forcing us to put this out here
without proper guardrails or we die.
533
1520015
4963
你在没有适当保护的情况下逼我们
要么接受它的存在,要么死。
25:25
You know, how do you, like,
534
1525020
2794
那你怎么能
25:27
make the case that what you have done
is responsible here and not reckless.
535
1527814
3754
确保你的所作所为
是负责任的、不鲁莽的。
25:31
GB: Yeah, we think about these
questions all the time.
536
1531568
3128
GB: 我们一直在想这些问题。
25:34
Like, seriously all the time.
537
1534738
1418
真的就是一直在想。
25:36
And I don't think we're always
going to get it right.
538
1536198
2711
我们不能保证
做的一直都是对的。
25:38
But one thing I think
has been incredibly important,
539
1538909
2460
但我认为有一个非常重要的点,
从一开始,在我们思考
25:41
from the very beginning,
when we were thinking
540
1541411
2169
如何创造通用人工智能,
25:43
about how to build
artificial general intelligence,
541
1543580
2419
让它造福全人类的时候,
25:45
actually have it benefit all of humanity,
542
1545999
2002
你该怎么达到这个目标呢?对吧?
25:48
like, how are you
supposed to do that, right?
543
1548001
2127
我们默认会采取这样的计划,
悄悄做,
25:50
And that default plan of being,
well, you build in secret,
544
1550170
2711
25:52
you get this super powerful thing,
545
1552923
1626
做出了一个超牛的东西,
然后考虑考虑它的安全性,
按下“运行”,
25:54
and then you figure out the safety of it
and then you push “go,”
546
1554549
3003
然后希望自己做成了。
25:57
and you hope you got it right.
547
1557552
1460
我不知道如何执行这个计划。
25:59
I don't know how to execute that plan.
548
1559012
1835
也许有人知道。
26:00
Maybe someone else does.
549
1560889
1168
但对我来说,这太吓人了,
感觉不太对。
26:02
But for me, that was always terrifying,
it didn't feel right.
550
1562099
2877
所以我们转而采用这种方式,
26:04
And so I think that this
alternative approach
551
1564976
2128
这就是我能看到的
唯一一条途径,
26:07
is the only other path that I see,
552
1567104
2043
26:09
which is that you do let
reality hit you in the face.
553
1569147
2503
就让现实打你的脸。
26:11
And I think you do give people
time to give input.
554
1571691
2336
你也给了人们时间,
让他们提供输入。
26:14
You do have, before these
machines are perfect,
555
1574027
2211
在计算机达到完美状态之前,
26:16
before they are super powerful,
that you actually have the ability
556
1576279
3128
在它们变得超级强大之前,
你有机会
看到它们真正发生的情况。
26:19
to see them in action.
557
1579407
1168
26:20
And we've seen it from GPT-3, right?
558
1580617
1752
我们已经从 GPT-3 身上
看到了,对吧?
26:22
GPT-3, we really were afraid
559
1582369
1376
GPT-3,我们真的很害怕
26:23
that the number one thing
people were going to do with it
560
1583745
2711
人们最想用它做的事
26:26
was generate misinformation,
try to tip elections.
561
1586456
2336
就是生成虚假信息,
试图左右选举。
26:28
Instead, the number one thing
was generating Viagra spam.
562
1588834
2711
其实他们最想做的是
生成壮阳药钓鱼邮件。
26:31
(Laughter)
563
1591545
3169
(笑声)
26:36
CA: So Viagra spam is bad,
but there are things that are much worse.
564
1596007
3212
CA: 壮阳药钓鱼邮件确实不太好,
但是还有更糟糕的事。
26:39
Here's a thought experiment for you.
565
1599219
1752
我们来做一个思想实验。
26:40
Suppose you're sitting in a room,
566
1600971
1710
假设你坐在一个房间里,
26:42
there's a box on the table.
567
1602681
1668
桌子上有个盒子。
26:44
You believe that in that box
is something that,
568
1604349
3003
你认为那个盒子里的东西,
26:47
there's a very strong chance
it's something absolutely glorious
569
1607394
2961
非常有可能是个
很让人喜欢的东西,
26:50
that's going to give beautiful gifts
to your family and to everyone.
570
1610397
3920
可以是送给家人、
送给各位的精美礼品。
26:54
But there's actually also a one percent
thing in the small print there
571
1614359
3629
但也有 1% 的可能性
上面写了个小小的
26:58
that says: “Pandora.”
572
1618029
1877
“潘多拉”。
26:59
And there's a chance
573
1619906
1669
有可能会向世界
释放出无法想象的灾厄。
27:01
that this actually could unleash
unimaginable evils on the world.
574
1621616
4088
27:06
Do you open that box?
575
1626538
1543
你会打开这个盒子吗?
27:08
GB: Well, so, absolutely not.
576
1628123
1460
GB: 当然不会。
27:09
I think you don't do it that way.
577
1629624
1919
我觉得你不会这么做的。
27:12
And honestly, like, I'll tell you a story
that I haven't actually told before,
578
1632210
3796
老实说,我来给你讲一个
我之前没说过的故事,
27:16
which is that shortly
after we started OpenAI,
579
1636006
2586
我们创立 OpenAI 之后不久,
27:18
I remember I was in Puerto Rico
for an AI conference.
580
1638592
2711
我记得当时我在波多黎各
参加一场 AI 会议。
27:21
I'm sitting in the hotel room just
looking out over this wonderful water,
581
1641344
3462
我坐在酒店房间里看海景,
27:24
all these people having a good time.
582
1644806
1752
所有人都玩得很开心。
就在此刻,思考一下:
27:26
And you think about it for a moment,
583
1646558
1752
你现在可以打开
那个潘多拉魔盒,
27:28
if you could choose for basically
that Pandora’s box
584
1648310
4504
27:32
to be five years away
585
1652814
2711
你可以选择在 5 年后
27:35
or 500 years away,
586
1655567
1585
或者 500 年以后,
27:37
which would you pick, right?
587
1657194
1501
你会怎么选?
27:38
On the one hand you're like,
well, maybe for you personally,
588
1658737
2836
一方面你可能觉得
对你自己来说,
27:41
it's better to have it be five years away.
589
1661573
2002
5 年以后比较好。
27:43
But if it gets to be 500 years away
and people get more time to get it right,
590
1663617
3628
但如果得在 500 年以后打开,
人们也有更多的时间让它走上正轨,
27:47
which do you pick?
591
1667287
1168
你会选哪个?
27:48
And you know, I just
really felt it in the moment.
592
1668496
2336
当时我真的觉得该这么选。
27:50
I was like, of course
you do the 500 years.
593
1670874
2002
我觉得你就得选 500 年以后。
27:53
My brother was in the military at the time
594
1673293
2002
我的兄弟当时在服役,
27:55
and like, he puts his life on the line
in a much more real way
595
1675295
2961
他那么实在地冒着生命危险,
27:58
than any of us typing things in computers
596
1678256
2628
比与此同时我们这些
在电脑上打打字、
28:00
and developing this
technology at the time.
597
1680926
2585
开发技术的人要实在得多。
28:03
And so, yeah, I'm really sold
on the you've got to approach this right.
598
1683511
4547
我真的确信人们得
“正视这项技术”这种说法。
28:08
But I don't think that's quite
playing the field as it truly lies.
599
1688058
3628
但我认为这与它所处的领域不符。
28:11
Like, if you look at the whole
history of computing,
600
1691686
2670
纵观整个计算机技术史,
28:14
I really mean it when I say
that this is an industry-wide
601
1694397
4463
我可以认真地说,
这是一场涉及整个行业,
28:18
or even just almost like
602
1698902
1543
或更像是
28:20
a human-development-
of-technology-wide shift.
603
1700487
3336
“人类的技术进步”的变革。
28:23
And the more that you sort of,
don't put together the pieces
604
1703865
4088
你越不把这些现有的
线索、信息整合在一起,
28:27
that are there, right,
605
1707994
1293
28:29
we're still making faster computers,
606
1709329
1752
我们就还在制造更快的计算机、
28:31
we're still improving the algorithms,
all of these things, they are happening.
607
1711081
3670
改进算法,
如此种种都是进行时。
28:34
And if you don't put them together,
you get an overhang,
608
1714793
2627
如果你不整合它们,
你就处在一种“悬空”的状态,
28:37
which means that if someone does,
609
1717420
1627
意思就是如果有人动手做了,
28:39
or the moment that someone does manage
to connect to the circuit,
610
1719089
3086
或者在某一个瞬间,
有人真的打通了这条路,
28:42
then you suddenly have
this very powerful thing,
611
1722175
2252
你就突然间有了一个
非常强大的东西,
28:44
no one's had any time to adjust,
612
1724427
1544
没有人有时间去调整,
28:46
who knows what kind
of safety precautions you get.
613
1726012
2336
谁知道会有什么样的
安全防护措施呢。
28:48
And so I think
that one thing I take away
614
1728390
1918
所以我认为
我学到的一点就是
28:50
is like, even you think about development
of other sort of technologies,
615
1730308
3837
就算你想的是开发其他技术,
28:54
think about nuclear weapons,
616
1734187
1376
比如核武器,
28:55
people talk about being
like a zero to one,
617
1735563
2002
这都是人类力所能及的
28:57
sort of, change in what humans could do.
618
1737565
2628
零到一的改变。
29:00
But I actually think
that if you look at capability,
619
1740235
2461
但如果你看看人类的能力,
29:02
it's been quite smooth over time.
620
1742696
1585
其实一直以来都是很平滑的。
29:04
And so the history, I think,
of every technology we've developed
621
1744281
3670
历史上我们研发出的每一项技术,
29:07
has been, you've got
to do it incrementally
622
1747993
2002
都是一步一步来的,
29:10
and you've got to figure out
how to manage it
623
1750036
2127
每一次你要增强这种能力的时候,
29:12
for each moment that you're increasing it.
624
1752163
2461
你都得考虑如何掌控它。
29:14
CA: So what I'm hearing is that you ...
625
1754666
2252
CA: 我可以这么理解……
29:16
the model you want us to have
626
1756918
1668
你想我们以后使用的模型
29:18
is that we have birthed
this extraordinary child
627
1758628
2795
是我们孕育出的神奇小孩,
29:21
that may have superpowers
628
1761423
2544
它会拥有带领人类
走向新世界的超能力。
29:24
that take humanity to a whole new place.
629
1764009
2544
29:26
It is our collective responsibility
to provide the guardrails
630
1766594
5005
我们都有义务为这个孩子
29:31
for this child
631
1771641
1210
保驾护航,
29:32
to collectively teach it to be wise
and not to tear us all down.
632
1772892
5047
齐心协力让它更智慧,
不要毁灭我们自己。
29:37
Is that basically the model?
633
1777939
1377
这个模型是这样吗?
29:39
GB: I think it's true.
634
1779357
1168
GB: 是的。
29:40
And I think it's also important
to say this may shift, right?
635
1780567
2878
不得不说这种情况
也可能会改变,对吧?
29:43
We've got to take each step
as we encounter it.
636
1783445
3253
我们要随机应变地向前迈进。
29:46
And I think it's incredibly
important today
637
1786740
2002
我认为如今很重要的一点是,
29:48
that we all do get literate
in this technology,
638
1788783
2878
我们都得对这项技术有所了解,
29:51
figure out how to provide the feedback,
639
1791661
1919
搞明白如何提供反馈,
29:53
decide what we want from it.
640
1793621
1377
搞清楚我们到底想从中获得什么。
29:54
And my hope is that that will
continue to be the best path,
641
1794998
3128
我希望,我们依旧
走在最优的道路上,
29:58
but it's so good we're honestly
having this debate
642
1798168
2377
但我也很高兴我们
坦诚地讨论这个话题,
30:00
because we wouldn't otherwise
if it weren't out there.
643
1800545
2628
因为如果这项技术不存在,
我们也不会有这样的讨论了。
30:03
CA: Greg Brockman, thank you so much
for coming to TED and blowing our minds.
644
1803631
3629
CA: 格雷格·布罗克曼,
感谢你来到 TED,
让我们大开眼界。
30:07
(Applause)
645
1807302
1626
(掌声)
New videos
Original video on YouTube.com
关于本网站
这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。