Can a robot pass a university entrance exam? | Noriko Arai

202,616 views ・ 2017-09-13

TED


请双击下面的英文字幕来播放视频。

翻译人员: heng gao 校对人员: Jack Zhang
00:13
Today, I'm going to talk about AI and us.
0
13014
3660
今天我将和大家聊聊人工智能和我们
00:18
AI researchers have always said
1
18206
2143
曾经,人工智能方面的研究人员 总是告诉我们,
00:20
that we humans do not need to worry,
2
20373
2594
我们人类不需要担心,
00:22
because only menial jobs will be taken over by machines.
3
22991
3580
因为机器只能从事体力劳动,
00:27
Is that really true?
4
27274
1603
可真的是那样吗?
00:30
They have also said that AI will create new jobs,
5
30365
3827
他们还说人工智能可以创造 新的工作机会,
00:34
so those who lose their jobs will find a new one.
6
34216
3411
因此,那些因其失去工作的人 可以找到新的工作。
00:38
Of course.
7
38264
1355
这当然对的。
00:39
But the real question is:
8
39643
2172
但是,真正的问题是:
00:41
How many of those who may lose their jobs to AI
9
41839
4105
那些因为人工智能出现 而失去工作的人中
00:45
will be able to land a new one,
10
45968
2489
有多少人能够找到一份新的工作?
00:48
especially when AI is smart enough to learn better than most of us?
11
48481
5850
尤其是,当人工智足够聪明到 比大多数人类做得更好时。
00:55
Let me ask you a question:
12
55397
2185
我想问大家一个问题:
00:58
How many of you think
13
58666
1798
你们中有多少人认为
01:00
that AI will pass the entrance examination of a top university by 2020?
14
60488
6094
到2020年,人工智能可以通过 顶尖大学的入学考试。
01:07
Oh, so many. OK.
15
67836
2457
哦,好多。好的
01:10
So some of you may say, "Of course, yes!"
16
70317
4358
你们中有人可能会说: "当然了,这是肯定的!“
01:15
Now singularity is the issue.
17
75369
2134
现在的问题是什么时候 会到达这个科技奇点。
01:18
And some others may say, "Maybe,
18
78590
3095
其他一些人会说:”可能吧,
01:21
because AI already won against a top Go player."
19
81709
4508
因为人工智能已经赢了 一个顶级的围棋选手。“
01:27
And others may say, "No, never. Uh-uh."
20
87213
3681
然后其他人可能说: ”不会的,这不可能发生。“
01:32
That means we do not know the answer yet, right?
21
92195
3593
这就说明我们现在其实并不知道答案 对不对?
01:36
So that was the reason why I started Todai Robot Project,
22
96268
4960
这就是我为什么发起了 “东大”机器人项目,
01:41
making an AI which passes the entrance examination
23
101252
3872
创造一个可以通过日本顶尖大学
01:45
of the University of Tokyo,
24
105148
2589
(东京大学)的入学考试
01:47
the top university in Japan.
25
107761
2537
的机器人。
01:51
This is our Todai Robot.
26
111464
2548
这就是我们的“东大”机器人。
01:56
And, of course, the brain of the robot is working in the remote server.
27
116131
5810
当然,机器人的大脑是在远程服务器中工作。
02:02
It is now writing a 600-word essay
28
122747
4222
图中他正在写一篇600字的短文,
02:06
on maritime trade in the 17th century.
29
126993
4119
主题是17世纪的海运贸易。
02:11
How does that sound?
30
131136
1765
这听起来怎么样?
02:14
Why did I take the entrance exam as its benchmark?
31
134113
4104
为什么我要选择入学考试作为 他的衡量标准呢?
02:19
Because I thought we had to study the performance of AI
32
139098
4741
因为,我认为我们要研究 人工智能的表现,
02:23
in comparison to humans,
33
143863
2114
就一定要将它与人类进行对比
02:26
especially on the skills and expertise
34
146001
2860
尤其是在那些我们认为只有人类可以
02:28
which are believed to be acquired only by humans
35
148885
4088
而且只能通过教育,
02:32
and only through education.
36
152997
2335
才能获得的技能和知识。
02:35
To enter Todai, the University of Tokyo,
37
155782
4043
为了进入东大——东京大学
02:39
you have to pass two different types of exams.
38
159849
4421
你需要通过两个不同类型的考试。
02:44
The first one is a national standardized test
39
164294
3760
第一个是全国标准测试,
02:48
in multiple-choice style.
40
168078
2403
考试是以多项选择的形式。
02:50
You have to take seven subjects
41
170505
2455
你需要考7个科目,
02:52
and achieve a high score --
42
172984
1955
并且获得一个较高的分数。
02:54
I would say like an 85 percent or more accuracy rate --
43
174963
4772
我估计大概有84%的人, 准确的说或者有更多,
02:59
to be allowed to take the second stage written test
44
179759
4087
能有机会参加第二阶段
03:03
prepared by Todai.
45
183870
2159
由东大准备的笔试。
03:06
So let me first explain how modern AI works,
46
186994
5317
首先,让我向大家解释一下 现代人工智能是如何工作的,
03:12
taking the "Jeopardy!" challenge as an example.
47
192335
3069
就以”危险边缘!“挑战游戏作例子。
03:17
Here is a typical "Jeopardy!" question:
48
197539
3079
这有一个典型的”危险边缘!" 游戏问题:
03:20
"Mozart's last symphony shares its name with this planet."
49
200642
4461
“莫扎特的最后一部交响曲与这颗星球 有相同的名字。”
03:26
Interestingly, a "Jeopardy!" question always asks,
50
206195
4013
有趣的是,“危险边缘!”游戏
总是以提问的方式出现,而且总是以 “这个”什么东西作为结尾。
03:30
always ends with "this" something:
51
210232
3328
03:33
"this" planet, "this" country,
52
213584
2827
这颗星球,这个国家,
03:36
"this" rock musician, and so on.
53
216435
2608
这个摇滚明星,等等。
03:39
In other words, "Jeopardy!" doesn't ask many different types of questions,
54
219067
4299
换而言之,“危险边缘!"游戏 并不会问不同类型的问题,
03:43
but a single type,
55
223390
1837
而是只有一种形式,
03:45
which we call "factoid questions."
56
225251
2536
我们称之为“事实问答”。
03:48
By the way, do you know the answer?
57
228975
2167
对了,你们知道这个问题的答案吗?
03:53
If you do not know the answer and if you want to know the answer,
58
233980
4055
如果你们不知道这个问题的答案, 但是如果你们想知道答案,
03:58
what would you do?
59
238059
1287
你们会怎么做?
04:00
You Google, right? Of course.
60
240160
3132
你们会用谷歌搜索,对不对? 当然是这样。
04:03
Why not?
61
243316
1480
为什么不呢?
04:04
But you have to pick appropriate keywords
62
244820
3592
但是,你必须选择合适的关键字,
04:08
like "Mozart," "last" and "symphony" to search.
63
248436
4364
比如“莫扎特”,“最后”, “交响曲”来进行搜索。
04:13
The machine basically does the same.
64
253462
2400
人工智能基本上也是这样做的。
04:16
Then this Wikipedia page will be ranked top.
65
256457
4660
然后关于这个的维基百科页面 会被排在第一个位置。
04:21
Then the machine reads the page.
66
261840
1908
接下来,机器将阅读这个页面,
04:23
No, uh-uh.
67
263772
1171
不,不是这样。
04:25
Unfortunately, none of the modern AIs,
68
265470
3462
不幸的是,没有任何现存的人工智能
04:28
including Watson, Siri and Todai Robot,
69
268956
3968
包括 Watson(IBM创造的人工智能), Siri 和东大机器人,
04:32
is able to read.
70
272948
1661
能够进行阅读。
04:35
But they are very good at searching and optimizing.
71
275437
3800
但是,他们非常擅长搜索和 寻找最优解。
04:40
It will recognize
72
280158
2023
他们可以识别到一个位置,
04:42
that the keywords "Mozart," "last" and "symphony"
73
282866
2935
这些关键字 “莫扎特”,“最后”和 “交响曲”
04:45
are appearing heavily around here.
74
285825
2903
在这个位置附近出现的频率非常高。
04:49
So if it can find a word which is a planet
75
289790
4375
因此,如果他可以在这个位置附近
04:54
and which is co-occurring with these keywords,
76
294189
3648
找到一个代表星球的单词,
04:57
that must be the answer.
77
297861
1989
那么这个词就一定是答案了。
05:00
This is how Watson finds the answer "Jupiter," in this case.
78
300762
5186
因此,Watson 找到了答案是“木星”。
05:08
Our Todai Robot works similarly, but a bit smarter
79
308433
4049
我们的东大机器人工作方式与这个相似,但稍稍聪明点,
05:12
in answering history yes-no questions,
80
312506
3239
对于回答历史方面的“是-否”问题时,
05:16
like, "'Charlemagne repelled the Magyars.' Is this sentence true or false?"
81
316560
5663
比如,“查理曼击退了马扎尔人。” 这句话是真的还是假的?
05:23
Our robot starts producing a factoid question,
82
323181
4073
我们的机器人自己首先 会提出一个事实问题,
05:27
like: "Charlemagne repelled [this person type]" by itself.
83
327278
4899
比如:“查理曼击退【……人】”
05:32
Then, "Avars" but not "Magyars" is ranked top.
84
332995
4732
然后,如果是“阿尔瓦人”而不是 “马扎儿人”被排在了第一个。
05:38
This sentence is likely to be false.
85
338357
3049
这句话就很可能就是错误的。
05:42
Our robot does not read, does not understand,
86
342772
4860
我们的机器人并不会阅读, 也无法理解,
05:48
but it is statistically correct in many cases.
87
348335
4144
但是在统计意义上 大多数情况下是对的。
05:54
For the second stage written test,
88
354147
2508
对于第二阶段的笔试,
05:56
it is required to write a 600-word essay like this one:
89
356679
5106
要求考试者写一篇600字的短文,比如这个:
06:01
[Discuss the rise and fall of the maritime trade
90
361809
2278
【讨论一下十七世纪东亚和东南亚
海运贸易的兴起和衰落……】
正如我之前展示的,
06:04
in East and Southeast Asia in the 17th century ...]
91
364111
2422
06:06
and as I have shown earlier,
92
366557
1387
我们的机器人从课本和维基百科上 摘抄句子,
06:07
our robot took the sentences from the textbooks and Wikipedia,
93
367968
4194
06:12
combined them together,
94
372186
1961
并将它们组合在一起,
06:14
and optimized it to produce an essay
95
374171
3619
然后以最优解的方式生成一篇短文,
06:17
without understanding a thing.
96
377814
2207
尽管他自己完全不理解 自己写的是啥。
06:20
(Laughter)
97
380045
1737
(笑声)
06:21
But surprisingly, it wrote a better essay
98
381806
4895
但令人吃惊的是,他比大多数学生
06:26
than most of the students.
99
386725
1561
写的文章还要好。
06:28
(Laughter)
100
388310
2391
(笑声)
06:30
How about mathematics?
101
390725
1529
那把人格智能应用到数学方面会怎么样呢?
06:33
A fully automatic math-solving machine
102
393354
3158
自从“人工智能”这个词诞生开始,
06:36
has been a dream
103
396536
1631
完全自动的解决数学问题的机器,
06:38
since the birth of the word "artificial intelligence,"
104
398191
4679
就一直是人们梦寐以求的。
06:43
but it has stayed at the level of arithmetic for a long, long time.
105
403785
6007
但是他已经停留在只能做算术的水平 很长时间了。
06:51
Last year, we finally succeeded in developing a system
106
411530
5350
去年,我们终于成功开发了一个系统
06:56
which solved pre-university-level problems from end to end,
107
416904
5173
它可以从头到尾求解
07:02
like this one.
108
422101
1262
像这样的大学入学水准的题目。
07:05
This is the original problem written in Japanese,
109
425648
4002
这道题是由日语写的,
07:09
and we had to teach it 2,000 mathematical axioms
110
429674
4397
我们需要教它2000个数学公理,
07:14
and 8,000 Japanese words
111
434095
2774
和8000日文单词,
07:16
to make it accept the problems written in natural language.
112
436893
4558
它才能识别用我们用日文写的问题。
07:22
And it is now translating the original problems
113
442234
3542
现在他正将原始的问题
07:25
into machine-readable formulas.
114
445800
3139
翻译成机器可识别的公式。
07:30
Weird, but it is now ready to solve it, I think.
115
450578
6099
这听起来有点怪, 但我想它现在可以解这道题了。
07:36
Go and solve it.
116
456701
1411
开始解这道题。
07:38
Yes! It is now executing symbolic computation.
117
458818
4284
好的,现在它在正在进行符号计算。
07:44
Even more weird,
118
464030
1580
这可能看起来可能更奇怪。
07:45
but probably this is the most fun part for the machine.
119
465634
4825
但对于这台机器而言, 这可能是最有趣的部分。
07:50
(Laughter)
120
470483
2351
(笑声)
07:52
Now it outputs a perfect answer,
121
472858
2815
现在它输出了一份完美的答案。
07:55
though its proof is impossible to read, even for mathematicians.
122
475697
4707
尽管他的证明过程对数学家来说 也是不可能读懂的。
08:02
Anyway, last year our robot was among the top one percent
123
482773
6961
但不管怎么说,我们的机器人去年在 第二阶段的数学笔试中,
08:10
in the second stage written exam in mathematics.
124
490199
3633
排名前1%。
08:14
(Applause)
125
494652
3210
(鼓掌)
08:18
Thank you.
126
498412
1311
谢谢!
08:19
So, did it enter Todai?
127
499747
2471
那么,他考入东大了吗?
08:22
No, not as I expected.
128
502981
3058
没有,并不象我期望的那样。
08:26
Why?
129
506783
1399
为什么呢?
08:28
Because it doesn't understand any meaning.
130
508206
2639
因为它无法理解任何意义。
08:32
Let me show you a typical error it made in the English test.
131
512308
4079
让我给你们展示一个它在英文考试中 犯的一个典型错误。
08:36
[Nate: We're almost at the bookstore. Just a few more minutes.
132
516411
2977
【Nate:我们再过几分钟就要到书店了。
08:39
Sunil: Wait. ______ . Nate: Thank you! That always happens ...]
133
519412
3039
Sunil: 等一下。_______. Nate: 谢谢! 这经常发生……。】
08:42
Two people are talking.
134
522475
1151
两个人在谈话。
08:43
For us, who can understand the situation --
135
523650
2054
对于我们这些能够理解语境的人,
08:45
[1. "We walked for a long time." 2. "We're almost there."
136
525704
2773
A.”我们走了很长时间了。“ B. ”我们马上到那了。“
08:48
3. "Your shoes look expensive." 4. "Your shoelace is untied."]
137
528501
3032
C.”你的鞋看起来很贵。“ D.”你的鞋带开了。“
08:51
it is obvious number four is the correct answer, right?
138
531557
2873
很明显D是正确答案,对吧?
08:54
But Todai Robot chose number two,
139
534454
2238
但是,东大机器人选了B答案。
08:56
even after learning 15 billion English sentences
140
536716
5360
而且还是在使用深度学习技术
09:02
using deep learning technologies.
141
542100
2728
学习了150亿个英文句子之后。
09:07
OK, so now you might understand what I said:
142
547600
4172
好的, 你们现在应该能理解我之前说的:
09:12
modern AIs do not read,
143
552399
2648
现代人工智能不能阅读,
09:15
do not understand.
144
555071
1413
无法理解意义。
09:17
They only disguise as if they do.
145
557516
3169
他们只是装作他们好像能这样做。
09:24
This is the distribution graph
146
564867
2981
这是一张50万学生成绩的分布图,
09:27
of half a million students who took the same exam as Todai Robot.
147
567872
5777
他们参加了和东大机器人一样的考试。
09:34
Now our Todai Robot is among the top 20 percent,
148
574558
5165
我们的东大机器人排在前20%,
09:40
and it was capable to pass
149
580986
2415
而且他现在可以通过
09:43
more than 60 percent of the universities in Japan --
150
583425
3941
超过60%的日本大学的考试,
09:47
but not Todai.
151
587390
1377
除了东京大学。
09:50
But see how it is beyond the volume zone
152
590116
4025
但是,我们可以看到 他处在能够成为白领的人群比例
09:54
of to-be white-collar workers.
153
594165
2864
之上。
10:00
You might think I was delighted.
154
600060
2858
你们可能认为我应该感到高兴。
10:03
After all, my robot was surpassing students everywhere.
155
603939
3971
毕竟我的机器人 超过了多有地方的学生。
10:09
Instead, I was alarmed.
156
609022
2691
但是相反地,我感到恐慌。
10:13
How on earth could this unintelligent machine outperform students --
157
613086
5607
这个不智能的机器到底为什么 比我们的学生,我们的孩子,
10:18
our children?
158
618717
1292
表现的还好?
10:20
Right?
159
620033
1153
是不是?
10:22
I decided to investigate what was going on in the human world.
160
622101
4402
我决定调查一下我们这到底是为什么。
10:28
I took hundreds of sentences from high school textbooks
161
628542
4729
我从高中的课本中摘了几百段句子,
10:33
and made easy multiple-choice quizzes,
162
633859
3313
制作了简单的多项选择问卷,
10:37
and asked thousands of high school students to answer.
163
637196
4143
然后让上千名学生回答。
10:42
Here is an example:
164
642690
1176
这是有一个例子:
10:43
[Buddhism spread to ... , Christianity to ... and Oceania,
165
643890
2818
[佛教传播到...., 基督教传播到....和大洋洲,
10:46
and Islam to ...]
166
646732
1151
伊斯兰教传播到....,]
10:47
Of course, the original problems are written in Japanese,
167
647907
2740
当然,问题都是用日文写的,
10:50
their mother tongue.
168
650671
1155
他们的母语。
10:51
[ ______ has spread to Oceania.
169
651850
1515
[ ________传播到了大洋洲。
10:53
1. Hinduism 2. Christianity 3. Islam 4. Buddhism ]
170
653389
2417
A.印度教 B. 基督教 C.伊斯兰教 D. 佛教】
10:55
Obviously, Christianity is the answer, isn't it?
171
655830
2299
很明显,答案是基督教,对不对?
10:58
It's written!
172
658153
1214
上边写着呢。
11:01
And Todai Robot chose the correct answer, too.
173
661482
4026
而东大机器人也选择了正确的答案。
11:06
But one-third of junior high school students
174
666758
4879
但是有1/3的初中学生
11:11
failed to answer this question.
175
671661
2612
并没有回答对。
11:16
Do you think it is only the case in Japan?
176
676456
3159
你认为这种情况只发生在日本吗?
11:19
I do not think so,
177
679639
1976
我不这么认为,
11:21
because Japan is always ranked among the top in OECD PISA tests,
178
681639
6371
因为日本人总是在经合组织的 国际学生评估测试中名列前茅。
11:28
measuring 15-year-old students' performance in mathematics,
179
688034
3927
这个项目每3年
11:31
science and reading
180
691985
1964
会评估15岁学生
11:33
every three years.
181
693973
1636
在数学、科学和阅读方面的表现。
11:39
We have been believing
182
699390
2053
我们一直相信
11:41
that everybody can learn
183
701467
2043
每个人都能够学习
11:43
and learn well,
184
703534
1905
而且能够学好,
11:45
as long as we provide good learning materials
185
705463
3697
只要我们在网上提供
11:49
free on the web
186
709184
1455
免费的,好的学习材料,
11:50
so that they can access through the internet.
187
710663
3069
这样他们就可以通过网络 获取这些资料。
11:53
But such wonderful materials may benefit only those who can read well,
188
713756
5859
但是这些优质的学习资料可能 只会使那些理解能力强的人受益,
12:00
and the percentage of those who can read well
189
720534
3935
而那些理解能力强的人所占的比例
12:04
may be much less than we expected.
190
724493
3378
可能远远比我们认为的要少。
12:10
How we humans will coexist with AI
191
730040
4241
基于真实的证据,
我们将如何与人工智能和平共存?
12:14
is something we have to think about carefully,
192
734305
3522
是我们人类需要认真思考一个问题。
12:17
based on solid evidence.
193
737851
2137
12:21
At the same time, we have to think in a hurry
194
741063
3977
同时,我们要赶快思考这个问题,
12:25
because time is running out.
195
745064
2402
因为,留给我们的时间十分有限。
12:28
Thank you.
196
748106
1162
谢谢!
12:29
(Applause)
197
749626
3933
(鼓掌)
12:34
Chris Anderson: Noriko, thank you.
198
754211
2080
Chris Anderson: Noriko, 谢谢你。
12:36
Noriko Arai: Thank you.
199
756315
1765
Noriko Arai: 谢谢!
12:38
CA: In your talk, you so beautifully give us a sense of how AIs think,
200
758104
5304
CA: 你的出色的演讲, 让我们感受到了人工智能是如何思考
12:43
what they can do amazingly
201
763432
1564
它们能够做什么令人吃惊的事,
12:45
and what they can't do.
202
765020
1695
以及什么事情是它们做不了。
12:46
But -- do I read you right,
203
766739
1494
但是,我不知道我理解的对不对
12:48
that you think we really need quite an urgent revolution in education
204
768257
5270
你认为我们在教育方面急切地 需要进行改革,
12:53
to help kids do the things that humans can do better than AIs?
205
773551
4155
以帮助孩子们从事那些人类能够比 人工智能做的更好的工作。
12:57
NA: Yes, yes, yes.
206
777730
1328
NA:是, 是的。
12:59
Because we humans can understand the meaning.
207
779082
4035
因为我们人类能够理解其中的意义。
13:03
That is something which is very, very lacking in AI.
208
783141
4906
这对人工智能来说非常缺乏的。
13:08
But most of the students just pack the knowledge
209
788071
4368
但是,大多数学生仅仅是 被动的接受知识(填鸭式教育),
13:12
without understanding the meaning of the knowledge,
210
792463
3903
而并没有理解知识背后的意义。
13:16
so that is not knowledge, that is just memorizing,
211
796390
2809
因此,那些根本不是知识, 那些只是记忆而已。
13:19
and AI can do the same thing.
212
799223
2450
而人工智能也能做同样的事。
13:21
So we have to think about a new type of education.
213
801697
3631
因此,我们要思考一种 新的教育方式。
13:25
CA: A shift from knowledge, rote knowledge, to meaning.
214
805352
3289
CA: 从死记硬背转向理解意义。
13:28
NA: Mm-hmm.
215
808665
1151
NA: 嗯嗯。
13:29
CA: Well, there's a challenge for the educators. Thank you so much.
216
809840
3240
CA: 好的,这是对现场教育者 的一个挑战。非常感谢你。
13:33
NA: Thank you very much. Thank you.
217
813104
1698
NA: 谢谢你们。
13:34
(Applause)
218
814826
1185
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7