Can a robot pass a university entrance exam? | Noriko Arai

205,788 views ・ 2017-09-13

TED


請雙擊下方英文字幕播放視頻。

譯者: Judy Wan-Ling CHEN 審譯者: Wilde Luo
今天我要跟大家聊聊 「人工智慧與你我」,
00:13
Today, I'm going to talk about AI and us.
0
13014
3660
00:18
AI researchers have always said
1
18206
2143
人工智慧研究專家常說,
00:20
that we humans do not need to worry,
2
20373
2594
人類不需要擔心,
00:22
because only menial jobs will be taken over by machines.
3
22991
3580
因為機器只會取代那些 乏味枯燥的粗活。
00:27
Is that really true?
4
27274
1603
是真的嗎?
00:30
They have also said that AI will create new jobs,
5
30365
3827
他們也說過人工智慧 可以創造新的工作,
00:34
so those who lose their jobs will find a new one.
6
34216
3411
因此那些失業的人還是可以再就業。
00:38
Of course.
7
38264
1355
當然。
00:39
But the real question is:
8
39643
2172
只是問題的癥結是:
00:41
How many of those who may lose their jobs to AI
9
41839
4105
多少人因為人工智慧失業後
00:45
will be able to land a new one,
10
45968
2489
能夠真的找到新的工作,
00:48
especially when AI is smart enough to learn better than most of us?
11
48481
5850
尤其當人工智慧已經成熟到 比大多數人都能更有效率地學習?
00:55
Let me ask you a question:
12
55397
2185
讓我問你們一個問題:
你們之中認為在 2020 年之前
00:58
How many of you think
13
58666
1798
01:00
that AI will pass the entrance examination of a top university by 2020?
14
60488
6094
人工智慧將可以通過 頂尖大學的入學測驗的人,請舉手。
01:07
Oh, so many. OK.
15
67836
2457
哇,好多。好的。
01:10
So some of you may say, "Of course, yes!"
16
70317
4358
相當多人會認為:「那當然!」
01:15
Now singularity is the issue.
17
75369
2134
現在「人工智慧的奇點」 是個熱門話題。
01:18
And some others may say, "Maybe,
18
78590
3095
也有很多人會說:「有可能,
01:21
because AI already won against a top Go player."
19
81709
4508
人工智慧都已經打敗過 世界頂尖的圍棋高手了。」
01:27
And others may say, "No, never. Uh-uh."
20
87213
3681
也有一些人持不同看法: 「絕對不可能。嗯,就是這樣。」
01:32
That means we do not know the answer yet, right?
21
92195
3593
這說明了我們其實還沒有 真正的答案,不是嗎?
01:36
So that was the reason why I started Todai Robot Project,
22
96268
4960
這就是我啟動 「東大機器人計畫」的始末,
01:41
making an AI which passes the entrance examination
23
101252
3872
打造一個人工智慧機器人,
通過東京大學的入學考,
01:45
of the University of Tokyo,
24
105148
2589
01:47
the top university in Japan.
25
107761
2537
東大是全日本最頂尖的學府。
01:51
This is our Todai Robot.
26
111464
2548
為您介紹東大機器人。
當然,它正接受遠端伺服器遙控著。
01:56
And, of course, the brain of the robot is working in the remote server.
27
116131
5810
02:02
It is now writing a 600-word essay
28
122747
4222
正在寫一篇 600 字的論文,
02:06
on maritime trade in the 17th century.
29
126993
4119
闡述 17 世紀的海上貿易。
聽起來如何?
02:11
How does that sound?
30
131136
1765
為什麼當初要將入學考試 作為一個標竿?
02:14
Why did I take the entrance exam as its benchmark?
31
134113
4104
因為我想,我們有必要研究人工智慧
02:19
Because I thought we had to study the performance of AI
32
139098
4741
02:23
in comparison to humans,
33
143863
2114
相較人類的表現,
尤其那些有規模 跟特殊性的專長領域,
02:26
especially on the skills and expertise
34
146001
2860
02:28
which are believed to be acquired only by humans
35
148885
4088
向來我們都相信 這些技能和知識惟有人類
02:32
and only through education.
36
152997
2335
透過教育才能獲得。
02:35
To enter Todai, the University of Tokyo,
37
155782
4043
要考上日本的 第一學府,東京大學,
02:39
you have to pass two different types of exams.
38
159849
4421
必須通過兩項測驗。
第一個是日本高考, 全國標準化的測驗,
02:44
The first one is a national standardized test
39
164294
3760
全選擇題的題型。
02:48
in multiple-choice style.
40
168078
2403
02:50
You have to take seven subjects
41
170505
2455
你必須在七大科目中
02:52
and achieve a high score --
42
172984
1955
都獲得高分——
02:54
I would say like an 85 percent or more accuracy rate --
43
174963
4772
大概要達到 85% 以上的正確率——
02:59
to be allowed to take the second stage written test
44
179759
4087
才可以獲准進入第二階段筆試,
03:03
prepared by Todai.
45
183870
2159
由東大命題。
03:06
So let me first explain how modern AI works,
46
186994
5317
先讓我說明一下 當代的人工智慧運作的方式,
03:12
taking the "Jeopardy!" challenge as an example.
47
192335
3069
舉一個 「Jeopardy!」 《危險邊緣》 的例子。
03:17
Here is a typical "Jeopardy!" question:
48
197539
3079
典型的一道題目如下:
03:20
"Mozart's last symphony shares its name with this planet."
49
200642
4461
「莫札特最後創作的交響曲 跟這個星球同名。」
03:26
Interestingly, a "Jeopardy!" question always asks,
50
206195
4013
有趣的是,
《危險邊緣》的例子總是有
03:30
always ends with "this" something:
51
210232
3328
「這個」字眼在裡頭:
03:33
"this" planet, "this" country,
52
213584
2827
「這個」星球、「這個」國家,
03:36
"this" rock musician, and so on.
53
216435
2608
「這個」搖滾樂手 或「這個」什麼什麼。
也就是說,《危險邊緣》 沒有太多類型的題目,
03:39
In other words, "Jeopardy!" doesn't ask many different types of questions,
54
219067
4299
03:43
but a single type,
55
223390
1837
幾乎只有這一類,
03:45
which we call "factoid questions."
56
225251
2536
我們稱作「趣味小問題」。
03:48
By the way, do you know the answer?
57
228975
2167
有沒有人剛好知道答案呀?
03:53
If you do not know the answer and if you want to know the answer,
58
233980
4055
假如我們不知道答案, 但是又想要知道,
怎麼辦?
03:58
what would you do?
59
238059
1287
當然是上網搜尋呀——
04:00
You Google, right? Of course.
60
240160
3132
04:03
Why not?
61
243316
1480
為甚麼不呢?
04:04
But you have to pick appropriate keywords
62
244820
3592
但要挑對關鍵字,
04:08
like "Mozart," "last" and "symphony" to search.
63
248436
4364
譬如輸入像「莫札特」、 「最後」跟「交響曲」去搜尋。
04:13
The machine basically does the same.
64
253462
2400
基本上機器人也是這麼做。
04:16
Then this Wikipedia page will be ranked top.
65
256457
4660
維基百科的頁面就會出現在最上頭。
所以機器人就開始 「讀」這個頁面嗎?
04:21
Then the machine reads the page.
66
261840
1908
04:23
No, uh-uh.
67
263772
1171
並不是喔──
04:25
Unfortunately, none of the modern AIs,
68
265470
3462
不幸的是,所有當代的智慧機器人,
04:28
including Watson, Siri and Todai Robot,
69
268956
3968
無論是 IBM 的華生、蘋果的 Siri 或是東大機器人,
04:32
is able to read.
70
272948
1661
它們都沒有「閱讀」的能力。
04:35
But they are very good at searching and optimizing.
71
275437
3800
但是它們在搜尋跟 得到最佳化結果上很在行。
它會找到關鍵字像是
04:40
It will recognize
72
280158
2023
04:42
that the keywords "Mozart," "last" and "symphony"
73
282866
2935
「莫札特」、「最後」 跟「交響曲」,
04:45
are appearing heavily around here.
74
285825
2903
重複地出現在這一帶。
04:49
So if it can find a word which is a planet
75
289790
4375
接著繼續尋找屬於星球的詞彙,
04:54
and which is co-occurring with these keywords,
76
294189
3648
是跟前述這些關鍵字同時出現的,
04:57
that must be the answer.
77
297861
1989
那鐵定就是答案了。
05:00
This is how Watson finds the answer "Jupiter," in this case.
78
300762
5186
華生就是這樣找到「木星」的。
05:08
Our Todai Robot works similarly, but a bit smarter
79
308433
4049
東大機器人的運作方式很接近, 但在回答歷史科目的
05:12
in answering history yes-no questions,
80
312506
3239
判斷題上表現稍好,
05:16
like, "'Charlemagne repelled the Magyars.' Is this sentence true or false?"
81
316560
5663
例如: 查理曼大帝擊敗馬札爾人,
對還是錯?
05:23
Our robot starts producing a factoid question,
82
323181
4073
機器人自動轉換為一道 趣味小問題,
05:27
like: "Charlemagne repelled [this person type]" by itself.
83
327278
4899
變成:「 查理曼大帝 擊敗了這一種人」。
05:32
Then, "Avars" but not "Magyars" is ranked top.
84
332995
4732
結果最上頭出現了「阿瓦爾人」 而非「馬札爾人」。
05:38
This sentence is likely to be false.
85
338357
3049
所以這個陳述句很可能是錯誤的。
05:42
Our robot does not read, does not understand,
86
342772
4860
機器人不會閱讀,也不了解,
05:48
but it is statistically correct in many cases.
87
348335
4144
但從統計學角度評估, 卻具有高準確度。
至於第二階段的筆試,
05:54
For the second stage written test,
88
354147
2508
05:56
it is required to write a 600-word essay like this one:
89
356679
5106
受測者必須寫一篇 600 字的論文,
如這一道題:
06:01
[Discuss the rise and fall of the maritime trade
90
361809
2278
(闡述 17 世紀時東亞與東南亞 海上貿易的興衰……)
如同我稍早展示過的,
06:04
in East and Southeast Asia in the 17th century ...]
91
364111
2422
06:06
and as I have shown earlier,
92
366557
1387
我們的機器人將教科書 與維基百科的句子
06:07
our robot took the sentences from the textbooks and Wikipedia,
93
367968
4194
06:12
combined them together,
94
372186
1961
併在一起,
優化後形成一篇文章,
06:14
and optimized it to produce an essay
95
374171
3619
06:17
without understanding a thing.
96
377814
2207
完全不懂字裡行間的意涵。
06:20
(Laughter)
97
380045
1737
(笑聲)
06:21
But surprisingly, it wrote a better essay
98
381806
4895
但是令人驚訝的是, 機器人這樣寫出來的文章,
06:26
than most of the students.
99
386725
1561
居然比大多數的學生好。
06:28
(Laughter)
100
388310
2391
(笑聲)
06:30
How about mathematics?
101
390725
1529
那麼數學呢?
06:33
A fully automatic math-solving machine
102
393354
3158
能全自動處理數學問題的機器人,
06:36
has been a dream
103
396536
1631
是大家都夢寐以求的,
06:38
since the birth of the word "artificial intelligence,"
104
398191
4679
打從「人工智慧」的概念 問世以來就是如此。
06:43
but it has stayed at the level of arithmetic for a long, long time.
105
403785
6007
但是,它曾經 長期停滯在算術的階段。
06:51
Last year, we finally succeeded in developing a system
106
411530
5350
去年我們總算成功發展一套系統,
06:56
which solved pre-university-level problems from end to end,
107
416904
5173
可以從頭到尾地 解決中等教育程度的數學題目,
07:02
like this one.
108
422101
1262
像這一題。
07:05
This is the original problem written in Japanese,
109
425648
4002
原文是日文。
07:09
and we had to teach it 2,000 mathematical axioms
110
429674
4397
我們必須先教會機器人 2,000 個數學公理,
與 8,000 個日文字,
07:14
and 8,000 Japanese words
111
434095
2774
07:16
to make it accept the problems written in natural language.
112
436893
4558
才能讓機器人 看懂原文的數學題目。
07:22
And it is now translating the original problems
113
442234
3542
它現在正在翻譯原來的題目
07:25
into machine-readable formulas.
114
445800
3139
成為機器人的語言。
07:30
Weird, but it is now ready to solve it, I think.
115
450578
6099
很怪,不過應該可以開始計算了。
07:36
Go and solve it.
116
456701
1411
開始解題。
07:38
Yes! It is now executing symbolic computation.
117
458818
4284
沒錯!它正在進行符號運算。
更怪了,
07:44
Even more weird,
118
464030
1580
07:45
but probably this is the most fun part for the machine.
119
465634
4825
但或許機器人會覺得這個才好玩。
07:50
(Laughter)
120
470483
2351
(笑聲)
07:52
Now it outputs a perfect answer,
121
472858
2815
好了,它產出了一個完美的解。
07:55
though its proof is impossible to read, even for mathematicians.
122
475697
4707
儘管連數學家都證實了, 完全沒有人看得懂。
08:02
Anyway, last year our robot was among the top one percent
123
482773
6961
無論如何,去年我們的機器人 在第二階段的數學表現中
被歸類在排名前 1% 高分的群組中。
08:10
in the second stage written exam in mathematics.
124
490199
3633
08:14
(Applause)
125
494652
3210
(掌聲)
08:18
Thank you.
126
498412
1311
謝謝。
08:19
So, did it enter Todai?
127
499747
2471
所以它最終有沒有考上東大呢?
08:22
No, not as I expected.
128
502981
3058
它並沒有如預期的金榜題名。
08:26
Why?
129
506783
1399
為什麼?
08:28
Because it doesn't understand any meaning.
130
508206
2639
因為它根本什麼也不懂。
08:32
Let me show you a typical error it made in the English test.
131
512308
4079
讓我展示一個在英文科的典型錯誤。
08:36
[Nate: We're almost at the bookstore. Just a few more minutes.
132
516411
2977
(奈特:我們快到書店了, 再過幾分鐘就到了。
桑妮:等一下。______。 奈特:謝謝!每次都這樣……)
08:39
Sunil: Wait. ______ . Nate: Thank you! That always happens ...]
133
519412
3039
兩個人在對話。
08:42
Two people are talking.
134
522475
1151
我們都明白發生了什麽——
08:43
For us, who can understand the situation --
135
523650
2054
(選項:1. 我們走了很久的路 2. 我們幾乎快到了
08:45
[1. "We walked for a long time." 2. "We're almost there."
136
525704
2773
3. 你的鞋子看起來好昂貴 4. 你的鞋帶鬆了)
很明顯地,標準答案 是選 4,同意嗎?
08:48
3. "Your shoes look expensive." 4. "Your shoelace is untied."]
137
528501
3032
08:51
it is obvious number four is the correct answer, right?
138
531557
2873
可是東大機器人選 2,
08:54
But Todai Robot chose number two,
139
534454
2238
08:56
even after learning 15 billion English sentences
140
536716
5360
就算已經學習了 150 億個英文詞句,
還透過深度學習技術。
09:02
using deep learning technologies.
141
542100
2728
09:07
OK, so now you might understand what I said:
142
547600
4172
好吧,現在你可能明白我所說的:
09:12
modern AIs do not read,
143
552399
2648
當代人工智慧沒有辦法閱讀,
不能理解。
09:15
do not understand.
144
555071
1413
09:17
They only disguise as if they do.
145
557516
3169
它們只是佯裝成什麽都懂。
09:24
This is the distribution graph
146
564867
2981
這個分佈圖,
代表跟東大機器人一起接受入學 考試的其他 50 萬名考生的成績。
09:27
of half a million students who took the same exam as Todai Robot.
147
567872
5777
09:34
Now our Todai Robot is among the top 20 percent,
148
574558
5165
機器人排名其中的前 20%,
09:40
and it was capable to pass
149
580986
2415
可以考進日本
09:43
more than 60 percent of the universities in Japan --
150
583425
3941
超過六成的大學──
09:47
but not Todai.
151
587390
1377
但就是考不上東大。
可是看看被它超越的廣大區塊,
09:50
But see how it is beyond the volume zone
152
590116
4025
所謂的白領階級。
09:54
of to-be white-collar workers.
153
594165
2864
你可能會猜想我應該很開心。
10:00
You might think I was delighted.
154
600060
2858
10:03
After all, my robot was surpassing students everywhere.
155
603939
3971
畢竟我的機器人正在 全面性地超越學生。
其實不然,我很驚恐。
10:09
Instead, I was alarmed.
156
609022
2691
這個一點都不聰明的機器人居然 表現得比學生們──
10:13
How on earth could this unintelligent machine outperform students --
157
613086
5607
也是我們的孩子們,更好?
10:18
our children?
158
618717
1292
怎麼可以?
10:20
Right?
159
620033
1153
我決定深入調查 人類世界究竟發生了什麼事。
10:22
I decided to investigate what was going on in the human world.
160
622101
4402
10:28
I took hundreds of sentences from high school textbooks
161
628542
4729
我收集了上百個 高中教科書裡頭的詞句,
10:33
and made easy multiple-choice quizzes,
162
633859
3313
然後編成簡單的選擇題測驗,
10:37
and asked thousands of high school students to answer.
163
637196
4143
讓上千位高中生接受測試。
10:42
Here is an example:
164
642690
1176
這是其中一個範例:
10:43
[Buddhism spread to ... , Christianity to ... and Oceania,
165
643890
2818
題目都是用他們的母語 ──日文寫的。
10:46
and Islam to ...]
166
646732
1151
10:47
Of course, the original problems are written in Japanese,
167
647907
2740
(題目:______傳播到了大洋洲。
10:50
their mother tongue.
168
650671
1155
10:51
[ ______ has spread to Oceania.
169
651850
1515
1. 印度教 2. 基督教 3. 伊斯蘭教 4. 佛教)
10:53
1. Hinduism 2. Christianity 3. Islam 4. Buddhism ]
170
653389
2417
顯而易見的答案是基督教,對吧?
10:55
Obviously, Christianity is the answer, isn't it?
171
655830
2299
都包含在題目所給信息裡了。
10:58
It's written!
172
658153
1214
11:01
And Todai Robot chose the correct answer, too.
173
661482
4026
東大機器人也選出了正確的答案。
11:06
But one-third of junior high school students
174
666758
4879
但是有三分之一的國中生
11:11
failed to answer this question.
175
671661
2612
無法回答這個問題。
11:16
Do you think it is only the case in Japan?
176
676456
3159
你以為這個問題只存在於日本嗎?
11:19
I do not think so,
177
679639
1976
我不這麼認為,
11:21
because Japan is always ranked among the top in OECD PISA tests,
178
681639
6371
日本總是在國際學生能力評估計劃 測驗中名列前茅。
那是一套衡量 15 歲青少年在數學、
11:28
measuring 15-year-old students' performance in mathematics,
179
688034
3927
11:31
science and reading
180
691985
1964
科學與閱讀素質的測驗,
11:33
every three years.
181
693973
1636
每三年考一次。
11:39
We have been believing
182
699390
2053
我們一直相信,
11:41
that everybody can learn
183
701467
2043
每個人都能學習,
11:43
and learn well,
184
703534
1905
並且學得出色,
11:45
as long as we provide good learning materials
185
705463
3697
只要我們提供高質量的學習資料,
11:49
free on the web
186
709184
1455
這些免費資源,
11:50
so that they can access through the internet.
187
710663
3069
讓他們透過網路取得使用。
11:53
But such wonderful materials may benefit only those who can read well,
188
713756
5859
但這些優質的資料 只會讓那些能夠有效閱讀的人受益,
12:00
and the percentage of those who can read well
189
720534
3935
而能夠有效閱讀的人所佔的比例,
12:04
may be much less than we expected.
190
724493
3378
可能遠低於我們的預期。
人類要如何與人工智慧共存
12:10
How we humans will coexist with AI
191
730040
4241
12:14
is something we have to think about carefully,
192
734305
3522
是我們要謹慎思考的課題,
12:17
based on solid evidence.
193
737851
2137
客觀考量各項可靠證據。
同時,我們也要加緊腳步思考,
12:21
At the same time, we have to think in a hurry
194
741063
3977
因為所剩時間不多了。
12:25
because time is running out.
195
745064
2402
謝謝。
12:28
Thank you.
196
748106
1162
12:29
(Applause)
197
749626
3933
(掌聲)
克里斯 · 安德森:紀子,謝謝你。
12:34
Chris Anderson: Noriko, thank you.
198
754211
2080
新井紀子:謝謝。
12:36
Noriko Arai: Thank you.
199
756315
1765
克里斯:您方才向我們 說明了人工智慧的運作方式。
12:38
CA: In your talk, you so beautifully give us a sense of how AIs think,
200
758104
5304
12:43
what they can do amazingly
201
763432
1564
它們可以完美勝任的,
以及無法勝任的工作。
12:45
and what they can't do.
202
765020
1695
12:46
But -- do I read you right,
203
766739
1494
但是,我的解讀是否是正確的,
12:48
that you think we really need quite an urgent revolution in education
204
768257
5270
你認為我們急需教育改革,
12:53
to help kids do the things that humans can do better than AIs?
205
773551
4155
以協助學子在特定的領域中 讓人工智慧難以望其項背?
12:57
NA: Yes, yes, yes.
206
777730
1328
紀子:是的。
因為我們人類可以理解意義。
12:59
Because we humans can understand the meaning.
207
779082
4035
而這在人工智慧中是相當缺乏的。
13:03
That is something which is very, very lacking in AI.
208
783141
4906
但是大多數的學生都只會囫圇吞棗,
13:08
But most of the students just pack the knowledge
209
788071
4368
13:12
without understanding the meaning of the knowledge,
210
792463
3903
而非深入理解知識,
13:16
so that is not knowledge, that is just memorizing,
211
796390
2809
這就只是單純記憶的動作,
13:19
and AI can do the same thing.
212
799223
2450
人工智慧也辦得到。
13:21
So we have to think about a new type of education.
213
801697
3631
所以我們應該要思考新型態的教育。
13:25
CA: A shift from knowledge, rote knowledge, to meaning.
214
805352
3289
克里斯:從「死記硬背」 到「深入理解」的轉變。
13:28
NA: Mm-hmm.
215
808665
1151
紀子:嗯嗯嗯。
13:29
CA: Well, there's a challenge for the educators. Thank you so much.
216
809840
3240
克里斯:我想這是給 教育家的一大挑戰,
再次感謝您。
紀子:謝謝,非常謝謝你們。
13:33
NA: Thank you very much. Thank you.
217
813104
1698
13:34
(Applause)
218
814826
1185
(掌聲)
關於本網站

本網站將向您介紹對學習英語有用的 YouTube 視頻。 您將看到來自世界各地的一流教師教授的英語課程。 雙擊每個視頻頁面上顯示的英文字幕,從那裡播放視頻。 字幕與視頻播放同步滾動。 如果您有任何意見或要求,請使用此聯繫表與我們聯繫。

https://forms.gle/WvT1wiN1qDtmnspy7