Rajesh Rao: Computing a Rosetta Stone for the Indus script

1,196,598 views ・ 2011-06-28

TED


请双击下面的英文字幕来播放视频。

翻译人员: Jenny Yang 校对人员: Lan Xu
00:15
I'd like to begin with a thought experiment.
0
15260
3000
首先,我们来做个思考的实验
00:19
Imagine that it's 4,000 years into the future.
1
19260
3000
想像距今4000年后
00:22
Civilization as we know it
2
22260
2000
我们所了解的文明
00:24
has ceased to exist --
3
24260
2000
不复存在
00:26
no books,
4
26260
2000
没有书籍
00:28
no electronic devices,
5
28260
3000
没有电子设备
00:31
no Facebook or Twitter.
6
31260
3000
没有脸书没有推特
00:34
All knowledge of the English language and the English alphabet
7
34260
3000
英语和英语字母
00:37
has been lost.
8
37260
2000
也都消失殆尽
00:39
Now imagine archeologists
9
39260
2000
现在再来想想考古学家们
00:41
digging through the rubble of one of our cities.
10
41260
2000
在我们城市残砖碎瓦中挖掘
00:43
What might they find?
11
43260
2000
他们会发现什么呢?
00:45
Well perhaps some rectangular pieces of plastic
12
45260
3000
可能会发现长方形的塑胶
00:48
with strange symbols on them.
13
48260
3000
上面印着奇怪的符号
00:51
Perhaps some circular pieces of metal.
14
51260
3000
或者发现一些圆形的金属碎片
00:54
Maybe some cylindrical containers
15
54260
2000
也可能是些圆柱型的容器
00:56
with some symbols on them.
16
56260
2000
上头有些符号
00:58
And perhaps one archeologist becomes an instant celebrity
17
58260
3000
还可能有某个考古学家在
01:01
when she discovers --
18
61260
2000
北美的某个地方
01:03
buried in the hills somewhere in North America --
19
63260
2000
发现了大量相似的符号后
01:05
massive versions of these same symbols.
20
65260
3000
一夜成名
01:10
Now let's ask ourselves,
21
70260
2000
让我们来问问我们自己
01:12
what could such artifacts say about us
22
72260
3000
我们的这些残存给4000年后
01:15
to people 4,000 years into the future?
23
75260
3000
的人们展示了一个怎样的我们
01:18
This is no hypothetical question.
24
78260
2000
这不是假设
01:20
In fact, this is exactly the kind of question we're faced with
25
80260
3000
事实上,这正是我们面对
01:23
when we try to understand the Indus Valley civilization,
26
83260
3000
4000年前存在的印度河流域文明时
01:26
which existed 4,000 years ago.
27
86260
2000
面对的问题
01:28
The Indus civilization was roughly contemporaneous
28
88260
3000
古印度河流域文明和较为人所熟知的
01:31
with the much better known Egyptian and the Mesopotamian civilizations,
29
91260
3000
埃及以及美所不达米亚文化大概处于同时期
01:34
but it was actually much larger than either of these two civilizations.
30
94260
3000
但其实它的范围超过这两个文明中的任何一个
01:37
It occupied the area
31
97260
2000
它涵盖了
01:39
of approximately one million square kilometers,
32
99260
2000
大约一百万平方公里
01:41
covering what is now Pakistan,
33
101260
2000
包括今天的巴基斯坦
01:43
Northwestern India
34
103260
2000
印度的西北部
01:45
and parts of Afghanistan and Iran.
35
105260
2000
部分阿富汗和伊朗
01:47
Given that it was such a vast civilization,
36
107260
2000
如此广大的文明
01:49
you might expect to find really powerful rulers, kings,
37
109260
4000
你可能会期待有强大的统治者,国王
01:53
and huge monuments glorifying these powerful kings.
38
113260
3000
以及用来纪念王者的巨大纪念碑
01:56
In fact,
39
116260
2000
事实上
01:58
what archeologists have found is none of that.
40
118260
2000
考古学者没有发现那些
02:00
They've found small objects such as these.
41
120260
3000
他们发现了这样一些小物件
02:03
Here's an example of one of these objects.
42
123260
3000
这就是其中的一样
02:06
Well obviously this is a replica.
43
126260
2000
当然这只是个复制品
02:08
But who is this person?
44
128260
3000
但这是谁呢?
02:11
A king? A god?
45
131260
2000
是一个国王?一个神?
02:13
A priest?
46
133260
2000
还是某个祭祀?
02:15
Or perhaps an ordinary person
47
135260
2000
或者是一个普通人
02:17
like you or me?
48
137260
2000
如你和我?
02:19
We don't know.
49
139260
2000
我们不知道
02:21
But the Indus people also left behind artifacts with writing on them.
50
141260
3000
但是古印度流域的人们也留下了背面写有字符的手工制品
02:24
Well no, not pieces of plastic,
51
144260
2000
不是在塑料片上
02:26
but stone seals, copper tablets,
52
146260
3000
而是在石头图章,铜片上
02:29
pottery and, surprisingly,
53
149260
2000
陶器,还有令人惊讶的
02:31
one large sign board,
54
151260
2000
在很大的符号石板上
02:33
which was found buried near the gate of a city.
55
153260
2000
掩埋在城门附近的
02:35
Now we don't know if it says Hollywood,
56
155260
2000
所以我们不知道上面是否写了好莱坞
02:37
or even Bollywood for that matter.
57
157260
2000
甚至是宝莱坞
02:39
In fact, we don't even know
58
159260
2000
事实上,我们不知道
02:41
what any of these objects say,
59
161260
2000
上面写的任何东西
02:43
and that's because the Indus script is undeciphered.
60
163260
3000
因为我们还不能破译这些印度河文字
02:46
We don't know what any of these symbols mean.
61
166260
2000
我们不知道这些符号是什么意思
02:48
The symbols are most commonly found on seals.
62
168260
3000
这些符号大多数出现在印章上
02:51
So you see up there one such object.
63
171260
2000
你们可以看到这个物品
02:53
It's the square object with the unicorn-like animal on it.
64
173260
3000
这个方形的物品上有个像是独角兽的动物
02:56
Now that's a magnificent piece of art.
65
176260
2000
这是一件很精致的艺术品
02:58
So how big do you think that is?
66
178260
2000
你觉得这块有多大呢?
03:00
Perhaps that big?
67
180260
2000
这么大?
03:02
Or maybe that big?
68
182260
2000
也许这么大?
03:04
Well let me show you.
69
184260
2000
让我来给你看看
03:07
Here's a replica of one such seal.
70
187260
3000
这是这枚印章的复制品
03:10
It's only about one inch by one inch in size --
71
190260
2000
只有一平方英吋大
03:12
pretty tiny.
72
192260
2000
很小
03:14
So what were these used for?
73
194260
2000
那这是干什么用的呢?
03:16
We know that these were used for stamping clay tags
74
196260
3000
我们知道这是用来盖在黏土标签上的
03:19
that were attached to bundles of goods that were sent from one place to the other.
75
199260
3000
一捆捆的货物被附上标签,从一地被运往另一地
03:22
So you know those packing slips you get on your FedEx boxes?
76
202260
3000
你知道联邦快递盒子里的那种装箱单吧?
03:25
These were used to make those kinds of packing slips.
77
205260
3000
这些就是用来做那些送货单的
03:28
You might wonder what these objects contain
78
208260
3000
你也许会想知道这些东西
03:31
in terms of their text.
79
211260
2000
包涵了什么文字含义
03:33
Perhaps they're the name of the sender
80
213260
2000
也许是寄件人的名字
03:35
or some information about the goods
81
215260
2000
或者是某些货物的信息
03:37
that are being sent from one place to the other -- we don't know.
82
217260
3000
从一地被送往别处-我们不知道
03:40
We need to decipher the script to answer that question.
83
220260
2000
我们需要破解这段文字才能回答那个问题
03:42
Deciphering the script
84
222260
2000
破解这段文字
03:44
is not just an intellectual puzzle;
85
224260
2000
不只是破解一个智力难题
03:46
it's actually become a question
86
226260
2000
事实上这个问题
03:48
that's become deeply intertwined
87
228260
2000
深深地和南亚的政治
03:50
with the politics and the cultural history of South Asia.
88
230260
3000
文化历史交织在一起
03:53
In fact, the script has become a battleground of sorts
89
233260
3000
而对这些文字的争论已经形成了一个战场
03:56
between three different groups of people.
90
236260
2000
三群持不同见解团体各持己见
03:58
First, there's a group of people
91
238260
2000
第一群人
04:00
who are very passionate in their belief
92
240260
2000
热切地坚信
04:02
that the Indus script
93
242260
2000
这些印度河流域的字符
04:04
does not represent a language at all.
94
244260
2000
完全不代表语言
04:06
These people believe that the symbols
95
246260
2000
这些人认为这些符号
04:08
are very similar to the kind of symbols you find on traffic signs
96
248260
3000
和交通标示的符号相似
04:11
or the emblems you find on shields.
97
251260
3000
也像盾上面的一些象征符号
04:14
There's a second group of people
98
254260
2000
第二群人
04:16
who believe that the Indus script represents an Indo-European language.
99
256260
3000
相信印度河文字代表着某个印欧语言
04:19
If you look at a map of India today,
100
259260
2000
如果你看看印度今日的版图
04:21
you'll see that most of the languages spoken in North India
101
261260
3000
很多在印度北部使用的语言
04:24
belong to the Indo-European language family.
102
264260
3000
都是属于印欧语系
04:27
So some people believe that the Indus script
103
267260
2000
所以一些人相信古印度河文字
04:29
represents an ancient Indo-European language such as Sanskrit.
104
269260
3000
是某种古老的印欧语,就像梵语一样
04:32
There's a last group of people
105
272260
2000
最后一群人
04:34
who believe that the Indus people
106
274260
3000
相信古印度人
04:37
were the ancestors of people living in South India today.
107
277260
3000
是现居南印度的人的祖先
04:40
These people believe that the Indus script
108
280260
2000
这些人觉得古印度文字
04:42
represents an ancient form
109
282260
2000
是达罗毗荼语系下的
04:44
of the Dravidian language family,
110
284260
2000
一种古老的文字形式
04:46
which is the language family spoken in much of South India today.
111
286260
3000
现今南印度多数人皆使用达罗毗荼语
04:49
And the proponents of this theory
112
289260
2000
支持这一理论的人
04:51
point to that small pocket of Dravidian-speaking people in the North,
113
291260
3000
指出印度北部有一小部份使用达罗毗荼语的人
04:54
actually near Afghanistan,
114
294260
2000
事实上靠近阿富汗
04:56
and they say that perhaps, sometime in the past,
115
296260
3000
他们说或许,过去曾经有一段时间
04:59
Dravidian languages were spoken all over India
116
299260
3000
整个印度都使用达罗毗荼语
05:02
and that this suggests
117
302260
2000
这也意味着
05:04
that the Indus civilization is perhaps also Dravidian.
118
304260
3000
印度河文明或许就是达罗毗荼文明
05:07
Which of these hypotheses can be true?
119
307260
3000
这些假设哪个是真的呢?
05:10
We don't know, but perhaps if you deciphered the script,
120
310260
2000
我们不知道, 但如果你能解开这些文字符号
05:12
you would be able to answer this question.
121
312260
2000
你就能回答这个问题
05:14
But deciphering the script is a very challenging task.
122
314260
2000
但是破解文字密码是一项很具有挑战性的任务
05:16
First, there's no Rosetta Stone.
123
316260
2000
首先, 我们没有罗塞塔石碑
05:18
I don't mean the software;
124
318260
2000
我不是指那软件
05:20
I mean an ancient artifact
125
320260
2000
我是指那件文物
05:22
that contains in the same text
126
322260
2000
上面用我们知道和不知道的文字
05:24
both a known text and an unknown text.
127
324260
3000
书写了同样的内容。
05:27
We don't have such an artifact for the Indus script.
128
327260
3000
印度河文化中并没有发现同样的古物
05:30
And furthermore, we don't even know what language they spoke.
129
330260
3000
而且,我们不知道他们说的是什么语言
05:33
And to make matters even worse,
130
333260
2000
更糟的是
05:35
most of the text that we have are extremely short.
131
335260
2000
大部份的文字都相当短
05:37
So as I showed you, they're usually found on these seals
132
337260
2000
就像我刚才给大家看的,最常发现的是在印章类的东西上
05:39
that are very, very tiny.
133
339260
2000
他们都非常非常地小
05:41
And so given these formidable obstacles,
134
341260
2000
给我们解读带来了难以克服的障碍
05:43
one might wonder and worry
135
343260
2000
有些人或许会担心
05:45
whether one will ever be able to decipher the Indus script.
136
345260
3000
是否真的有谁可以解开古印度河文字之谜呢?
05:48
In the rest of my talk,
137
348260
2000
在我接下来的演讲中
05:50
I'd like to tell you about how I learned to stop worrying
138
350260
2000
我要来告诉各位我怎么学会停止担心
05:52
and love the challenge posed by the Indus script.
139
352260
2000
并喜欢上解读古印度河文字所带来的挑战
05:54
I've always been fascinated by the Indus script
140
354260
3000
我对古印度河文字的痴迷
05:57
ever since I read about it in a middle school textbook.
141
357260
2000
从在中学课本中读到便开始
05:59
And why was I fascinated?
142
359260
2000
我为什么会着迷呢?
06:01
Well it's the last major undeciphered script in the ancient world.
143
361260
4000
因为这是历史上目前唯一未被破解的重要的古文字
06:05
My career path led me to become a computational neuroscientist,
144
365260
3000
我的职业生涯让我成为一个计算机神经科学家
06:08
so in my day job,
145
368260
2000
我白天的工作
06:10
I create computer models of the brain
146
370260
2000
是在电脑上模拟大脑
06:12
to try to understand how the brain makes predictions,
147
372260
3000
试图了借大脑是怎样预知事物的
06:15
how the brain makes decisions,
148
375260
2000
大脑是怎样做决定的,
06:17
how the brain learns and so on.
149
377260
2000
大脑是怎样学习的等等
06:19
But in 2007, my path crossed again with the Indus script.
150
379260
3000
但在2007年,我与古印度河文字重逢
06:22
That's when I was in India,
151
382260
2000
当时在印度
06:24
and I had the wonderful opportunity
152
384260
2000
我有幸得到机会
06:26
to meet with some Indian scientists
153
386260
2000
跟一些印度科学家会面
06:28
who were using computer models to try to analyze the script.
154
388260
3000
他们在用计算机模型分析这些文字
06:31
And so it was then that I realized
155
391260
2000
那个时候我认识到
06:33
there was an opportunity for me to collaborate with these scientists,
156
393260
3000
这是我跟这些科学家合作的一个机会
06:36
and so I jumped at that opportunity.
157
396260
2000
所以我立刻就抓住了这个机会
06:38
And I'd like to describe some of the results that we have found.
158
398260
2000
我想在这里介绍我们发现的一些结果
06:40
Or better yet, let's all collectively decipher.
159
400260
3000
或者更好的是,让我们一起来破解吧
06:43
Are you ready?
160
403260
2000
你们准备好了吗?
06:45
The first thing that you need to do when you have an undeciphered script
161
405260
3000
当你想要破解文字符号时,首先你要
06:48
is try to figure out the direction of writing.
162
408260
2000
分辨书写的方向
06:50
Here are two texts that contain some symbols on them.
163
410260
3000
这里有两段相同符号的文字
06:53
Can you tell me
164
413260
2000
你能告诉我
06:55
if the direction of writing is right to left or left to right?
165
415260
3000
书写的方向是从右到左还是从左到右?
06:58
I'll give you a couple of seconds.
166
418260
3000
我给大家几秒钟想一想
07:01
Okay. Right to left, how many? Okay.
167
421260
3000
好, 从右到左,有多少人同意?好
07:04
Okay. Left to right?
168
424260
2000
从左到右呢?
07:06
Oh, it's almost 50/50. Okay.
169
426260
2000
噢,大约是一半一半
07:08
The answer is:
170
428260
2000
答案是:
07:10
if you look at the left-hand side of the two texts,
171
430260
2000
如果你看这两段文字的左边
07:12
you'll notice that there's a cramping of signs,
172
432260
3000
你会注意到左边的符号挤在一起
07:15
and it seems like 4,000 years ago,
173
435260
2000
好像是4000年前
07:17
when the scribe was writing from right to left,
174
437260
2000
当书写者从右写到左时
07:19
they ran out of space.
175
439260
2000
因为最后没地方了写了
07:21
And so they had to cram the sign.
176
441260
2000
只能缩小间距
07:23
One of the signs is also below the text on the top.
177
443260
2000
其中有一个符号也出现在下面的文字上面
07:25
This suggests the direction of writing
178
445260
2000
意味着书写的方向
07:27
was probably from right to left,
179
447260
2000
是从右到左的
07:29
and so that's one of the first things we know,
180
449260
2000
所以我们知道的第一件事是
07:31
that directionality is a very key aspect of linguistic scripts.
181
451260
3000
书写方向是语言符号中一个很关键的方面
07:34
And the Indus script now has
182
454260
2000
因此古印度河文字就有了这个
07:36
this particular property.
183
456260
2000
特别的特征
07:38
What other properties of language does the script show?
184
458260
2000
这文字符号中还有什么其他语言特征?
07:40
Languages contain patterns.
185
460260
2000
语言会有一定模式
07:42
If I give you the letter Q
186
462260
2000
如果我给你一个字母Q
07:44
and ask you to predict the next letter, what do you think that would be?
187
464260
3000
然后请你猜下一个字母,你们觉得会是什么呢?
07:47
Most of you said U, which is right.
188
467260
2000
大部份人都会说U,那是对的
07:49
Now if I asked you to predict one more letter,
189
469260
2000
现在我请你们再猜下个字母
07:51
what do you think that would be?
190
471260
2000
你们觉得会是什么呢?
07:53
Now there's several thoughts. There's E. It could be I. It could be A,
191
473260
3000
现在这边有很多种答案,有人说E,可能是I, 也可能是一个A,
07:56
but certainly not B, C or D, right?
192
476260
3000
但是肯定不是B,C,或者D,对吗?
07:59
The Indus script also exhibits similar kinds of patterns.
193
479260
3000
印度河文字也有类似的模式
08:02
There's a lot of text that start with this diamond-shaped symbol.
194
482260
3000
有很多段文字的开头都有菱形符号
08:05
And this in turn tends to be followed
195
485260
2000
接着常常是跟着一个
08:07
by this quotation marks-like symbol.
196
487260
2000
有点象引号那样的符号
08:09
And this is very similar to a Q and U example.
197
489260
2000
这就跟Q跟U的例子很相似
08:11
This symbol can in turn be followed
198
491260
2000
这符号之后
08:13
by these fish-like symbols and some other signs,
199
493260
3000
会有着类似鱼等其他的符号
08:16
but never by these other signs at the bottom.
200
496260
2000
但从来不会是最底下出现的这些符号
08:18
And furthermore, there's some signs
201
498260
2000
进一步来说,有些符号
08:20
that really prefer the end of texts,
202
500260
2000
常常出现在文章结尾
08:22
such as this jar-shaped sign,
203
502260
2000
像是这壶型的记号
08:24
and this sign, in fact, happens to be
204
504260
2000
事实上,这记号刚好是
08:26
the most frequently occurring sign in the script.
205
506260
2000
在这些文句中最常出现的记号
08:28
Given such patterns, here was our idea.
206
508260
3000
看出这些惯用模式后,我们有个想法
08:31
The idea was to use a computer
207
511260
2000
就是用电脑
08:33
to learn these patterns,
208
513260
2000
去了解这些图案
08:35
and so we gave the computer the existing texts.
209
515260
3000
我们将这些现存的文字符号输入电脑
08:38
And the computer learned a statistical model
210
518260
2000
然后电脑以某种统计模型
08:40
of which symbols tend to occur together
211
520260
2000
算出哪些符号会一起出现
08:42
and which symbols tend to follow each other.
212
522260
2000
而哪些符号常以先后顺序出现
08:44
Given the computer model,
213
524260
2000
使用此电脑模式
08:46
we can test the model by essentially quizzing it.
214
526260
3000
我们可以用考试的方法来测试这个模式
08:49
So we could deliberately erase some symbols,
215
529260
2000
我们可以故意去掉某些符号
08:51
and we can ask it to predict the missing symbols.
216
531260
3000
我们让计算机来预测缺少的字符
08:54
Here are some examples.
217
534260
3000
这是一些例子
09:00
You may regard this
218
540260
2000
你们可以把这个看作
09:02
as perhaps the most ancient game
219
542260
2000
最古老的
09:04
of Wheel of Fortune.
220
544260
3000
幸运大转盘游戏(一个填字游戏)
09:08
What we found
221
548260
2000
我们发现
09:10
was that the computer was successful in 75 percent of the cases
222
550260
2000
计算机预测正确字符的
09:12
in predicting the correct symbol.
223
552260
2000
成功率是75%
09:14
In the rest of the cases,
224
554260
2000
剩下的例子中
09:16
typically the second best guess or third best guess was the right answer.
225
556260
3000
正确答案不是在第二选择就是在第三选择中
09:19
There's also practical use
226
559260
2000
这种预测方法
09:21
for this particular procedure.
227
561260
2000
会派到实际的用处
09:23
There's a lot of these texts that are damaged.
228
563260
2000
很多文字符号都已经损坏
09:25
Here's an example of one such text.
229
565260
2000
这里有个例子
09:27
And we can use the computer model now to try to complete this text
230
567260
3000
我们可以使用电脑模式来修复这段文字内容
09:30
and make a best guess prediction.
231
570260
2000
并做出最好的猜测
09:32
Here's an example of a symbol that was predicted.
232
572260
3000
这就是一个预测的例子
09:35
And this could be really useful as we try to decipher the script
233
575260
2000
这非常有用, 在我们破解字符的过程中
09:37
by generating more data that we can analyze.
234
577260
3000
让我们得到了更多可供分析的数据
09:40
Now here's one other thing you can do with the computer model.
235
580260
3000
还有另一件事也是可以利用电脑模式来做的
09:43
So imagine a monkey
236
583260
2000
想象有个猴子
09:45
sitting at a keyboard.
237
585260
2000
坐在键盘上
09:47
I think you might get a random jumble of letters that looks like this.
238
587260
3000
我猜会得到一堆像这样随机跑出的字母
09:50
Such a random jumble of letters
239
590260
2000
这些随机混乱的字母
09:52
is said to have a very high entropy.
240
592260
2000
说明了很高的熵
09:54
This is a physics and information theory term.
241
594260
2000
这是物理学和信息理论的说法
09:56
But just imagine it's a really random jumble of letters.
242
596260
3000
但想像这是相当随机混乱的字母排列
09:59
How many of you have ever spilled coffee on a keyboard?
243
599260
4000
你们中多少人曾经将咖啡溅在键盘上?
10:03
You might have encountered the stuck-key problem --
244
603260
2000
你也许会碰到键盘卡住的问题
10:05
so basically the same symbol being repeated over and over again.
245
605260
3000
所以同一个字符会一再重复
10:08
This kind of a sequence is said to have a very low entropy
246
608260
3000
这类的数列可以说有着非常低的熵
10:11
because there's no variation at all.
247
611260
2000
因为它们一点都没变化
10:13
Language, on the other hand, has an intermediate level of entropy;
248
613260
3000
语言,从别个角度说,有着中间程度的熵
10:16
it's neither too rigid,
249
616260
2000
它不是很严格
10:18
nor is it too random.
250
618260
2000
也不是很随机
10:20
What about the Indus script?
251
620260
2000
那印度河字符呢?
10:22
Here's a graph that plots the entropies of a whole bunch of sequences.
252
622260
4000
这个图表显示了一大堆顺序的熵值
10:26
At the very top you find the uniformly random sequence,
253
626260
2000
你会在上面发现那些随机顺序
10:28
which is a random jumble of letters --
254
628260
2000
那是一些杂乱随机放置的字母
10:30
and interestingly, we also find
255
630260
2000
有趣的是, 我们同时发现
10:32
the DNA sequence from the human genome and instrumental music.
256
632260
3000
人类基因的DNA排序和乐器演奏的音乐
10:35
And both of these are very, very flexible,
257
635260
2000
它们都非常非常灵活
10:37
which is why you find them in the very high range.
258
637260
2000
所以他们处在高程度的范围里
10:39
At the lower end of the scale,
259
639260
2000
在低端范围里
10:41
you find a rigid sequence, a sequence of all A's,
260
641260
2000
你可以发现很严格的顺序, 比如全是字母A的排列
10:43
and you also find a computer program,
261
643260
2000
你还会发现一个计算机程序
10:45
in this case in the language Fortran,
262
645260
2000
是用FORTRAN语言写的
10:47
which obeys really strict rules.
263
647260
2000
它遵循严格的规则
10:49
Linguistic scripts
264
649260
2000
语言文字
10:51
occupy the middle range.
265
651260
2000
占中间段
10:53
Now what about the Indus script?
266
653260
2000
那印度河文字呢?
10:55
We found that the Indus script
267
655260
2000
我们发现印度河流域的文字
10:57
actually falls within the range of the linguistic scripts.
268
657260
2000
的数值正好落在语言文字段里
10:59
When this result was first published,
269
659260
2000
这个结果刚公布的时候
11:01
it was highly controversial.
270
661260
3000
引起了很大的争议
11:04
There were people who raised a hue and cry,
271
664260
3000
有人提出强烈反对
11:07
and these people were the ones who believed
272
667260
2000
这些批评的人认为
11:09
that the Indus script does not represent language.
273
669260
3000
古印度河文字根本不是文字
11:12
I even started to get some hate mail.
274
672260
2000
我甚至收到一些恶意信函
11:14
My students said
275
674260
2000
我的学生说
11:16
that I should really seriously consider getting some protection.
276
676260
3000
我应该要严肃地考虑怎样保护自身安危
11:19
Who'd have thought
277
679260
2000
谁会想到
11:21
that deciphering could be a dangerous profession?
278
681260
2000
解读古文字会变成一种危险职业
11:23
What does this result really show?
279
683260
2000
这结果说明了什么呢?
11:25
It shows that the Indus script
280
685260
2000
说明了古印度河文字
11:27
shares an important property of language.
281
687260
2000
有着重要的语言特征
11:29
So, as the old saying goes,
282
689260
2000
所以,就像老话说的那样
11:31
if it looks like a linguistic script
283
691260
2000
如果这看起来像是语言性文字
11:33
and it acts like a linguistic script,
284
693260
2000
且它也有语言性文字的功用
11:35
then perhaps we may have a linguistic script on our hands.
285
695260
3000
恐怕我们手里捧着的大概就是一种语言文字
11:38
What other evidence is there
286
698260
2000
有什么其他证据显示
11:40
that the script could actually encode language?
287
700260
2000
这类文字事实上包涵语言
11:42
Well linguistic scripts can actually encode multiple languages.
288
702260
3000
语言字符其实可以被包括在多种语言中
11:45
So for example, here's the same sentence written in English
289
705260
3000
比如,这是同样一句话写成英语
11:48
and the same sentence written in Dutch
290
708260
2000
这是这句话写成荷兰语
11:50
using the same letters of the alphabet.
291
710260
2000
用到同样的字母
11:52
If you don't know Dutch and you only know English
292
712260
3000
如果你不懂荷兰语只知道英语
11:55
and I give you some words in Dutch,
293
715260
2000
我给你看一些荷兰语单词
11:57
you'll tell me that these words contain
294
717260
2000
你会说这些词包涵了
11:59
some very unusual patterns.
295
719260
2000
一些独特的模式
12:01
Some things are not right,
296
721260
2000
有些地方不对
12:03
and you'll say these words are probably not English words.
297
723260
3000
你也会说这些不是英文单词
12:06
The same thing happens in the case of the Indus script.
298
726260
2000
这同样也发生在印度河文字中
12:08
The computer found several texts --
299
728260
2000
计算机也发现一些文字
12:10
two of them are shown here --
300
730260
2000
这里有两三个
12:12
that have very unusual patterns.
301
732260
2000
他们具有很独特的模式
12:14
So for example the first text:
302
734260
2000
比如,第一个例子
12:16
there's a doubling of this jar-shaped sign.
303
736260
3000
这个壶形字符被重复
12:19
This sign is the most frequently-occurring sign
304
739260
2000
这个符号是印度河文字里
12:21
in the Indus script,
305
741260
2000
最常见的一个记号
12:23
and it's only in this text
306
743260
2000
而只在这段文字里
12:25
that it occurs as a doubling pair.
307
745260
2000
它以成对的模式出现
12:27
Why is that the case?
308
747260
2000
为什么呢?
12:29
We went back and looked at where these particular texts were found,
309
749260
3000
我们回去找这些特殊的文字是在哪里找到的
12:32
and it turns out that they were found
310
752260
2000
我们发现它们出现在离
12:34
very, very far away from the Indus Valley.
311
754260
2000
印度河谷十分遥远的地方
12:36
They were found in present day Iraq and Iran.
312
756260
3000
它们是在现在的伊朗和伊拉克被发现的
12:39
And why were they found there?
313
759260
2000
它们怎么会出现在那里呢?
12:41
What I haven't told you is that
314
761260
2000
我还没有告诉你们
12:43
the Indus people were very, very enterprising.
315
763260
2000
印度人相当会做生意
12:45
They used to trade with people pretty far away from where they lived,
316
765260
3000
他们曾跑到离他们居住很远的地方去做交易
12:48
and so in this case, they were traveling by sea
317
768260
3000
在这个例子中,他们跨越海洋
12:51
all the way to Mesopotamia, present-day Iraq.
318
771260
3000
到了两河流域,也就是今天的伊拉克
12:54
And what seems to have happened here
319
774260
2000
看起来
12:56
is that the Indus traders, the merchants,
320
776260
3000
印度的生意人,商人
12:59
were using this script to write a foreign language.
321
779260
3000
在用这些文字记录外语
13:02
It's just like our English and Dutch example.
322
782260
2000
就好比英语和荷兰语的例子
13:04
And that would explain why we have these strange patterns
323
784260
2000
这可以解释为什么我们会发现这些奇怪的规律
13:06
that are very different from the kinds of patterns you see in the text
324
786260
3000
它们和在印度河流域发现的文字的规律
13:09
that are found within the Indus Valley.
325
789260
3000
非常不同
13:12
This suggests that the same script, the Indus script,
326
792260
2000
这说明印度河文字,同样的文字
13:14
could be used to write different languages.
327
794260
3000
可能被用来记录不同的语言
13:17
The results we have so far seem to point to the conclusion
328
797260
3000
根据我们现有的研究结果指向的结论来看
13:20
that the Indus script probably does represent language.
329
800260
3000
印度河流域的这些字符代表语言
13:23
If it does represent language,
330
803260
2000
而如果它的确代表语言
13:25
then how do we read the symbols?
331
805260
2000
那么我们怎样来解读这些符号
13:27
That's our next big challenge.
332
807260
2000
这是对我们最大的挑战
13:29
So you'll notice that many of the symbols
333
809260
2000
你注意到很多这些符号
13:31
look like pictures of humans, of insects,
334
811260
2000
看上去像人和昆虫的图画
13:33
of fishes, of birds.
335
813260
3000
或者鱼,鸟
13:36
Most ancient scripts
336
816260
2000
很多古代的字符
13:38
use the rebus principle,
337
818260
2000
都使用了画谜原则
13:40
which is, using pictures to represent words.
338
820260
3000
也就是用图画代表字词
13:43
So as an example, here's a word.
339
823260
3000
比如, 这是一个词
13:46
Can you write it using pictures?
340
826260
2000
你能用图画来书写吗?
13:48
I'll give you a couple seconds.
341
828260
2000
我给你们几秒钟
13:50
Got it?
342
830260
2000
画好了吗
13:52
Okay. Great.
343
832260
2000
13:54
Here's my solution.
344
834260
2000
这是我的答案
13:56
You could use the picture of a bee followed by a picture of a leaf --
345
836260
2000
你可以用一片树叶(leaf)跟了一只蜜蜂(bee)
13:58
and that's "belief," right.
346
838260
2000
那就是"相信“(发音跟bee加leaf接近)对吗?
14:00
There could be other solutions.
347
840260
2000
也可能有其他的答案
14:02
In the case of the Indus script,
348
842260
2000
对印度河字符来说
14:04
the problem is the reverse.
349
844260
2000
我们的面临着反向的挑战
14:06
You have to figure out the sounds of each of these pictures
350
846260
3000
你得找到每幅图的发声
14:09
such that the entire sequence makes sense.
351
849260
2000
整个排列秩序意义通顺
14:11
So this is just like a crossword puzzle,
352
851260
3000
所以这就跟填字游戏一样
14:14
except that this is the mother of all crossword puzzles
353
854260
3000
只不过这是所有填字游戏的鼻祖
14:17
because the stakes are so high if you solve it.
354
857260
4000
因为破解它的价值太可观了
14:21
My colleagues, Iravatham Mahadevan and Asko Parpola,
355
861260
3000
我的同事, Iravatham Mahadevan 和 Asko Parpola
14:24
have been making some headway on this particular problem.
356
864260
2000
已经在这个特殊的问题上取得了一定的进展
14:26
And I'd like to give you a quick example of Parpola's work.
357
866260
2000
我给你你们看一个Parpola研究的例子
14:28
Here's a really short text.
358
868260
2000
这是一段很短的文字
14:30
It contains seven vertical strokes followed by this fish-like sign.
359
870260
3000
它里面有7个竖笔,跟了一个象鱼的形状
14:33
And I want to mention that these seals were used
360
873260
2000
我得提一下这些章用于
14:35
for stamping clay tags
361
875260
2000
印在黏土标签上
14:37
that were attached to bundles of goods,
362
877260
2000
然后附在一捆捆的货物上
14:39
so it's quite likely that these tags, at least some of them,
363
879260
3000
所以这些标签,至少其中的一些
14:42
contain names of merchants.
364
882260
2000
有商品的名字
14:44
And it turns out that in India
365
884260
2000
而在印度
14:46
there's a long tradition
366
886260
2000
这是一个久远的传统
14:48
of names being based on horoscopes
367
888260
2000
名字是根据星座
14:50
and star constellations present at the time of birth.
368
890260
3000
和生日的时候展示的星宿来定的
14:53
In Dravidian languages,
369
893260
2000
在达罗毗荼语系中
14:55
the word for fish is "meen"
370
895260
2000
鱼的发音是“meen”
14:57
which happens to sound just like the word for star.
371
897260
3000
而这个字的发音跟星星刚好相同
15:00
And so seven stars
372
900260
2000
所以七颗星星
15:02
would stand for "elu meen,"
373
902260
2000
代表“elu meen”
15:04
which is the Dravidian word
374
904260
2000
就是达罗毗荼语的
15:06
for the Big Dipper star constellation.
375
906260
2000
北斗七星
15:08
Similarly, there's another sequence of six stars,
376
908260
3000
同样地,这里有六颗星
15:11
and that translates to "aru meen,"
377
911260
2000
可以听作“aru meen”
15:13
which is the old Dravidian name
378
913260
2000
是一个达罗毗荼语的旧名词
15:15
for the star constellation Pleiades.
379
915260
2000
代表昴宿星团
15:17
And finally, there's other combinations,
380
917260
3000
最后, 还有其他的组合
15:20
such as this fish sign with something that looks like a roof on top of it.
381
920260
3000
比如鱼的符号上有个顶
15:23
And that could be translated into "mey meen,"
382
923260
3000
可以说成“MEY meen”
15:26
which is the old Dravidian name for the planet Saturn.
383
926260
3000
在旧达罗毗荼语中指土星
15:29
So that was pretty exciting.
384
929260
2000
让人感到振奋的是
15:31
It looks like we're getting somewhere.
385
931260
2000
我们似乎有所进展
15:33
But does this prove
386
933260
2000
但这样是否就能证明
15:35
that these seals contain Dravidian names
387
935260
2000
这些印章上达罗毗荼语的名字
15:37
based on planets and star constellations?
388
937260
2000
是基于星球和星座呢?
15:39
Well not yet.
389
939260
2000
还不能
15:41
So we have no way of validating
390
941260
2000
我们没有办法验证
15:43
these particular readings,
391
943260
2000
这些解读方式是正确的
15:45
but if more and more of these readings start making sense,
392
945260
3000
但是如果越来越多的解读开始有道理
15:48
and if longer and longer sequences
393
948260
2000
而越来越长的排序
15:50
appear to be correct,
394
950260
2000
显示出正确性
15:52
then we know that we are on the right track.
395
952260
2000
那我们就知道我们走上了正道
15:54
Today,
396
954260
2000
今天
15:56
we can write a word such as TED
397
956260
3000
我们可以用埃及象形文字以及楔形文字
15:59
in Egyptian hieroglyphics and in cuneiform script,
398
959260
3000
写下的TED这样的名字
16:02
because both of these were deciphered
399
962260
2000
因为人们在十九世纪
16:04
in the 19th century.
400
964260
2000
就破解了这两种文字符号
16:06
The decipherment of these two scripts
401
966260
2000
这两种文字的破解
16:08
enabled these civilizations to speak to us again directly.
402
968260
3000
使得这些古文明能与我们再次直接的交流
16:11
The Mayans
403
971260
2000
玛雅文明
16:13
started speaking to us in the 20th century,
404
973260
2000
也在20世纪开始与我们对话
16:15
but the Indus civilization remains silent.
405
975260
3000
而印度河流域的文明至今保持沉默
16:18
Why should we care?
406
978260
2000
我们为什么在乎呢?
16:20
The Indus civilization does not belong
407
980260
2000
印度河流域的文化不仅仅属于
16:22
to just the South Indians or the North Indians
408
982260
2000
南印度或者是北印度
16:24
or the Pakistanis;
409
984260
2000
或者是巴基斯坦
16:26
it belongs to all of us.
410
986260
2000
它是属于我们大家的
16:28
These are our ancestors --
411
988260
2000
他们是我们的祖先
16:30
yours and mine.
412
990260
2000
你们的也是我们的
16:32
They were silenced
413
992260
2000
因为在历史发展中不幸
16:34
by an unfortunate accident of history.
414
994260
2000
偶然地沉寂了
16:36
If we decipher the script,
415
996260
2000
如果我们破解了这些字符
16:38
we would enable them to speak to us again.
416
998260
2000
我们可以让他们再一次和我们交流
16:40
What would they tell us?
417
1000260
3000
他们会告诉我们什么呢?
16:43
What would we find out about them? About us?
418
1003260
3000
我们会在他们身上和我们自己身上发现什么呢?
16:46
I can't wait to find out.
419
1006260
3000
我迫不及待地想要找到答案
16:49
Thank you.
420
1009260
2000
谢谢
16:51
(Applause)
421
1011260
4000
(掌声)
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7