Rajesh Rao: Computing a Rosetta Stone for the Indus script

1,218,633 views ・ 2011-06-28

TED


请双击下面的英文字幕来播放视频。

翻译人员: Jenny Yang 校对人员: Lan Xu
00:15
I'd like to begin with a thought experiment.
0
15260
3000
首先,我们来做个思考的实验
00:19
Imagine that it's 4,000 years into the future.
1
19260
3000
想像距今4000年后
00:22
Civilization as we know it
2
22260
2000
我们所了解的文明
00:24
has ceased to exist --
3
24260
2000
不复存在
00:26
no books,
4
26260
2000
没有书籍
00:28
no electronic devices,
5
28260
3000
没有电子设备
00:31
no Facebook or Twitter.
6
31260
3000
没有脸书没有推特
00:34
All knowledge of the English language and the English alphabet
7
34260
3000
英语和英语字母
00:37
has been lost.
8
37260
2000
也都消失殆尽
00:39
Now imagine archeologists
9
39260
2000
现在再来想想考古学家们
00:41
digging through the rubble of one of our cities.
10
41260
2000
在我们城市残砖碎瓦中挖掘
00:43
What might they find?
11
43260
2000
他们会发现什么呢?
00:45
Well perhaps some rectangular pieces of plastic
12
45260
3000
可能会发现长方形的塑胶
00:48
with strange symbols on them.
13
48260
3000
上面印着奇怪的符号
00:51
Perhaps some circular pieces of metal.
14
51260
3000
或者发现一些圆形的金属碎片
00:54
Maybe some cylindrical containers
15
54260
2000
也可能是些圆柱型的容器
00:56
with some symbols on them.
16
56260
2000
上头有些符号
00:58
And perhaps one archeologist becomes an instant celebrity
17
58260
3000
还可能有某个考古学家在
01:01
when she discovers --
18
61260
2000
北美的某个地方
01:03
buried in the hills somewhere in North America --
19
63260
2000
发现了大量相似的符号后
01:05
massive versions of these same symbols.
20
65260
3000
一夜成名
01:10
Now let's ask ourselves,
21
70260
2000
让我们来问问我们自己
01:12
what could such artifacts say about us
22
72260
3000
我们的这些残存给4000年后
01:15
to people 4,000 years into the future?
23
75260
3000
的人们展示了一个怎样的我们
01:18
This is no hypothetical question.
24
78260
2000
这不是假设
01:20
In fact, this is exactly the kind of question we're faced with
25
80260
3000
事实上,这正是我们面对
01:23
when we try to understand the Indus Valley civilization,
26
83260
3000
4000年前存在的印度河流域文明时
01:26
which existed 4,000 years ago.
27
86260
2000
面对的问题
01:28
The Indus civilization was roughly contemporaneous
28
88260
3000
古印度河流域文明和较为人所熟知的
01:31
with the much better known Egyptian and the Mesopotamian civilizations,
29
91260
3000
埃及以及美所不达米亚文化大概处于同时期
01:34
but it was actually much larger than either of these two civilizations.
30
94260
3000
但其实它的范围超过这两个文明中的任何一个
01:37
It occupied the area
31
97260
2000
它涵盖了
01:39
of approximately one million square kilometers,
32
99260
2000
大约一百万平方公里
01:41
covering what is now Pakistan,
33
101260
2000
包括今天的巴基斯坦
01:43
Northwestern India
34
103260
2000
印度的西北部
01:45
and parts of Afghanistan and Iran.
35
105260
2000
部分阿富汗和伊朗
01:47
Given that it was such a vast civilization,
36
107260
2000
如此广大的文明
01:49
you might expect to find really powerful rulers, kings,
37
109260
4000
你可能会期待有强大的统治者,国王
01:53
and huge monuments glorifying these powerful kings.
38
113260
3000
以及用来纪念王者的巨大纪念碑
01:56
In fact,
39
116260
2000
事实上
01:58
what archeologists have found is none of that.
40
118260
2000
考古学者没有发现那些
02:00
They've found small objects such as these.
41
120260
3000
他们发现了这样一些小物件
02:03
Here's an example of one of these objects.
42
123260
3000
这就是其中的一样
02:06
Well obviously this is a replica.
43
126260
2000
当然这只是个复制品
02:08
But who is this person?
44
128260
3000
但这是谁呢?
02:11
A king? A god?
45
131260
2000
是一个国王?一个神?
02:13
A priest?
46
133260
2000
还是某个祭祀?
02:15
Or perhaps an ordinary person
47
135260
2000
或者是一个普通人
02:17
like you or me?
48
137260
2000
如你和我?
02:19
We don't know.
49
139260
2000
我们不知道
02:21
But the Indus people also left behind artifacts with writing on them.
50
141260
3000
但是古印度流域的人们也留下了背面写有字符的手工制品
02:24
Well no, not pieces of plastic,
51
144260
2000
不是在塑料片上
02:26
but stone seals, copper tablets,
52
146260
3000
而是在石头图章,铜片上
02:29
pottery and, surprisingly,
53
149260
2000
陶器,还有令人惊讶的
02:31
one large sign board,
54
151260
2000
在很大的符号石板上
02:33
which was found buried near the gate of a city.
55
153260
2000
掩埋在城门附近的
02:35
Now we don't know if it says Hollywood,
56
155260
2000
所以我们不知道上面是否写了好莱坞
02:37
or even Bollywood for that matter.
57
157260
2000
甚至是宝莱坞
02:39
In fact, we don't even know
58
159260
2000
事实上,我们不知道
02:41
what any of these objects say,
59
161260
2000
上面写的任何东西
02:43
and that's because the Indus script is undeciphered.
60
163260
3000
因为我们还不能破译这些印度河文字
02:46
We don't know what any of these symbols mean.
61
166260
2000
我们不知道这些符号是什么意思
02:48
The symbols are most commonly found on seals.
62
168260
3000
这些符号大多数出现在印章上
02:51
So you see up there one such object.
63
171260
2000
你们可以看到这个物品
02:53
It's the square object with the unicorn-like animal on it.
64
173260
3000
这个方形的物品上有个像是独角兽的动物
02:56
Now that's a magnificent piece of art.
65
176260
2000
这是一件很精致的艺术品
02:58
So how big do you think that is?
66
178260
2000
你觉得这块有多大呢?
03:00
Perhaps that big?
67
180260
2000
这么大?
03:02
Or maybe that big?
68
182260
2000
也许这么大?
03:04
Well let me show you.
69
184260
2000
让我来给你看看
03:07
Here's a replica of one such seal.
70
187260
3000
这是这枚印章的复制品
03:10
It's only about one inch by one inch in size --
71
190260
2000
只有一平方英吋大
03:12
pretty tiny.
72
192260
2000
很小
03:14
So what were these used for?
73
194260
2000
那这是干什么用的呢?
03:16
We know that these were used for stamping clay tags
74
196260
3000
我们知道这是用来盖在黏土标签上的
03:19
that were attached to bundles of goods that were sent from one place to the other.
75
199260
3000
一捆捆的货物被附上标签,从一地被运往另一地
03:22
So you know those packing slips you get on your FedEx boxes?
76
202260
3000
你知道联邦快递盒子里的那种装箱单吧?
03:25
These were used to make those kinds of packing slips.
77
205260
3000
这些就是用来做那些送货单的
03:28
You might wonder what these objects contain
78
208260
3000
你也许会想知道这些东西
03:31
in terms of their text.
79
211260
2000
包涵了什么文字含义
03:33
Perhaps they're the name of the sender
80
213260
2000
也许是寄件人的名字
03:35
or some information about the goods
81
215260
2000
或者是某些货物的信息
03:37
that are being sent from one place to the other -- we don't know.
82
217260
3000
从一地被送往别处-我们不知道
03:40
We need to decipher the script to answer that question.
83
220260
2000
我们需要破解这段文字才能回答那个问题
03:42
Deciphering the script
84
222260
2000
破解这段文字
03:44
is not just an intellectual puzzle;
85
224260
2000
不只是破解一个智力难题
03:46
it's actually become a question
86
226260
2000
事实上这个问题
03:48
that's become deeply intertwined
87
228260
2000
深深地和南亚的政治
03:50
with the politics and the cultural history of South Asia.
88
230260
3000
文化历史交织在一起
03:53
In fact, the script has become a battleground of sorts
89
233260
3000
而对这些文字的争论已经形成了一个战场
03:56
between three different groups of people.
90
236260
2000
三群持不同见解团体各持己见
03:58
First, there's a group of people
91
238260
2000
第一群人
04:00
who are very passionate in their belief
92
240260
2000
热切地坚信
04:02
that the Indus script
93
242260
2000
这些印度河流域的字符
04:04
does not represent a language at all.
94
244260
2000
完全不代表语言
04:06
These people believe that the symbols
95
246260
2000
这些人认为这些符号
04:08
are very similar to the kind of symbols you find on traffic signs
96
248260
3000
和交通标示的符号相似
04:11
or the emblems you find on shields.
97
251260
3000
也像盾上面的一些象征符号
04:14
There's a second group of people
98
254260
2000
第二群人
04:16
who believe that the Indus script represents an Indo-European language.
99
256260
3000
相信印度河文字代表着某个印欧语言
04:19
If you look at a map of India today,
100
259260
2000
如果你看看印度今日的版图
04:21
you'll see that most of the languages spoken in North India
101
261260
3000
很多在印度北部使用的语言
04:24
belong to the Indo-European language family.
102
264260
3000
都是属于印欧语系
04:27
So some people believe that the Indus script
103
267260
2000
所以一些人相信古印度河文字
04:29
represents an ancient Indo-European language such as Sanskrit.
104
269260
3000
是某种古老的印欧语,就像梵语一样
04:32
There's a last group of people
105
272260
2000
最后一群人
04:34
who believe that the Indus people
106
274260
3000
相信古印度人
04:37
were the ancestors of people living in South India today.
107
277260
3000
是现居南印度的人的祖先
04:40
These people believe that the Indus script
108
280260
2000
这些人觉得古印度文字
04:42
represents an ancient form
109
282260
2000
是达罗毗荼语系下的
04:44
of the Dravidian language family,
110
284260
2000
一种古老的文字形式
04:46
which is the language family spoken in much of South India today.
111
286260
3000
现今南印度多数人皆使用达罗毗荼语
04:49
And the proponents of this theory
112
289260
2000
支持这一理论的人
04:51
point to that small pocket of Dravidian-speaking people in the North,
113
291260
3000
指出印度北部有一小部份使用达罗毗荼语的人
04:54
actually near Afghanistan,
114
294260
2000
事实上靠近阿富汗
04:56
and they say that perhaps, sometime in the past,
115
296260
3000
他们说或许,过去曾经有一段时间
04:59
Dravidian languages were spoken all over India
116
299260
3000
整个印度都使用达罗毗荼语
05:02
and that this suggests
117
302260
2000
这也意味着
05:04
that the Indus civilization is perhaps also Dravidian.
118
304260
3000
印度河文明或许就是达罗毗荼文明
05:07
Which of these hypotheses can be true?
119
307260
3000
这些假设哪个是真的呢?
05:10
We don't know, but perhaps if you deciphered the script,
120
310260
2000
我们不知道, 但如果你能解开这些文字符号
05:12
you would be able to answer this question.
121
312260
2000
你就能回答这个问题
05:14
But deciphering the script is a very challenging task.
122
314260
2000
但是破解文字密码是一项很具有挑战性的任务
05:16
First, there's no Rosetta Stone.
123
316260
2000
首先, 我们没有罗塞塔石碑
05:18
I don't mean the software;
124
318260
2000
我不是指那软件
05:20
I mean an ancient artifact
125
320260
2000
我是指那件文物
05:22
that contains in the same text
126
322260
2000
上面用我们知道和不知道的文字
05:24
both a known text and an unknown text.
127
324260
3000
书写了同样的内容。
05:27
We don't have such an artifact for the Indus script.
128
327260
3000
印度河文化中并没有发现同样的古物
05:30
And furthermore, we don't even know what language they spoke.
129
330260
3000
而且,我们不知道他们说的是什么语言
05:33
And to make matters even worse,
130
333260
2000
更糟的是
05:35
most of the text that we have are extremely short.
131
335260
2000
大部份的文字都相当短
05:37
So as I showed you, they're usually found on these seals
132
337260
2000
就像我刚才给大家看的,最常发现的是在印章类的东西上
05:39
that are very, very tiny.
133
339260
2000
他们都非常非常地小
05:41
And so given these formidable obstacles,
134
341260
2000
给我们解读带来了难以克服的障碍
05:43
one might wonder and worry
135
343260
2000
有些人或许会担心
05:45
whether one will ever be able to decipher the Indus script.
136
345260
3000
是否真的有谁可以解开古印度河文字之谜呢?
05:48
In the rest of my talk,
137
348260
2000
在我接下来的演讲中
05:50
I'd like to tell you about how I learned to stop worrying
138
350260
2000
我要来告诉各位我怎么学会停止担心
05:52
and love the challenge posed by the Indus script.
139
352260
2000
并喜欢上解读古印度河文字所带来的挑战
05:54
I've always been fascinated by the Indus script
140
354260
3000
我对古印度河文字的痴迷
05:57
ever since I read about it in a middle school textbook.
141
357260
2000
从在中学课本中读到便开始
05:59
And why was I fascinated?
142
359260
2000
我为什么会着迷呢?
06:01
Well it's the last major undeciphered script in the ancient world.
143
361260
4000
因为这是历史上目前唯一未被破解的重要的古文字
06:05
My career path led me to become a computational neuroscientist,
144
365260
3000
我的职业生涯让我成为一个计算机神经科学家
06:08
so in my day job,
145
368260
2000
我白天的工作
06:10
I create computer models of the brain
146
370260
2000
是在电脑上模拟大脑
06:12
to try to understand how the brain makes predictions,
147
372260
3000
试图了借大脑是怎样预知事物的
06:15
how the brain makes decisions,
148
375260
2000
大脑是怎样做决定的,
06:17
how the brain learns and so on.
149
377260
2000
大脑是怎样学习的等等
06:19
But in 2007, my path crossed again with the Indus script.
150
379260
3000
但在2007年,我与古印度河文字重逢
06:22
That's when I was in India,
151
382260
2000
当时在印度
06:24
and I had the wonderful opportunity
152
384260
2000
我有幸得到机会
06:26
to meet with some Indian scientists
153
386260
2000
跟一些印度科学家会面
06:28
who were using computer models to try to analyze the script.
154
388260
3000
他们在用计算机模型分析这些文字
06:31
And so it was then that I realized
155
391260
2000
那个时候我认识到
06:33
there was an opportunity for me to collaborate with these scientists,
156
393260
3000
这是我跟这些科学家合作的一个机会
06:36
and so I jumped at that opportunity.
157
396260
2000
所以我立刻就抓住了这个机会
06:38
And I'd like to describe some of the results that we have found.
158
398260
2000
我想在这里介绍我们发现的一些结果
06:40
Or better yet, let's all collectively decipher.
159
400260
3000
或者更好的是,让我们一起来破解吧
06:43
Are you ready?
160
403260
2000
你们准备好了吗?
06:45
The first thing that you need to do when you have an undeciphered script
161
405260
3000
当你想要破解文字符号时,首先你要
06:48
is try to figure out the direction of writing.
162
408260
2000
分辨书写的方向
06:50
Here are two texts that contain some symbols on them.
163
410260
3000
这里有两段相同符号的文字
06:53
Can you tell me
164
413260
2000
你能告诉我
06:55
if the direction of writing is right to left or left to right?
165
415260
3000
书写的方向是从右到左还是从左到右?
06:58
I'll give you a couple of seconds.
166
418260
3000
我给大家几秒钟想一想
07:01
Okay. Right to left, how many? Okay.
167
421260
3000
好, 从右到左,有多少人同意?好
07:04
Okay. Left to right?
168
424260
2000
从左到右呢?
07:06
Oh, it's almost 50/50. Okay.
169
426260
2000
噢,大约是一半一半
07:08
The answer is:
170
428260
2000
答案是:
07:10
if you look at the left-hand side of the two texts,
171
430260
2000
如果你看这两段文字的左边
07:12
you'll notice that there's a cramping of signs,
172
432260
3000
你会注意到左边的符号挤在一起
07:15
and it seems like 4,000 years ago,
173
435260
2000
好像是4000年前
07:17
when the scribe was writing from right to left,
174
437260
2000
当书写者从右写到左时
07:19
they ran out of space.
175
439260
2000
因为最后没地方了写了
07:21
And so they had to cram the sign.
176
441260
2000
只能缩小间距
07:23
One of the signs is also below the text on the top.
177
443260
2000
其中有一个符号也出现在下面的文字上面
07:25
This suggests the direction of writing
178
445260
2000
意味着书写的方向
07:27
was probably from right to left,
179
447260
2000
是从右到左的
07:29
and so that's one of the first things we know,
180
449260
2000
所以我们知道的第一件事是
07:31
that directionality is a very key aspect of linguistic scripts.
181
451260
3000
书写方向是语言符号中一个很关键的方面
07:34
And the Indus script now has
182
454260
2000
因此古印度河文字就有了这个
07:36
this particular property.
183
456260
2000
特别的特征
07:38
What other properties of language does the script show?
184
458260
2000
这文字符号中还有什么其他语言特征?
07:40
Languages contain patterns.
185
460260
2000
语言会有一定模式
07:42
If I give you the letter Q
186
462260
2000
如果我给你一个字母Q
07:44
and ask you to predict the next letter, what do you think that would be?
187
464260
3000
然后请你猜下一个字母,你们觉得会是什么呢?
07:47
Most of you said U, which is right.
188
467260
2000
大部份人都会说U,那是对的
07:49
Now if I asked you to predict one more letter,
189
469260
2000
现在我请你们再猜下个字母
07:51
what do you think that would be?
190
471260
2000
你们觉得会是什么呢?
07:53
Now there's several thoughts. There's E. It could be I. It could be A,
191
473260
3000
现在这边有很多种答案,有人说E,可能是I, 也可能是一个A,
07:56
but certainly not B, C or D, right?
192
476260
3000
但是肯定不是B,C,或者D,对吗?
07:59
The Indus script also exhibits similar kinds of patterns.
193
479260
3000
印度河文字也有类似的模式
08:02
There's a lot of text that start with this diamond-shaped symbol.
194
482260
3000
有很多段文字的开头都有菱形符号
08:05
And this in turn tends to be followed
195
485260
2000
接着常常是跟着一个
08:07
by this quotation marks-like symbol.
196
487260
2000
有点象引号那样的符号
08:09
And this is very similar to a Q and U example.
197
489260
2000
这就跟Q跟U的例子很相似
08:11
This symbol can in turn be followed
198
491260
2000
这符号之后
08:13
by these fish-like symbols and some other signs,
199
493260
3000
会有着类似鱼等其他的符号
08:16
but never by these other signs at the bottom.
200
496260
2000
但从来不会是最底下出现的这些符号
08:18
And furthermore, there's some signs
201
498260
2000
进一步来说,有些符号
08:20
that really prefer the end of texts,
202
500260
2000
常常出现在文章结尾
08:22
such as this jar-shaped sign,
203
502260
2000
像是这壶型的记号
08:24
and this sign, in fact, happens to be
204
504260
2000
事实上,这记号刚好是
08:26
the most frequently occurring sign in the script.
205
506260
2000
在这些文句中最常出现的记号
08:28
Given such patterns, here was our idea.
206
508260
3000
看出这些惯用模式后,我们有个想法
08:31
The idea was to use a computer
207
511260
2000
就是用电脑
08:33
to learn these patterns,
208
513260
2000
去了解这些图案
08:35
and so we gave the computer the existing texts.
209
515260
3000
我们将这些现存的文字符号输入电脑
08:38
And the computer learned a statistical model
210
518260
2000
然后电脑以某种统计模型
08:40
of which symbols tend to occur together
211
520260
2000
算出哪些符号会一起出现
08:42
and which symbols tend to follow each other.
212
522260
2000
而哪些符号常以先后顺序出现
08:44
Given the computer model,
213
524260
2000
使用此电脑模式
08:46
we can test the model by essentially quizzing it.
214
526260
3000
我们可以用考试的方法来测试这个模式
08:49
So we could deliberately erase some symbols,
215
529260
2000
我们可以故意去掉某些符号
08:51
and we can ask it to predict the missing symbols.
216
531260
3000
我们让计算机来预测缺少的字符
08:54
Here are some examples.
217
534260
3000
这是一些例子
09:00
You may regard this
218
540260
2000
你们可以把这个看作
09:02
as perhaps the most ancient game
219
542260
2000
最古老的
09:04
of Wheel of Fortune.
220
544260
3000
幸运大转盘游戏(一个填字游戏)
09:08
What we found
221
548260
2000
我们发现
09:10
was that the computer was successful in 75 percent of the cases
222
550260
2000
计算机预测正确字符的
09:12
in predicting the correct symbol.
223
552260
2000
成功率是75%
09:14
In the rest of the cases,
224
554260
2000
剩下的例子中
09:16
typically the second best guess or third best guess was the right answer.
225
556260
3000
正确答案不是在第二选择就是在第三选择中
09:19
There's also practical use
226
559260
2000
这种预测方法
09:21
for this particular procedure.
227
561260
2000
会派到实际的用处
09:23
There's a lot of these texts that are damaged.
228
563260
2000
很多文字符号都已经损坏
09:25
Here's an example of one such text.
229
565260
2000
这里有个例子
09:27
And we can use the computer model now to try to complete this text
230
567260
3000
我们可以使用电脑模式来修复这段文字内容
09:30
and make a best guess prediction.
231
570260
2000
并做出最好的猜测
09:32
Here's an example of a symbol that was predicted.
232
572260
3000
这就是一个预测的例子
09:35
And this could be really useful as we try to decipher the script
233
575260
2000
这非常有用, 在我们破解字符的过程中
09:37
by generating more data that we can analyze.
234
577260
3000
让我们得到了更多可供分析的数据
09:40
Now here's one other thing you can do with the computer model.
235
580260
3000
还有另一件事也是可以利用电脑模式来做的
09:43
So imagine a monkey
236
583260
2000
想象有个猴子
09:45
sitting at a keyboard.
237
585260
2000
坐在键盘上
09:47
I think you might get a random jumble of letters that looks like this.
238
587260
3000
我猜会得到一堆像这样随机跑出的字母
09:50
Such a random jumble of letters
239
590260
2000
这些随机混乱的字母
09:52
is said to have a very high entropy.
240
592260
2000
说明了很高的熵
09:54
This is a physics and information theory term.
241
594260
2000
这是物理学和信息理论的说法
09:56
But just imagine it's a really random jumble of letters.
242
596260
3000
但想像这是相当随机混乱的字母排列
09:59
How many of you have ever spilled coffee on a keyboard?
243
599260
4000
你们中多少人曾经将咖啡溅在键盘上?
10:03
You might have encountered the stuck-key problem --
244
603260
2000
你也许会碰到键盘卡住的问题
10:05
so basically the same symbol being repeated over and over again.
245
605260
3000
所以同一个字符会一再重复
10:08
This kind of a sequence is said to have a very low entropy
246
608260
3000
这类的数列可以说有着非常低的熵
10:11
because there's no variation at all.
247
611260
2000
因为它们一点都没变化
10:13
Language, on the other hand, has an intermediate level of entropy;
248
613260
3000
语言,从别个角度说,有着中间程度的熵
10:16
it's neither too rigid,
249
616260
2000
它不是很严格
10:18
nor is it too random.
250
618260
2000
也不是很随机
10:20
What about the Indus script?
251
620260
2000
那印度河字符呢?
10:22
Here's a graph that plots the entropies of a whole bunch of sequences.
252
622260
4000
这个图表显示了一大堆顺序的熵值
10:26
At the very top you find the uniformly random sequence,
253
626260
2000
你会在上面发现那些随机顺序
10:28
which is a random jumble of letters --
254
628260
2000
那是一些杂乱随机放置的字母
10:30
and interestingly, we also find
255
630260
2000
有趣的是, 我们同时发现
10:32
the DNA sequence from the human genome and instrumental music.
256
632260
3000
人类基因的DNA排序和乐器演奏的音乐
10:35
And both of these are very, very flexible,
257
635260
2000
它们都非常非常灵活
10:37
which is why you find them in the very high range.
258
637260
2000
所以他们处在高程度的范围里
10:39
At the lower end of the scale,
259
639260
2000
在低端范围里
10:41
you find a rigid sequence, a sequence of all A's,
260
641260
2000
你可以发现很严格的顺序, 比如全是字母A的排列
10:43
and you also find a computer program,
261
643260
2000
你还会发现一个计算机程序
10:45
in this case in the language Fortran,
262
645260
2000
是用FORTRAN语言写的
10:47
which obeys really strict rules.
263
647260
2000
它遵循严格的规则
10:49
Linguistic scripts
264
649260
2000
语言文字
10:51
occupy the middle range.
265
651260
2000
占中间段
10:53
Now what about the Indus script?
266
653260
2000
那印度河文字呢?
10:55
We found that the Indus script
267
655260
2000
我们发现印度河流域的文字
10:57
actually falls within the range of the linguistic scripts.
268
657260
2000
的数值正好落在语言文字段里
10:59
When this result was first published,
269
659260
2000
这个结果刚公布的时候
11:01
it was highly controversial.
270
661260
3000
引起了很大的争议
11:04
There were people who raised a hue and cry,
271
664260
3000
有人提出强烈反对
11:07
and these people were the ones who believed
272
667260
2000
这些批评的人认为
11:09
that the Indus script does not represent language.
273
669260
3000
古印度河文字根本不是文字
11:12
I even started to get some hate mail.
274
672260
2000
我甚至收到一些恶意信函
11:14
My students said
275
674260
2000
我的学生说
11:16
that I should really seriously consider getting some protection.
276
676260
3000
我应该要严肃地考虑怎样保护自身安危
11:19
Who'd have thought
277
679260
2000
谁会想到
11:21
that deciphering could be a dangerous profession?
278
681260
2000
解读古文字会变成一种危险职业
11:23
What does this result really show?
279
683260
2000
这结果说明了什么呢?
11:25
It shows that the Indus script
280
685260
2000
说明了古印度河文字
11:27
shares an important property of language.
281
687260
2000
有着重要的语言特征
11:29
So, as the old saying goes,
282
689260
2000
所以,就像老话说的那样
11:31
if it looks like a linguistic script
283
691260
2000
如果这看起来像是语言性文字
11:33
and it acts like a linguistic script,
284
693260
2000
且它也有语言性文字的功用
11:35
then perhaps we may have a linguistic script on our hands.
285
695260
3000
恐怕我们手里捧着的大概就是一种语言文字
11:38
What other evidence is there
286
698260
2000
有什么其他证据显示
11:40
that the script could actually encode language?
287
700260
2000
这类文字事实上包涵语言
11:42
Well linguistic scripts can actually encode multiple languages.
288
702260
3000
语言字符其实可以被包括在多种语言中
11:45
So for example, here's the same sentence written in English
289
705260
3000
比如,这是同样一句话写成英语
11:48
and the same sentence written in Dutch
290
708260
2000
这是这句话写成荷兰语
11:50
using the same letters of the alphabet.
291
710260
2000
用到同样的字母
11:52
If you don't know Dutch and you only know English
292
712260
3000
如果你不懂荷兰语只知道英语
11:55
and I give you some words in Dutch,
293
715260
2000
我给你看一些荷兰语单词
11:57
you'll tell me that these words contain
294
717260
2000
你会说这些词包涵了
11:59
some very unusual patterns.
295
719260
2000
一些独特的模式
12:01
Some things are not right,
296
721260
2000
有些地方不对
12:03
and you'll say these words are probably not English words.
297
723260
3000
你也会说这些不是英文单词
12:06
The same thing happens in the case of the Indus script.
298
726260
2000
这同样也发生在印度河文字中
12:08
The computer found several texts --
299
728260
2000
计算机也发现一些文字
12:10
two of them are shown here --
300
730260
2000
这里有两三个
12:12
that have very unusual patterns.
301
732260
2000
他们具有很独特的模式
12:14
So for example the first text:
302
734260
2000
比如,第一个例子
12:16
there's a doubling of this jar-shaped sign.
303
736260
3000
这个壶形字符被重复
12:19
This sign is the most frequently-occurring sign
304
739260
2000
这个符号是印度河文字里
12:21
in the Indus script,
305
741260
2000
最常见的一个记号
12:23
and it's only in this text
306
743260
2000
而只在这段文字里
12:25
that it occurs as a doubling pair.
307
745260
2000
它以成对的模式出现
12:27
Why is that the case?
308
747260
2000
为什么呢?
12:29
We went back and looked at where these particular texts were found,
309
749260
3000
我们回去找这些特殊的文字是在哪里找到的
12:32
and it turns out that they were found
310
752260
2000
我们发现它们出现在离
12:34
very, very far away from the Indus Valley.
311
754260
2000
印度河谷十分遥远的地方
12:36
They were found in present day Iraq and Iran.
312
756260
3000
它们是在现在的伊朗和伊拉克被发现的
12:39
And why were they found there?
313
759260
2000
它们怎么会出现在那里呢?
12:41
What I haven't told you is that
314
761260
2000
我还没有告诉你们
12:43
the Indus people were very, very enterprising.
315
763260
2000
印度人相当会做生意
12:45
They used to trade with people pretty far away from where they lived,
316
765260
3000
他们曾跑到离他们居住很远的地方去做交易
12:48
and so in this case, they were traveling by sea
317
768260
3000
在这个例子中,他们跨越海洋
12:51
all the way to Mesopotamia, present-day Iraq.
318
771260
3000
到了两河流域,也就是今天的伊拉克
12:54
And what seems to have happened here
319
774260
2000
看起来
12:56
is that the Indus traders, the merchants,
320
776260
3000
印度的生意人,商人
12:59
were using this script to write a foreign language.
321
779260
3000
在用这些文字记录外语
13:02
It's just like our English and Dutch example.
322
782260
2000
就好比英语和荷兰语的例子
13:04
And that would explain why we have these strange patterns
323
784260
2000
这可以解释为什么我们会发现这些奇怪的规律
13:06
that are very different from the kinds of patterns you see in the text
324
786260
3000
它们和在印度河流域发现的文字的规律
13:09
that are found within the Indus Valley.
325
789260
3000
非常不同
13:12
This suggests that the same script, the Indus script,
326
792260
2000
这说明印度河文字,同样的文字
13:14
could be used to write different languages.
327
794260
3000
可能被用来记录不同的语言
13:17
The results we have so far seem to point to the conclusion
328
797260
3000
根据我们现有的研究结果指向的结论来看
13:20
that the Indus script probably does represent language.
329
800260
3000
印度河流域的这些字符代表语言
13:23
If it does represent language,
330
803260
2000
而如果它的确代表语言
13:25
then how do we read the symbols?
331
805260
2000
那么我们怎样来解读这些符号
13:27
That's our next big challenge.
332
807260
2000
这是对我们最大的挑战
13:29
So you'll notice that many of the symbols
333
809260
2000
你注意到很多这些符号
13:31
look like pictures of humans, of insects,
334
811260
2000
看上去像人和昆虫的图画
13:33
of fishes, of birds.
335
813260
3000
或者鱼,鸟
13:36
Most ancient scripts
336
816260
2000
很多古代的字符
13:38
use the rebus principle,
337
818260
2000
都使用了画谜原则
13:40
which is, using pictures to represent words.
338
820260
3000
也就是用图画代表字词
13:43
So as an example, here's a word.
339
823260
3000
比如, 这是一个词
13:46
Can you write it using pictures?
340
826260
2000
你能用图画来书写吗?
13:48
I'll give you a couple seconds.
341
828260
2000
我给你们几秒钟
13:50
Got it?
342
830260
2000
画好了吗
13:52
Okay. Great.
343
832260
2000
13:54
Here's my solution.
344
834260
2000
这是我的答案
13:56
You could use the picture of a bee followed by a picture of a leaf --
345
836260
2000
你可以用一片树叶(leaf)跟了一只蜜蜂(bee)
13:58
and that's "belief," right.
346
838260
2000
那就是"相信“(发音跟bee加leaf接近)对吗?
14:00
There could be other solutions.
347
840260
2000
也可能有其他的答案
14:02
In the case of the Indus script,
348
842260
2000
对印度河字符来说
14:04
the problem is the reverse.
349
844260
2000
我们的面临着反向的挑战
14:06
You have to figure out the sounds of each of these pictures
350
846260
3000
你得找到每幅图的发声
14:09
such that the entire sequence makes sense.
351
849260
2000
整个排列秩序意义通顺
14:11
So this is just like a crossword puzzle,
352
851260
3000
所以这就跟填字游戏一样
14:14
except that this is the mother of all crossword puzzles
353
854260
3000
只不过这是所有填字游戏的鼻祖
14:17
because the stakes are so high if you solve it.
354
857260
4000
因为破解它的价值太可观了
14:21
My colleagues, Iravatham Mahadevan and Asko Parpola,
355
861260
3000
我的同事, Iravatham Mahadevan 和 Asko Parpola
14:24
have been making some headway on this particular problem.
356
864260
2000
已经在这个特殊的问题上取得了一定的进展
14:26
And I'd like to give you a quick example of Parpola's work.
357
866260
2000
我给你你们看一个Parpola研究的例子
14:28
Here's a really short text.
358
868260
2000
这是一段很短的文字
14:30
It contains seven vertical strokes followed by this fish-like sign.
359
870260
3000
它里面有7个竖笔,跟了一个象鱼的形状
14:33
And I want to mention that these seals were used
360
873260
2000
我得提一下这些章用于
14:35
for stamping clay tags
361
875260
2000
印在黏土标签上
14:37
that were attached to bundles of goods,
362
877260
2000
然后附在一捆捆的货物上
14:39
so it's quite likely that these tags, at least some of them,
363
879260
3000
所以这些标签,至少其中的一些
14:42
contain names of merchants.
364
882260
2000
有商品的名字
14:44
And it turns out that in India
365
884260
2000
而在印度
14:46
there's a long tradition
366
886260
2000
这是一个久远的传统
14:48
of names being based on horoscopes
367
888260
2000
名字是根据星座
14:50
and star constellations present at the time of birth.
368
890260
3000
和生日的时候展示的星宿来定的
14:53
In Dravidian languages,
369
893260
2000
在达罗毗荼语系中
14:55
the word for fish is "meen"
370
895260
2000
鱼的发音是“meen”
14:57
which happens to sound just like the word for star.
371
897260
3000
而这个字的发音跟星星刚好相同
15:00
And so seven stars
372
900260
2000
所以七颗星星
15:02
would stand for "elu meen,"
373
902260
2000
代表“elu meen”
15:04
which is the Dravidian word
374
904260
2000
就是达罗毗荼语的
15:06
for the Big Dipper star constellation.
375
906260
2000
北斗七星
15:08
Similarly, there's another sequence of six stars,
376
908260
3000
同样地,这里有六颗星
15:11
and that translates to "aru meen,"
377
911260
2000
可以听作“aru meen”
15:13
which is the old Dravidian name
378
913260
2000
是一个达罗毗荼语的旧名词
15:15
for the star constellation Pleiades.
379
915260
2000
代表昴宿星团
15:17
And finally, there's other combinations,
380
917260
3000
最后, 还有其他的组合
15:20
such as this fish sign with something that looks like a roof on top of it.
381
920260
3000
比如鱼的符号上有个顶
15:23
And that could be translated into "mey meen,"
382
923260
3000
可以说成“MEY meen”
15:26
which is the old Dravidian name for the planet Saturn.
383
926260
3000
在旧达罗毗荼语中指土星
15:29
So that was pretty exciting.
384
929260
2000
让人感到振奋的是
15:31
It looks like we're getting somewhere.
385
931260
2000
我们似乎有所进展
15:33
But does this prove
386
933260
2000
但这样是否就能证明
15:35
that these seals contain Dravidian names
387
935260
2000
这些印章上达罗毗荼语的名字
15:37
based on planets and star constellations?
388
937260
2000
是基于星球和星座呢?
15:39
Well not yet.
389
939260
2000
还不能
15:41
So we have no way of validating
390
941260
2000
我们没有办法验证
15:43
these particular readings,
391
943260
2000
这些解读方式是正确的
15:45
but if more and more of these readings start making sense,
392
945260
3000
但是如果越来越多的解读开始有道理
15:48
and if longer and longer sequences
393
948260
2000
而越来越长的排序
15:50
appear to be correct,
394
950260
2000
显示出正确性
15:52
then we know that we are on the right track.
395
952260
2000
那我们就知道我们走上了正道
15:54
Today,
396
954260
2000
今天
15:56
we can write a word such as TED
397
956260
3000
我们可以用埃及象形文字以及楔形文字
15:59
in Egyptian hieroglyphics and in cuneiform script,
398
959260
3000
写下的TED这样的名字
16:02
because both of these were deciphered
399
962260
2000
因为人们在十九世纪
16:04
in the 19th century.
400
964260
2000
就破解了这两种文字符号
16:06
The decipherment of these two scripts
401
966260
2000
这两种文字的破解
16:08
enabled these civilizations to speak to us again directly.
402
968260
3000
使得这些古文明能与我们再次直接的交流
16:11
The Mayans
403
971260
2000
玛雅文明
16:13
started speaking to us in the 20th century,
404
973260
2000
也在20世纪开始与我们对话
16:15
but the Indus civilization remains silent.
405
975260
3000
而印度河流域的文明至今保持沉默
16:18
Why should we care?
406
978260
2000
我们为什么在乎呢?
16:20
The Indus civilization does not belong
407
980260
2000
印度河流域的文化不仅仅属于
16:22
to just the South Indians or the North Indians
408
982260
2000
南印度或者是北印度
16:24
or the Pakistanis;
409
984260
2000
或者是巴基斯坦
16:26
it belongs to all of us.
410
986260
2000
它是属于我们大家的
16:28
These are our ancestors --
411
988260
2000
他们是我们的祖先
16:30
yours and mine.
412
990260
2000
你们的也是我们的
16:32
They were silenced
413
992260
2000
因为在历史发展中不幸
16:34
by an unfortunate accident of history.
414
994260
2000
偶然地沉寂了
16:36
If we decipher the script,
415
996260
2000
如果我们破解了这些字符
16:38
we would enable them to speak to us again.
416
998260
2000
我们可以让他们再一次和我们交流
16:40
What would they tell us?
417
1000260
3000
他们会告诉我们什么呢?
16:43
What would we find out about them? About us?
418
1003260
3000
我们会在他们身上和我们自己身上发现什么呢?
16:46
I can't wait to find out.
419
1006260
3000
我迫不及待地想要找到答案
16:49
Thank you.
420
1009260
2000
谢谢
16:51
(Applause)
421
1011260
4000
(掌声)
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7


This website was created in October 2020 and last updated on June 12, 2025.

It is now archived and preserved as an English learning resource.

Some information may be out of date.

隐私政策

eng.lish.video

Developer's Blog