The race to sequence the human genome - Tien Nguyen

598,701 views ・ 2015-10-12

TED-Ed


请双击下面的英文字幕来播放视频。

翻译人员: Aiting Ye 校对人员: Yolanda Zhang
00:06
Packed inside every cell in your body is a set of genetic instructions,
0
6741
4561
我们身体每一个细胞里都有一组
00:11
3.2 billion base pairs long.
1
11302
2930
长达32亿组碱基对的遗传指令。
00:14
Deciphering these directions would be a monumental task
2
14232
3114
要解读这些指令是一项无比艰巨的任务,
00:17
but could offer unprecedented insight about the human body.
3
17346
4664
但对我们了解自身有着深远的意义。
00:22
In 1990, a consortium of 20 international research centers
4
22010
3852
1990年,由20个国际研究中心组成的
00:25
embarked on the world's largest biological collaboration
5
25862
3632
合作团队开始着手完成
00:29
to accomplish this mission.
6
29494
2188
这项全世界最浩大的生物工程。
00:31
The Human Genome Project proposed to sequence the entire human genome
7
31682
3496
人类基因组项目预测这项基因测序工程
00:35
over 15 years with $3 billion of public funds.
8
35178
5039
需要长达15年的时间,耗资30亿美元。
00:40
Then, seven years before its scheduled completion,
9
40217
2541
然而,在项目预计完成的7年前,
00:42
a private company called Celera announced that they could accomplish the same goal
10
42758
4538
一个叫做Celera的私人企业宣布
00:47
in just three years and at a fraction of the cost.
11
47296
4495
他们可以用更少的资金, 在三年内就完成这一项目。
00:51
The two camps discussed a joint venture, but talks quickly fell apart
12
51791
3686
这两个团队曾试图展开合作, 但是谈判最终因为
00:55
as disagreements arose over legal and ethical issues of genetic property.
13
55477
5048
对研究结果 在法律和伦理上的分歧而失败。
01:00
And so the race began.
14
60525
2699
于是他们之间的竞争开始了。
01:03
Though both teams used the same technology to sequence the entire human genome,
15
63224
3978
尽管两个团队在基因测序方面 采用了同样的技术手段,
01:07
it was their strategies that made all the difference.
16
67202
3147
他们测序的策略却截然不同。
01:10
Their paths diverged in the most critical of steps:
17
70349
3585
区别就在于以下关键几步:
01:13
the first one.
18
73934
1778
首先,
01:15
In the Human Genome Project's approach,
19
75712
2052
人类基因组计划的方案是
01:17
the genome was first divided into smaller, more manageable chunks
20
77764
4601
把整个基因图谱分为更小, 更易操作的片段,
01:22
about 150,000 base pairs long
21
82365
3306
每个片段都由15万个碱基对组成,
01:25
that overlapped each other a little bit on both ends.
22
85671
3081
相邻片段首尾均存在小部分重叠。
01:28
Each of these fragments of DNA
23
88752
1513
每个DNA片段都被注入到
01:30
was inserted inside a bacterial artificial chromosome
24
90265
3208
人工培育的细菌染色体中,
01:33
where they were cloned and fingerprinted.
25
93473
2669
并被复制,从而获得指纹谱图。
01:36
The fingerprints showed scientists where the fragments overlapped
26
96142
3408
指纹图谱可以向科学家们展现
01:39
without knowing the actual sequence.
27
99550
2124
那些未知序列中的重叠部分。
01:41
Using the overlapping bits as a guide,
28
101674
2117
利用这些重叠的小片段作为线索,
01:43
the researchers marked each fragment's place in the genome
29
103791
3049
研究者们在染色体中对各个片段做记号,
01:46
to create a contiguous map,
30
106840
2509
以获得一幅延续性的图谱,
01:49
a process that took about six years.
31
109349
2609
这个过程持续了六年之久。
01:51
The cloned fragments were sequenced in labs around the world
32
111958
3683
全世界所有对这些基因片段 进行测序的实验室
01:55
following one of the project's two major principles:
33
115641
3466
都遵循着以下两项准则:
01:59
that collaboration on our shared heritage was open to all nations.
34
119107
5098
研究成果属于全人类,并且对世界各国公开。
02:04
In each case, the fragments were arbitrarily broken up
35
124205
2679
所有实验中,基因片段都被任意分割为
02:06
into small, overlapping pieces about 1,000 base pairs long.
36
126884
4752
更小的,有重叠部分的1000个碱基对。
02:11
Then, using a technology called the Sanger method,
37
131636
2669
随后,他们运用“桑格测序法” (注:双脱氧链终止法)
02:14
each piece was sequenced letter by letter.
38
134305
2668
将每个片段内的碱基进行逐一测序 (即A,T,C,G)。
02:16
This rigorous map-based approach called hierarchical shotgun sequencing
39
136973
5357
这一严格的图谱测序法被称作 “分级散弹枪测序法”,
02:22
minimized the risk of misassembly,
40
142330
2633
可以将错误组合风险降至最低,
02:24
a huge hazard of sequencing genomes with many repetitive portions,
41
144963
4009
这些重复组合的基因有着巨大风险,
02:28
like the human genome.
42
148972
1749
例如人类基因组。
02:30
The consortium's "better safe than sorry" approach
43
150721
2397
人类基因组计划的这项 “宁稳妥,勿遗憾”的原则,
02:33
contrasted starkly with Celera's strategy called whole genome shotgun sequencing.
44
153118
5750
与Celera公司的“全基因组散弹枪测序法” 形成鲜明对比。
02:38
It hinged on skipping the mapping phase entirely,
45
158868
3655
因为Celera公司完全跳过图谱阶段,
02:42
a faster, though foolhardy, approach according to some.
46
162523
3532
在有些人看来这是一个有勇无谋的策略。
02:46
The entire genome was directly chopped up
47
166055
2699
他们将整个基因组直接切成
02:48
into a giant heap of small, overlapping bits.
48
168754
3793
许多小而重叠的片段。
02:52
Once these bits were sequenced via the Sanger method,
49
172547
2881
一旦这些小片段完成“桑格测序”,
02:55
Celera would take the formidable risk of reconstructing the genome
50
175428
4079
Celera公司会采取风险极高的方法,
02:59
using just the overlaps.
51
179507
2733
也就是用那些重叠部分来直接重组基因。
03:02
But perhaps their decision wasn't such a gamble
52
182240
2307
可或许他们的策略并非是一场豪赌,
03:04
because guess whose freshly completed map was available online for free?
53
184547
5725
因为猜猜看是谁首先完成 可以在网上免费获得的图谱呢?
03:10
The Human Genome Consortium,
54
190272
1617
人类基因组计划
03:11
in accordance with the project's second major principle
55
191889
2648
研究中的第二项准则是,
03:14
which held that all of the project's data
56
194537
2123
要将研究中所收集的资料
03:16
would be shared publicly within 24 hours of collection.
57
196660
4888
在24小时之内公布于众。
03:21
So in 1998, scientists around the world
58
201548
2542
因此1998年,世界各国的科学家
03:24
were furiously sequencing lines of genetic code
59
204090
3179
运用实践证明过的“桑格测序法”
03:27
using the tried and true, yet laborious, Sanger method.
60
207269
4078
对各种遗传基因展开“疯狂”的测序。
03:31
Finally, after three exhausting years of continuous sequencing and assembling,
61
211347
4338
最终,经过3年艰苦漫长的 测序和重组,
03:35
the verdict was in.
62
215685
1675
比赛有结果了。
03:37
In February 2001, both groups simultaneously published
63
217360
3724
2001年二月的时候,双方同时发布了
03:41
working drafts of more than 90% of the human genome,
64
221084
4370
超过90%的人类基因组草图,
03:45
several years ahead of the consortium's schedule.
65
225454
2792
都比原先预测的进度早了好几年。
03:48
The race ended in a tie.
66
228246
2433
比赛打平了。
03:50
The Human Genome Project's practice of immediately sharing its data
67
230679
4145
人类基因组计划这种及时分享数据的
03:54
was an unusual one.
68
234824
1802
做法并不常见。
03:56
It is more typical for scientists to closely guard their data
69
236626
3049
科学家们更倾向于在他们可以
03:59
until they are able to analyze it and publish their conclusions.
70
239675
3812
分析并且发布结果的时候再公布研究数据。
04:03
Instead, the Human Genome Project accelerated the pace of research
71
243487
4072
然而,人类基因组计划的这种做法 加速了研究过程,
04:07
and created an international collaboration on an unprecedented scale.
72
247559
5147
并且促成了研究领域一项 空前的国际合作。
04:12
Since then, robust investment in both the public and private sector
73
252706
3822
自此,在公共和私人领域的 研究得到深入开发,
04:16
has led to the identification of many disease related genes
74
256528
4305
使很多与基因相关的疾病 得以被检测出来,
04:20
and remarkable advances in sequencing technology.
75
260833
3848
同时测序方法也被不断完善。
04:24
Today, a person's genome can be sequenced in just a few days.
76
264681
4699
如今,一个人的全部基因测序 只需要几天就能完成。
04:29
However, reading the genome is only the first step.
77
269380
3239
但是,能够解读基因只是第一步而已。
04:32
We're a long way away from understanding what most of our genes do
78
272619
3889
要了解大多数基因的功能 以及它们是如何被控制的,
04:36
and how they are controlled.
79
276508
1826
我们还有很漫长的路要走。
04:38
Those are some of the challenges
80
278334
1576
这些工作将要交给我们下一代
04:39
for the next generation of ambitious research initiatives.
81
279910
3314
充满进取心的研究者来完成了。
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7