How to sequence the human genome - Mark J. Kiel

如何对人类基因组进行测序 - 马克.J.基尔

1,501,141 views ・ 2013-12-09

TED-Ed


请双击下面的英文字幕来播放视频。

翻译人员: Tian Ye 校对人员: Yuanqing Edberg
00:06
You've probably heard of the human genome,
0
6704
2076
你大概听说过人类基因组,
00:08
the huge collection of genes
1
8780
1613
它是巨量的基因总和
00:10
inside each and every one of your cells.
2
10393
3397
在你的每一个细胞里都有一份。
00:13
You probably also know
3
13790
454
你很可能也知道
00:14
that we've sequenced the human genome,
4
14244
2583
我们已经对人类基因组进行了测序,
00:16
but what does that actually mean?
5
16827
2166
但这到底是什么意思?
00:18
How do you sequence someone's genome?
6
18993
3166
怎样对一个人的基因组进行测序呢?
00:22
Let's back up a bit.
7
22159
1636
我们先讲点背景知识。
00:23
What is a genome?
8
23795
1667
什么是基因组?
00:25
Well, a genome is all the genes plus some extra
9
25462
5058
基因组,就是全部的基因再加上点别的
00:30
that make up an organism.
10
30520
2064
共同组成了有机体。
00:32
Genes are made up of DNA,
11
32584
1998
基因由DNA组成,
00:34
and DNA is made up of long, paired strands
12
34582
3096
而DNA是成对的长链,
00:37
of A's,
13
37678
1236
由A(腺嘌呤),
00:38
T's,
14
38914
1170
T(胸腺嘧啶),
00:40
C's,
15
40084
1886
C(胞嘧啶),
00:41
and G's.
16
41970
862
和G(鸟嘌呤)组成。
00:42
Your genome is the code
17
42832
1518
你的基因组是一组代码,
00:44
that your cells use to know how to behave.
18
44350
3512
你的细胞要通过它来知道自己该做什么。
00:47
Cells interacting together make tissues.
19
47862
3453
细胞交织在一起成为组织,
00:51
Tissues cooperating with each other make organs.
20
51315
3132
不同的组织协调构成了器官,
00:54
Organs cooperating with each other
21
54447
1964
器官间协同工作
00:56
make an organism,
22
56411
1499
则形成了有机体,
00:57
you!
23
57910
1328
就是你!
00:59
So, you are who you are
24
59238
2760
所以,你之所以是你
01:01
in large part because of your genome.
25
61998
2623
很大程度上原因在于你的基因组。
01:04
The first human genome
26
64621
1427
第一个人类基因组测序
01:06
was sequenced ten years ago
27
66048
2112
完成于十年前,
01:08
and was no easy task.
28
68160
1836
非常艰难。
01:09
It took two decades to complete,
29
69996
2913
它花了20年才完成,
01:12
required the effort of hundreds of scientists
30
72909
2767
需要来自几十个国家的
01:15
across dozens of countries,
31
75676
1565
几百位科学家付出了辛劳,
01:17
and cost over three billion dollars.
32
77241
3233
而且费用超过30亿美元。
01:20
But some day very soon,
33
80474
1658
但不久的一天,
01:22
it will be possible to know the sequence of letters
34
82132
2186
搞清组成你这个人的基因组的
01:24
that make up your own personal genome
35
84318
2422
字母排列将成为可能,
01:26
all in a matter of minutes
36
86740
1753
只要几分钟,
01:28
and for less than the cost
37
88493
1328
费用还不及
01:29
of a pretty nice birthday present.
38
89821
2610
一份不错的生日礼物贵。
01:32
How is that possible?
39
92431
1892
这怎么可能呢?
01:34
Let's take a closer look.
40
94323
2000
让我来解释一下。
01:36
Knowing the sequence of the billions of letters
41
96323
2583
搞清组成你的基因组的
01:38
that make up your genome
42
98906
1476
几十亿个字母排列,
01:40
is the goal of genome sequencing.
43
100382
2526
是基因组测序的目标。
01:42
A genome is both really, really big
44
102908
3541
基因组既很大很大,
01:46
and very, very small.
45
106449
2953
又很小很小。
01:49
The individual letters of DNA,
46
109402
2190
DNA的每个字母,
01:51
the A's, T's, G's, and C's,
47
111592
3185
就是A、T、G、C,
01:54
are only eight or ten atoms wide,
48
114777
2990
宽度只有8到10个原子,
01:57
and they're all packed together into a clump,
49
117767
3540
而他们全部缠在一起成为一团,
02:01
like a ball of yarn.
50
121307
1741
像一团毛线。
02:03
So, to get all that information
51
123048
2107
所以,要从这么小的地方
02:05
out of that tiny space,
52
125155
2252
获取所有这些信息,
02:07
scientists first have to break
53
127407
1586
科学家首先要把
02:08
the long string of DNA down into smaller pieces.
54
128993
5061
DNA的长链打断为许多短链。
02:14
Each of these pieces is then separated in space
55
134054
2996
然后把每一段分离出来
02:17
and sequenced individually,
56
137050
1858
单独测序,
02:18
but how?
57
138908
1428
怎么测?
02:20
It's helpful to remember
58
140336
1159
这样想一下会更容易理解:
02:21
that DNA binds to other DNA
59
141495
2753
DNA会和其它DNA交合
02:24
if the sequences are the exact opposite of each other.
60
144248
3749
如果彼此的序列刚好互补的话。
02:27
A's bind to T's,
61
147997
1810
A配T,
02:29
and T's bind to A's.
62
149807
2581
T配A。
02:32
G's bind to C's,
63
152388
1829
G配C,
02:34
and C's to G's.
64
154217
2376
C配G。
02:36
If the A-T-G-C sequence of two pieces of DNA
65
156593
4007
如果两片DNA的序列ATGC
02:40
are exact opposites,
66
160600
2080
刚好互补
02:42
they stick together.
67
162680
1003
它们就会结合在一起。
02:43
Because the genome pieces
68
163683
1725
因为基因组片段
02:45
are so very small,
69
165408
1780
非常小,
02:47
we need some way to increase
70
167188
1625
我们需要某些办法
02:48
the signal we can detect
71
168813
1613
来放大信号,这样才能检测到
02:50
from each of the individual letters.
72
170426
2277
每个字母是什么。
02:52
In the most common method,
73
172703
1490
科学家最常用的方法,
02:54
scientists use enzymes to make thousands of copies
74
174193
2996
是用酶来合成几千份
02:57
of each genome piece.
75
177189
2187
基因副本
02:59
So, we now have thousands of replicas
76
179376
2458
这样我们就有了每一个基因组
03:01
of each of the genome pieces,
77
181834
2092
几千份复制品,
03:03
all with the same sequence
78
183926
1627
全部都有一样的
03:05
of A's, T's, G's, and C's.
79
185553
4187
ATGC序列,
03:09
But we have to read them all somehow.
80
189740
2618
但我们还得把全部的序列读出来。
03:12
To do this, we need to make
81
192358
1756
为了做到这一点,我们需要
03:14
a batch of special letters,
82
194114
1846
制作一批特别字母,
03:15
each with a distinct color.
83
195960
2002
每种都有特殊的颜色。
03:17
A mixture of these special colored letters and enzymes
84
197962
2885
把这些带颜色标记的字母和酶混合在一起,
03:20
are then added to the genome
85
200847
1442
再加入到
03:22
we're trying to read.
86
202289
1636
我们试图阅读的基因组里
03:23
At each spot on the genome,
87
203925
1630
在基因组的每一个点上,
03:25
one of the special letters
88
205555
1318
有一个特殊字母
03:26
binds to its opposite letter,
89
206873
1934
和对应的字母结合,
03:28
so we now have a double-stranded piece of DNA
90
208807
3202
这样就有了一段双链DNA,
03:32
with a colorful spot at each letter.
91
212009
2585
并且在每个字母上都有颜色标记。
03:34
Scientists then take pictures
92
214594
1796
然后科学家
03:36
of each snippet of genome.
93
216390
2375
对每一小段基因组照相,
03:38
Seeing the order of the colors
94
218765
1584
看到了颜色的排列顺序,
03:40
allows us to read the sequence.
95
220349
3250
我们就可以读出序列。
03:43
The sequences of each
96
223599
1415
这样几百万条DNA片段
03:45
of these millions of pieces of DNA
97
225014
2230
的序列
03:47
are stitched together using computer programs
98
227244
2117
是由计算机程序拼接一起的
03:49
to create a complete sequence of the entire genome.
99
229361
3507
这样就得到了整个基因组的完整序列。
03:52
This isn't the only way
100
232868
1058
这并非读取DNA片段序列字母的
03:53
to read the letter sequences of pieces of DNA,
101
233926
2601
唯一方法
03:56
but it's one of the most common.
102
236527
1762
却是最常用的方法之一。
03:58
Of course, just reading the letters in the genome
103
238289
2708
当然,仅仅是读取了基因组中的字母
04:00
doesn't tell us much.
104
240997
1633
还远远不够。
04:02
It's kind of like looking through a book
105
242630
1998
就好像你在翻一本书,
04:04
written in a language you don't speak.
106
244628
2548
书中所用的语言你并不懂,
04:07
You can recognize all the letters
107
247176
1708
你能读出书里的每个字母,
04:08
but still have no idea what's going on.
108
248884
2663
却仍然不理解书里说什么。
04:11
So, the next step is to decipher
109
251547
2055
所以,下一步就是解码,
04:13
what the sequence means,
110
253602
1859
序列到底是什么意思,
04:15
how your genome and my genome are different.
111
255461
3261
为什么你的基因组和我的不一样。
04:18
Interpreting the genes of the genome
112
258722
1917
阐释基因组中的不同基因的功能,
04:20
is the part scientists are still working on.
113
260639
2816
是科学家正在进行的工作。
04:23
While not every difference is consequential,
114
263455
2666
虽然并不是每一处不同之处都意义重大,
04:26
the sum of these differences
115
266121
1558
但这些不同之处组合在一起
04:27
is responsible for differences
116
267679
1538
决定了人与人之间的不同。
04:29
in how we look,
117
269217
1213
它决定了我们怎么看,
04:30
what we like,
118
270430
1001
我们喜欢什么,
04:31
how we act,
119
271431
1192
我们如何反应,
04:32
and even how likely we are to get sick
120
272623
2221
甚至我们患上疾病的可能性,
04:34
or respond to specific medicines.
121
274844
2465
或是对特定药物的反应。
04:37
Better understanding of how disparities
122
277309
1668
更好地了解
04:38
between our genomes
123
278977
1233
我们各自的基因组有哪些差别,
04:40
account for these differences
124
280210
1599
认识到这些差别
04:41
is sure to change the way we think
125
281809
1965
必将改变我们的观念:
04:43
not only about how doctors treat their patients,
126
283774
2766
不光是医生应当如何治疗病人,
04:46
but also how we treat each other.
127
286540
2583
而且是我们该如何。
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7