The beauty of data visualization | David McCandless

395,940 views ・ 2010-08-23

TED


请双击下面的英文字幕来播放视频。

翻译人员: Juyao Dong 校对人员: Zhu Jie
00:15
It feels like we're all suffering
0
15260
2000
我们好像总是被超负荷的信息
00:17
from information overload or data glut.
1
17260
3000
和过多的数据折磨.
00:20
And the good news is there might be an easy solution to that,
2
20260
2000
好消息是也许我们能有种办法解决这个问题,
00:22
and that's using our eyes more.
3
22260
2000
那就是更多的用我们的眼睛.
00:24
So, visualizing information, so that we can see
4
24260
2000
把信息视觉化,我们就能看出
00:26
the patterns and connections that matter
5
26260
3000
其中的规律和联系
00:29
and then designing that information so it makes more sense,
6
29260
3000
然后再设计这些信息,让它更能有意义,
00:32
or it tells a story,
7
32260
2000
或者让它来说明一个故事,
00:34
or allows us to focus only on the information that's important.
8
34260
3000
或者让我们能只关注那些重要的信息.
00:38
Failing that, visualized information can just look really cool.
9
38260
3000
如果不能做到这一点,至少视觉化的信息看起来特别酷.
00:41
So, let's see.
10
41260
2000
让我们来看看.
00:45
This is the $Billion Dollar o-Gram,
11
45260
2000
这是十亿美元图,
00:47
and this image arose
12
47260
2000
我在看到媒体的十亿美元的报道时
00:49
out of frustration I had
13
49260
2000
很沮丧,
00:51
with the reporting of billion-dollar amounts in the press.
14
51260
2000
于是我就画了这幅图.
00:53
That is, they're meaningless without context:
15
53260
3000
如果没有文字辅助,它是没意义的.
00:56
500 billion for this pipeline,
16
56260
2000
五千亿用于运输管道的建设
00:58
20 billion for this war.
17
58260
2000
二百亿用于战争.
01:00
It doesn't make any sense, so the only way to understand it
18
60260
2000
这样并不能说明问题,所以唯一能理解它的方法
01:02
is visually and relatively.
19
62260
2000
就是视觉化,相关化.
01:04
So I scraped a load of reported figures
20
64260
2000
所以我从不同的信息渠道
01:06
from various news outlets
21
66260
2000
收集了很多报道的数据
01:08
and then scaled the boxes according to those amounts.
22
68260
3000
然后根据数量画出了这些方框.
01:11
And the colors here represent the motivation behind the money.
23
71260
3000
颜色代表了这部分钱的用途.
01:14
So purple is "fighting,"
24
74260
3000
紫色代表战乱
01:17
and red is "giving money away," and green is "profiteering."
25
77260
3000
红色是把钱捐赠出去,绿色代表牟取暴利.
01:20
And what you can see straight away
26
80260
2000
你能直接看出来的是
01:22
is you start to have a different relationship to the numbers.
27
82260
2000
你开始和这些数字有了不同的关系.
01:24
You can literally see them.
28
84260
2000
你能真正的看见它们了.
01:26
But more importantly, you start to see
29
86260
2000
但更重要的是,你能开始看到
01:28
patterns and connections between numbers
30
88260
2000
这些数字中的规律和联系了
01:30
that would otherwise be scattered across multiple news reports.
31
90260
3000
如果不通过画图,这些信息都将只能散落在不同的新闻报道中.
01:33
Let me point out some that I really like.
32
93260
3000
让我来讲讲我很喜欢的一个例子.
01:36
This is OPEC's revenue, this green box here --
33
96260
2000
这是石油输出国组织的财政收益,绿色的方格这里
01:38
780 billion a year.
34
98260
2000
是7800亿每年.
01:40
And this little pixel in the corner -- three billion --
35
100260
3000
这个角落里的小像素是三十个亿
01:43
that's their climate change fund.
36
103260
3000
这是他们的气候变化基金.
01:46
Americans, incredibly generous people --
37
106260
2000
美国,这个特别慷慨的国家
01:48
over 300 billion a year, donated to charity every year,
38
108260
3000
每年捐赠多于3000个亿
01:51
compared with the amount of foreign aid
39
111260
2000
相比于其它17个工业化最发达的国家
01:53
given by the top 17 industrialized nations
40
113260
2000
一共对外捐出的
01:55
at 120 billion.
41
115260
2000
1200亿.
01:57
Then of course,
42
117260
2000
当然,
01:59
the Iraq War, predicted to cost just 60 billion
43
119260
2000
在2003年的预算中
02:01
back in 2003.
44
121260
3000
伊拉克战争只要花掉600亿
02:04
And it mushroomed slightly. Afghanistan and Iraq mushroomed now
45
124260
3000
这项实际的花销比预算稍多了一点, 但阿富汗战争的投入
02:07
to 3,000 billion.
46
127260
3000
迅速增加到了3万亿.
02:10
So now it's great
47
130260
2000
现在就很方便了,
02:12
because now we have this texture, and we can add numbers to it as well.
48
132260
2000
因为我们有了这个结构,我们还能往里面加数字.
02:14
So we could say, well, a new figure comes out ... let's see African debt.
49
134260
3000
我们能说一个新的图形出现了.....让我们来看下非洲的债务.
02:17
How much of this diagram do you think might be taken up
50
137260
2000
你们认为在这张图里非洲
02:19
by the debt that Africa owes to the West?
51
139260
2000
欠西方国家的债务能占多大比例呢?
02:21
Let's take a look.
52
141260
2000
我们来看一看.
02:23
So there it is:
53
143260
2000
这就是了.
02:25
227 billion is what Africa owes.
54
145260
2000
非洲的债务一共是二千二百七十亿.
02:27
And the recent financial crisis,
55
147260
2000
最近的这场金融危机--
02:29
how much of this diagram might that figure take up?
56
149260
2000
在这张图里能占多大的比例呢?
02:31
What has that cost the world? Let's take a look at that.
57
151260
3000
这场危机又给世界带来了多大损失呢?让我们来看这里.
02:34
Dooosh -- Which I think is the appropriate sound effect
58
154260
3000
Doosh(拟声词,相似于"哇"),我想应该是配合这么多钱
02:37
for that much money:
59
157260
2000
的恰当的声音效果.
02:39
11,900 billion.
60
159260
4000
十一万九千个亿.
02:45
So, by visualizing this information,
61
165260
2000
所以,通过视觉化这些信息,
02:47
we turned it into a landscape
62
167260
2000
我们把它们转化成了一个直观景象
02:49
that you can explore with your eyes,
63
169260
2000
这样你就能只通过眼睛探索,
02:51
a kind of map really, a sort of information map.
64
171260
2000
真的类似于一个地图,一种信息的地图.
02:53
And when you're lost in information,
65
173260
2000
当你在信息里迷失的时候,
02:55
an information map is kind of useful.
66
175260
3000
这样的地图是很有帮助的.
02:58
So I want to show you another landscape now.
67
178260
2000
我现在想展示另一番景象.
03:00
We need to imagine what a landscape
68
180260
2000
我们来想象下世界的恐惧会是
03:02
of the world's fears might look like.
69
182260
3000
什么样的一番景象.
03:05
Let's take a look.
70
185260
2000
让我们来看一下.
03:07
This is Mountains Out of Molehills,
71
187260
2000
这是摩尔山之外的一座"山",
03:09
a timeline of global media panic.
72
189260
2000
是全球医疗恐惧的时间线.
03:11
(Laughter)
73
191260
2000
(笑声)
03:13
So, I'll label this for you in a second.
74
193260
2000
我等下会把这些标记出来.
03:15
But the height here, I want to point out,
75
195260
2000
但我想指出这里的高度
03:17
is the intensity of certain fears
76
197260
2000
是媒体报道里
03:19
as reported in the media.
77
199260
2000
人们的恐惧程度.
03:21
Let me point them out.
78
201260
2000
让我来指出来.
03:23
So this, swine flu -- pink.
79
203260
4000
这里,猪流感,标记成粉色.
03:27
Bird flu.
80
207260
2000
禽流感.
03:29
SARS -- brownish here. Remember that one?
81
209260
3000
非典--棕色一样的颜色.记住这一条.
03:32
The millennium bug,
82
212260
3000
千禧虫--
03:35
terrible disaster.
83
215260
2000
可怕的灾难.
03:37
These little green peaks
84
217260
2000
这些小的绿色的峰
03:39
are asteroid collisions.
85
219260
2000
是对小行星碰撞的恐惧.
03:41
(Laughter)
86
221260
2000
(笑声)
03:43
And in summer, here, killer wasps.
87
223260
2000
夏天的时候,这点是对杀人蜂的恐惧.
03:45
(Laughter)
88
225260
8000
(笑声)
03:53
So these are what our fears look like
89
233260
2000
所以这就是媒体描述里
03:55
over time in our media.
90
235260
2000
我们的恐惧"长的模样",
03:57
But what I love -- and I'm a journalist --
91
237260
2000
但我最喜欢的是--我是个记者--
03:59
and what I love is finding hidden patterns; I love being a data detective.
92
239260
3000
我最喜欢的是找出隐藏着的规律. 我喜欢做个数据侦探.
04:02
And there's a very interesting and odd pattern hidden in this data
93
242260
3000
而且这些数据里有很有趣又很奇怪的规律
04:05
that you can only see when you visualize it.
94
245260
2000
你只能通过视觉化数据才能看出这些规律.
04:07
Let me highlight it for you.
95
247260
2000
我来强调一下.
04:09
See this line, this is a landscape for violent video games.
96
249260
3000
看这条线.这是暴力视频游戏的"景象".
04:12
As you can see, there's a kind of odd, regular pattern in the data,
97
252260
3000
正如你能看到的,是个有点奇怪的长方形的样子,
04:15
twin peaks every year.
98
255260
2000
每年都有两个差不多大小的峰.
04:17
If we look closer, we see those peaks occur
99
257260
2000
如果我们仔细看看,我们能看出这些峰
04:19
at the same month every year.
100
259260
3000
每年出现的时间是一样的.
04:22
Why?
101
262260
2000
为什么呢?
04:24
Well, November, Christmas video games come out,
102
264260
2000
十一月的时候,是圣诞视频游戏发放的日子,
04:26
and there may well be an upsurge in the concern about their content.
103
266260
3000
就很可能引起关于其内容的恐慌.
04:29
But April isn't a particularly massive month
104
269260
3000
但四月对游戏而言
04:32
for video games.
105
272260
2000
并不是个很特别的月份
04:34
Why April?
106
274260
2000
为什么会是四月呢?
04:36
Well, in April 1999 was the Columbine shooting,
107
276260
3000
其实,1999年的四月发生了哥伦拜恩枪击案,
04:39
and since then, that fear
108
279260
2000
从那以后,
04:41
has been remembered by the media
109
281260
2000
人们总是能通过媒体想起这份恐惧
04:43
and echoes through the group mind gradually through the year.
110
283260
2000
在这一年里,这份恐惧也总是渐渐环绕在这个群体里.
04:45
You have retrospectives, anniversaries,
111
285260
3000
你总会回顾,纪念,
04:48
court cases, even copy-cat shootings,
112
288260
3000
也总有诉讼甚至是只单纯模仿别人的枪击案,
04:51
all pushing that fear into the agenda.
113
291260
3000
所有这些都能引起人们的恐惧,而体现在这日历里.
04:54
And there's another pattern here as well. Can you spot it?
114
294260
2000
而且这里还有另一个规律.你看出来了吗?
04:56
See that gap there? There's a gap,
115
296260
2000
看见这个间断了吗?这里有一个间断.
04:58
and it affects all the other stories.
116
298260
2000
这一点影响了其它所有部分.
05:00
Why is there a gap there?
117
300260
2000
为什么这里会有一个间断呢?
05:02
You see where it starts? September 2001,
118
302260
3000
看到它是从什么时候开始的吗?2001年9月(911发生的时间)
05:05
when we had something very real
119
305260
2000
那个月里发了一件真的
05:07
to be scared about.
120
307260
2000
很令人恐惧的事情.
05:09
So, I've been working as a data journalist for about a year,
121
309260
3000
我做数据统计记者已经有一年了,
05:12
and I keep hearing a phrase
122
312260
2000
我总是能听到这样一个说法
05:14
all the time, which is this:
123
314260
3000
说:
05:17
"Data is the new oil."
124
317260
2000
"数据是新的石油."
05:19
Data is the kind of ubiquitous resource
125
319260
3000
而且数据是一种无法不在的资源
05:22
that we can shape to provide new innovations and new insights,
126
322260
3000
我们可以用它来开创新的革命和用新眼光来洞察世界,
05:25
and it's all around us, and it can be mined very easily.
127
325260
3000
而且它就在我们身边,很容易被挖掘.
05:28
It's not a particularly great metaphor in these times,
128
328260
3000
而且在这个时代这也不是什么很特别的比喻
05:31
especially if you live around the Gulf of Mexico,
129
331260
3000
尤其是如果你生活在墨西哥湾,
05:34
but I would, perhaps, adapt this metaphor slightly,
130
334260
2000
但我想,或者可以对这个比喻稍加改动,
05:36
and I would say that data is the new soil.
131
336260
3000
我要说数据是新土壤.
05:40
Because for me, it feels like a fertile, creative medium.
132
340260
3000
因为对我而言,它更像个可以施肥,能创造出东西来的媒体.
05:43
Over the years, online,
133
343260
2000
经过这么多年,在网上
05:45
we've laid down
134
345260
3000
我们已经积攒下
05:48
a huge amount of information and data,
135
348260
2000
巨大数量的信息和数据,
05:50
and we irrigate it with networks and connectivity,
136
350260
2000
而且我们用网络和连接来浇灌它们,
05:52
and it's been worked and tilled by unpaid workers and governments.
137
352260
3000
不收取工资的人们和政府一直在努力耕作这片"土壤".
05:55
And, all right, I'm kind of milking the metaphor a little bit.
138
355260
3000
好吧,我有点夸大这个比喻了.
05:58
But it's a really fertile medium,
139
358260
3000
但它真的是一个可以繁殖的媒体,
06:01
and it feels like visualizations, infographics, data visualizations,
140
361260
3000
它想让被视觉化,把咨询图像化,数据可视化,
06:04
they feel like flowers blooming from this medium.
141
364260
3000
它们就像是从土壤里长出的花朵.
06:07
But if you look at it directly,
142
367260
2000
但如果你直接看这些数据,
06:09
it's just a lot of numbers and disconnected facts.
143
369260
2000
它们就只是一堆数字和没有关联的事件.
06:11
But if you start working with it and playing with it in a certain way,
144
371260
3000
但如果你开始分析它们,并用一种方式处理它们,
06:14
interesting things can appear and different patterns can be revealed.
145
374260
3000
有趣的事就会发生,不同的规律也会显现出来.
06:17
Let me show you this.
146
377260
2000
让我来展示下这个.
06:19
Can you guess what this data set is?
147
379260
3000
你能猜出这是关于什么的数据吗?
06:22
What rises twice a year,
148
382260
2000
有什么东西每年要升两次?
06:24
once in Easter
149
384260
2000
一次在复活节
06:26
and then two weeks before Christmas,
150
386260
2000
另一次在圣诞前的两周,
06:28
has a mini peak every Monday,
151
388260
2000
每个周一都有一个小峰
06:30
and then flattens out over the summer?
152
390260
2000
并在夏天的时候一直保持平坦.
06:32
I'll take answers.
153
392260
2000
我来听听你们的想法.
06:34
(Audience: Chocolate.) David McCandless: Chocolate.
154
394260
2000
(观众:巧克力).David:巧克力.
06:36
You might want to get some chocolate in.
155
396260
3000
你或许会想加一点巧克力进去.
06:39
Any other guesses?
156
399260
2000
还有别的猜想呢?
06:41
(Audience: Shopping.) DM: Shopping.
157
401260
2000
(观众:购物).David:购物.
06:43
Yeah, retail therapy might help.
158
403260
3000
嗯,购物疗法也许会有帮助.
06:46
(Audience: Sick leave.)
159
406260
2000
(观众:病假)
06:48
DM: Sick leave. Yeah, you'll definitely want to take some time off.
160
408260
2000
David:病假.嗯,你一定想休息几天.
06:50
Shall we see?
161
410260
2000
我们来看答案吧?
06:53
(Laughter)
162
413260
8000
(笑声)
07:01
(Applause)
163
421260
3000
(掌声)
07:04
So, the information guru Lee Byron and myself,
164
424260
3000
这里的信息是,Lee Byron和我自己
07:07
we scraped 10,000 status Facebook updates
165
427260
3000
搜集了一万个Facebook的状态
07:10
for the phrase "break-up" and "broken-up"
166
430260
2000
寻找包括"分手"和"分手了"的字样的更新
07:12
and this is the pattern we found --
167
432260
2000
这就是我们收集的结果--
07:14
people clearing out for Spring Break,
168
434260
2000
人们在春假期间"清理"关系.
07:16
(Laughter)
169
436260
5000
(笑声)
07:21
coming out of very bad weekends on a Monday,
170
441260
2000
接着的周一从过得很糟的周末中走出来
07:23
being single over the summer,
171
443260
3000
一个暑假里都保持着单身.
07:26
and then the lowest day of the year, of course: Christmas Day.
172
446260
3000
当然一年中分手最少的一天是:圣诞.
07:29
Who would do that?
173
449260
3000
谁会这么做呢?
07:32
So there's a titanic amount of data out there now,
174
452260
2000
现在外面有极多的数据,
07:34
unprecedented.
175
454260
2000
史无前例的多.
07:37
But if you ask the right kind of question,
176
457260
2000
但如果你问正确的问题,
07:39
or you work it in the right kind of way,
177
459260
2000
或者你用正确的方式处理它,
07:41
interesting things can emerge.
178
461260
3000
有趣的事就会发生.
07:44
So information is beautiful. Data is beautiful.
179
464260
3000
所以信息是漂亮的,数据也是漂亮的.
07:47
I wonder if I could make my life beautiful.
180
467260
3000
我想我能不能把我的生活也弄得很漂亮.
07:50
And here's my visual C.V.
181
470260
2000
这是我的视觉化的简历.
07:52
I'm not quite sure I've succeeded.
182
472260
2000
我还不太确定这份简历是不是成功.
07:54
Pretty blocky, the colors aren't that great.
183
474260
2000
有很多的小块,颜色也不是特别的好.
07:56
But I wanted to convey something to you.
184
476260
3000
但我想传递些东西给你们.
07:59
I started as a programmer,
185
479260
2000
我从一个程序员开始做起,
08:01
and then I worked as a writer for many years, about 20 years,
186
481260
2000
之后我做了很多年作家,大约有二十年,
08:03
in print, online and then in advertising,
187
483260
2000
在杂志上,网上之后还有广告中,
08:05
and only recently have I started designing.
188
485260
3000
只在最近这一段我才开始进行设计.
08:08
And I've never been to design school.
189
488260
2000
而且我从来没去过设计学校,
08:10
I've never studied art or anything.
190
490260
3000
我从没学校艺术或这类的东西.
08:13
I just kind of learned through doing.
191
493260
2000
我就仅仅是从做的过程中来学.
08:15
And when I started designing,
192
495260
2000
当我开始设计的时候,
08:17
I discovered an odd thing about myself.
193
497260
2000
我发现了关于我自己的一件奇怪的事.
08:19
I already knew how to design,
194
499260
2000
我已经知道怎么去设计了,
08:21
but it wasn't like I was amazingly brilliant at it,
195
501260
3000
不是说我奇迹般的对设计特别在行,
08:24
but more like I was sensitive
196
504260
2000
而更像是我对关于网格,空间
08:26
to the ideas of grids and space
197
506260
2000
以及排列和版面设计之类的
08:28
and alignment and typography.
198
508260
2000
特别的敏感.
08:30
It's almost like being exposed
199
510260
2000
就好像是暴露在媒体
08:32
to all this media over the years
200
512260
2000
这么多年之后,
08:34
had instilled a kind of dormant design literacy in me.
201
514260
3000
我被潜移默化地注入了设计的能力.
08:37
And I don't feel like I'm unique.
202
517260
2000
而且我不觉得自己是唯一一个.
08:39
I feel that everyday, all of us now
203
519260
2000
我感觉每一天,我们所有人
08:41
are being blasted by information design.
204
521260
3000
都对信息设计特别的不满.
08:44
It's being poured into our eyes through the Web,
205
524260
2000
它们从网上一股脑灌入我们眼里
08:46
and we're all visualizers now;
206
526260
2000
而且我们都是视觉型的人,
08:48
we're all demanding a visual aspect
207
528260
2000
我们都想从视觉的角度
08:50
to our information.
208
530260
3000
来看我们的信息.
08:53
There's something almost quite magical about visual information.
209
533260
3000
这就是关于视觉化信息的神奇的地方.
08:56
It's effortless, it literally pours in.
210
536260
3000
不需要努力的,它就自己进来了.
08:59
And if you're navigating a dense information jungle,
211
539260
3000
如果你在密集的信息丛林里搜寻,
09:02
coming across a beautiful graphic
212
542260
2000
突然看到一个漂亮的图画,
09:04
or a lovely data visualization,
213
544260
2000
或者是一个可爱的视觉化的数据,
09:06
it's a relief, it's like coming across a clearing in the jungle.
214
546260
3000
那将会是怎样的释然啊! 就好像是在森林里碰到了开阔的地方.
09:09
I was curious about this, so it led me
215
549260
2000
我对这很好奇,所以它引导我
09:11
to the work of a Danish physicist
216
551260
2000
关注一个丹麦的物理学家
09:13
called Tor Norretranders,
217
553260
2000
名叫Tor Norretranders,
09:15
and he converted the bandwidth of the senses into computer terms.
218
555260
3000
他把感观的带宽转化成了电脑术语.
09:19
So here we go. This is your senses,
219
559260
2000
过程是这样的.这是你的感观,
09:21
pouring into your senses every second.
220
561260
2000
周围世界每一秒都往你的感观里注入东西.
09:23
Your sense of sight is the fastest.
221
563260
3000
你的视觉是最快的.
09:26
It has the same bandwidth as a computer network.
222
566260
3000
它有和电脑网络一样的带宽.
09:29
Then you have touch, which is about the speed of a USB key.
223
569260
3000
你还有触觉,它有和USB接口一样的速度.
09:32
And then you have hearing and smell,
224
572260
2000
你的听觉和嗅觉,
09:34
which has the throughput of a hard disk.
225
574260
2000
有和硬盘一样的处理速度.
09:36
And then you have poor old taste,
226
576260
2000
你还有可怜的,古老的味觉,
09:38
which is like barely the throughput of a pocket calculator.
227
578260
3000
它仅仅只有口袋计算器那样的工作效率.
09:41
And that little square in the corner, a naught .7 percent,
228
581260
3000
那角落里小小的正方形,占百分之0.7的比例,
09:44
that's the amount we're actually aware of.
229
584260
3000
这就是我们真正能感知的部分.
09:47
So a lot of your vision --
230
587260
2000
所以你很多的想像--
09:49
the bulk of it is visual, and it's pouring in.
231
589260
2000
大部分是视觉的,而且它是自己大批进入的.
09:51
It's unconscious.
232
591260
2000
潜意识里进行的.
09:53
The eye is exquisitely sensitive
233
593260
3000
眼睛对颜色变化中的规律,
09:56
to patterns in variations in color, shape and pattern.
234
596260
3000
形状和模式是特别敏感的.
09:59
It loves them, and it calls them beautiful.
235
599260
2000
如果眼睛喜欢它们,它就把它们叫作漂亮.
10:01
It's the language of the eye.
236
601260
2000
这就是眼睛的语言.
10:03
If you combine the language of the eye with the language of the mind,
237
603260
2000
而且你把眼睛的语言和头脑的语言联系在一起,
10:05
which is about words and numbers and concepts,
238
605260
3000
关于词语,数字和概念这些,
10:08
you start speaking two languages simultaneously,
239
608260
3000
你将能同时讲两种语言,
10:11
each enhancing the other.
240
611260
3000
每一种都能加强另一种.
10:14
So, you have the eye, and then you drop in the concepts.
241
614260
3000
所以你有眼睛,然后你想到了概念,
10:17
And that whole thing -- it's two languages
242
617260
2000
这个过程,其实是两种语言,
10:19
both working at the same time.
243
619260
2000
它们同时在一起工作.
10:21
So we can use this new kind of language, if you like,
244
621260
2000
所以我们能用这种新的语言,如果你喜欢的话,
10:23
to alter our perspective or change our views.
245
623260
3000
来改变我们的视角或者我们的观点.
10:26
Let me ask you a simple question
246
626260
2000
让我来问一个简单的问题
10:28
with a really simple answer:
247
628260
2000
它有个特别简单的答案.
10:30
Who has the biggest military budget?
248
630260
2000
谁拥有最多的军队预算?
10:32
It's got to be America, right?
249
632260
2000
这一定是美国,对吧?
10:34
Massive. 609 billion in 2008 --
250
634260
2000
非常多的.在2008年有六千零九十亿
10:36
607, rather.
251
636260
2000
六千零七十亿实际上是.
10:38
So massive, in fact, that it can contain
252
638260
2000
这么多,事实上,它包含了
10:40
all the other military budgets in the world inside itself.
253
640260
3000
所有其它国家的军队预算在里面.
10:43
Gobble, gobble, gobble, gobble, gobble.
254
643260
2000
格格,格格,格格,格格,格格(火鸡的叫声)
10:45
Now, you can see Africa's total debt there
255
645260
2000
现在你能拿整个非洲的债务
10:47
and the U.K. budget deficit for reference.
256
647260
2000
和英国的预算赤字来比较一下.
10:49
So that might well chime
257
649260
2000
这很可能
10:51
with your view that America
258
651260
2000
和你印象中的美国形象一致,
10:53
is a sort of warmongering military machine,
259
653260
3000
那个好战的,有军队机器,
10:56
out to overpower the world
260
656260
2000
强大到要用它的巨大的工业军事合成体
10:58
with its huge industrial-military complex.
261
658260
3000
来压倒世界的国家.
11:01
But is it true that America has the biggest military budget?
262
661260
3000
但美国真的有最多的军事预算吗?
11:04
Because America is an incredibly rich country.
263
664260
2000
因为它是个特别富有的国家.
11:06
In fact, it's so massively rich
264
666260
2000
事实上,它是这么有富有
11:08
that it can contain the four other
265
668260
2000
它能把四个在它之后的
11:10
top industrialized nations' economies
266
670260
2000
工业国家的经济收入
11:12
inside itself, it's so vastly rich.
267
672260
3000
全包含在它之内.如此的富有.
11:15
So its military budget is bound to be enormous.
268
675260
3000
它的军队预算是和经济联系在一起的
11:18
So, to be fair and to alter our perspective,
269
678260
2000
所以,为了公平为了改变我们的视角,
11:20
we have to bring in another data set,
270
680260
2000
我们得引入另一组数据,
11:22
and that data set is GDP, or the country's earnings.
271
682260
2000
这就是国民生产总值,或者是一个国家的收入.
11:24
Who has the biggest budget as a proportion of GDP?
272
684260
2000
哪个国家的军队预算是占国民生产总值最多的呢?
11:26
Let's have a look.
273
686260
2000
让我们来看一看.
11:28
That changes the picture considerably.
274
688260
3000
这把格局改变了很多.
11:31
Other countries pop into view that you, perhaps, weren't considering,
275
691260
3000
其它你很可能没想到的国家一下子出现了,
11:34
and American drops into eighth.
276
694260
2000
美国跌入了第八位.
11:36
Now you can also do this with soldiers.
277
696260
2000
你也同样可以对士兵进行这样的运算.
11:38
Who has the most soldiers? It's got to be China.
278
698260
2000
哪个国家有最多的士兵?一定是中国.
11:40
Of course, 2.1 million.
279
700260
2000
当然,二百一十万.
11:42
Again, chiming with your view
280
702260
2000
同样的,和你对中国,
11:44
that China has a militarized regime
281
704260
2000
那个军事化的政权,准备着
11:46
ready to, you know, mobilize its enormous forces.
282
706260
2000
要发挥其强大的武装力量的印象一致.
11:48
But of course, China has an enormous population.
283
708260
3000
但不可否认,中国有巨大的人口,
11:51
So if we do the same,
284
711260
2000
如果我们做同样的处理,
11:53
we see a radically different picture.
285
713260
2000
我们看到一个彻底不同的景象.
11:55
China drops to 124th.
286
715260
2000
中国落到了124位.
11:57
It actually has a tiny army
287
717260
2000
它其实只有很小的一支部队,
11:59
when you take other data into consideration.
288
719260
3000
如果你把人口考虑进去的话.
12:02
So, absolute figures, like the military budget,
289
722260
2000
所以,绝对的数字,就像是军队预算,
12:04
in a connected world,
290
724260
2000
在一个紧密联系的世界里
12:06
don't give you the whole picture.
291
726260
2000
好像不能给你一个完整的格局.
12:08
They're not as true as they could be.
292
728260
2000
它们并不是它们能达到的最真实的状态.
12:10
We need relative figures that are connected to other data
293
730260
3000
我们需要相对的数据,这些数据和其它数据相联系
12:13
so that we can see a fuller picture,
294
733260
2000
我们才能看到一个更全面的世界.
12:15
and then that can lead to us changing our perspective.
295
735260
2000
而这一点能改变我们的视角.
12:17
As Hans Rosling, the master,
296
737260
2000
就像Hans Rosling, 校长
12:19
my master, said,
297
739260
3000
我的校长说的那样,
12:22
"Let the dataset change your mindset."
298
742260
3000
"让数据模式改变你的心理模式."
12:26
And if it can do that, maybe it can also change your behavior.
299
746260
3000
如果它能做到这一点,也许它能同时改变你的行为.
12:29
Take a look at this one.
300
749260
2000
来看一下这个.
12:31
I'm a bit of a health nut.
301
751260
2000
我有一点健康迷.
12:33
I love taking supplements and being fit,
302
753260
3000
我喜欢吃一些保养品来保持健康
12:36
but I can never understand what's going on in terms of evidence.
303
756260
3000
但我从来不能用证据证明这些东西是不是真的有用.
12:39
There's always conflicting evidence.
304
759260
2000
总是会有相当矛盾的证据.
12:41
Should I take vitamin C? Should I be taking wheatgrass?
305
761260
2000
我是应该吃维他命C?还是吃小麦草素?
12:43
This is a visualization of all the evidence
306
763260
2000
这里把所有这些关于
12:45
for nutritional supplements.
307
765260
2000
营养补品的证据都视觉化了.
12:47
This kind of diagram is called a balloon race.
308
767260
3000
这样的图表叫作热气球竞赛.
12:50
So the higher up the image,
309
770260
2000
图上越高的地方,
12:52
the more evidence there is for each supplement.
310
772260
3000
有越多的关于这些补品的证据.
12:55
And the bubbles correspond to popularity as regards to Google hits.
311
775260
3000
这些气泡的大小对应于GOOGLE上这样补品的受欢迎程度.
12:58
So you can immediately apprehend
312
778260
3000
这样你能马上就领会效力
13:01
the relationship between efficacy and popularity,
313
781260
3000
和受欢迎程度间的关系.
13:04
but you can also, if you grade the evidence,
314
784260
3000
而且同时,如果你把证据分级别的话,
13:07
do a "worth it" line.
315
787260
2000
画出一条"值得"的线.
13:09
So supplements above this line are worth investigating,
316
789260
3000
所以在这条线以上的补品是那些值得投资的,
13:12
but only for the conditions listed below,
317
792260
3000
但只是针对下面列出的这些症状.
13:15
and then the supplements below the line
318
795260
3000
对于这条线以下的补品,
13:18
are perhaps not worth investigating.
319
798260
2000
就很可能不太值得投资了.
13:20
Now this image constitutes a huge amount of work.
320
800260
3000
这幅图凝结了我们大量的工作.
13:23
We scraped like 1,000 studies from PubMed,
321
803260
3000
我们挖掘了Pubmed上一千个左右的研究,
13:26
the biomedical database,
322
806260
2000
Pubmed是一个生物医学的数据库,
13:28
and we compiled them and graded them all.
323
808260
3000
我们收集了这些研究并把他们分了级别.
13:31
And it was incredibly frustrating for me
324
811260
2000
这让我相当沮丧
13:33
because I had a book of 250 visualizations to do for my book,
325
813260
3000
因为我的书需要250个可视化图像
13:36
and I spent a month doing this,
326
816260
2000
我用了一个月做这幅图,
13:38
and I only filled two pages.
327
818260
2000
却只能填充两页纸.
13:40
But what it points to
328
820260
2000
但重点是
13:42
is that visualizing information like this
329
822260
2000
像这样把信息视觉化的过程
13:44
is a form of knowledge compression.
330
824260
2000
是把知识压缩的过程.
13:46
It's a way of squeezing an enormous amount
331
826260
2000
这是一种从巨大数量的信息
13:48
of information and understanding
332
828260
2000
中压榨出精华
13:50
into a small space.
333
830260
2000
并理解的过程.
13:52
And once you've curated that data, and once you've cleaned that data,
334
832260
2000
一旦你整理了这些数据,或者说清理了这些数据,
13:54
and once it's there,
335
834260
2000
一旦它在那了
13:56
you can do cool stuff like this.
336
836260
2000
你就能做像这样很酷的事了.
13:58
So I converted this into an interactive app,
337
838260
2000
所以我用这个原理开发出了一个互动的应用程序,
14:00
so I can now generate this application online --
338
840260
2000
我现在能在网上生成这个程序--
14:02
this is the visualization online --
339
842260
2000
并在网上进行视觉化--
14:04
and I can say, "Yeah, brilliant."
340
844260
2000
我能称赞它"真好,太棒了"
14:06
So it spawns itself.
341
846260
2000
它自动开始工作.
14:08
And then I can say, "Well, just show me the stuff
342
848260
2000
然后我就能说"嗯,给我看下那些
14:10
that affects heart health."
343
850260
2000
影响心脏健康的东西"
14:12
So let's filter that out.
344
852260
2000
它就开始筛选
14:14
So heart is filtered out, so I can see if I'm curious about that.
345
854260
3000
心脏被选出去了,我对这很好奇.
14:17
I think, "No, no. I don't want to take any synthetics,
346
857260
2000
我想"不,不,我不想吃任何合成的药品
14:19
I just want to see plants and --
347
859260
3000
我只想用植物--
14:22
just show me herbs and plants. I've got all the natural ingredients."
348
862260
3000
就只显示那些草本和植物.我有所有自然的营养成分."
14:25
Now this app is spawning itself
349
865260
2000
现在这个程序又开始从数据
14:27
from the data.
350
867260
2000
里自动生成.
14:29
The data is all stored in a Google Doc,
351
869260
2000
所有数据都保存在Google doc里面,
14:31
and it's literally generating itself from that data.
352
871260
3000
它实际上是在自己利用数据完成视觉化.
14:34
So the data is now alive; this is a living image,
353
874260
2000
现在数据变成活的了,它现在是一个动态的图像了,
14:36
and I can update it in a second.
354
876260
2000
而且我能一下就完成更新.
14:38
New evidence comes out. I just change a row on a spreadsheet.
355
878260
2000
如果新的证据出现了--我只需要在表格里改一行.
14:40
Doosh! Again, the image recreates itself.
356
880260
4000
哇!又一次的,图像自己重新生成了自己.
14:44
So it's cool.
357
884260
2000
这很酷.
14:46
It's kind of living.
358
886260
3000
有点有生命的感觉.
14:49
But it can go beyond data,
359
889260
2000
但它其实能比数据走得更远,
14:51
and it can go beyond numbers.
360
891260
2000
而且也不仅仅限于数字.
14:53
I like to apply information visualization
361
893260
2000
我喜欢用信息视觉化
14:55
to ideas and concepts.
362
895260
3000
的方法来处理思想和概念.
14:58
This is a visualization
363
898260
2000
这是一个对政治观点
15:00
of the political spectrum,
364
900260
2000
视觉化的图像,
15:02
an attempt for me to try
365
902260
2000
是我在试图理解
15:04
and understand how it works
366
904260
2000
政治是如果工作的.
15:06
and how the ideas percolate down
367
906260
2000
理解想法是怎么向下渗透的
15:08
from government into society and culture,
368
908260
2000
从政府开始到社会和文化,
15:10
into families, into individuals, into their beliefs
369
910260
3000
再传递到家庭,各人以及他们的信念中
15:13
and back around again in a cycle.
370
913260
3000
而且又如何再反过来影响上层,形成一个循环的.
15:16
What I love about this image
371
916260
2000
关于这幅图我喜欢的是
15:18
is it's made up of concepts,
372
918260
2000
它是由概念组成的,
15:20
it explores our worldviews
373
920260
2000
它开拓了我们的世界观
15:22
and it helps us -- it helps me anyway --
374
922260
2000
并且帮助我们--至少是帮助我--
15:24
to see what others think,
375
924260
2000
看清别人是怎么想的,
15:26
to see where they're coming from.
376
926260
2000
了解他们的想法是从哪里来的.
15:28
And it feels just incredibly cool to do that.
377
928260
3000
做这件事感觉真是太棒了.
15:31
What was most exciting for me
378
931260
3000
最让我兴奋的是
15:34
designing this
379
934260
2000
设计
15:36
was that, when I was designing this image,
380
936260
2000
当我在设计这些图的时候,
15:38
I desperately wanted this side, the left side,
381
938260
3000
我极其希望这一边,左边
15:41
to be better than the right side --
382
941260
2000
能比右边好--
15:43
being a journalist, a Left-leaning person --
383
943260
3000
作为一个记者,我是个有点左倾的人--
15:46
but I couldn't, because I would have created
384
946260
2000
但我不能,因为如果这样我全造出一个
15:48
a lopsided, biased diagram.
385
948260
3000
不平衡的,带着偏见的图.
15:51
So, in order to really create a full image,
386
951260
3000
所以,为了真正的创造一个完整的图,
15:54
I had to honor the perspectives on the right-hand side
387
954260
3000
我得尊重右手边的这些观点
15:57
and at the same time, uncomfortably recognize
388
957260
3000
同时,有点不太情愿地认识到
16:00
how many of those qualities were actually in me,
389
960260
3000
我其实也有这其中很多的品质,
16:03
which was very, very annoying and uncomfortable.
390
963260
2000
对我而言,这是特别特别地讨厌和不舒服的.
16:05
(Laughter)
391
965260
4000
(笑声)
16:09
But not too uncomfortable,
392
969260
2000
但也不是特别不舒服,
16:11
because there's something unthreatening
393
971260
3000
因为看到一些政治观点,
16:14
about seeing a political perspective,
394
974260
2000
总比被告知或者被强迫听着这些观点,
16:16
versus being told or forced to listen to one.
395
976260
3000
不让人那么有压迫和威胁感.
16:19
You're capable of holding conflicting viewpoints
396
979260
3000
事实上,在你看到它们的时候
16:22
joyously when you can see them.
397
982260
2000
你能快乐的承受矛盾的观点.
16:24
It's even fun to engage with them
398
984260
2000
甚至和它们打交道是有趣的
16:26
because it's visual.
399
986260
2000
因为这是视觉的.
16:28
So that's what's exciting to me,
400
988260
2000
这就是它让我着迷的地方,
16:30
seeing how data can change my perspective
401
990260
2000
看数据是如何让我改变观点
16:32
and change my mind midstream --
402
992260
2000
并且改变我思维的主流--
16:34
beautiful, lovely data.
403
994260
3000
漂亮的,可爱的数据.
16:38
So, just to wrap up,
404
998260
2000
简单总结一下,
16:40
I wanted to say
405
1000260
2000
我想说
16:42
that it feels to me that design is about solving problems
406
1002260
2000
我感觉设计其实是在解决问题
16:44
and providing elegant solutions,
407
1004260
3000
并且给出一个精美的解决办法.
16:47
and information design is about
408
1007260
2000
信息设计是在
16:49
solving information problems.
409
1009260
2000
解决信息上的问题.
16:51
It feels like we have a lot of information problems
410
1011260
2000
现在我们社会里,好像有很多
16:53
in our society at the moment,
411
1013260
2000
信息方面的问题,
16:55
from the overload and the saturation
412
1015260
2000
来自于过多的饱和的数据,
16:57
to the breakdown of trust and reliability
413
1017260
2000
到信任和可靠性的缺失
16:59
and runaway skepticism and lack of transparency,
414
1019260
2000
以及逃避的怀疑和缺乏透明度,
17:01
or even just interestingness.
415
1021260
2000
甚至是失去兴趣.
17:03
I mean, I find information just too interesting.
416
1023260
2000
我想说,我觉得信息真是特别有趣.
17:05
It has a magnetic quality that draws me in.
417
1025260
3000
它有一种磁力吸引着我.
17:09
So, visualizing information
418
1029260
2000
所以,视觉化信息
17:11
can give us a very quick solution to those kinds of problems.
419
1031260
3000
能给我们一种快速的方法来解决这些问题.
17:14
Even when the information is terrible,
420
1034260
2000
即使是在信息很糟的情况下,
17:16
the visual can be quite beautiful.
421
1036260
3000
视觉上仍会很漂亮.
17:19
Often we can get clarity
422
1039260
3000
经常地我们能很快的
17:22
or the answer to a simple question very quickly,
423
1042260
2000
理清条理,或者找到一个简单问题的答案.
17:24
like this one,
424
1044260
2000
就像这个,
17:26
the recent Icelandic volcano.
425
1046260
3000
关于冰岛的火山的最近的讨论.
17:29
Which was emitting the most CO2?
426
1049260
2000
哪一个喷发出了最多的二氧化碳?
17:31
Was it the planes or the volcano,
427
1051260
2000
是那些飞机还是火山,
17:33
the grounded planes or the volcano?
428
1053260
2000
是那些严禁起飞的飞机还是火山?
17:35
So we can have a look.
429
1055260
2000
让我们来看一下.
17:37
We look at the data and we see:
430
1057260
2000
看着这些数据
17:39
Yep, the volcano emitted 150,000 tons;
431
1059260
2000
我们发现,火山喷发出了十五万吨;
17:41
the grounded planes would have emitted
432
1061260
2000
那些禁止起飞的飞机如果飞着的话
17:43
345,000 if they were in the sky.
433
1063260
3000
将喷发出三十四万五千吨.
17:46
So essentially, we had our first carbon-neutral volcano.
434
1066260
3000
所以实际上,我们有了第一个碳中性的火山.
17:49
(Laughter)
435
1069260
2000
(笑声)
17:51
(Applause)
436
1071260
9000
(掌声)
18:00
And that is beautiful. Thank you.
437
1080260
3000
而且这很漂亮!谢谢
18:03
(Applause)
438
1083260
8000
(掌声)
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7


This website was created in October 2020 and last updated on June 12, 2025.

It is now archived and preserved as an English learning resource.

Some information may be out of date.

隐私政策

eng.lish.video

Developer's Blog