The beauty of data visualization | David McCandless

390,731 views ・ 2010-08-23

TED


请双击下面的英文字幕来播放视频。

翻译人员: Juyao Dong 校对人员: Zhu Jie
00:15
It feels like we're all suffering
0
15260
2000
我们好像总是被超负荷的信息
00:17
from information overload or data glut.
1
17260
3000
和过多的数据折磨.
00:20
And the good news is there might be an easy solution to that,
2
20260
2000
好消息是也许我们能有种办法解决这个问题,
00:22
and that's using our eyes more.
3
22260
2000
那就是更多的用我们的眼睛.
00:24
So, visualizing information, so that we can see
4
24260
2000
把信息视觉化,我们就能看出
00:26
the patterns and connections that matter
5
26260
3000
其中的规律和联系
00:29
and then designing that information so it makes more sense,
6
29260
3000
然后再设计这些信息,让它更能有意义,
00:32
or it tells a story,
7
32260
2000
或者让它来说明一个故事,
00:34
or allows us to focus only on the information that's important.
8
34260
3000
或者让我们能只关注那些重要的信息.
00:38
Failing that, visualized information can just look really cool.
9
38260
3000
如果不能做到这一点,至少视觉化的信息看起来特别酷.
00:41
So, let's see.
10
41260
2000
让我们来看看.
00:45
This is the $Billion Dollar o-Gram,
11
45260
2000
这是十亿美元图,
00:47
and this image arose
12
47260
2000
我在看到媒体的十亿美元的报道时
00:49
out of frustration I had
13
49260
2000
很沮丧,
00:51
with the reporting of billion-dollar amounts in the press.
14
51260
2000
于是我就画了这幅图.
00:53
That is, they're meaningless without context:
15
53260
3000
如果没有文字辅助,它是没意义的.
00:56
500 billion for this pipeline,
16
56260
2000
五千亿用于运输管道的建设
00:58
20 billion for this war.
17
58260
2000
二百亿用于战争.
01:00
It doesn't make any sense, so the only way to understand it
18
60260
2000
这样并不能说明问题,所以唯一能理解它的方法
01:02
is visually and relatively.
19
62260
2000
就是视觉化,相关化.
01:04
So I scraped a load of reported figures
20
64260
2000
所以我从不同的信息渠道
01:06
from various news outlets
21
66260
2000
收集了很多报道的数据
01:08
and then scaled the boxes according to those amounts.
22
68260
3000
然后根据数量画出了这些方框.
01:11
And the colors here represent the motivation behind the money.
23
71260
3000
颜色代表了这部分钱的用途.
01:14
So purple is "fighting,"
24
74260
3000
紫色代表战乱
01:17
and red is "giving money away," and green is "profiteering."
25
77260
3000
红色是把钱捐赠出去,绿色代表牟取暴利.
01:20
And what you can see straight away
26
80260
2000
你能直接看出来的是
01:22
is you start to have a different relationship to the numbers.
27
82260
2000
你开始和这些数字有了不同的关系.
01:24
You can literally see them.
28
84260
2000
你能真正的看见它们了.
01:26
But more importantly, you start to see
29
86260
2000
但更重要的是,你能开始看到
01:28
patterns and connections between numbers
30
88260
2000
这些数字中的规律和联系了
01:30
that would otherwise be scattered across multiple news reports.
31
90260
3000
如果不通过画图,这些信息都将只能散落在不同的新闻报道中.
01:33
Let me point out some that I really like.
32
93260
3000
让我来讲讲我很喜欢的一个例子.
01:36
This is OPEC's revenue, this green box here --
33
96260
2000
这是石油输出国组织的财政收益,绿色的方格这里
01:38
780 billion a year.
34
98260
2000
是7800亿每年.
01:40
And this little pixel in the corner -- three billion --
35
100260
3000
这个角落里的小像素是三十个亿
01:43
that's their climate change fund.
36
103260
3000
这是他们的气候变化基金.
01:46
Americans, incredibly generous people --
37
106260
2000
美国,这个特别慷慨的国家
01:48
over 300 billion a year, donated to charity every year,
38
108260
3000
每年捐赠多于3000个亿
01:51
compared with the amount of foreign aid
39
111260
2000
相比于其它17个工业化最发达的国家
01:53
given by the top 17 industrialized nations
40
113260
2000
一共对外捐出的
01:55
at 120 billion.
41
115260
2000
1200亿.
01:57
Then of course,
42
117260
2000
当然,
01:59
the Iraq War, predicted to cost just 60 billion
43
119260
2000
在2003年的预算中
02:01
back in 2003.
44
121260
3000
伊拉克战争只要花掉600亿
02:04
And it mushroomed slightly. Afghanistan and Iraq mushroomed now
45
124260
3000
这项实际的花销比预算稍多了一点, 但阿富汗战争的投入
02:07
to 3,000 billion.
46
127260
3000
迅速增加到了3万亿.
02:10
So now it's great
47
130260
2000
现在就很方便了,
02:12
because now we have this texture, and we can add numbers to it as well.
48
132260
2000
因为我们有了这个结构,我们还能往里面加数字.
02:14
So we could say, well, a new figure comes out ... let's see African debt.
49
134260
3000
我们能说一个新的图形出现了.....让我们来看下非洲的债务.
02:17
How much of this diagram do you think might be taken up
50
137260
2000
你们认为在这张图里非洲
02:19
by the debt that Africa owes to the West?
51
139260
2000
欠西方国家的债务能占多大比例呢?
02:21
Let's take a look.
52
141260
2000
我们来看一看.
02:23
So there it is:
53
143260
2000
这就是了.
02:25
227 billion is what Africa owes.
54
145260
2000
非洲的债务一共是二千二百七十亿.
02:27
And the recent financial crisis,
55
147260
2000
最近的这场金融危机--
02:29
how much of this diagram might that figure take up?
56
149260
2000
在这张图里能占多大的比例呢?
02:31
What has that cost the world? Let's take a look at that.
57
151260
3000
这场危机又给世界带来了多大损失呢?让我们来看这里.
02:34
Dooosh -- Which I think is the appropriate sound effect
58
154260
3000
Doosh(拟声词,相似于"哇"),我想应该是配合这么多钱
02:37
for that much money:
59
157260
2000
的恰当的声音效果.
02:39
11,900 billion.
60
159260
4000
十一万九千个亿.
02:45
So, by visualizing this information,
61
165260
2000
所以,通过视觉化这些信息,
02:47
we turned it into a landscape
62
167260
2000
我们把它们转化成了一个直观景象
02:49
that you can explore with your eyes,
63
169260
2000
这样你就能只通过眼睛探索,
02:51
a kind of map really, a sort of information map.
64
171260
2000
真的类似于一个地图,一种信息的地图.
02:53
And when you're lost in information,
65
173260
2000
当你在信息里迷失的时候,
02:55
an information map is kind of useful.
66
175260
3000
这样的地图是很有帮助的.
02:58
So I want to show you another landscape now.
67
178260
2000
我现在想展示另一番景象.
03:00
We need to imagine what a landscape
68
180260
2000
我们来想象下世界的恐惧会是
03:02
of the world's fears might look like.
69
182260
3000
什么样的一番景象.
03:05
Let's take a look.
70
185260
2000
让我们来看一下.
03:07
This is Mountains Out of Molehills,
71
187260
2000
这是摩尔山之外的一座"山",
03:09
a timeline of global media panic.
72
189260
2000
是全球医疗恐惧的时间线.
03:11
(Laughter)
73
191260
2000
(笑声)
03:13
So, I'll label this for you in a second.
74
193260
2000
我等下会把这些标记出来.
03:15
But the height here, I want to point out,
75
195260
2000
但我想指出这里的高度
03:17
is the intensity of certain fears
76
197260
2000
是媒体报道里
03:19
as reported in the media.
77
199260
2000
人们的恐惧程度.
03:21
Let me point them out.
78
201260
2000
让我来指出来.
03:23
So this, swine flu -- pink.
79
203260
4000
这里,猪流感,标记成粉色.
03:27
Bird flu.
80
207260
2000
禽流感.
03:29
SARS -- brownish here. Remember that one?
81
209260
3000
非典--棕色一样的颜色.记住这一条.
03:32
The millennium bug,
82
212260
3000
千禧虫--
03:35
terrible disaster.
83
215260
2000
可怕的灾难.
03:37
These little green peaks
84
217260
2000
这些小的绿色的峰
03:39
are asteroid collisions.
85
219260
2000
是对小行星碰撞的恐惧.
03:41
(Laughter)
86
221260
2000
(笑声)
03:43
And in summer, here, killer wasps.
87
223260
2000
夏天的时候,这点是对杀人蜂的恐惧.
03:45
(Laughter)
88
225260
8000
(笑声)
03:53
So these are what our fears look like
89
233260
2000
所以这就是媒体描述里
03:55
over time in our media.
90
235260
2000
我们的恐惧"长的模样",
03:57
But what I love -- and I'm a journalist --
91
237260
2000
但我最喜欢的是--我是个记者--
03:59
and what I love is finding hidden patterns; I love being a data detective.
92
239260
3000
我最喜欢的是找出隐藏着的规律. 我喜欢做个数据侦探.
04:02
And there's a very interesting and odd pattern hidden in this data
93
242260
3000
而且这些数据里有很有趣又很奇怪的规律
04:05
that you can only see when you visualize it.
94
245260
2000
你只能通过视觉化数据才能看出这些规律.
04:07
Let me highlight it for you.
95
247260
2000
我来强调一下.
04:09
See this line, this is a landscape for violent video games.
96
249260
3000
看这条线.这是暴力视频游戏的"景象".
04:12
As you can see, there's a kind of odd, regular pattern in the data,
97
252260
3000
正如你能看到的,是个有点奇怪的长方形的样子,
04:15
twin peaks every year.
98
255260
2000
每年都有两个差不多大小的峰.
04:17
If we look closer, we see those peaks occur
99
257260
2000
如果我们仔细看看,我们能看出这些峰
04:19
at the same month every year.
100
259260
3000
每年出现的时间是一样的.
04:22
Why?
101
262260
2000
为什么呢?
04:24
Well, November, Christmas video games come out,
102
264260
2000
十一月的时候,是圣诞视频游戏发放的日子,
04:26
and there may well be an upsurge in the concern about their content.
103
266260
3000
就很可能引起关于其内容的恐慌.
04:29
But April isn't a particularly massive month
104
269260
3000
但四月对游戏而言
04:32
for video games.
105
272260
2000
并不是个很特别的月份
04:34
Why April?
106
274260
2000
为什么会是四月呢?
04:36
Well, in April 1999 was the Columbine shooting,
107
276260
3000
其实,1999年的四月发生了哥伦拜恩枪击案,
04:39
and since then, that fear
108
279260
2000
从那以后,
04:41
has been remembered by the media
109
281260
2000
人们总是能通过媒体想起这份恐惧
04:43
and echoes through the group mind gradually through the year.
110
283260
2000
在这一年里,这份恐惧也总是渐渐环绕在这个群体里.
04:45
You have retrospectives, anniversaries,
111
285260
3000
你总会回顾,纪念,
04:48
court cases, even copy-cat shootings,
112
288260
3000
也总有诉讼甚至是只单纯模仿别人的枪击案,
04:51
all pushing that fear into the agenda.
113
291260
3000
所有这些都能引起人们的恐惧,而体现在这日历里.
04:54
And there's another pattern here as well. Can you spot it?
114
294260
2000
而且这里还有另一个规律.你看出来了吗?
04:56
See that gap there? There's a gap,
115
296260
2000
看见这个间断了吗?这里有一个间断.
04:58
and it affects all the other stories.
116
298260
2000
这一点影响了其它所有部分.
05:00
Why is there a gap there?
117
300260
2000
为什么这里会有一个间断呢?
05:02
You see where it starts? September 2001,
118
302260
3000
看到它是从什么时候开始的吗?2001年9月(911发生的时间)
05:05
when we had something very real
119
305260
2000
那个月里发了一件真的
05:07
to be scared about.
120
307260
2000
很令人恐惧的事情.
05:09
So, I've been working as a data journalist for about a year,
121
309260
3000
我做数据统计记者已经有一年了,
05:12
and I keep hearing a phrase
122
312260
2000
我总是能听到这样一个说法
05:14
all the time, which is this:
123
314260
3000
说:
05:17
"Data is the new oil."
124
317260
2000
"数据是新的石油."
05:19
Data is the kind of ubiquitous resource
125
319260
3000
而且数据是一种无法不在的资源
05:22
that we can shape to provide new innovations and new insights,
126
322260
3000
我们可以用它来开创新的革命和用新眼光来洞察世界,
05:25
and it's all around us, and it can be mined very easily.
127
325260
3000
而且它就在我们身边,很容易被挖掘.
05:28
It's not a particularly great metaphor in these times,
128
328260
3000
而且在这个时代这也不是什么很特别的比喻
05:31
especially if you live around the Gulf of Mexico,
129
331260
3000
尤其是如果你生活在墨西哥湾,
05:34
but I would, perhaps, adapt this metaphor slightly,
130
334260
2000
但我想,或者可以对这个比喻稍加改动,
05:36
and I would say that data is the new soil.
131
336260
3000
我要说数据是新土壤.
05:40
Because for me, it feels like a fertile, creative medium.
132
340260
3000
因为对我而言,它更像个可以施肥,能创造出东西来的媒体.
05:43
Over the years, online,
133
343260
2000
经过这么多年,在网上
05:45
we've laid down
134
345260
3000
我们已经积攒下
05:48
a huge amount of information and data,
135
348260
2000
巨大数量的信息和数据,
05:50
and we irrigate it with networks and connectivity,
136
350260
2000
而且我们用网络和连接来浇灌它们,
05:52
and it's been worked and tilled by unpaid workers and governments.
137
352260
3000
不收取工资的人们和政府一直在努力耕作这片"土壤".
05:55
And, all right, I'm kind of milking the metaphor a little bit.
138
355260
3000
好吧,我有点夸大这个比喻了.
05:58
But it's a really fertile medium,
139
358260
3000
但它真的是一个可以繁殖的媒体,
06:01
and it feels like visualizations, infographics, data visualizations,
140
361260
3000
它想让被视觉化,把咨询图像化,数据可视化,
06:04
they feel like flowers blooming from this medium.
141
364260
3000
它们就像是从土壤里长出的花朵.
06:07
But if you look at it directly,
142
367260
2000
但如果你直接看这些数据,
06:09
it's just a lot of numbers and disconnected facts.
143
369260
2000
它们就只是一堆数字和没有关联的事件.
06:11
But if you start working with it and playing with it in a certain way,
144
371260
3000
但如果你开始分析它们,并用一种方式处理它们,
06:14
interesting things can appear and different patterns can be revealed.
145
374260
3000
有趣的事就会发生,不同的规律也会显现出来.
06:17
Let me show you this.
146
377260
2000
让我来展示下这个.
06:19
Can you guess what this data set is?
147
379260
3000
你能猜出这是关于什么的数据吗?
06:22
What rises twice a year,
148
382260
2000
有什么东西每年要升两次?
06:24
once in Easter
149
384260
2000
一次在复活节
06:26
and then two weeks before Christmas,
150
386260
2000
另一次在圣诞前的两周,
06:28
has a mini peak every Monday,
151
388260
2000
每个周一都有一个小峰
06:30
and then flattens out over the summer?
152
390260
2000
并在夏天的时候一直保持平坦.
06:32
I'll take answers.
153
392260
2000
我来听听你们的想法.
06:34
(Audience: Chocolate.) David McCandless: Chocolate.
154
394260
2000
(观众:巧克力).David:巧克力.
06:36
You might want to get some chocolate in.
155
396260
3000
你或许会想加一点巧克力进去.
06:39
Any other guesses?
156
399260
2000
还有别的猜想呢?
06:41
(Audience: Shopping.) DM: Shopping.
157
401260
2000
(观众:购物).David:购物.
06:43
Yeah, retail therapy might help.
158
403260
3000
嗯,购物疗法也许会有帮助.
06:46
(Audience: Sick leave.)
159
406260
2000
(观众:病假)
06:48
DM: Sick leave. Yeah, you'll definitely want to take some time off.
160
408260
2000
David:病假.嗯,你一定想休息几天.
06:50
Shall we see?
161
410260
2000
我们来看答案吧?
06:53
(Laughter)
162
413260
8000
(笑声)
07:01
(Applause)
163
421260
3000
(掌声)
07:04
So, the information guru Lee Byron and myself,
164
424260
3000
这里的信息是,Lee Byron和我自己
07:07
we scraped 10,000 status Facebook updates
165
427260
3000
搜集了一万个Facebook的状态
07:10
for the phrase "break-up" and "broken-up"
166
430260
2000
寻找包括"分手"和"分手了"的字样的更新
07:12
and this is the pattern we found --
167
432260
2000
这就是我们收集的结果--
07:14
people clearing out for Spring Break,
168
434260
2000
人们在春假期间"清理"关系.
07:16
(Laughter)
169
436260
5000
(笑声)
07:21
coming out of very bad weekends on a Monday,
170
441260
2000
接着的周一从过得很糟的周末中走出来
07:23
being single over the summer,
171
443260
3000
一个暑假里都保持着单身.
07:26
and then the lowest day of the year, of course: Christmas Day.
172
446260
3000
当然一年中分手最少的一天是:圣诞.
07:29
Who would do that?
173
449260
3000
谁会这么做呢?
07:32
So there's a titanic amount of data out there now,
174
452260
2000
现在外面有极多的数据,
07:34
unprecedented.
175
454260
2000
史无前例的多.
07:37
But if you ask the right kind of question,
176
457260
2000
但如果你问正确的问题,
07:39
or you work it in the right kind of way,
177
459260
2000
或者你用正确的方式处理它,
07:41
interesting things can emerge.
178
461260
3000
有趣的事就会发生.
07:44
So information is beautiful. Data is beautiful.
179
464260
3000
所以信息是漂亮的,数据也是漂亮的.
07:47
I wonder if I could make my life beautiful.
180
467260
3000
我想我能不能把我的生活也弄得很漂亮.
07:50
And here's my visual C.V.
181
470260
2000
这是我的视觉化的简历.
07:52
I'm not quite sure I've succeeded.
182
472260
2000
我还不太确定这份简历是不是成功.
07:54
Pretty blocky, the colors aren't that great.
183
474260
2000
有很多的小块,颜色也不是特别的好.
07:56
But I wanted to convey something to you.
184
476260
3000
但我想传递些东西给你们.
07:59
I started as a programmer,
185
479260
2000
我从一个程序员开始做起,
08:01
and then I worked as a writer for many years, about 20 years,
186
481260
2000
之后我做了很多年作家,大约有二十年,
08:03
in print, online and then in advertising,
187
483260
2000
在杂志上,网上之后还有广告中,
08:05
and only recently have I started designing.
188
485260
3000
只在最近这一段我才开始进行设计.
08:08
And I've never been to design school.
189
488260
2000
而且我从来没去过设计学校,
08:10
I've never studied art or anything.
190
490260
3000
我从没学校艺术或这类的东西.
08:13
I just kind of learned through doing.
191
493260
2000
我就仅仅是从做的过程中来学.
08:15
And when I started designing,
192
495260
2000
当我开始设计的时候,
08:17
I discovered an odd thing about myself.
193
497260
2000
我发现了关于我自己的一件奇怪的事.
08:19
I already knew how to design,
194
499260
2000
我已经知道怎么去设计了,
08:21
but it wasn't like I was amazingly brilliant at it,
195
501260
3000
不是说我奇迹般的对设计特别在行,
08:24
but more like I was sensitive
196
504260
2000
而更像是我对关于网格,空间
08:26
to the ideas of grids and space
197
506260
2000
以及排列和版面设计之类的
08:28
and alignment and typography.
198
508260
2000
特别的敏感.
08:30
It's almost like being exposed
199
510260
2000
就好像是暴露在媒体
08:32
to all this media over the years
200
512260
2000
这么多年之后,
08:34
had instilled a kind of dormant design literacy in me.
201
514260
3000
我被潜移默化地注入了设计的能力.
08:37
And I don't feel like I'm unique.
202
517260
2000
而且我不觉得自己是唯一一个.
08:39
I feel that everyday, all of us now
203
519260
2000
我感觉每一天,我们所有人
08:41
are being blasted by information design.
204
521260
3000
都对信息设计特别的不满.
08:44
It's being poured into our eyes through the Web,
205
524260
2000
它们从网上一股脑灌入我们眼里
08:46
and we're all visualizers now;
206
526260
2000
而且我们都是视觉型的人,
08:48
we're all demanding a visual aspect
207
528260
2000
我们都想从视觉的角度
08:50
to our information.
208
530260
3000
来看我们的信息.
08:53
There's something almost quite magical about visual information.
209
533260
3000
这就是关于视觉化信息的神奇的地方.
08:56
It's effortless, it literally pours in.
210
536260
3000
不需要努力的,它就自己进来了.
08:59
And if you're navigating a dense information jungle,
211
539260
3000
如果你在密集的信息丛林里搜寻,
09:02
coming across a beautiful graphic
212
542260
2000
突然看到一个漂亮的图画,
09:04
or a lovely data visualization,
213
544260
2000
或者是一个可爱的视觉化的数据,
09:06
it's a relief, it's like coming across a clearing in the jungle.
214
546260
3000
那将会是怎样的释然啊! 就好像是在森林里碰到了开阔的地方.
09:09
I was curious about this, so it led me
215
549260
2000
我对这很好奇,所以它引导我
09:11
to the work of a Danish physicist
216
551260
2000
关注一个丹麦的物理学家
09:13
called Tor Norretranders,
217
553260
2000
名叫Tor Norretranders,
09:15
and he converted the bandwidth of the senses into computer terms.
218
555260
3000
他把感观的带宽转化成了电脑术语.
09:19
So here we go. This is your senses,
219
559260
2000
过程是这样的.这是你的感观,
09:21
pouring into your senses every second.
220
561260
2000
周围世界每一秒都往你的感观里注入东西.
09:23
Your sense of sight is the fastest.
221
563260
3000
你的视觉是最快的.
09:26
It has the same bandwidth as a computer network.
222
566260
3000
它有和电脑网络一样的带宽.
09:29
Then you have touch, which is about the speed of a USB key.
223
569260
3000
你还有触觉,它有和USB接口一样的速度.
09:32
And then you have hearing and smell,
224
572260
2000
你的听觉和嗅觉,
09:34
which has the throughput of a hard disk.
225
574260
2000
有和硬盘一样的处理速度.
09:36
And then you have poor old taste,
226
576260
2000
你还有可怜的,古老的味觉,
09:38
which is like barely the throughput of a pocket calculator.
227
578260
3000
它仅仅只有口袋计算器那样的工作效率.
09:41
And that little square in the corner, a naught .7 percent,
228
581260
3000
那角落里小小的正方形,占百分之0.7的比例,
09:44
that's the amount we're actually aware of.
229
584260
3000
这就是我们真正能感知的部分.
09:47
So a lot of your vision --
230
587260
2000
所以你很多的想像--
09:49
the bulk of it is visual, and it's pouring in.
231
589260
2000
大部分是视觉的,而且它是自己大批进入的.
09:51
It's unconscious.
232
591260
2000
潜意识里进行的.
09:53
The eye is exquisitely sensitive
233
593260
3000
眼睛对颜色变化中的规律,
09:56
to patterns in variations in color, shape and pattern.
234
596260
3000
形状和模式是特别敏感的.
09:59
It loves them, and it calls them beautiful.
235
599260
2000
如果眼睛喜欢它们,它就把它们叫作漂亮.
10:01
It's the language of the eye.
236
601260
2000
这就是眼睛的语言.
10:03
If you combine the language of the eye with the language of the mind,
237
603260
2000
而且你把眼睛的语言和头脑的语言联系在一起,
10:05
which is about words and numbers and concepts,
238
605260
3000
关于词语,数字和概念这些,
10:08
you start speaking two languages simultaneously,
239
608260
3000
你将能同时讲两种语言,
10:11
each enhancing the other.
240
611260
3000
每一种都能加强另一种.
10:14
So, you have the eye, and then you drop in the concepts.
241
614260
3000
所以你有眼睛,然后你想到了概念,
10:17
And that whole thing -- it's two languages
242
617260
2000
这个过程,其实是两种语言,
10:19
both working at the same time.
243
619260
2000
它们同时在一起工作.
10:21
So we can use this new kind of language, if you like,
244
621260
2000
所以我们能用这种新的语言,如果你喜欢的话,
10:23
to alter our perspective or change our views.
245
623260
3000
来改变我们的视角或者我们的观点.
10:26
Let me ask you a simple question
246
626260
2000
让我来问一个简单的问题
10:28
with a really simple answer:
247
628260
2000
它有个特别简单的答案.
10:30
Who has the biggest military budget?
248
630260
2000
谁拥有最多的军队预算?
10:32
It's got to be America, right?
249
632260
2000
这一定是美国,对吧?
10:34
Massive. 609 billion in 2008 --
250
634260
2000
非常多的.在2008年有六千零九十亿
10:36
607, rather.
251
636260
2000
六千零七十亿实际上是.
10:38
So massive, in fact, that it can contain
252
638260
2000
这么多,事实上,它包含了
10:40
all the other military budgets in the world inside itself.
253
640260
3000
所有其它国家的军队预算在里面.
10:43
Gobble, gobble, gobble, gobble, gobble.
254
643260
2000
格格,格格,格格,格格,格格(火鸡的叫声)
10:45
Now, you can see Africa's total debt there
255
645260
2000
现在你能拿整个非洲的债务
10:47
and the U.K. budget deficit for reference.
256
647260
2000
和英国的预算赤字来比较一下.
10:49
So that might well chime
257
649260
2000
这很可能
10:51
with your view that America
258
651260
2000
和你印象中的美国形象一致,
10:53
is a sort of warmongering military machine,
259
653260
3000
那个好战的,有军队机器,
10:56
out to overpower the world
260
656260
2000
强大到要用它的巨大的工业军事合成体
10:58
with its huge industrial-military complex.
261
658260
3000
来压倒世界的国家.
11:01
But is it true that America has the biggest military budget?
262
661260
3000
但美国真的有最多的军事预算吗?
11:04
Because America is an incredibly rich country.
263
664260
2000
因为它是个特别富有的国家.
11:06
In fact, it's so massively rich
264
666260
2000
事实上,它是这么有富有
11:08
that it can contain the four other
265
668260
2000
它能把四个在它之后的
11:10
top industrialized nations' economies
266
670260
2000
工业国家的经济收入
11:12
inside itself, it's so vastly rich.
267
672260
3000
全包含在它之内.如此的富有.
11:15
So its military budget is bound to be enormous.
268
675260
3000
它的军队预算是和经济联系在一起的
11:18
So, to be fair and to alter our perspective,
269
678260
2000
所以,为了公平为了改变我们的视角,
11:20
we have to bring in another data set,
270
680260
2000
我们得引入另一组数据,
11:22
and that data set is GDP, or the country's earnings.
271
682260
2000
这就是国民生产总值,或者是一个国家的收入.
11:24
Who has the biggest budget as a proportion of GDP?
272
684260
2000
哪个国家的军队预算是占国民生产总值最多的呢?
11:26
Let's have a look.
273
686260
2000
让我们来看一看.
11:28
That changes the picture considerably.
274
688260
3000
这把格局改变了很多.
11:31
Other countries pop into view that you, perhaps, weren't considering,
275
691260
3000
其它你很可能没想到的国家一下子出现了,
11:34
and American drops into eighth.
276
694260
2000
美国跌入了第八位.
11:36
Now you can also do this with soldiers.
277
696260
2000
你也同样可以对士兵进行这样的运算.
11:38
Who has the most soldiers? It's got to be China.
278
698260
2000
哪个国家有最多的士兵?一定是中国.
11:40
Of course, 2.1 million.
279
700260
2000
当然,二百一十万.
11:42
Again, chiming with your view
280
702260
2000
同样的,和你对中国,
11:44
that China has a militarized regime
281
704260
2000
那个军事化的政权,准备着
11:46
ready to, you know, mobilize its enormous forces.
282
706260
2000
要发挥其强大的武装力量的印象一致.
11:48
But of course, China has an enormous population.
283
708260
3000
但不可否认,中国有巨大的人口,
11:51
So if we do the same,
284
711260
2000
如果我们做同样的处理,
11:53
we see a radically different picture.
285
713260
2000
我们看到一个彻底不同的景象.
11:55
China drops to 124th.
286
715260
2000
中国落到了124位.
11:57
It actually has a tiny army
287
717260
2000
它其实只有很小的一支部队,
11:59
when you take other data into consideration.
288
719260
3000
如果你把人口考虑进去的话.
12:02
So, absolute figures, like the military budget,
289
722260
2000
所以,绝对的数字,就像是军队预算,
12:04
in a connected world,
290
724260
2000
在一个紧密联系的世界里
12:06
don't give you the whole picture.
291
726260
2000
好像不能给你一个完整的格局.
12:08
They're not as true as they could be.
292
728260
2000
它们并不是它们能达到的最真实的状态.
12:10
We need relative figures that are connected to other data
293
730260
3000
我们需要相对的数据,这些数据和其它数据相联系
12:13
so that we can see a fuller picture,
294
733260
2000
我们才能看到一个更全面的世界.
12:15
and then that can lead to us changing our perspective.
295
735260
2000
而这一点能改变我们的视角.
12:17
As Hans Rosling, the master,
296
737260
2000
就像Hans Rosling, 校长
12:19
my master, said,
297
739260
3000
我的校长说的那样,
12:22
"Let the dataset change your mindset."
298
742260
3000
"让数据模式改变你的心理模式."
12:26
And if it can do that, maybe it can also change your behavior.
299
746260
3000
如果它能做到这一点,也许它能同时改变你的行为.
12:29
Take a look at this one.
300
749260
2000
来看一下这个.
12:31
I'm a bit of a health nut.
301
751260
2000
我有一点健康迷.
12:33
I love taking supplements and being fit,
302
753260
3000
我喜欢吃一些保养品来保持健康
12:36
but I can never understand what's going on in terms of evidence.
303
756260
3000
但我从来不能用证据证明这些东西是不是真的有用.
12:39
There's always conflicting evidence.
304
759260
2000
总是会有相当矛盾的证据.
12:41
Should I take vitamin C? Should I be taking wheatgrass?
305
761260
2000
我是应该吃维他命C?还是吃小麦草素?
12:43
This is a visualization of all the evidence
306
763260
2000
这里把所有这些关于
12:45
for nutritional supplements.
307
765260
2000
营养补品的证据都视觉化了.
12:47
This kind of diagram is called a balloon race.
308
767260
3000
这样的图表叫作热气球竞赛.
12:50
So the higher up the image,
309
770260
2000
图上越高的地方,
12:52
the more evidence there is for each supplement.
310
772260
3000
有越多的关于这些补品的证据.
12:55
And the bubbles correspond to popularity as regards to Google hits.
311
775260
3000
这些气泡的大小对应于GOOGLE上这样补品的受欢迎程度.
12:58
So you can immediately apprehend
312
778260
3000
这样你能马上就领会效力
13:01
the relationship between efficacy and popularity,
313
781260
3000
和受欢迎程度间的关系.
13:04
but you can also, if you grade the evidence,
314
784260
3000
而且同时,如果你把证据分级别的话,
13:07
do a "worth it" line.
315
787260
2000
画出一条"值得"的线.
13:09
So supplements above this line are worth investigating,
316
789260
3000
所以在这条线以上的补品是那些值得投资的,
13:12
but only for the conditions listed below,
317
792260
3000
但只是针对下面列出的这些症状.
13:15
and then the supplements below the line
318
795260
3000
对于这条线以下的补品,
13:18
are perhaps not worth investigating.
319
798260
2000
就很可能不太值得投资了.
13:20
Now this image constitutes a huge amount of work.
320
800260
3000
这幅图凝结了我们大量的工作.
13:23
We scraped like 1,000 studies from PubMed,
321
803260
3000
我们挖掘了Pubmed上一千个左右的研究,
13:26
the biomedical database,
322
806260
2000
Pubmed是一个生物医学的数据库,
13:28
and we compiled them and graded them all.
323
808260
3000
我们收集了这些研究并把他们分了级别.
13:31
And it was incredibly frustrating for me
324
811260
2000
这让我相当沮丧
13:33
because I had a book of 250 visualizations to do for my book,
325
813260
3000
因为我的书需要250个可视化图像
13:36
and I spent a month doing this,
326
816260
2000
我用了一个月做这幅图,
13:38
and I only filled two pages.
327
818260
2000
却只能填充两页纸.
13:40
But what it points to
328
820260
2000
但重点是
13:42
is that visualizing information like this
329
822260
2000
像这样把信息视觉化的过程
13:44
is a form of knowledge compression.
330
824260
2000
是把知识压缩的过程.
13:46
It's a way of squeezing an enormous amount
331
826260
2000
这是一种从巨大数量的信息
13:48
of information and understanding
332
828260
2000
中压榨出精华
13:50
into a small space.
333
830260
2000
并理解的过程.
13:52
And once you've curated that data, and once you've cleaned that data,
334
832260
2000
一旦你整理了这些数据,或者说清理了这些数据,
13:54
and once it's there,
335
834260
2000
一旦它在那了
13:56
you can do cool stuff like this.
336
836260
2000
你就能做像这样很酷的事了.
13:58
So I converted this into an interactive app,
337
838260
2000
所以我用这个原理开发出了一个互动的应用程序,
14:00
so I can now generate this application online --
338
840260
2000
我现在能在网上生成这个程序--
14:02
this is the visualization online --
339
842260
2000
并在网上进行视觉化--
14:04
and I can say, "Yeah, brilliant."
340
844260
2000
我能称赞它"真好,太棒了"
14:06
So it spawns itself.
341
846260
2000
它自动开始工作.
14:08
And then I can say, "Well, just show me the stuff
342
848260
2000
然后我就能说"嗯,给我看下那些
14:10
that affects heart health."
343
850260
2000
影响心脏健康的东西"
14:12
So let's filter that out.
344
852260
2000
它就开始筛选
14:14
So heart is filtered out, so I can see if I'm curious about that.
345
854260
3000
心脏被选出去了,我对这很好奇.
14:17
I think, "No, no. I don't want to take any synthetics,
346
857260
2000
我想"不,不,我不想吃任何合成的药品
14:19
I just want to see plants and --
347
859260
3000
我只想用植物--
14:22
just show me herbs and plants. I've got all the natural ingredients."
348
862260
3000
就只显示那些草本和植物.我有所有自然的营养成分."
14:25
Now this app is spawning itself
349
865260
2000
现在这个程序又开始从数据
14:27
from the data.
350
867260
2000
里自动生成.
14:29
The data is all stored in a Google Doc,
351
869260
2000
所有数据都保存在Google doc里面,
14:31
and it's literally generating itself from that data.
352
871260
3000
它实际上是在自己利用数据完成视觉化.
14:34
So the data is now alive; this is a living image,
353
874260
2000
现在数据变成活的了,它现在是一个动态的图像了,
14:36
and I can update it in a second.
354
876260
2000
而且我能一下就完成更新.
14:38
New evidence comes out. I just change a row on a spreadsheet.
355
878260
2000
如果新的证据出现了--我只需要在表格里改一行.
14:40
Doosh! Again, the image recreates itself.
356
880260
4000
哇!又一次的,图像自己重新生成了自己.
14:44
So it's cool.
357
884260
2000
这很酷.
14:46
It's kind of living.
358
886260
3000
有点有生命的感觉.
14:49
But it can go beyond data,
359
889260
2000
但它其实能比数据走得更远,
14:51
and it can go beyond numbers.
360
891260
2000
而且也不仅仅限于数字.
14:53
I like to apply information visualization
361
893260
2000
我喜欢用信息视觉化
14:55
to ideas and concepts.
362
895260
3000
的方法来处理思想和概念.
14:58
This is a visualization
363
898260
2000
这是一个对政治观点
15:00
of the political spectrum,
364
900260
2000
视觉化的图像,
15:02
an attempt for me to try
365
902260
2000
是我在试图理解
15:04
and understand how it works
366
904260
2000
政治是如果工作的.
15:06
and how the ideas percolate down
367
906260
2000
理解想法是怎么向下渗透的
15:08
from government into society and culture,
368
908260
2000
从政府开始到社会和文化,
15:10
into families, into individuals, into their beliefs
369
910260
3000
再传递到家庭,各人以及他们的信念中
15:13
and back around again in a cycle.
370
913260
3000
而且又如何再反过来影响上层,形成一个循环的.
15:16
What I love about this image
371
916260
2000
关于这幅图我喜欢的是
15:18
is it's made up of concepts,
372
918260
2000
它是由概念组成的,
15:20
it explores our worldviews
373
920260
2000
它开拓了我们的世界观
15:22
and it helps us -- it helps me anyway --
374
922260
2000
并且帮助我们--至少是帮助我--
15:24
to see what others think,
375
924260
2000
看清别人是怎么想的,
15:26
to see where they're coming from.
376
926260
2000
了解他们的想法是从哪里来的.
15:28
And it feels just incredibly cool to do that.
377
928260
3000
做这件事感觉真是太棒了.
15:31
What was most exciting for me
378
931260
3000
最让我兴奋的是
15:34
designing this
379
934260
2000
设计
15:36
was that, when I was designing this image,
380
936260
2000
当我在设计这些图的时候,
15:38
I desperately wanted this side, the left side,
381
938260
3000
我极其希望这一边,左边
15:41
to be better than the right side --
382
941260
2000
能比右边好--
15:43
being a journalist, a Left-leaning person --
383
943260
3000
作为一个记者,我是个有点左倾的人--
15:46
but I couldn't, because I would have created
384
946260
2000
但我不能,因为如果这样我全造出一个
15:48
a lopsided, biased diagram.
385
948260
3000
不平衡的,带着偏见的图.
15:51
So, in order to really create a full image,
386
951260
3000
所以,为了真正的创造一个完整的图,
15:54
I had to honor the perspectives on the right-hand side
387
954260
3000
我得尊重右手边的这些观点
15:57
and at the same time, uncomfortably recognize
388
957260
3000
同时,有点不太情愿地认识到
16:00
how many of those qualities were actually in me,
389
960260
3000
我其实也有这其中很多的品质,
16:03
which was very, very annoying and uncomfortable.
390
963260
2000
对我而言,这是特别特别地讨厌和不舒服的.
16:05
(Laughter)
391
965260
4000
(笑声)
16:09
But not too uncomfortable,
392
969260
2000
但也不是特别不舒服,
16:11
because there's something unthreatening
393
971260
3000
因为看到一些政治观点,
16:14
about seeing a political perspective,
394
974260
2000
总比被告知或者被强迫听着这些观点,
16:16
versus being told or forced to listen to one.
395
976260
3000
不让人那么有压迫和威胁感.
16:19
You're capable of holding conflicting viewpoints
396
979260
3000
事实上,在你看到它们的时候
16:22
joyously when you can see them.
397
982260
2000
你能快乐的承受矛盾的观点.
16:24
It's even fun to engage with them
398
984260
2000
甚至和它们打交道是有趣的
16:26
because it's visual.
399
986260
2000
因为这是视觉的.
16:28
So that's what's exciting to me,
400
988260
2000
这就是它让我着迷的地方,
16:30
seeing how data can change my perspective
401
990260
2000
看数据是如何让我改变观点
16:32
and change my mind midstream --
402
992260
2000
并且改变我思维的主流--
16:34
beautiful, lovely data.
403
994260
3000
漂亮的,可爱的数据.
16:38
So, just to wrap up,
404
998260
2000
简单总结一下,
16:40
I wanted to say
405
1000260
2000
我想说
16:42
that it feels to me that design is about solving problems
406
1002260
2000
我感觉设计其实是在解决问题
16:44
and providing elegant solutions,
407
1004260
3000
并且给出一个精美的解决办法.
16:47
and information design is about
408
1007260
2000
信息设计是在
16:49
solving information problems.
409
1009260
2000
解决信息上的问题.
16:51
It feels like we have a lot of information problems
410
1011260
2000
现在我们社会里,好像有很多
16:53
in our society at the moment,
411
1013260
2000
信息方面的问题,
16:55
from the overload and the saturation
412
1015260
2000
来自于过多的饱和的数据,
16:57
to the breakdown of trust and reliability
413
1017260
2000
到信任和可靠性的缺失
16:59
and runaway skepticism and lack of transparency,
414
1019260
2000
以及逃避的怀疑和缺乏透明度,
17:01
or even just interestingness.
415
1021260
2000
甚至是失去兴趣.
17:03
I mean, I find information just too interesting.
416
1023260
2000
我想说,我觉得信息真是特别有趣.
17:05
It has a magnetic quality that draws me in.
417
1025260
3000
它有一种磁力吸引着我.
17:09
So, visualizing information
418
1029260
2000
所以,视觉化信息
17:11
can give us a very quick solution to those kinds of problems.
419
1031260
3000
能给我们一种快速的方法来解决这些问题.
17:14
Even when the information is terrible,
420
1034260
2000
即使是在信息很糟的情况下,
17:16
the visual can be quite beautiful.
421
1036260
3000
视觉上仍会很漂亮.
17:19
Often we can get clarity
422
1039260
3000
经常地我们能很快的
17:22
or the answer to a simple question very quickly,
423
1042260
2000
理清条理,或者找到一个简单问题的答案.
17:24
like this one,
424
1044260
2000
就像这个,
17:26
the recent Icelandic volcano.
425
1046260
3000
关于冰岛的火山的最近的讨论.
17:29
Which was emitting the most CO2?
426
1049260
2000
哪一个喷发出了最多的二氧化碳?
17:31
Was it the planes or the volcano,
427
1051260
2000
是那些飞机还是火山,
17:33
the grounded planes or the volcano?
428
1053260
2000
是那些严禁起飞的飞机还是火山?
17:35
So we can have a look.
429
1055260
2000
让我们来看一下.
17:37
We look at the data and we see:
430
1057260
2000
看着这些数据
17:39
Yep, the volcano emitted 150,000 tons;
431
1059260
2000
我们发现,火山喷发出了十五万吨;
17:41
the grounded planes would have emitted
432
1061260
2000
那些禁止起飞的飞机如果飞着的话
17:43
345,000 if they were in the sky.
433
1063260
3000
将喷发出三十四万五千吨.
17:46
So essentially, we had our first carbon-neutral volcano.
434
1066260
3000
所以实际上,我们有了第一个碳中性的火山.
17:49
(Laughter)
435
1069260
2000
(笑声)
17:51
(Applause)
436
1071260
9000
(掌声)
18:00
And that is beautiful. Thank you.
437
1080260
3000
而且这很漂亮!谢谢
18:03
(Applause)
438
1083260
8000
(掌声)
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7