How we can find ourselves in data | Giorgia Lupi

112,335 views ・ 2017-05-04

TED


请双击下面的英文字幕来播放视频。

翻译人员: Yan Gao 校对人员: Ellen Tung
00:12
This is what my last week looked like.
0
12580
2880
上周我的生活是这样的。
00:16
What I did,
1
16500
1696
我做了什么,
00:18
who I was with,
2
18220
1816
我跟谁在一起,
00:20
the main sensations I had for every waking hour ...
3
20060
3120
清醒时的每个小时我的 主要感受,等等。
00:23
If the feeling came as I thought of my dad
4
23900
2496
我是否在想起刚去世的爸爸时
00:26
who recently passed away,
5
26420
1776
产生了这感觉,
00:28
or if I could have just definitely avoided the worries and anxieties.
6
28220
4056
或者我能否绝对避免担忧和焦虑。
00:32
And if you think I'm a little obsessive,
7
32300
1936
如果你认为我有点着魔,
00:34
you're probably right.
8
34260
1856
你应该是对的。
00:36
But clearly, from this visualization,
9
36140
2256
但是很明显,从这个画面中,
00:38
you can learn much more about me than from this other one,
10
38420
3296
你对我的了解会比 从另外的途径了解得多很多,
00:41
which are images you're probably more familiar with
11
41740
2976
你可能对这些图片更熟悉,
00:44
and which you possibly even have on your phone right now.
12
44740
2816
甚至你的手机里现在就有。
00:47
Bar charts for the steps you walked,
13
47580
2216
你走路步数的条形图,
00:49
pie charts for the quality of your sleep --
14
49820
2376
你睡眠质量的饼图,
00:52
the path of your morning runs.
15
52220
1720
你晨跑的路线。
00:55
In my day job, I work with data.
16
55100
2296
我的工作是与数据打交道。
00:57
I run a data visualization design company,
17
57420
2496
我运营一家数据可视化设计公司,
00:59
and we design and develop ways to make information accessible
18
59940
3336
我们设计和开发一些方法, 意图通过视觉表现
01:03
through visual representations.
19
63300
2096
使信息容易理解。
01:05
What my job has taught me over the years
20
65420
3016
多年来我的工作教给我的是,
01:08
is that to really understand data and their true potential,
21
68460
4216
要真正了解数据及其真实的潜力,
01:12
sometimes we actually have to forget about them
22
72700
3096
实际上有时我们必须忘掉它们,
01:15
and see through them instead.
23
75820
1760
反而才能识破它们。
01:18
Because data are always just a tool we use to represent reality.
24
78260
3576
因为数据永远只是我们 用来表达现实的工具。
01:21
They're always used as a placeholder for something else,
25
81860
2856
数据总被用作其他东西的占位符,
01:24
but they are never the real thing.
26
84740
2176
但它们永远不是真实的事物。
01:26
But let me step back for a moment
27
86940
1936
但是让我先回溯一下,
01:28
to when I first understood this personally.
28
88900
2480
回到我个人第一次 明白这道理的时候。
01:32
In 1994, I was 13 years old.
29
92100
3336
1994年,我13岁,
01:35
I was a teenager in Italy.
30
95460
1936
生活在意大利。
01:37
I was too young to be interested in politics,
31
97420
2616
我太年轻了,对政治不感兴趣,
01:40
but I knew that a businessman, Silvio Berlusconi,
32
100060
2496
但是我知道有个商人, 西尔维奥·贝卢斯科尼,
01:42
was running for president for the moderate right.
33
102580
2560
正在代表右翼温和派竞选总统。
01:45
We lived in a very liberal town,
34
105940
2016
我们的镇上非常偏向自由党,
01:47
and my father was a politician for the Democratic Party.
35
107980
3296
而且我父亲是民主党的政客。
01:51
And I remember that no one thought that Berlusconi could get elected --
36
111300
4336
我记得没有人认为 贝卢斯科尼可以当选——
01:55
that was totally not an option.
37
115660
1760
他完全不可能当选。
01:58
But it happened.
38
118380
1216
然而事实相反,他当选了。
01:59
And I remember the feeling very vividly.
39
119620
2536
我非常清晰地记得那种感觉。
02:02
It was a complete surprise,
40
122180
1856
那是个巨大的意外,
02:04
as my dad promised that in my town he knew nobody who voted for him.
41
124060
5040
因为我爸爸信誓旦旦地说, 他知道我们镇上没人投票给他。
02:10
This was the first time
42
130540
1496
这是第一次
02:12
when the data I had gave me a completely distorted image of reality.
43
132060
4560
我手里的数据反映出的现实 是完全错误的。
02:17
My data sample was actually pretty limited and skewed,
44
137100
3296
我的数据样本实际上 很有限且有偏向性,
02:20
so probably it was because of that, I thought, I lived in a bubble,
45
140420
3736
可能是由于我以为 自己生活在一个气泡里,
02:24
and I didn't have enough chances to see outside of it.
46
144180
2600
没有足够的机会看到外面的世界。
02:27
Now, fast-forward to November 8, 2016
47
147900
3776
现在,快进到2016年11月8日,
02:31
in the United States.
48
151700
1200
在美国。
02:33
The internet polls,
49
153780
1296
互联网民意调查、
02:35
statistical models,
50
155100
1376
统计模型、
02:36
all the pundits agreeing on a possible outcome for the presidential election.
51
156500
4816
所有专家对总统选举的 预测结果意见一致。
02:41
It looked like we had enough information this time,
52
161340
2616
好像这次我们的信息很充足,
02:43
and many more chances to see outside the closed circle we lived in --
53
163980
4096
而且有更多机会看到 我们所在的封闭圈以外的世界,
02:48
but we clearly didn't.
54
168100
1320
但是很显然,事实并非如此。
02:49
The feeling felt very familiar.
55
169860
2096
那感觉太熟悉了。
02:51
I had been there before.
56
171980
1480
我亲身经历过。
02:54
I think it's fair to say the data failed us this time --
57
174180
2856
我认为可以说这次是 数据让我们失望了,
02:57
and pretty spectacularly.
58
177060
1856
而且非常严重。
02:58
We believed in data,
59
178940
1696
我们相信了数据,
03:00
but what happened,
60
180660
1416
但真正发生的是,
03:02
even with the most respected newspaper,
61
182100
2696
即便是最受尊敬的报纸也只是
03:04
is that the obsession to reduce everything to two simple percentage numbers
62
184820
4696
痴迷于将所有事情缩减成 两个简单的百分比数字,
03:09
to make a powerful headline
63
189540
1976
用来制作震撼的标题,
03:11
made us focus on these two digits
64
191540
2056
让我们聚焦在这两个数字上,
03:13
and them alone.
65
193620
1200
并且只看到这两个数字。
03:15
In an effort to simplify the message
66
195380
2056
为了简化信息,
03:17
and draw a beautiful, inevitable red and blue map,
67
197460
3416
画出漂亮的、无法抵御的红蓝地图,
03:20
we lost the point completely.
68
200900
1880
我们完全失去了重点。
03:23
We somehow forgot that there were stories --
69
203260
2136
我们莫名其妙地忘记了还有故事——
03:25
stories of human beings behind these numbers.
70
205420
2360
这些数字背后的人类的故事。
03:29
In a different context,
71
209060
1576
还有一件背景不同
03:30
but to a very similar point,
72
210660
1656
但情况很相似的事件,
03:32
a peculiar challenge was presented to my team by this woman.
73
212340
3896
这位女士给我的团队 提出了一个特殊的挑战。
03:36
She came to us with a lot of data,
74
216260
2376
她带着很多数据来找我们,
03:38
but ultimately she wanted to tell one of the most humane stories possible.
75
218660
4416
但最终她想要的是 讲一个可能最有人性的故事。
03:43
She's Samantha Cristoforetti.
76
223100
1696
她是萨曼莎·克里斯托维蒂,
03:44
She has been the first Italian woman astronaut,
77
224820
2576
意大利第一位女性宇航员,
03:47
and she contacted us before being launched
78
227420
2496
在出发去国际空间站进行
03:49
on a six-month-long expedition to the International Space Station.
79
229940
3896
为期六个月的远征之前, 她联系到我们。
03:53
She told us, "I'm going to space,
80
233860
2216
她说:“我要去太空了,
03:56
and I want to do something meaningful with the data of my mission
81
236100
3096
我想用我的任务数据 做些有意义的事,
03:59
to reach out to people."
82
239220
1240
去联络人们。”
04:01
A mission to the International Space Station
83
241420
2536
去国际空间站的任务
04:03
comes with terabytes of data
84
243980
2096
带着兆兆字节(太字节)的数据,
04:06
about anything you can possibly imagine --
85
246100
2376
涉及你能想到的任何事——
04:08
the orbits around Earth,
86
248500
1496
绕地轨道,
04:10
the speed and position of the ISS
87
250020
2096
ISS的速度和位置,
04:12
and all of the other thousands of live streams from its sensors.
88
252140
3680
还有另外数千个 来自其传感器的直播流。
04:16
We had all of the hard data we could think of --
89
256660
2896
我们拥有所有可以想到的硬数据——
04:19
just like the pundits before the election --
90
259580
2416
就像那次选举前的专家一样——
04:22
but what is the point of all these numbers?
91
262020
2976
但是这些数字是什么意思?
04:25
People are not interested in data for the sake of it,
92
265020
2736
人们对数据本身不感兴趣,
04:27
because numbers are never the point.
93
267780
1855
因为数字永远不是重点。
04:29
They're always the means to an end.
94
269659
1961
它们总是用来结束的手段。
04:32
The story we needed to tell
95
272659
1777
我们要讲的故事是,
04:34
is that there is a human being in a teeny box
96
274460
2496
小箱子里有一个人
04:36
flying in space above your head,
97
276980
2256
正在你头上的太空中飞行
04:39
and that you can actually see her with your naked eye on a clear night.
98
279260
4096
你在晴朗的夜晚能用肉眼看到她。
04:43
So we decided to use data to create a connection
99
283380
3096
所以我们决定用数据在萨曼莎和
04:46
between Samantha and all of the people looking at her from below.
100
286500
4056
正从地面看着她的 所有人之间建立一个连接。
04:50
We designed and developed what we called "Friends in Space,"
101
290580
3176
我们设计和开发了“太空中的朋友”,
04:53
a web application that simply lets you say "hello" to Samantha
102
293780
4656
这是一个网络应用程序, 简单地让你从你的位置
04:58
from where you are,
103
298460
1256
对萨曼莎说“你好”,
04:59
and "hello" to all the people who are online at the same time
104
299740
3536
对世界各地的所有同时在线的人
05:03
from all over the world.
105
303300
1520
说“你好”。
05:05
And all of these "hellos" left visible marks on the map
106
305460
3456
所有这些“你好”都能在 萨曼莎飞过的地图上
05:08
as Samantha was flying by
107
308940
2016
留下可见的痕迹,
05:10
and as she was actually waving back every day at us
108
310980
3376
而且她每天都在使用推特
05:14
using Twitter from the ISS.
109
314380
1680
从国际空间站向我们问候。
05:16
This made people see the mission's data from a very different perspective.
110
316700
4976
这使人们看待任务数据的 角度大有不同。
05:21
It all suddenly became much more about our human nature and our curiosity,
111
321700
4696
这一切突然变得 更加关乎人性和好奇心,
05:26
rather than technology.
112
326420
1656
而不是技术。
05:28
So data powered the experience,
113
328100
2336
所以虽然数据丰富了经历,
05:30
but stories of human beings were the drive.
114
330460
2400
但人类的故事才是背后的驱动力。
05:34
The very positive response of its thousands of users
115
334660
3336
数千用户的积极回应
05:38
taught me a very important lesson --
116
338020
1936
教给我非常重要的一点——
05:39
that working with data means designing ways
117
339980
2856
处理数据意味着设计各种方法,
05:42
to transform the abstract and the uncountable
118
342860
2736
将抽象和无法量化的信息转化成
05:45
into something that can be seen, felt and directly reconnected
119
345620
4016
可以看到、感觉到并 与我们的生活和行为
05:49
to our lives and to our behaviors,
120
349660
2296
直接重新连接的东西,
05:51
something that is hard to achieve
121
351980
1856
而如果我们让对数字的痴迷和
05:53
if we let the obsession for the numbers and the technology around them
122
353860
3896
围绕数字的技术 在这个过程中引领我们,
05:57
lead us in the process.
123
357780
1280
则很难实现这一点。
06:00
But we can do even more to connect data to the stories they represent.
124
360420
4896
但是,我们还能更进一步 将数据与它们所代表的故事连接起来。
06:05
We can remove technology completely.
125
365340
2656
我们可以完全去掉技术。
06:08
A few years ago, I met this other woman,
126
368020
2256
几年前,我遇到另一位女士,
06:10
Stefanie Posavec --
127
370300
1376
斯蒂芬妮·波萨维奇——
06:11
a London-based designer who shares with me the passion and obsession about data.
128
371700
5816
一位伦敦的设计师, 与我一样对数据热爱和痴迷。
06:17
We didn't know each other,
129
377540
1336
我们不认识对方,
06:18
but we decided to run a very radical experiment,
130
378900
3256
但我们决定进行 一个非常激进的实验,
06:22
starting a communication using only data,
131
382180
2536
开始一场只使用数据的交流,
06:24
no other language,
132
384740
1336
不使用任何其他语言,
06:26
and we opted for using no technology whatsoever to share our data.
133
386100
4616
我们选择不使用任何科技 来分享我们的数据。
06:30
In fact, our only means of communication
134
390740
2896
事实上,我们唯一的沟通方式
06:33
would be through the old-fashioned post office.
135
393660
2856
是通过老式邮局。
06:36
For "Dear Data," every week for one year,
136
396540
2456
为了“亲爱的数据”, 一年中的每个星期,
06:39
we used our personal data to get to know each other --
137
399020
3456
我们用自己的 个人数据来了解彼此——
06:42
personal data around weekly shared mundane topics,
138
402500
3656
个人信息包括每周分享的平凡话题,
06:46
from our feelings
139
406180
1216
从我们自己的感受
06:47
to the interactions with our partners,
140
407420
1856
到我们与爱人之间的互动,
06:49
from the compliments we received to the sounds of our surroundings.
141
409300
3160
从我们收到的赞美到周围的声音。
06:53
Personal information that we would then manually hand draw
142
413300
3536
然后我们把这些个人信息
06:56
on a postcard-size sheet of paper
143
416860
2496
手绘在一张明信片大小的纸上,
06:59
that we would every week send from London to New York,
144
419380
2936
每周从伦敦寄到我所在的
07:02
where I live,
145
422340
1256
纽约,
07:03
and from New York to London, where she lives.
146
423620
2200
以及从纽约寄到她所在的伦敦。
07:06
The front of the postcard is the data drawing,
147
426300
3696
明信片的正面是数据图,
07:10
and the back of the card
148
430020
1296
卡片背面当然包括
07:11
contains the address of the other person, of course,
149
431340
2429
对方的地址,
07:13
and the legend for how to interpret our drawing.
150
433793
2640
还有如何破译数据图的方法。
07:17
The very first week into the project,
151
437460
2016
在开始的第一个星期,
07:19
we actually chose a pretty cold and impersonal topic.
152
439500
3056
我们实际上选择了一个 相当冷门和非私人化的话题。
07:22
How many times do we check the time in a week?
153
442580
3200
一周内看了多少次时间?
07:26
So here is the front of my card,
154
446540
1936
这里是我的卡片的正面,
07:28
and you can see that every little symbol
155
448500
1976
你可以看到,每一个小符号
07:30
represents all of the times that I checked the time,
156
450500
3416
代表着我每次看时间,
07:33
positioned for days and different hours chronologically --
157
453940
3376
位置按顺序代表日期和小时——
07:37
nothing really complicated here.
158
457340
2040
没有什么复杂的。
07:40
But then you see in the legend
159
460020
1576
但是再看看这破译说明,
07:41
how I added anecdotal details about these moments.
160
461620
3456
我是如何把这些时刻的 各种细节加进去的。
07:45
In fact, the different types of symbols indicate why I was checking the time --
161
465100
4576
实际上,不同类型的符号 代表着我为什么要看时间——
07:49
what was I doing?
162
469700
1216
当时我在做什么?
07:50
Was I bored? Was I hungry?
163
470940
1696
我无聊吗?我饿吗?
07:52
Was I late?
164
472660
1216
我迟到了吗?
07:53
Did I check it on purpose or just casually glance at the clock?
165
473900
3216
我是有意看表 还是随意瞥一眼时钟?
07:57
And this is the key part --
166
477140
2256
但关键是——
07:59
representing the details of my days and my personality
167
479420
3696
我的数据收集代表了我的
08:03
through my data collection.
168
483140
1936
生活细节和个性。
08:05
Using data as a lens or a filter to discover and reveal, for example,
169
485100
4696
用数据作为镜头或过滤器 来发现和揭示,例如
08:09
my never-ending anxiety for being late,
170
489820
2176
我对迟到无休止的焦虑,
08:12
even though I'm absolutely always on time.
171
492020
2200
即使我绝对每次都准时。
08:16
Stefanie and I spent one year collecting our data manually
172
496020
4096
斯蒂芬妮和我花了一年时间 手动收集我们的数据,
08:20
to force us to focus on the nuances that computers cannot gather --
173
500140
4496
迫使我们专注于 计算机无法收集——
08:24
or at least not yet --
174
504660
1536
至少现在还无法收集的细节,
08:26
using data also to explore our minds and the words we use,
175
506220
3496
使用数据来探索我们的思想 和我们使用的词语,
08:29
and not only our activities.
176
509740
1936
而不仅是我们的活动。
08:31
Like at week number three,
177
511700
1416
比如在第三周,
08:33
where we tracked the "thank yous" we said and were received,
178
513140
3816
我们记录了我们所说的 以及收到的“感谢”,
08:36
and when I realized that I thank mostly people that I don't know.
179
516980
4656
它让我意识到,我多数时间 在感谢我不认识的人。
08:41
Apparently I'm a compulsive thanker to waitresses and waiters,
180
521660
4336
显然我对感谢男女服务生有强迫症,
08:46
but I definitely don't thank enough the people who are close to me.
181
526020
3160
但绝对没有对身边的人 表达足够的感谢。
08:50
Over one year,
182
530820
1256
在这一年多里,
08:52
the process of actively noticing and counting these types of actions
183
532100
4496
对这些类型的行为 积极留意和计数的过程
08:56
became a ritual.
184
536620
1296
成为了一种仪式。
08:57
It actually changed ourselves.
185
537940
2056
它真的改变了我们自己。
09:00
We became much more in tune with ourselves,
186
540020
2696
我们变得更加贴近真实的自己,
09:02
much more aware of our behaviors and our surroundings.
187
542740
3120
更加了解我们的行为和周围环境。
09:06
Over one year, Stefanie and I connected at a very deep level
188
546500
2976
一年多的时间,斯蒂芬妮和我 通过共享数据日记
09:09
through our shared data diary,
189
549500
2016
建立了非常深层的联系,
09:11
but we could do this only because we put ourselves in these numbers,
190
551540
4296
但是我们能做到这样, 只因为我们用这些数字表达了自己,
09:15
adding the contexts of our very personal stories to them.
191
555860
3976
并加入了我们的个人故事背景。
09:19
It was the only way to make them truly meaningful
192
559860
2456
这是使它们真正有意义
09:22
and representative of ourselves.
193
562340
2200
并代表了我们自己的唯一途径。
09:26
I am not asking you to start drawing your personal data,
194
566300
3096
我不是要你开始画你的个人数据,
09:29
or to find a pen pal across the ocean.
195
569420
2856
也不是要你找个跨洋笔友。
09:32
But I'm asking you to consider data --
196
572300
2576
但是我请你把数据——
09:34
all kind of data --
197
574900
1456
各种数据——
09:36
as the beginning of the conversation
198
576380
1776
看成交谈的开始,
09:38
and not the end.
199
578180
1200
而不是终止。
09:39
Because data alone will never give us a solution.
200
579900
3176
因为数据本身永远不会给我们答案。
09:43
And this is why data failed us so badly --
201
583100
2696
这就是为什么数据 让我们败得这么惨——
09:45
because we failed to include the right amount of context
202
585820
3376
因为我们没有考虑到 要用适量的背景信息
09:49
to represent reality --
203
589220
1456
来展示现实——
09:50
a nuanced, complicated and intricate reality.
204
590700
3200
微妙的、错综复杂的现实。
09:54
We kept looking at these two numbers,
205
594780
2456
我们一直盯着这两个数字,
09:57
obsessing with them
206
597260
1496
痴迷于这两个数字,
09:58
and pretending that our world could be reduced
207
598780
2496
假装我们的世界
可以缩减成两个数字和一场赛马,
10:01
to a couple digits and a horse race,
208
601300
2336
10:03
while the real stories,
209
603660
1256
而真实的故事,
10:04
the ones that really mattered,
210
604940
1456
真正重要的故事
10:06
were somewhere else.
211
606420
1416
在别处。
10:07
What we missed looking at these stories only through models and algorithms
212
607860
4416
如果只用模型和算法 来看待这些故事,我们错过的是
10:12
is what I call "data humanism."
213
612300
2520
我所说的“数据人文主义”。
10:15
In the Renaissance humanism,
214
615380
2016
在文艺复兴时期的人文主义中,
10:17
European intellectuals
215
617420
1616
欧洲的智者们
10:19
placed the human nature instead of God at the center of their view of the world.
216
619060
4920
在他们世界观的中心位置 摆放的是人类本性,而不是上帝。
10:24
I believe something similar needs to happen
217
624620
2216
我相信在数据的世界,
10:26
with the universe of data.
218
626860
1776
也需要类似的事情。
10:28
Now data are apparently treated like a God --
219
628660
2976
现在我们显然把数据 当成了一个神——
10:31
keeper of infallible truth for our present and our future.
220
631660
3280
我们现在和未来的永恒真理持有者。
10:35
The experiences that I shared with you today
221
635660
2896
我今天分享的经验告诉我,
10:38
taught me that to make data faithfully representative of our human nature
222
638580
5016
为了使数据忠实地代表我们的人性,
10:43
and to make sure they will not mislead us anymore,
223
643620
3416
并确保数据不再误导我们,
10:47
we need to start designing ways to include empathy, imperfection
224
647060
3696
我们需要开始设计方法, 在收集、处理、分析和
10:50
and human qualities
225
650780
1576
演示数据时,
10:52
in how we collect, process, analyze and display them.
226
652380
3720
纳入同情、不完美和人文素质。
10:57
I do see a place where, ultimately,
227
657100
2976
我能预见,终将有个地方,
11:00
instead of using data only to become more efficient,
228
660100
3336
数据不会被单纯用来提高效率,
11:03
we will all use data to become more humane.
229
663460
2800
我们都会用数据来变得更人性化。
11:06
Thank you.
230
666700
1216
谢谢。
11:07
(Applause)
231
667940
4441
(掌声)
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7