3 ways to spot a bad statistic | Mona Chalabi

244,278 views ・ 2017-04-17

TED


请双击下面的英文字幕来播放视频。

翻译人员: Wilde Luo 校对人员: Lipeng Chen
00:12
I'm going to be talking about statistics today.
0
12704
2763
今天我想讲一下统计学。
00:15
If that makes you immediately feel a little bit wary, that's OK,
1
15491
3138
如果你马上有所警觉,没关系,
00:18
that doesn't make you some kind of crazy conspiracy theorist,
2
18653
2859
我不会让你变成那种 疯狂的阴谋论者,
00:21
it makes you skeptical.
3
21536
1296
只会让你学会质疑。
00:22
And when it comes to numbers, especially now, you should be skeptical.
4
22856
3886
尤其是现在,说到数字、数据, 你应该时刻保持怀疑态度。
00:26
But you should also be able to tell which numbers are reliable
5
26766
3011
但你需要能够辨别哪些数字是可信的,
00:29
and which ones aren't.
6
29801
1160
哪些存在问题。
00:30
So today I want to try to give you some tools to be able to do that.
7
30985
3206
因此为了让你们具有这种甄别能力, 我会试着提供一些方法。
00:34
But before I do,
8
34215
1169
但在我开始之前,
00:35
I just want to clarify which numbers I'm talking about here.
9
35408
2839
我想说明我所说的统计 数据是什么类型的。
我讨论的并不是这种广告:
00:38
I'm not talking about claims like,
10
38271
1635
00:39
"9 out of 10 women recommend this anti-aging cream."
11
39930
2449
“10个女人中有9个 会推荐这款抗衰老乳霜。”
00:42
I think a lot of us always roll our eyes at numbers like that.
12
42403
2972
我想我们中很多人对 这样的数字不以为然。
00:45
What's different now is people are questioning statistics like,
13
45399
2984
但不同的是,现在人们对这样的 数字也开始抱有怀疑:
“美国的失业率是 5%。”
00:48
"The US unemployment rate is five percent."
14
48407
2014
00:50
What makes this claim different is it doesn't come from a private company,
15
50445
3516
这个数字不是出于一个私人公司,
00:53
it comes from the government.
16
53985
1388
而是来自政府。
事实上,如今10个美国人中差不多有4个 不相信政府公布的
00:55
About 4 out of 10 Americans distrust the economic data
17
55397
3336
00:58
that gets reported by government.
18
58757
1573
那些经济数据。
01:00
Among supporters of President Trump it's even higher;
19
60354
2491
在特朗普总统的支持者中,情况更糟,
01:02
it's about 7 out of 10.
20
62869
1633
10人中有7人不相信那些数据。
01:04
I don't need to tell anyone here
21
64526
1804
无须我告诉大家,
01:06
that there are a lot of dividing lines in our society right now,
22
66354
3011
当今我们的社会中 划分了许多的分界线,
01:09
and a lot of them start to make sense,
23
69389
1825
一旦你明白“人”和那些“政府数据”
之间的关系,这些分界线中的 大多数就不言自明了。
01:11
once you understand people's relationships with these government numbers.
24
71238
3687
01:14
On the one hand, there are those who say these statistics are crucial,
25
74949
3336
一方面,有些人说, 这些统计数据很重要,
01:18
that we need them to make sense of society as a whole
26
78309
2630
我们需要它们来理解整个社会,
01:20
in order to move beyond emotional anecdotes
27
80963
2164
进而脱离情感喜好、偏见等,
01:23
and measure progress in an [objective] way.
28
83151
2410
以一种客观的方式衡量发展程度。
01:25
And then there are the others,
29
85585
1467
正相反,另一方面,有一些人说
01:27
who say that these statistics are elitist,
30
87076
2156
这些数据隶属于精英统治,
01:29
maybe even rigged;
31
89256
1208
它们甚至有可能是被操纵的;
01:30
they don't make sense and they don't really reflect
32
90488
2394
它们没什么意义,也不能真正反映
01:32
what's happening in people's everyday lives.
33
92906
2296
人们的日常生活中发生了什么。
01:35
It kind of feels like that second group is winning the argument right now.
34
95226
3487
现在看来,似乎是第二种人 在这两个阵营的针锋相对中获得了胜利。
01:38
We're living in a world of alternative facts,
35
98737
2108
我们所生活的世界里, 胡说八道已成常态,
01:40
where people don't find statistics this kind of common ground,
36
100869
2935
人们对统计数据不能达成共识,
01:43
this starting point for debate.
37
103828
1636
也不会把数据作为争论时的出发点。
01:45
This is a problem.
38
105488
1286
这是一个问题。
01:46
There are actually moves in the US right now
39
106798
2067
如今在美国,人们有一个举措是
01:48
to get rid of some government statistics altogether.
40
108889
2861
完全丢弃政府的某项统计数据。
01:51
Right now there's a bill in congress about measuring racial inequality.
41
111774
3387
现在,国会有一项关于 衡量种族不平等程度的议案。
01:55
The draft law says that government money should not be used
42
115185
2801
草案规定,政府资金不能被用在
搜集关于种族隔离的数据上。
01:58
to collect data on racial segregation.
43
118010
1902
01:59
This is a total disaster.
44
119936
1885
这完全是一场灾难。
02:01
If we don't have this data,
45
121845
1748
如果我们没有相关数据,
02:03
how can we observe discrimination,
46
123617
1778
我们怎样观察种族歧视现象,
02:05
let alone fix it?
47
125419
1278
更不要说解决它了?
02:06
In other words:
48
126721
1188
换句话说:
02:07
How can a government create fair policies
49
127933
2059
如果政府都不能衡量 如今的种族不平等程度,
那他们又如何制定合理的政策呢?
02:10
if they can't measure current levels of unfairness?
50
130016
2771
02:12
This isn't just about discrimination,
51
132811
1794
不只是在种族歧视方面,
02:14
it's everything -- think about it.
52
134629
1670
还包括其他方面—— 想一想吧,
如果我们没有关于医疗健康
02:16
How can we legislate on health care
53
136323
1690
以及贫穷问题的可靠数据, 要如何为医疗保健问题立法?
02:18
if we don't have good data on health or poverty?
54
138037
2271
如果我们甚至对进入或离开 这个国家的人数都无法达成共识,
02:20
How can we have public debate about immigration
55
140332
2198
02:22
if we can't at least agree
56
142554
1250
02:23
on how many people are entering and leaving the country?
57
143828
2643
我们又如何
关于移民问题进行公众讨论?
02:26
Statistics come from the state; that's where they got their name.
58
146495
3058
“统计”(statistics)来自于 “国家事务”(state),是它的词源所在。
它的重点在于更准确地衡量人口
02:29
The point was to better measure the population
59
149577
2157
来更好地造福于人民。
02:31
in order to better serve it.
60
151758
1357
02:33
So we need these government numbers,
61
153139
1725
所以我们需要这些政府数据,
02:34
but we also have to move beyond either blindly accepting
62
154888
2647
但我们也不能盲目接受
02:37
or blindly rejecting them.
63
157559
1268
或者盲目排斥它们。
02:38
We need to learn the skills to be able to spot bad statistics.
64
158851
2997
我们需要学会甄别不可信的数据。
02:41
I started to learn some of these
65
161872
1528
当我在统计部门工作时——
02:43
when I was working in a statistical department
66
163424
2166
它是联合国的一部分,
我开始学会了一些甄别的技巧。
02:45
that's part of the United Nations.
67
165614
1643
02:47
Our job was to find out how many Iraqis had been forced from their homes
68
167281
3406
我们的工作是弄清楚 有多少伊拉克人是因为战争
02:50
as a result of the war,
69
170711
1158
而被迫离家,
02:51
and what they needed.
70
171893
1158
以及他们需要什么。
这的确是一项非常重要的工作, 但其难度也让人难以置信。
02:53
It was really important work, but it was also incredibly difficult.
71
173075
3178
02:56
Every single day, we were making decisions
72
176277
2018
每一天,我们都在做出
02:58
that affected the accuracy of our numbers --
73
178319
2157
会影响我们统计数据精确度的决定——
03:00
decisions like which parts of the country we should go to,
74
180500
2744
决定我们应该去 这个国家的哪一块区域调查,
03:03
who we should speak to,
75
183268
1156
和哪种人交谈,
03:04
which questions we should ask.
76
184448
1568
我们应该询问他们什么问题。
03:06
And I started to feel really disillusioned with our work,
77
186040
2680
我开始对我们的工作感到失望,
03:08
because we thought we were doing a really good job,
78
188744
2518
因为我们以为我们的 工作非常棒、很有意义,
03:11
but the one group of people who could really tell us were the Iraqis,
79
191286
3278
但是真正能告诉我们 实情的是那些伊拉克人,
03:14
and they rarely got the chance to find our analysis, let alone question it.
80
194588
3540
他们却很难有机会看到我们的 分析结果,更别提质疑它了。
03:18
So I started to feel really determined
81
198152
1831
所以我开始坚信,
能让数据更加精确可信的一种方法,
03:20
that the one way to make numbers more accurate
82
200007
2311
03:22
is to have as many people as possible be able to question them.
83
202342
3053
就是让尽可能多的人能够去质疑它。
03:25
So I became a data journalist.
84
205419
1434
所以我成为了一名“数据记者”。
03:26
My job is finding these data sets and sharing them with the public.
85
206877
3904
我的工作是找到这些数据集, 并且向公众发布。
03:30
Anyone can do this, you don't have to be a geek or a nerd.
86
210805
3173
任何人都可以做到这件事; 你无须是一个极客或者老手。
你可以忽视那些术语; 那些想要表现得很聪明
03:34
You can ignore those words; they're used by people
87
214002
2355
又假装谦虚的人才喜欢这些专业词汇。
03:36
trying to say they're smart while pretending they're humble.
88
216381
2822
毫无疑问任何人都能做到这些事。
03:39
Absolutely anyone can do this.
89
219227
1589
03:40
I want to give you guys three questions
90
220840
2067
我想向你们提出3个问题,
03:42
that will help you be able to spot some bad statistics.
91
222931
3005
会帮助你甄别那些不可靠的数据。
03:45
So, question number one is: Can you see uncertainty?
92
225960
3507
那么第一个问题是: 你能看到不确定性吗?
03:49
One of things that's really changed people's relationship with numbers,
93
229491
3364
有一件事真正改变了 人们和数据的关系,
03:52
and even their trust in the media,
94
232879
1641
甚至改变了人们对媒体的信任,
03:54
has been the use of political polls.
95
234544
2258
那就是政治上的民意调查的滥用。
03:56
I personally have a lot of issues with political polls
96
236826
2538
我个人对政治民意调查颇有微词,
03:59
because I think the role of journalists is actually to report the facts
97
239388
3376
因为我认为,记者的任务就是 原原本本地报道事实,
04:02
and not attempt to predict them,
98
242788
1553
而不应该尝试去预测什么,
04:04
especially when those predictions can actually damage democracy
99
244365
2996
尤其是当这些“预测” 实际上会有损民主的时候,
它向人们示意: 不用费心给那个家伙投票了,
04:07
by signaling to people: don't bother to vote for that guy,
100
247385
2732
他是没机会入选的。
04:10
he doesn't have a chance.
101
250141
1205
我们把这事先搁到一边, 来谈谈这种做法的精确度如何。
04:11
Let's set that aside for now and talk about the accuracy of this endeavor.
102
251370
3654
04:15
Based on national elections in the UK, Italy, Israel
103
255048
4608
根据英国、意大利、 以色列的国家选举情况,
04:19
and of course, the most recent US presidential election,
104
259680
2764
当然,还有近期的 美国总统选举情况,
04:22
using polls to predict electoral outcomes
105
262468
2137
它们都表明:使用民意 调查来预测选举结果
04:24
is about as accurate as using the moon to predict hospital admissions.
106
264629
3812
无异于夜观天象来预测是否 住院——都是不可靠的。
04:28
No, seriously, I used actual data from an academic study to draw this.
107
268465
4200
说真的,我用一份学术研究中的 真实数据画出了这幅图。
04:32
There are a lot of reasons why polling has become so inaccurate.
108
272689
3727
民意调查变得如此不准确 是有很多原因的。
04:36
Our societies have become really diverse,
109
276440
1970
我们的社会已经变得相当多元化,
04:38
which makes it difficult for pollsters to get a really nice representative sample
110
278434
3821
这使得民意调查者很难在 人群中为他们的调查得到一份
不错的、有代表性的样本。
04:42
of the population for their polls.
111
282279
1627
04:43
People are really reluctant to answer their phones to pollsters,
112
283930
3006
人们不是很情愿地 回答民意调查电话,
04:46
and also, shockingly enough, people might lie.
113
286960
2276
另外,令人震惊的是, 人们可能会说谎。
04:49
But you wouldn't necessarily know that to look at the media.
114
289260
2811
但在媒体的报道中,你不一定会 看到并了解这些过程。
04:52
For one thing, the probability of a Hillary Clinton win
115
292095
2761
例如,希拉里 · 克林顿 赢得选举的可能性
04:54
was communicated with decimal places.
116
294880
2791
在媒体的报道中精确到了小数点。
04:57
We don't use decimal places to describe the temperature.
117
297695
2621
我们描述气温都不会这么精确。
05:00
How on earth can predicting the behavior of 230 million voters in this country
118
300340
4228
所以,对这个国家里的 2亿3千万选民的行为的预测,
05:04
be that precise?
119
304592
1829
怎么可能会如此精确?
05:06
And then there were those sleek charts.
120
306445
2002
另外,还有一些美观的图表。
05:08
See, a lot of data visualizations will overstate certainty, and it works --
121
308471
3973
知道吗,很多数据可视化的方法会 夸大数据的准确性,而且很有效——
05:12
these charts can numb our brains to criticism.
122
312468
2620
这些图表会麻痹我们的大脑, 让我们无法对其准确性生疑。
05:15
When you hear a statistic, you might feel skeptical.
123
315112
2558
当你听到一项统计数据, 你或许会怀疑。
05:17
As soon as it's buried in a chart,
124
317694
1635
但当它整合进了图表中,
它看起来仿佛就成了客观的科学,
05:19
it feels like some kind of objective science,
125
319353
2129
05:21
and it's not.
126
321506
1249
但实际上正好相反。
05:22
So I was trying to find ways to better communicate this to people,
127
322779
3103
所以我试着找出一些方法 来告诉人们这些数据背后的事实,
05:25
to show people the uncertainty in our numbers.
128
325906
2504
为人们展示这些数据的不确定性。
05:28
What I did was I started taking real data sets,
129
328434
2246
我所做的,就是先采集数据,
05:30
and turning them into hand-drawn visualizations,
130
330704
2652
然后把它们转换为手绘的示意图,
05:33
so that people can see how imprecise the data is;
131
333380
2672
所以人们可以看到 这些数据的不精确性;
05:36
so people can see that a human did this,
132
336076
1996
可以看到这份图表是人画出来的,
有人搜集数据并把它可视化了。
05:38
a human found the data and visualized it.
133
338096
1972
05:40
For example, instead of finding out the probability
134
340092
2672
例如,我们不去计算出每个月
05:42
of getting the flu in any given month,
135
342788
2126
可能患流感的概率,
05:44
you can see the rough distribution of flu season.
136
344938
2792
但我们可以得到流感 在每个季节的大致分布。
05:47
This is --
137
347754
1167
这就是——
05:48
(Laughter)
138
348945
1018
(笑声)
现在就是二月,展示 这张图不是很合适。
05:49
a bad shot to show in February.
139
349987
1486
05:51
But it's also more responsible data visualization,
140
351497
2455
但这是更加负责任的数据可视图表,
05:53
because if you were to show the exact probabilities,
141
353976
2455
因为,你如果在这张图中 展示了精确的概率,
05:56
maybe that would encourage people to get their flu jabs
142
356455
2592
那么它就有可能鼓励人们 在不合适的时间段
接种流感疫苗。
05:59
at the wrong time.
143
359071
1456
06:00
The point of these shaky lines
144
360983
1693
这些摇摇晃晃的线条
06:02
is so that people remember these imprecisions,
145
362700
2911
是想向人们说明 这些数据的不精确性,
06:05
but also so they don't necessarily walk away with a specific number,
146
365635
3227
它也不会让人们简单地 得到一个鸡肋的具体数字,
06:08
but they can remember important facts.
147
368886
1866
而是让他们记住重要的事实。
06:10
Facts like injustice and inequality leave a huge mark on our lives.
148
370776
4024
不公正、不平等这样的事实 给我们的生活带来了巨大的影响。
06:14
Facts like Black Americans and Native Americans have shorter life expectancies
149
374824
4189
“美国黑人以及原住民的预期寿命 比其他种族的人短一些”,
06:19
than those of other races,
150
379037
1400
这样的事实,
06:20
and that isn't changing anytime soon.
151
380461
2138
短时间内也难以改变。
06:22
Facts like prisoners in the US can be kept in solitary confinement cells
152
382623
3901
还包括“美国监狱中被单独监禁的 囚犯的活动空间
06:26
that are smaller than the size of an average parking space.
153
386548
3342
比起一般的停车位面积还要小” 这样的事实。
06:30
The point of these visualizations is also to remind people
154
390355
3335
这些数据可视化的 意义也在于提醒人们
06:33
of some really important statistical concepts,
155
393714
2350
一些极其重要的统计概念,
06:36
concepts like averages.
156
396088
1636
例如“平均”的概念。
06:37
So let's say you hear a claim like,
157
397748
1668
比方说你听到了这样一句话:
06:39
"The average swimming pool in the US contains 6.23 fecal accidents."
158
399440
4434
“在美国,一般大小的游泳池 平均含有6.23次大便。”
06:43
That doesn't mean every single swimming pool in the country
159
403898
2797
这并不意味着这个 国家里每一座游泳池
06:46
contains exactly 6.23 turds.
160
406719
2194
刚好有6.23次大便。
06:48
So in order to show that,
161
408937
1417
所以为了展示这一点,
06:50
I went back to the original data, which comes from the CDC,
162
410378
2841
我找到疾病预防控制 中心的原始数据,
06:53
who surveyed 47 swimming facilities.
163
413243
2065
他们调查了47座游泳池。
06:55
And I just spent one evening redistributing poop.
164
415332
2391
我花了一个晚上 “重新分配这些大便”。
06:57
So you can kind of see how misleading averages can be.
165
417747
2682
所以你可以看到, “平均”是多么误导人。
07:00
(Laughter)
166
420453
1282
(笑声)
07:01
OK, so the second question that you guys should be asking yourselves
167
421759
3901
好,第二个问题, 你们在甄别数据时
07:05
to spot bad numbers is:
168
425684
1501
应当反问自己的是:
07:07
Can I see myself in the data?
169
427209
1967
我能在这些数据中 看到自己的身影吗?
07:09
This question is also about averages in a way,
170
429200
2913
这个问题在某种程度上 也是关于“平均”的,
07:12
because part of the reason why people are so frustrated
171
432137
2605
因为,人们对政府 公布的数据感到失望的
07:14
with these national statistics,
172
434766
1495
一部分原因就是,
07:16
is they don't really tell the story of who's winning and who's losing
173
436285
3273
他们无法分清,在国家政策下
07:19
from national policy.
174
439582
1156
谁损失、谁获益。
07:20
It's easy to understand why people are frustrated with global averages
175
440762
3318
很容易理解为什么当人们发现 自己的个人经验与全球平均数据
不相符时,他们会感到非常失望。
07:24
when they don't match up with their personal experiences.
176
444104
2679
07:26
I wanted to show people the way data relates to their everyday lives.
177
446807
3263
我想为人们展示:数据是如何 与他们的日常生活相关联的。
我开设了一个咨询专栏, 叫做《亲爱的Mona》,
07:30
I started this advice column called "Dear Mona,"
178
450094
2246
人们会写信询问 他们所关心的事情,
07:32
where people would write to me with questions and concerns
179
452364
2726
而我会尝试借着数据回答他们。
07:35
and I'd try to answer them with data.
180
455114
1784
这些问题包罗万象,
07:36
People asked me anything.
181
456922
1200
例如,“和老婆分床睡是正常的吗?”
07:38
questions like, "Is it normal to sleep in a separate bed to my wife?"
182
458146
3261
07:41
"Do people regret their tattoos?"
183
461431
1591
“人们会为身上的刺青感到后悔吗?”
“什么是‘自然死亡’?”
07:43
"What does it mean to die of natural causes?"
184
463046
2164
07:45
All of these questions are great, because they make you think
185
465234
2966
这些问题都很棒, 因为它们会让你思考,
如何去寻找并转达这些数据。
07:48
about ways to find and communicate these numbers.
186
468224
2336
07:50
If someone asks you, "How much pee is a lot of pee?"
187
470584
2503
如果有人问你: “ 排多少尿才算 ‘尿量多’ ?”
这是我曾被问及的一个问题,
07:53
which is a question that I got asked,
188
473111
2458
07:55
you really want to make sure that the visualization makes sense
189
475593
2980
你真的会确信:将数据图示化表达
07:58
to as many people as possible.
190
478597
1747
能让更多的人容易理解。
08:00
These numbers aren't unavailable.
191
480368
1575
这些统计数据并不难以得到。
08:01
Sometimes they're just buried in the appendix of an academic study.
192
481967
3507
有时它们只是被淹没在 学术研究的附录里。
08:05
And they're certainly not inscrutable;
193
485498
1839
它们也绝非高深莫测;
如果你真的想要检验 这些有关尿量的数据,
08:07
if you really wanted to test these numbers on urination volume,
194
487361
2975
抓一个瓶子自己试试吧。
08:10
you could grab a bottle and try it for yourself.
195
490360
2257
08:12
(Laughter)
196
492641
1008
(笑声)
08:13
The point of this isn't necessarily
197
493673
1694
重点在于,每一份数据集
不一定要跟你有专门的关联。
08:15
that every single data set has to relate specifically to you.
198
495391
2877
我对“法国有多少女人 因为戴面纱或头巾而被罚款”
08:18
I'm interested in how many women were issued fines in France
199
498292
2880
这样的问题很感兴趣,
08:21
for wearing the face veil, or the niqab,
200
501196
1959
即使我不住在法国也不戴面纱。
08:23
even if I don't live in France or wear the face veil.
201
503179
2618
08:25
The point of asking where you fit in is to get as much context as possible.
202
505821
3835
问自己是否符合数据当中的情况, 是为了更好地了解其环境背景。
08:29
So it's about zooming out from one data point,
203
509680
2191
所以我们从一小块 数据出发来综观全体,
08:31
like the unemployment rate is five percent,
204
511895
2104
就像“失业率是5%”这样的数据,
观察它随着时间如何变化,
08:34
and seeing how it changes over time,
205
514023
1757
08:35
or seeing how it changes by educational status --
206
515804
2650
或者观察它随着教育程度的 差异而如何改变——
08:38
this is why your parents always wanted you to go to college --
207
518478
3104
这就是你的父母总是 希望你上大学的原因——
08:41
or seeing how it varies by gender.
208
521606
2032
或者观察它随着 性别差异而如何变化。
08:43
Nowadays, male unemployment rate is higher
209
523662
2127
如今,男性的失业率
08:45
than the female unemployment rate.
210
525813
1700
比女性的失业率高。
08:47
Up until the early '80s, it was the other way around.
211
527537
2695
但是在80年代初期之前, 情况与此相反。
08:50
This is a story of one of the biggest changes
212
530256
2117
这是美国社会至今为止
08:52
that's happened in American society,
213
532397
1720
发生的最大的改变,
一旦你不再局限于“平均”, 就会发现这些信息都隐含在在图表中。
08:54
and it's all there in that chart, once you look beyond the averages.
214
534141
3276
08:57
The axes are everything;
215
537441
1165
轴线能代表一切;
08:58
once you change the scale, you can change the story.
216
538630
2669
当你改变观察的尺度, 你就得到了新的信息。
09:01
OK, so the third and final question that I want you guys to think about
217
541323
3380
好,第三个也是最后 一个问题,我希望
09:04
when you're looking at statistics is:
218
544727
1819
你们在观察数据时思考的是:
09:06
How was the data collected?
219
546570
1873
这些数据是如何被收集的?
09:09
So far, I've only talked about the way data is communicated,
220
549487
2939
目前为止,我只谈到 数据被传达的方式,
但收集这些数据的 方式也同样重要。
09:12
but the way it's collected matters just as much.
221
552450
2276
我知道这很困难,
09:14
I know this is tough,
222
554750
1167
因为收集数据的方法一般是 不透明而且有些无聊的。
09:15
because methodologies can be opaque and actually kind of boring,
223
555941
3081
但你能使用一些简单的 步骤来检视数据。
09:19
but there are some simple steps you can take to check this.
224
559046
2873
09:21
I'll use one last example here.
225
561943
1839
这里我要举最后一个例子。
09:24
One poll found that 41 percent of Muslims in this country support jihad,
226
564129
3887
一份民意调查指出,这个国家内 41%的穆斯林支持伊斯兰教的“圣战”,
09:28
which is obviously pretty scary,
227
568040
1525
这听起来相当吓人,
09:29
and it was reported everywhere in 2015.
228
569589
2642
这个数字在2015年被大肆报道。
09:32
When I want to check a number like that,
229
572255
2615
当我想要检查一下这个数据时,
09:34
I'll start off by finding the original questionnaire.
230
574894
2501
我会首先寻找原始的调查问卷。
09:37
It turns out that journalists who reported on that statistic
231
577419
2926
结果表明, 那些报道此事的记者
09:40
ignored a question lower down on the survey
232
580369
2231
忽视了调查问卷后面的一个问题,
09:42
that asked respondents how they defined "jihad."
233
582624
2346
关于受调查者如何定义“圣战”。
09:44
And most of them defined it as,
234
584994
1981
大多数人是这样定义的:
09:46
"Muslims' personal, peaceful struggle to be more religious."
235
586999
3942
“为了变得更加虔诚, 穆斯林们个人的、和平的奋斗。”
09:50
Only 16 percent defined it as, "violent holy war against unbelievers."
236
590965
4194
只有16%的人如此认为: “反对不信教者的暴力的、神圣的战争。”
09:55
This is the really important point:
237
595183
2430
这是很重要的一点:
09:57
based on those numbers, it's totally possible
238
597637
2155
根据这些数据, 完全有可能发生的是:
09:59
that no one in the survey who defined it as violent holy war
239
599816
3105
在调查中那些把“圣战”解释为 “暴力的、神圣的战争”的人
10:02
also said they support it.
240
602945
1332
根本就不支持“圣战”。
10:04
Those two groups might not overlap at all.
241
604301
2208
这两组人可能根本就不重叠。
10:06
It's also worth asking how the survey was carried out.
242
606942
2637
调查进行的方式也是 一个值得探讨的问题。
10:09
This was something called an opt-in poll,
243
609603
1998
这份调查使用了一种 “自愿参与”的调查方式,
10:11
which means anyone could have found it on the internet and completed it.
244
611625
3402
意味着任何一个人都可以在 网络上找到并完成这份调查。
我们甚至无从知道这些 被调查者是否是穆斯林。
10:15
There's no way of knowing if those people even identified as Muslim.
245
615051
3339
10:18
And finally, there were 600 respondents in that poll.
246
618414
2612
最后,有600人参与了这份调查。
根据皮尤研究中心统计,这个国家里
10:21
There are roughly three million Muslims in this country,
247
621050
2654
10:23
according to Pew Research Center.
248
623728
1607
差不多有3百万穆斯林。
这意味着,在这个国家中, 每5000个穆斯林中只有一个
10:25
That means the poll spoke to roughly one in every 5,000 Muslims
249
625359
2993
参与了这项调查。
10:28
in this country.
250
628376
1168
10:29
This is one of the reasons
251
629568
1266
这也是为什么政府的统计数据
10:30
why government statistics are often better than private statistics.
252
630858
3607
通常比私人机构的调查 数据更为准确的原因之一。
10:34
A poll might speak to a couple hundred people, maybe a thousand,
253
634489
3035
一份民意调查可能 有数百人参与,甚至一千人,
10:37
or if you're L'Oreal, trying to sell skin care products in 2005,
254
637548
3058
或者你在欧莱雅公司, 想要在2005年销售护肤产品,
10:40
then you spoke to 48 women to claim that they work.
255
640630
2417
那么你只需要调查48名 认为此产品有效的女性就行了。
10:43
(Laughter)
256
643071
1026
(笑声)
很显然,私人公司对于 追求数据的准确性兴趣不大,
10:44
Private companies don't have a huge interest in getting the numbers right,
257
644121
3556
10:47
they just need the right numbers.
258
647701
1755
他们只想得到想要的 数字来进行宣传。
10:49
Government statisticians aren't like that.
259
649480
2020
政府的统计学家就不是如此了。
10:51
In theory, at least, they're totally impartial,
260
651524
2447
理论上,他们是完全公正的,
10:53
not least because most of them do their jobs regardless of who's in power.
261
653995
3501
主要是因为他们的工作 不会受掌权者所影响。
10:57
They're civil servants.
262
657520
1162
他们是公务员。
10:58
And to do their jobs properly,
263
658706
1964
为了做好本职工作,
11:00
they don't just speak to a couple hundred people.
264
660694
2363
他们不能只调查几百人而已。
11:03
Those unemployment numbers I keep on referencing
265
663081
2318
我一直提到的这些失业率数字,
11:05
come from the Bureau of Labor Statistics,
266
665423
2004
来自于美国劳工统计局,
11:07
and to make their estimates,
267
667451
1335
为了进行这项估计,
11:08
they speak to over 140,000 businesses in this country.
268
668810
3489
他们调查了全国14万家以上的企业。
11:12
I get it, it's frustrating.
269
672323
1725
我知道,这令人沮丧。
11:14
If you want to test a statistic that comes from a private company,
270
674072
3115
如果你想检验来自 一家私人企业的统计数据,
你可以为自己和朋友们 购买一款面霜拿来使用,
11:17
you can buy the face cream for you and a bunch of friends, test it out,
271
677211
3361
如果它没有效果, 你可以说那些统计数字有误。
11:20
if it doesn't work, you can say the numbers were wrong.
272
680596
2591
但是你如何质疑 政府的统计数据呢?
11:23
But how do you question government statistics?
273
683211
2146
你需要把一切都检查一遍。
11:25
You just keep checking everything.
274
685381
1630
了解他们是如何收集数据的。
11:27
Find out how they collected the numbers.
275
687035
1913
观察一下,你是否在图表中 看到了你需要了解的一切信息。
11:28
Find out if you're seeing everything on the chart you need to see.
276
688972
3125
但是也不要完全不相信这些数据,
11:32
But don't give up on the numbers altogether, because if you do,
277
692121
2965
否则我们只能在一无所知中 制定公共政策,
11:35
we'll be making public policy decisions in the dark,
278
695110
2439
引导我们的除了私人利益,别无他物。
11:37
using nothing but private interests to guide us.
279
697573
2262
11:39
Thank you.
280
699859
1166
谢谢。
(掌声)
11:41
(Applause)
281
701049
2461
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7