3 ways to spot a bad statistic | Mona Chalabi

254,009 views ・ 2017-04-17

TED


请双击下面的英文字幕来播放视频。

翻译人员: Wilde Luo 校对人员: Lipeng Chen
00:12
I'm going to be talking about statistics today.
0
12704
2763
今天我想讲一下统计学。
00:15
If that makes you immediately feel a little bit wary, that's OK,
1
15491
3138
如果你马上有所警觉,没关系,
00:18
that doesn't make you some kind of crazy conspiracy theorist,
2
18653
2859
我不会让你变成那种 疯狂的阴谋论者,
00:21
it makes you skeptical.
3
21536
1296
只会让你学会质疑。
00:22
And when it comes to numbers, especially now, you should be skeptical.
4
22856
3886
尤其是现在,说到数字、数据, 你应该时刻保持怀疑态度。
00:26
But you should also be able to tell which numbers are reliable
5
26766
3011
但你需要能够辨别哪些数字是可信的,
00:29
and which ones aren't.
6
29801
1160
哪些存在问题。
00:30
So today I want to try to give you some tools to be able to do that.
7
30985
3206
因此为了让你们具有这种甄别能力, 我会试着提供一些方法。
00:34
But before I do,
8
34215
1169
但在我开始之前,
00:35
I just want to clarify which numbers I'm talking about here.
9
35408
2839
我想说明我所说的统计 数据是什么类型的。
我讨论的并不是这种广告:
00:38
I'm not talking about claims like,
10
38271
1635
00:39
"9 out of 10 women recommend this anti-aging cream."
11
39930
2449
“10个女人中有9个 会推荐这款抗衰老乳霜。”
00:42
I think a lot of us always roll our eyes at numbers like that.
12
42403
2972
我想我们中很多人对 这样的数字不以为然。
00:45
What's different now is people are questioning statistics like,
13
45399
2984
但不同的是,现在人们对这样的 数字也开始抱有怀疑:
“美国的失业率是 5%。”
00:48
"The US unemployment rate is five percent."
14
48407
2014
00:50
What makes this claim different is it doesn't come from a private company,
15
50445
3516
这个数字不是出于一个私人公司,
00:53
it comes from the government.
16
53985
1388
而是来自政府。
事实上,如今10个美国人中差不多有4个 不相信政府公布的
00:55
About 4 out of 10 Americans distrust the economic data
17
55397
3336
00:58
that gets reported by government.
18
58757
1573
那些经济数据。
01:00
Among supporters of President Trump it's even higher;
19
60354
2491
在特朗普总统的支持者中,情况更糟,
01:02
it's about 7 out of 10.
20
62869
1633
10人中有7人不相信那些数据。
01:04
I don't need to tell anyone here
21
64526
1804
无须我告诉大家,
01:06
that there are a lot of dividing lines in our society right now,
22
66354
3011
当今我们的社会中 划分了许多的分界线,
01:09
and a lot of them start to make sense,
23
69389
1825
一旦你明白“人”和那些“政府数据”
之间的关系,这些分界线中的 大多数就不言自明了。
01:11
once you understand people's relationships with these government numbers.
24
71238
3687
01:14
On the one hand, there are those who say these statistics are crucial,
25
74949
3336
一方面,有些人说, 这些统计数据很重要,
01:18
that we need them to make sense of society as a whole
26
78309
2630
我们需要它们来理解整个社会,
01:20
in order to move beyond emotional anecdotes
27
80963
2164
进而脱离情感喜好、偏见等,
01:23
and measure progress in an [objective] way.
28
83151
2410
以一种客观的方式衡量发展程度。
01:25
And then there are the others,
29
85585
1467
正相反,另一方面,有一些人说
01:27
who say that these statistics are elitist,
30
87076
2156
这些数据隶属于精英统治,
01:29
maybe even rigged;
31
89256
1208
它们甚至有可能是被操纵的;
01:30
they don't make sense and they don't really reflect
32
90488
2394
它们没什么意义,也不能真正反映
01:32
what's happening in people's everyday lives.
33
92906
2296
人们的日常生活中发生了什么。
01:35
It kind of feels like that second group is winning the argument right now.
34
95226
3487
现在看来,似乎是第二种人 在这两个阵营的针锋相对中获得了胜利。
01:38
We're living in a world of alternative facts,
35
98737
2108
我们所生活的世界里, 胡说八道已成常态,
01:40
where people don't find statistics this kind of common ground,
36
100869
2935
人们对统计数据不能达成共识,
01:43
this starting point for debate.
37
103828
1636
也不会把数据作为争论时的出发点。
01:45
This is a problem.
38
105488
1286
这是一个问题。
01:46
There are actually moves in the US right now
39
106798
2067
如今在美国,人们有一个举措是
01:48
to get rid of some government statistics altogether.
40
108889
2861
完全丢弃政府的某项统计数据。
01:51
Right now there's a bill in congress about measuring racial inequality.
41
111774
3387
现在,国会有一项关于 衡量种族不平等程度的议案。
01:55
The draft law says that government money should not be used
42
115185
2801
草案规定,政府资金不能被用在
搜集关于种族隔离的数据上。
01:58
to collect data on racial segregation.
43
118010
1902
01:59
This is a total disaster.
44
119936
1885
这完全是一场灾难。
02:01
If we don't have this data,
45
121845
1748
如果我们没有相关数据,
02:03
how can we observe discrimination,
46
123617
1778
我们怎样观察种族歧视现象,
02:05
let alone fix it?
47
125419
1278
更不要说解决它了?
02:06
In other words:
48
126721
1188
换句话说:
02:07
How can a government create fair policies
49
127933
2059
如果政府都不能衡量 如今的种族不平等程度,
那他们又如何制定合理的政策呢?
02:10
if they can't measure current levels of unfairness?
50
130016
2771
02:12
This isn't just about discrimination,
51
132811
1794
不只是在种族歧视方面,
02:14
it's everything -- think about it.
52
134629
1670
还包括其他方面—— 想一想吧,
如果我们没有关于医疗健康
02:16
How can we legislate on health care
53
136323
1690
以及贫穷问题的可靠数据, 要如何为医疗保健问题立法?
02:18
if we don't have good data on health or poverty?
54
138037
2271
如果我们甚至对进入或离开 这个国家的人数都无法达成共识,
02:20
How can we have public debate about immigration
55
140332
2198
02:22
if we can't at least agree
56
142554
1250
02:23
on how many people are entering and leaving the country?
57
143828
2643
我们又如何
关于移民问题进行公众讨论?
02:26
Statistics come from the state; that's where they got their name.
58
146495
3058
“统计”(statistics)来自于 “国家事务”(state),是它的词源所在。
它的重点在于更准确地衡量人口
02:29
The point was to better measure the population
59
149577
2157
来更好地造福于人民。
02:31
in order to better serve it.
60
151758
1357
02:33
So we need these government numbers,
61
153139
1725
所以我们需要这些政府数据,
02:34
but we also have to move beyond either blindly accepting
62
154888
2647
但我们也不能盲目接受
02:37
or blindly rejecting them.
63
157559
1268
或者盲目排斥它们。
02:38
We need to learn the skills to be able to spot bad statistics.
64
158851
2997
我们需要学会甄别不可信的数据。
02:41
I started to learn some of these
65
161872
1528
当我在统计部门工作时——
02:43
when I was working in a statistical department
66
163424
2166
它是联合国的一部分,
我开始学会了一些甄别的技巧。
02:45
that's part of the United Nations.
67
165614
1643
02:47
Our job was to find out how many Iraqis had been forced from their homes
68
167281
3406
我们的工作是弄清楚 有多少伊拉克人是因为战争
02:50
as a result of the war,
69
170711
1158
而被迫离家,
02:51
and what they needed.
70
171893
1158
以及他们需要什么。
这的确是一项非常重要的工作, 但其难度也让人难以置信。
02:53
It was really important work, but it was also incredibly difficult.
71
173075
3178
02:56
Every single day, we were making decisions
72
176277
2018
每一天,我们都在做出
02:58
that affected the accuracy of our numbers --
73
178319
2157
会影响我们统计数据精确度的决定——
03:00
decisions like which parts of the country we should go to,
74
180500
2744
决定我们应该去 这个国家的哪一块区域调查,
03:03
who we should speak to,
75
183268
1156
和哪种人交谈,
03:04
which questions we should ask.
76
184448
1568
我们应该询问他们什么问题。
03:06
And I started to feel really disillusioned with our work,
77
186040
2680
我开始对我们的工作感到失望,
03:08
because we thought we were doing a really good job,
78
188744
2518
因为我们以为我们的 工作非常棒、很有意义,
03:11
but the one group of people who could really tell us were the Iraqis,
79
191286
3278
但是真正能告诉我们 实情的是那些伊拉克人,
03:14
and they rarely got the chance to find our analysis, let alone question it.
80
194588
3540
他们却很难有机会看到我们的 分析结果,更别提质疑它了。
03:18
So I started to feel really determined
81
198152
1831
所以我开始坚信,
能让数据更加精确可信的一种方法,
03:20
that the one way to make numbers more accurate
82
200007
2311
03:22
is to have as many people as possible be able to question them.
83
202342
3053
就是让尽可能多的人能够去质疑它。
03:25
So I became a data journalist.
84
205419
1434
所以我成为了一名“数据记者”。
03:26
My job is finding these data sets and sharing them with the public.
85
206877
3904
我的工作是找到这些数据集, 并且向公众发布。
03:30
Anyone can do this, you don't have to be a geek or a nerd.
86
210805
3173
任何人都可以做到这件事; 你无须是一个极客或者老手。
你可以忽视那些术语; 那些想要表现得很聪明
03:34
You can ignore those words; they're used by people
87
214002
2355
又假装谦虚的人才喜欢这些专业词汇。
03:36
trying to say they're smart while pretending they're humble.
88
216381
2822
毫无疑问任何人都能做到这些事。
03:39
Absolutely anyone can do this.
89
219227
1589
03:40
I want to give you guys three questions
90
220840
2067
我想向你们提出3个问题,
03:42
that will help you be able to spot some bad statistics.
91
222931
3005
会帮助你甄别那些不可靠的数据。
03:45
So, question number one is: Can you see uncertainty?
92
225960
3507
那么第一个问题是: 你能看到不确定性吗?
03:49
One of things that's really changed people's relationship with numbers,
93
229491
3364
有一件事真正改变了 人们和数据的关系,
03:52
and even their trust in the media,
94
232879
1641
甚至改变了人们对媒体的信任,
03:54
has been the use of political polls.
95
234544
2258
那就是政治上的民意调查的滥用。
03:56
I personally have a lot of issues with political polls
96
236826
2538
我个人对政治民意调查颇有微词,
03:59
because I think the role of journalists is actually to report the facts
97
239388
3376
因为我认为,记者的任务就是 原原本本地报道事实,
04:02
and not attempt to predict them,
98
242788
1553
而不应该尝试去预测什么,
04:04
especially when those predictions can actually damage democracy
99
244365
2996
尤其是当这些“预测” 实际上会有损民主的时候,
它向人们示意: 不用费心给那个家伙投票了,
04:07
by signaling to people: don't bother to vote for that guy,
100
247385
2732
他是没机会入选的。
04:10
he doesn't have a chance.
101
250141
1205
我们把这事先搁到一边, 来谈谈这种做法的精确度如何。
04:11
Let's set that aside for now and talk about the accuracy of this endeavor.
102
251370
3654
04:15
Based on national elections in the UK, Italy, Israel
103
255048
4608
根据英国、意大利、 以色列的国家选举情况,
04:19
and of course, the most recent US presidential election,
104
259680
2764
当然,还有近期的 美国总统选举情况,
04:22
using polls to predict electoral outcomes
105
262468
2137
它们都表明:使用民意 调查来预测选举结果
04:24
is about as accurate as using the moon to predict hospital admissions.
106
264629
3812
无异于夜观天象来预测是否 住院——都是不可靠的。
04:28
No, seriously, I used actual data from an academic study to draw this.
107
268465
4200
说真的,我用一份学术研究中的 真实数据画出了这幅图。
04:32
There are a lot of reasons why polling has become so inaccurate.
108
272689
3727
民意调查变得如此不准确 是有很多原因的。
04:36
Our societies have become really diverse,
109
276440
1970
我们的社会已经变得相当多元化,
04:38
which makes it difficult for pollsters to get a really nice representative sample
110
278434
3821
这使得民意调查者很难在 人群中为他们的调查得到一份
不错的、有代表性的样本。
04:42
of the population for their polls.
111
282279
1627
04:43
People are really reluctant to answer their phones to pollsters,
112
283930
3006
人们不是很情愿地 回答民意调查电话,
04:46
and also, shockingly enough, people might lie.
113
286960
2276
另外,令人震惊的是, 人们可能会说谎。
04:49
But you wouldn't necessarily know that to look at the media.
114
289260
2811
但在媒体的报道中,你不一定会 看到并了解这些过程。
04:52
For one thing, the probability of a Hillary Clinton win
115
292095
2761
例如,希拉里 · 克林顿 赢得选举的可能性
04:54
was communicated with decimal places.
116
294880
2791
在媒体的报道中精确到了小数点。
04:57
We don't use decimal places to describe the temperature.
117
297695
2621
我们描述气温都不会这么精确。
05:00
How on earth can predicting the behavior of 230 million voters in this country
118
300340
4228
所以,对这个国家里的 2亿3千万选民的行为的预测,
05:04
be that precise?
119
304592
1829
怎么可能会如此精确?
05:06
And then there were those sleek charts.
120
306445
2002
另外,还有一些美观的图表。
05:08
See, a lot of data visualizations will overstate certainty, and it works --
121
308471
3973
知道吗,很多数据可视化的方法会 夸大数据的准确性,而且很有效——
05:12
these charts can numb our brains to criticism.
122
312468
2620
这些图表会麻痹我们的大脑, 让我们无法对其准确性生疑。
05:15
When you hear a statistic, you might feel skeptical.
123
315112
2558
当你听到一项统计数据, 你或许会怀疑。
05:17
As soon as it's buried in a chart,
124
317694
1635
但当它整合进了图表中,
它看起来仿佛就成了客观的科学,
05:19
it feels like some kind of objective science,
125
319353
2129
05:21
and it's not.
126
321506
1249
但实际上正好相反。
05:22
So I was trying to find ways to better communicate this to people,
127
322779
3103
所以我试着找出一些方法 来告诉人们这些数据背后的事实,
05:25
to show people the uncertainty in our numbers.
128
325906
2504
为人们展示这些数据的不确定性。
05:28
What I did was I started taking real data sets,
129
328434
2246
我所做的,就是先采集数据,
05:30
and turning them into hand-drawn visualizations,
130
330704
2652
然后把它们转换为手绘的示意图,
05:33
so that people can see how imprecise the data is;
131
333380
2672
所以人们可以看到 这些数据的不精确性;
05:36
so people can see that a human did this,
132
336076
1996
可以看到这份图表是人画出来的,
有人搜集数据并把它可视化了。
05:38
a human found the data and visualized it.
133
338096
1972
05:40
For example, instead of finding out the probability
134
340092
2672
例如,我们不去计算出每个月
05:42
of getting the flu in any given month,
135
342788
2126
可能患流感的概率,
05:44
you can see the rough distribution of flu season.
136
344938
2792
但我们可以得到流感 在每个季节的大致分布。
05:47
This is --
137
347754
1167
这就是——
05:48
(Laughter)
138
348945
1018
(笑声)
现在就是二月,展示 这张图不是很合适。
05:49
a bad shot to show in February.
139
349987
1486
05:51
But it's also more responsible data visualization,
140
351497
2455
但这是更加负责任的数据可视图表,
05:53
because if you were to show the exact probabilities,
141
353976
2455
因为,你如果在这张图中 展示了精确的概率,
05:56
maybe that would encourage people to get their flu jabs
142
356455
2592
那么它就有可能鼓励人们 在不合适的时间段
接种流感疫苗。
05:59
at the wrong time.
143
359071
1456
06:00
The point of these shaky lines
144
360983
1693
这些摇摇晃晃的线条
06:02
is so that people remember these imprecisions,
145
362700
2911
是想向人们说明 这些数据的不精确性,
06:05
but also so they don't necessarily walk away with a specific number,
146
365635
3227
它也不会让人们简单地 得到一个鸡肋的具体数字,
06:08
but they can remember important facts.
147
368886
1866
而是让他们记住重要的事实。
06:10
Facts like injustice and inequality leave a huge mark on our lives.
148
370776
4024
不公正、不平等这样的事实 给我们的生活带来了巨大的影响。
06:14
Facts like Black Americans and Native Americans have shorter life expectancies
149
374824
4189
“美国黑人以及原住民的预期寿命 比其他种族的人短一些”,
06:19
than those of other races,
150
379037
1400
这样的事实,
06:20
and that isn't changing anytime soon.
151
380461
2138
短时间内也难以改变。
06:22
Facts like prisoners in the US can be kept in solitary confinement cells
152
382623
3901
还包括“美国监狱中被单独监禁的 囚犯的活动空间
06:26
that are smaller than the size of an average parking space.
153
386548
3342
比起一般的停车位面积还要小” 这样的事实。
06:30
The point of these visualizations is also to remind people
154
390355
3335
这些数据可视化的 意义也在于提醒人们
06:33
of some really important statistical concepts,
155
393714
2350
一些极其重要的统计概念,
06:36
concepts like averages.
156
396088
1636
例如“平均”的概念。
06:37
So let's say you hear a claim like,
157
397748
1668
比方说你听到了这样一句话:
06:39
"The average swimming pool in the US contains 6.23 fecal accidents."
158
399440
4434
“在美国,一般大小的游泳池 平均含有6.23次大便。”
06:43
That doesn't mean every single swimming pool in the country
159
403898
2797
这并不意味着这个 国家里每一座游泳池
06:46
contains exactly 6.23 turds.
160
406719
2194
刚好有6.23次大便。
06:48
So in order to show that,
161
408937
1417
所以为了展示这一点,
06:50
I went back to the original data, which comes from the CDC,
162
410378
2841
我找到疾病预防控制 中心的原始数据,
06:53
who surveyed 47 swimming facilities.
163
413243
2065
他们调查了47座游泳池。
06:55
And I just spent one evening redistributing poop.
164
415332
2391
我花了一个晚上 “重新分配这些大便”。
06:57
So you can kind of see how misleading averages can be.
165
417747
2682
所以你可以看到, “平均”是多么误导人。
07:00
(Laughter)
166
420453
1282
(笑声)
07:01
OK, so the second question that you guys should be asking yourselves
167
421759
3901
好,第二个问题, 你们在甄别数据时
07:05
to spot bad numbers is:
168
425684
1501
应当反问自己的是:
07:07
Can I see myself in the data?
169
427209
1967
我能在这些数据中 看到自己的身影吗?
07:09
This question is also about averages in a way,
170
429200
2913
这个问题在某种程度上 也是关于“平均”的,
07:12
because part of the reason why people are so frustrated
171
432137
2605
因为,人们对政府 公布的数据感到失望的
07:14
with these national statistics,
172
434766
1495
一部分原因就是,
07:16
is they don't really tell the story of who's winning and who's losing
173
436285
3273
他们无法分清,在国家政策下
07:19
from national policy.
174
439582
1156
谁损失、谁获益。
07:20
It's easy to understand why people are frustrated with global averages
175
440762
3318
很容易理解为什么当人们发现 自己的个人经验与全球平均数据
不相符时,他们会感到非常失望。
07:24
when they don't match up with their personal experiences.
176
444104
2679
07:26
I wanted to show people the way data relates to their everyday lives.
177
446807
3263
我想为人们展示:数据是如何 与他们的日常生活相关联的。
我开设了一个咨询专栏, 叫做《亲爱的Mona》,
07:30
I started this advice column called "Dear Mona,"
178
450094
2246
人们会写信询问 他们所关心的事情,
07:32
where people would write to me with questions and concerns
179
452364
2726
而我会尝试借着数据回答他们。
07:35
and I'd try to answer them with data.
180
455114
1784
这些问题包罗万象,
07:36
People asked me anything.
181
456922
1200
例如,“和老婆分床睡是正常的吗?”
07:38
questions like, "Is it normal to sleep in a separate bed to my wife?"
182
458146
3261
07:41
"Do people regret their tattoos?"
183
461431
1591
“人们会为身上的刺青感到后悔吗?”
“什么是‘自然死亡’?”
07:43
"What does it mean to die of natural causes?"
184
463046
2164
07:45
All of these questions are great, because they make you think
185
465234
2966
这些问题都很棒, 因为它们会让你思考,
如何去寻找并转达这些数据。
07:48
about ways to find and communicate these numbers.
186
468224
2336
07:50
If someone asks you, "How much pee is a lot of pee?"
187
470584
2503
如果有人问你: “ 排多少尿才算 ‘尿量多’ ?”
这是我曾被问及的一个问题,
07:53
which is a question that I got asked,
188
473111
2458
07:55
you really want to make sure that the visualization makes sense
189
475593
2980
你真的会确信:将数据图示化表达
07:58
to as many people as possible.
190
478597
1747
能让更多的人容易理解。
08:00
These numbers aren't unavailable.
191
480368
1575
这些统计数据并不难以得到。
08:01
Sometimes they're just buried in the appendix of an academic study.
192
481967
3507
有时它们只是被淹没在 学术研究的附录里。
08:05
And they're certainly not inscrutable;
193
485498
1839
它们也绝非高深莫测;
如果你真的想要检验 这些有关尿量的数据,
08:07
if you really wanted to test these numbers on urination volume,
194
487361
2975
抓一个瓶子自己试试吧。
08:10
you could grab a bottle and try it for yourself.
195
490360
2257
08:12
(Laughter)
196
492641
1008
(笑声)
08:13
The point of this isn't necessarily
197
493673
1694
重点在于,每一份数据集
不一定要跟你有专门的关联。
08:15
that every single data set has to relate specifically to you.
198
495391
2877
我对“法国有多少女人 因为戴面纱或头巾而被罚款”
08:18
I'm interested in how many women were issued fines in France
199
498292
2880
这样的问题很感兴趣,
08:21
for wearing the face veil, or the niqab,
200
501196
1959
即使我不住在法国也不戴面纱。
08:23
even if I don't live in France or wear the face veil.
201
503179
2618
08:25
The point of asking where you fit in is to get as much context as possible.
202
505821
3835
问自己是否符合数据当中的情况, 是为了更好地了解其环境背景。
08:29
So it's about zooming out from one data point,
203
509680
2191
所以我们从一小块 数据出发来综观全体,
08:31
like the unemployment rate is five percent,
204
511895
2104
就像“失业率是5%”这样的数据,
观察它随着时间如何变化,
08:34
and seeing how it changes over time,
205
514023
1757
08:35
or seeing how it changes by educational status --
206
515804
2650
或者观察它随着教育程度的 差异而如何改变——
08:38
this is why your parents always wanted you to go to college --
207
518478
3104
这就是你的父母总是 希望你上大学的原因——
08:41
or seeing how it varies by gender.
208
521606
2032
或者观察它随着 性别差异而如何变化。
08:43
Nowadays, male unemployment rate is higher
209
523662
2127
如今,男性的失业率
08:45
than the female unemployment rate.
210
525813
1700
比女性的失业率高。
08:47
Up until the early '80s, it was the other way around.
211
527537
2695
但是在80年代初期之前, 情况与此相反。
08:50
This is a story of one of the biggest changes
212
530256
2117
这是美国社会至今为止
08:52
that's happened in American society,
213
532397
1720
发生的最大的改变,
一旦你不再局限于“平均”, 就会发现这些信息都隐含在在图表中。
08:54
and it's all there in that chart, once you look beyond the averages.
214
534141
3276
08:57
The axes are everything;
215
537441
1165
轴线能代表一切;
08:58
once you change the scale, you can change the story.
216
538630
2669
当你改变观察的尺度, 你就得到了新的信息。
09:01
OK, so the third and final question that I want you guys to think about
217
541323
3380
好,第三个也是最后 一个问题,我希望
09:04
when you're looking at statistics is:
218
544727
1819
你们在观察数据时思考的是:
09:06
How was the data collected?
219
546570
1873
这些数据是如何被收集的?
09:09
So far, I've only talked about the way data is communicated,
220
549487
2939
目前为止,我只谈到 数据被传达的方式,
但收集这些数据的 方式也同样重要。
09:12
but the way it's collected matters just as much.
221
552450
2276
我知道这很困难,
09:14
I know this is tough,
222
554750
1167
因为收集数据的方法一般是 不透明而且有些无聊的。
09:15
because methodologies can be opaque and actually kind of boring,
223
555941
3081
但你能使用一些简单的 步骤来检视数据。
09:19
but there are some simple steps you can take to check this.
224
559046
2873
09:21
I'll use one last example here.
225
561943
1839
这里我要举最后一个例子。
09:24
One poll found that 41 percent of Muslims in this country support jihad,
226
564129
3887
一份民意调查指出,这个国家内 41%的穆斯林支持伊斯兰教的“圣战”,
09:28
which is obviously pretty scary,
227
568040
1525
这听起来相当吓人,
09:29
and it was reported everywhere in 2015.
228
569589
2642
这个数字在2015年被大肆报道。
09:32
When I want to check a number like that,
229
572255
2615
当我想要检查一下这个数据时,
09:34
I'll start off by finding the original questionnaire.
230
574894
2501
我会首先寻找原始的调查问卷。
09:37
It turns out that journalists who reported on that statistic
231
577419
2926
结果表明, 那些报道此事的记者
09:40
ignored a question lower down on the survey
232
580369
2231
忽视了调查问卷后面的一个问题,
09:42
that asked respondents how they defined "jihad."
233
582624
2346
关于受调查者如何定义“圣战”。
09:44
And most of them defined it as,
234
584994
1981
大多数人是这样定义的:
09:46
"Muslims' personal, peaceful struggle to be more religious."
235
586999
3942
“为了变得更加虔诚, 穆斯林们个人的、和平的奋斗。”
09:50
Only 16 percent defined it as, "violent holy war against unbelievers."
236
590965
4194
只有16%的人如此认为: “反对不信教者的暴力的、神圣的战争。”
09:55
This is the really important point:
237
595183
2430
这是很重要的一点:
09:57
based on those numbers, it's totally possible
238
597637
2155
根据这些数据, 完全有可能发生的是:
09:59
that no one in the survey who defined it as violent holy war
239
599816
3105
在调查中那些把“圣战”解释为 “暴力的、神圣的战争”的人
10:02
also said they support it.
240
602945
1332
根本就不支持“圣战”。
10:04
Those two groups might not overlap at all.
241
604301
2208
这两组人可能根本就不重叠。
10:06
It's also worth asking how the survey was carried out.
242
606942
2637
调查进行的方式也是 一个值得探讨的问题。
10:09
This was something called an opt-in poll,
243
609603
1998
这份调查使用了一种 “自愿参与”的调查方式,
10:11
which means anyone could have found it on the internet and completed it.
244
611625
3402
意味着任何一个人都可以在 网络上找到并完成这份调查。
我们甚至无从知道这些 被调查者是否是穆斯林。
10:15
There's no way of knowing if those people even identified as Muslim.
245
615051
3339
10:18
And finally, there were 600 respondents in that poll.
246
618414
2612
最后,有600人参与了这份调查。
根据皮尤研究中心统计,这个国家里
10:21
There are roughly three million Muslims in this country,
247
621050
2654
10:23
according to Pew Research Center.
248
623728
1607
差不多有3百万穆斯林。
这意味着,在这个国家中, 每5000个穆斯林中只有一个
10:25
That means the poll spoke to roughly one in every 5,000 Muslims
249
625359
2993
参与了这项调查。
10:28
in this country.
250
628376
1168
10:29
This is one of the reasons
251
629568
1266
这也是为什么政府的统计数据
10:30
why government statistics are often better than private statistics.
252
630858
3607
通常比私人机构的调查 数据更为准确的原因之一。
10:34
A poll might speak to a couple hundred people, maybe a thousand,
253
634489
3035
一份民意调查可能 有数百人参与,甚至一千人,
10:37
or if you're L'Oreal, trying to sell skin care products in 2005,
254
637548
3058
或者你在欧莱雅公司, 想要在2005年销售护肤产品,
10:40
then you spoke to 48 women to claim that they work.
255
640630
2417
那么你只需要调查48名 认为此产品有效的女性就行了。
10:43
(Laughter)
256
643071
1026
(笑声)
很显然,私人公司对于 追求数据的准确性兴趣不大,
10:44
Private companies don't have a huge interest in getting the numbers right,
257
644121
3556
10:47
they just need the right numbers.
258
647701
1755
他们只想得到想要的 数字来进行宣传。
10:49
Government statisticians aren't like that.
259
649480
2020
政府的统计学家就不是如此了。
10:51
In theory, at least, they're totally impartial,
260
651524
2447
理论上,他们是完全公正的,
10:53
not least because most of them do their jobs regardless of who's in power.
261
653995
3501
主要是因为他们的工作 不会受掌权者所影响。
10:57
They're civil servants.
262
657520
1162
他们是公务员。
10:58
And to do their jobs properly,
263
658706
1964
为了做好本职工作,
11:00
they don't just speak to a couple hundred people.
264
660694
2363
他们不能只调查几百人而已。
11:03
Those unemployment numbers I keep on referencing
265
663081
2318
我一直提到的这些失业率数字,
11:05
come from the Bureau of Labor Statistics,
266
665423
2004
来自于美国劳工统计局,
11:07
and to make their estimates,
267
667451
1335
为了进行这项估计,
11:08
they speak to over 140,000 businesses in this country.
268
668810
3489
他们调查了全国14万家以上的企业。
11:12
I get it, it's frustrating.
269
672323
1725
我知道,这令人沮丧。
11:14
If you want to test a statistic that comes from a private company,
270
674072
3115
如果你想检验来自 一家私人企业的统计数据,
你可以为自己和朋友们 购买一款面霜拿来使用,
11:17
you can buy the face cream for you and a bunch of friends, test it out,
271
677211
3361
如果它没有效果, 你可以说那些统计数字有误。
11:20
if it doesn't work, you can say the numbers were wrong.
272
680596
2591
但是你如何质疑 政府的统计数据呢?
11:23
But how do you question government statistics?
273
683211
2146
你需要把一切都检查一遍。
11:25
You just keep checking everything.
274
685381
1630
了解他们是如何收集数据的。
11:27
Find out how they collected the numbers.
275
687035
1913
观察一下,你是否在图表中 看到了你需要了解的一切信息。
11:28
Find out if you're seeing everything on the chart you need to see.
276
688972
3125
但是也不要完全不相信这些数据,
11:32
But don't give up on the numbers altogether, because if you do,
277
692121
2965
否则我们只能在一无所知中 制定公共政策,
11:35
we'll be making public policy decisions in the dark,
278
695110
2439
引导我们的除了私人利益,别无他物。
11:37
using nothing but private interests to guide us.
279
697573
2262
11:39
Thank you.
280
699859
1166
谢谢。
(掌声)
11:41
(Applause)
281
701049
2461
关于本网站

这个网站将向你介绍对学习英语有用的YouTube视频。你将看到来自世界各地的一流教师教授的英语课程。双击每个视频页面上显示的英文字幕,即可从那里播放视频。字幕会随着视频的播放而同步滚动。如果你有任何意见或要求,请使用此联系表与我们联系。

https://forms.gle/WvT1wiN1qDtmnspy7


This website was created in October 2020 and last updated on June 12, 2025.

It is now archived and preserved as an English learning resource.

Some information may be out of date.

隐私政策

eng.lish.video

Developer's Blog