The method that can "prove" almost anything - James A. Smith

834,995 views ・ 2021-08-05

TED-Ed


Please double-click on the English subtitles below to play the video.

00:06
In 2011, a group of researchers conducted a scientific study
0
6371
4167
00:10
to find an impossible result:
1
10538
2125
00:12
that listening to certain songs can make you younger.
2
12663
3500
00:16
Their study involved real people, truthfully reported data,
3
16663
3625
00:20
and commonplace statistical analyses.
4
20288
3000
00:23
So how did they do it?
5
23288
1416
00:24
The answer lies in a statistical method scientists often use
6
24704
4125
00:28
to try to figure out whether their results mean something or if they’re random noise.
7
28829
4875
00:33
In fact, the whole point of the music study
8
33704
2625
00:36
was to point out ways this method can be misused.
9
36329
3917
00:40
A famous thought experiment explains the method:
10
40746
2791
00:43
there are eight cups of tea,
11
43746
1750
00:45
four with the milk added first, and four with the tea added first.
12
45496
4416
00:50
A participant must determine which are which according to taste.
13
50162
3625
00:53
There are 70 different ways the cups can be sorted into two groups of four,
14
53871
4583
00:58
and only one is correct.
15
58454
2000
01:00
So, can she taste the difference?
16
60662
2584
01:03
That’s our research question.
17
63246
1625
01:04
To analyze her choices, we define what’s called a null hypothesis:
18
64871
4625
01:09
that she can’t distinguish the teas.
19
69496
2167
01:11
If she can’t distinguish the teas,
20
71871
2042
01:13
she’ll still get the right answer 1 in 70 times by chance.
21
73913
5166
01:19
1 in 70 is roughly .014.
22
79079
3334
01:22
That single number is called a p-value.
23
82746
3292
01:26
In many fields, a p-value of .05 or below is considered statistically significant,
24
86038
6916
01:32
meaning there’s enough evidence to reject the null hypothesis.
25
92954
3792
01:36
Based on a p-value of .014,
26
96996
3375
01:40
they’d rule out the null hypothesis that she can’t distinguish the teas.
27
100371
4125
01:44
Though p-values are commonly used by both researchers and journals
28
104913
3916
01:48
to evaluate scientific results,
29
108829
2084
01:50
they’re really confusing, even for many scientists.
30
110913
2958
01:54
That’s partly because all a p-value actually tells us
31
114329
4042
01:58
is the probability of getting a certain result,
32
118371
3000
02:01
assuming the null hypothesis is true.
33
121371
2917
02:04
So if she correctly sorts the teas,
34
124663
2791
02:07
the p-value is the probability of her doing so
35
127454
3417
02:10
assuming she can’t tell the difference.
36
130871
2458
02:13
But the reverse isn’t true:
37
133329
2459
02:15
the p-value doesn’t tell us the probability
38
135788
2416
02:18
that she can taste the difference,
39
138204
1625
02:19
which is what we’re trying to find out.
40
139829
2084
02:22
So if a p-value doesn’t answer the research question,
41
142329
3250
02:25
why does the scientific community use it?
42
145579
2292
02:28
Well, because even though a p-value doesn’t directly state the probability
43
148329
4709
02:33
that the results are due to random chance,
44
153038
2500
02:35
it usually gives a pretty reliable indication.
45
155538
3333
02:39
At least, it does when used correctly.
46
159204
2792
02:41
And that’s where many researchers, and even whole fields,
47
161996
3917
02:45
have run into trouble.
48
165913
1458
02:47
Most real studies are more complex than the tea experiment.
49
167538
3458
02:51
Scientists can test their research question in multiple ways,
50
171288
3375
02:54
and some of these tests might produce a statistically significant result,
51
174663
4375
02:59
while others don’t.
52
179038
1208
03:00
It might seem like a good idea to test every possibility.
53
180454
3167
03:03
But it’s not, because with each additional test,
54
183913
3083
03:07
the chance of a false positive increases.
55
187163
3208
03:10
Searching for a low p-value, and then presenting only that analysis,
56
190996
4500
03:15
is often called p-hacking.
57
195496
2750
03:18
It’s like throwing darts until you hit a bullseye
58
198246
2750
03:20
and then saying you only threw the dart that hit the bull’s eye.
59
200996
3333
03:24
This is exactly what the music researchers did.
60
204746
3208
03:28
They played three groups of participants each a different song
61
208079
3709
03:31
and collected lots of information about them.
62
211788
2500
03:34
The analysis they published included only two out of the three groups.
63
214288
4250
03:38
Of all the information they collected,
64
218538
2208
03:40
their analysis only used participants’ fathers’ age—
65
220746
3542
03:44
to “control for variation in baseline age across participants.”
66
224288
4541
03:49
They also paused their experiment after every ten participants,
67
229246
4208
03:53
and continued if the p-value was above .05,
68
233454
4459
03:57
but stopped when it dipped below .05.
69
237913
3291
04:01
They found that participants who heard one song were 1.5 years younger
70
241746
5208
04:06
than those who heard the other song, with a p-value of .04.
71
246954
4375
04:12
Usually it’s much tougher to spot p-hacking,
72
252163
2833
04:14
because we don’t know the results are impossible:
73
254996
2667
04:17
the whole point of doing experiments is to learn something new.
74
257663
3416
04:21
Fortunately, there’s a simple way to make p-values more reliable:
75
261329
4209
04:25
pre-registering a detailed plan for the experiment and analysis
76
265913
4708
04:30
beforehand that others can check,
77
270621
2458
04:33
so researchers can’t keep trying different analyses
78
273079
3417
04:36
until they find a significant result.
79
276496
2125
04:38
And, in the true spirit of scientific inquiry,
80
278788
2458
04:41
there’s even a new field that’s basically science doing science on itself:
81
281246
5375
04:46
studying scientific practices in order to improve them.
82
286621
3667
About this website

This site will introduce you to YouTube videos that are useful for learning English. You will see English lessons taught by top-notch teachers from around the world. Double-click on the English subtitles displayed on each video page to play the video from there. The subtitles scroll in sync with the video playback. If you have any comments or requests, please contact us using this contact form.

https://forms.gle/WvT1wiN1qDtmnspy7