How AI Is Decoding Ancient Scrolls | Julian Schilliger and Youssef Nader | TED

9,554 views ・ 2025-01-30

TED


Please double-click on the English subtitles below to play the video.

00:04
We always think about the potential of AI changing the future.
0
4020
3880
00:08
But what about the potential of AI changing the past?
1
8900
2760
00:12
My name is Youssef Nader.
2
12860
1560
00:14
I'm an Egyptian AI researcher
3
14460
1960
00:16
and a PhD student at the Free University in Berlin,
4
16460
3120
00:19
and last year, I led the Vesuvius Grand Prize winning team
5
19620
4960
00:24
on exploring this very question.
6
24620
2240
00:27
You see, the story starts almost 2,000 years ago.
7
27900
3320
00:32
A Greek philosopher that we believe was Philodemus of Gadara
8
32180
4040
00:36
sat in one of the many rooms of the Villa dei Papiri.
9
36260
2880
00:39
He talked about music, he talked about pleasure,
10
39980
3280
00:43
he talked about what makes things enjoyable,
11
43260
2560
00:45
questions that still plague us until today.
12
45860
2280
00:49
One of his scribes wrote down his thoughts on sheets of papyrus.
13
49300
4800
00:54
The sheets were rolled and stowed away for later generations.
14
54660
4320
01:00
Fast-forward 150 years, ...
15
60180
3120
01:04
Mount Vesuvius erupts,
16
64380
2160
01:06
burying Herculaneum, the villa and the words of the philosopher
17
66540
4800
01:11
under a sea of hot mud and ashes.
18
71380
2840
01:15
Now fast-forward again, to the 17th century.
19
75460
2400
01:18
People are excavating around the area.
20
78420
2160
01:21
They found beautiful statues, breathtaking frescoes
21
81180
4360
01:25
and some weird-looking pieces of charcoal,
22
85580
3640
01:29
like you see in this picture.
23
89260
1720
01:31
This is when the first scrolls were discovered,
24
91700
2880
01:34
and people were racing to excavate more of these.
25
94620
3000
01:38
What knowledge is included that is not known to us now?
26
98300
4320
01:42
What things should we know about these scrolls?
27
102660
2920
01:48
My name is Julian, and I am a digital archaeologist.
28
108860
5040
01:55
When the pyroclastic flow hit the scrolls, it had a destructive effect.
29
115660
6920
02:03
It tore into them, shredded off pieces, and it charred them badly.
30
123460
6200
02:09
Even the deformation that you can see happened at that point.
31
129700
5120
02:14
People, 250-something years ago,
32
134860
3920
02:18
were curious what's lying inside those scrolls,
33
138780
3240
02:22
hidden and not accessible anymore.
34
142060
3200
02:26
Because of a lack of technology,
35
146020
1920
02:27
they had to resort to physically unrolling
36
147980
3920
02:31
and thereby destroying most of the scrolls.
37
151900
3680
02:36
To this day,
38
156340
1320
02:37
only the most damaged and deformed scrolls
39
157700
3000
02:40
remain in their initial, rolled-up configuration.
40
160740
4200
02:45
Fast-forwarding a little bit, the computer age arrives.
41
165780
4320
02:50
Youssef and I are born.
42
170140
2960
02:53
We are going on and getting our education --
43
173100
2840
02:55
(Laughter)
44
175940
1760
02:57
and at the same time, Brent Seales, a researcher and professor,
45
177700
6080
03:03
had the idea to use CT scan technology to actually digitize the scrolls,
46
183820
6560
03:10
with the hope of, one day, digitally unrolling them.
47
190420
5640
03:16
Behind me, you can see a video of such a CT scan,
48
196100
3520
03:19
and it goes through the CT scan 3D volume,
49
199660
3400
03:23
layer by layer.
50
203100
1200
03:25
The papyrus is visible as a spiral,
51
205020
4080
03:29
and you can see it's tightly wound-up,
52
209100
2080
03:31
sometimes touching each other, flaying off.
53
211220
2720
03:33
It's a difficult question, how to unroll this digitally.
54
213980
3640
03:38
Nat Friedman, a Silicon Valley investor,
55
218740
3440
03:42
also saw this research, and he wanted to help.
56
222220
3640
03:46
That was in 2022.
57
226420
2360
03:48
He reached out, and together with Brent Seales,
58
228820
3080
03:51
they created the Vesuvius Challenge,
59
231940
2760
03:54
with the goal to motivate nerds all over the world
60
234740
3720
03:58
to solve this problem.
61
238460
1560
04:00
(Laughter)
62
240020
1920
04:01
They created a grand prize,
63
241980
1800
04:03
promising eternal glory and monetary incentives
64
243820
3560
04:07
to anyone who could do that.
65
247420
1560
04:09
(Laughter)
66
249020
1800
04:10
I myself saw that on the internet
67
250860
2280
04:13
while writing my master's thesis at ETH Zurich, in robotics,
68
253180
4040
04:17
and I was instantly happy to solve it --
69
257220
3280
04:20
or at least try, why not, you know?
70
260540
2320
04:22
And I went on, joined the Discord community
71
262900
3880
04:26
where all the people that were also contestants
72
266820
2720
04:29
and playing with the scroll data
73
269540
1800
04:31
were exchanging ideas,
74
271340
2080
04:33
and I joined there and started working on it.
75
273420
2840
04:37
Also there, on Discord, I met Youssef and Luke [Farritor],
76
277820
3240
04:41
who would become my teammates,
77
281100
1440
04:42
and with whom I would actually win the grand prize.
78
282580
3360
04:45
Surprisingly, it went on, and made global headline news.
79
285980
3800
04:50
It even got into the British tabloids.
80
290460
2400
04:52
(Laughter)
81
292900
3040
04:55
So when we started,
82
295940
1560
04:57
there were two main problems still remaining.
83
297500
3400
05:00
One, you had to unroll the scroll.
84
300940
2840
05:03
And two, you then had to make the ink visible.
85
303780
3560
05:07
Youssef will tell you more about that part.
86
307380
2280
05:10
For me, the most exciting thing
87
310660
2120
05:12
was the computer-vision problem of unrolling those scrolls virtually.
88
312780
4280
05:17
I decided to iterate on a tool
89
317060
2240
05:19
that was created by the Kentucky researchers,
90
319340
2800
05:22
and make it faster, less prone to errors
91
322180
3640
05:25
and just iterate on it and make it better.
92
325820
2800
05:29
The Vesuvius Challenge team saw that
93
329380
1840
05:31
and also implemented a team of 10 people that would use my tool.
94
331220
4680
05:35
They would annotate scroll data, like you see in this video,
95
335940
3840
05:39
where they created a red line where the surface would lie.
96
339820
3440
05:43
The algorithm then would take it into 3D space,
97
343300
3120
05:46
creating a three-dimensional representation of the surface.
98
346460
4040
05:50
Computer algorithms
99
350860
1240
05:52
would then flatten it and create a segment.
100
352140
2360
05:55
This all would be called “segmentation”
101
355220
2680
05:57
in the space of the scrolling and unrolling community.
102
357940
6240
06:04
(Laughter)
103
364940
5200
06:10
So I created open-source commits to this tool
104
370140
5760
06:15
and implemented new algorithms from my studies, like Optical Flow,
105
375900
3840
06:19
to better track the sheets through the volume,
106
379780
3560
06:23
and we end up with something like what you see behind me.
107
383380
3840
06:27
First off, those were really small segments,
108
387260
2920
06:30
and I added improvement,
109
390180
1480
06:31
made the code faster
110
391700
1440
06:33
and had lots of feedback from the community.
111
393180
2880
06:36
They were really happy, and I was happy getting lots of feedback.
112
396100
3400
06:39
It was a really positive environment.
113
399540
2120
06:43
So in the end,
114
403300
1800
06:45
I could track the performance of the algorithms,
115
405140
2720
06:47
how the segmentation team performed,
116
407900
2160
06:50
and I could see that my improvements, from start to finish,
117
410100
3560
06:53
would be around a 10,000-fold improvement over the initial version.
118
413660
5560
07:00
This algorithm was then also used to unroll all the area
119
420580
4320
07:04
that you can see in our submission.
120
424940
2040
07:07
All the sheets were generated with these methods.
121
427020
3240
07:11
In December, I was looking for teammates.
122
431620
3280
07:14
I made a blog post,
123
434940
1280
07:16
and I showcased my newest algorithms,
124
436260
3280
07:19
reaching out to anyone that was willing to team up.
125
439580
3240
07:23
Youssef and Luke got into contact with me.
126
443780
3160
07:27
They were happy to team up, and I was happy as well.
127
447740
3280
07:31
(Laughter)
128
451740
1400
07:33
So after the virtual unwrapping, the words still are not visible.
129
453820
4720
07:39
The main problem is that the ink that was used at the time
130
459100
2920
07:42
was a carbon-based ink,
131
462020
1840
07:43
and carbon-based ink on carbon-based papyrus in a CT scan
132
463900
3440
07:47
isn’t visible, or at least [not] to the naked eye.
133
467380
2800
07:50
So the same team at the University of Kentucky
134
470860
2200
07:53
decided to test whether the ink was present at all
135
473100
2600
07:55
in the CT scans.
136
475700
1360
07:57
For this, they took some of the pieces that people broke off the scrolls,
137
477100
5160
08:02
and they fed them into the same pipeline of the X-ray CT scanning,
138
482260
5040
08:07
and this gives us the 3D data that we were working with.
139
487300
3040
08:10
Because you can see the ink and it’s an exposed surface,
140
490380
3840
08:14
you can even improve it with infrared imaging.
141
494220
2600
08:16
And this gives you a ground truth
142
496820
1640
08:18
of what letters you're actually trying to find.
143
498460
2480
08:21
And then from there,
144
501620
1280
08:22
you can train a machine-learning model to try to find these letters.
145
502940
3640
08:27
The way this works
146
507260
1440
08:28
is that the model looks at very small cubes at a single time
147
508740
3440
08:32
and tries to decide whether there is ink present in this area or not.
148
512220
4080
08:36
And then, when you keep moving this cube all around,
149
516300
3360
08:39
the model gets to see different data samples
150
519700
3040
08:42
and then tries to understand what ink actually is.
151
522780
3560
08:47
So this is how it looks while the model is training.
152
527020
2520
08:49
It's not perfect, but you can see that, especially around the middle,
153
529580
4640
08:54
the model is starting to see the letters perfectly.
154
534260
2880
08:57
So the data is there.
155
537140
2400
08:59
The ink is there.
156
539580
1240
09:00
But it’s just very hard to find and see.
157
540820
1960
09:04
Looking at the CT scan raw data on the left here,
158
544020
4960
09:09
you can see the fibers,
159
549020
2280
09:11
you can see the structure of the papyrus,
160
551340
2960
09:14
but the letters are very, very faint.
161
554340
2640
09:17
The letters from the right image are very, very faint in the CT scans.
162
557020
4400
09:21
And they're actually, in this special case,
163
561460
3080
09:24
characterized by a difference of contrast
164
564580
2480
09:27
and some speckles, freckles, features that are very hard to see.
165
567100
4520
09:32
So what happens if we try to take a look
166
572340
3360
09:35
at the segment that Julian was just showing?
167
575740
2720
09:40
So this is the data that we were working with.
168
580500
2720
09:43
And I'm going to give you 10 seconds to try to find the letters yourself.
169
583260
4040
09:47
(Laughter)
170
587340
1120
09:48
And as a hint,
171
588500
1640
09:50
I'll tell you that there are three letters in this image.
172
590180
2720
09:52
Believe me.
173
592900
2080
09:55
Try to find some pattern, some crackle patterns,
174
595980
5160
10:01
some cracks in there.
175
601140
1520
10:03
If you were able to identify this pattern of these three letters --
176
603420
4160
10:07
(Laughter)
177
607620
2720
10:10
then congratulations.
178
610380
1200
10:11
One year ago, you may have won 40,000 dollars.
179
611620
2360
10:14
(Laughter)
180
614020
2000
10:16
However, if you're like me, and you couldn't make sense of this,
181
616060
3720
10:19
there's a different way that you can find this ink --
182
619780
2680
10:22
one that actually scales very, very well.
183
622460
2440
10:26
So this is where my journey begins with the Vesuvius Challenge.
184
626460
3680
10:30
There is this neat idea in computer-vision literature
185
630180
2880
10:33
where if you don't actually have labels,
186
633060
2120
10:35
if you don't have the goal that you want your AI model to reach,
187
635220
4080
10:39
you can pick an intermediary goal along the way.
188
639300
2480
10:42
So, looking at these two pairs of images,
189
642500
4880
10:47
our eyes can identify that these are the same images,
190
647420
3040
10:50
just flipped.
191
650500
1600
10:52
And we can do that because we understand
192
652100
2680
10:54
the structures that are present in the images.
193
654820
2240
10:57
We can see this little triangle, and it's flipped,
194
657060
3040
11:00
so we know this is the same triangle.
195
660140
2240
11:02
Our eyes already have this feature,
196
662420
2720
11:05
but neural networks don't.
197
665180
2720
11:07
When they see these images,
198
667940
1320
11:09
they can't [tell] that these are the same image.
199
669260
2320
11:11
So one idea, just to let it know about the structures
200
671580
3160
11:14
and familiarize it with the data,
201
674780
1880
11:16
is to show it different views of the same image
202
676700
3560
11:20
and tell it that these are the same images.
203
680260
2000
11:22
And after that, you take this model
204
682900
2800
11:25
and you train it like the previous models that the University of Kentucky did.
205
685700
4440
11:30
And while the approach doesn't fully work, it also doesn't fully not work.
206
690180
4360
11:34
And this was the first image that was produced by the model.
207
694580
3840
11:38
And there was some very faint signal in there.
208
698460
3360
11:41
It seemed like the model was catching on something,
209
701860
2440
11:44
but it wasn't clear, exactly, what the model was catching on.
210
704340
3240
11:48
So I decided to take these predictions and create a new ground truth,
211
708180
6440
11:54
asking the model, "Hey, I think these might be letters.
212
714660
2760
11:57
I think there's something in there. Try to find more of this."
213
717420
3360
12:00
And my ground truth, actually,
214
720820
2000
12:02
has four correct letters and four other delusions.
215
722820
3200
12:06
But that was OK.
216
726060
1680
12:07
So training a new model with this data,
217
727780
2320
12:10
the model started to find more ink, find more letters,
218
730140
2760
12:12
and the lines even looked complete.
219
732940
2520
12:16
So I thought, "What are the chances that if I do this again,
220
736140
3920
12:20
the models keep improving?"
221
740100
1480
12:22
And this was the core behind our grand prize-winning solution.
222
742220
4400
12:27
Repeating this process over and over, the models kept improving.
223
747380
3480
12:31
The main trick was you needed to prevent the models from memorizing
224
751740
4440
12:36
what the previous models have learned.
225
756220
2000
12:38
You're essentially asking the model to learn
226
758260
2080
12:40
what the other model has learned.
227
760340
1880
12:42
So overfitting was a serious problem that required a lot of experiments.
228
762260
4760
12:47
But in the end, getting the recipe right,
229
767020
3440
12:50
we were able to predict all of these letters
230
770460
3320
12:53
without the models ever seeing them.
231
773780
2080
12:55
These were the first 10 letters.
232
775860
1640
12:57
There are, like, 20 in there,
233
777540
1600
12:59
but this was the first coherent word read from an unopened papyrus sheet.
234
779180
5640
13:05
From there, scaling the process,
235
785860
1800
13:07
within weeks, we had, now, columns of text,
236
787700
3440
13:11
even special characters
237
791180
1320
13:12
that papyrologists found very interesting that the model was able to find.
238
792540
3840
13:17
The approach was open-sourced,
239
797660
1720
13:19
and the data and the code were out there,
240
799420
2520
13:21
and the race for the grand prize was on.
241
801980
2120
13:24
Recovering four paragraphs at an 85-percent clarity.
242
804900
4160
13:29
And the key to our success was perfecting the data and the model
243
809100
3760
13:32
with so many iterations and so many experiments.
244
812900
2440
13:36
In the end, we were able to recover
245
816020
2080
13:38
more than 14 columns of text, and 2,000 letters.
246
818140
4000
13:44
(Applause)
247
824260
7000
13:51
2,000 characters safely stored away two millennia ago.
248
831820
5240
13:57
In just nine months, we discovered them again.
249
837940
4080
14:02
AI helped us, in large portions,
250
842460
2600
14:05
writing better code and even being part in our algorithms.
251
845060
4280
14:10
It opened a window into the past.
252
850060
2320
14:12
What's next?
253
852980
1520
14:14
Let's open this window more.
254
854500
2000
14:16
AI will help us access information that was so far safely locked away.
255
856540
5960
14:24
In the words of the author,
256
864180
1800
14:27
"We do not refrain from questioning nor understanding,
257
867660
5960
14:35
and may it be evident to say true things as they appear."
258
875380
6680
14:43
(Applause)
259
883220
1960
About this website

This site will introduce you to YouTube videos that are useful for learning English. You will see English lessons taught by top-notch teachers from around the world. Double-click on the English subtitles displayed on each video page to play the video from there. The subtitles scroll in sync with the video playback. If you have any comments or requests, please contact us using this contact form.

https://forms.gle/WvT1wiN1qDtmnspy7