The Next Computer? Your Glasses | Shahram Izadi | TED

492,079 views ・ 2025-04-18

TED


Please double-click on the English subtitles below to play the video.

00:04
For our entire lives, we’ve been living through a computing revolution.
0
4035
3770
00:08
Many of you here in this room have contributed to it,
1
8305
3104
00:11
with leaps forward in compute,
2
11442
2102
00:13
connectivity, mobile technologies,
3
13577
2736
00:16
and now AI.
4
16347
1702
00:18
For my part, I've dedicated my entire career to augmented reality,
5
18516
4471
00:23
fusing the real world with computing experiences.
6
23020
2803
00:26
I started this journey about 25 years ago for my PhD.
7
26624
4304
00:31
It might sound groundbreaking, but check out these early prototypes.
8
31362
3804
00:35
The technology was really primitive back then,
9
35633
2702
00:38
but the potential felt limitless
10
38369
1735
00:40
when we were experimenting in that university lab.
11
40137
3137
00:44
What I didn't know at that time
12
44141
1769
00:45
was many of the fundamental innovations for my work
13
45943
3470
00:49
would come from AI researchers in a different lab.
14
49447
3570
00:54
While I was helping computers see the world,
15
54051
2870
00:56
these AI researchers were helping computers reason about the world.
16
56921
3737
01:01
Since then, large language models and multimodal reasoning
17
61358
4138
01:05
have enabled richer language and image understanding.
18
65529
4772
01:10
These models are now fast enough for live conversations
19
70801
3437
01:14
where the AI can act on your behalf
20
74238
2436
01:16
and, most importantly, with your permission.
21
76707
2803
01:20
And augmented reality and virtual reality has moved computing
22
80544
4505
01:25
from the rectangular screen to the 360 immersive display
23
85049
4337
01:29
to now even the world itself becoming the display.
24
89420
3170
01:33
We now refer to this broad collection of experiences as extended reality or XR.
25
93224
5772
01:39
Until now,
26
99997
1568
01:41
these innovations have occurred separately and in silos.
27
101599
4237
01:46
Well here comes act two of the computing revolution.
28
106270
3704
01:50
AI and XR are converging,
29
110007
2703
01:52
unlocking radical new ways to interact with technology on your terms.
30
112743
4838
01:58
Computers will become more lightweight and personal.
31
118249
3436
02:02
They will share your vantage point,
32
122219
2303
02:04
understand your real-world context
33
124555
2269
02:06
and have a natural interface that's both simple and conversational.
34
126857
5039
02:12
Countless people and companies are innovating in this space,
35
132763
4238
02:17
including many on the TED stage this week.
36
137034
2402
02:20
We're excited to contribute to this momentum with Android XR.
37
140337
4171
02:25
It's an operating system we're building with Samsung
38
145309
3003
02:28
that brings XR hardware together with Gemini, our AI assistant,
39
148312
5138
02:33
to augment human intelligence.
40
153484
2436
02:36
It can support a broad range of devices,
41
156620
2436
02:39
from headsets to glasses
42
159089
2336
02:41
to form factors we haven't even dreamed of yet.
43
161458
2670
02:44
OK, let me show you where we’re heading
44
164528
2803
02:47
by inviting my colleague Nishtha to the stage.
45
167331
3403
02:51
Welcome, Nishtha.
46
171402
1168
02:52
(Applause)
47
172603
1668
02:54
Nishtha Bhatia: Hi.
48
174305
1401
02:56
Shahram Izadi: These ordinary-looking glasses are packed full of technology.
49
176240
3737
02:59
A miniaturized camera and microphones
50
179977
2269
03:02
give the AI the ability to see and hear the world.
51
182279
3504
03:05
Speakers let you listen to the AI and play music or even take calls.
52
185783
4538
03:10
And these glasses also have a tiny,
53
190788
2436
03:13
high-resolution in-lens display
54
193257
2936
03:16
that's full color
55
196193
1435
03:17
that I'm holding in my hand.
56
197661
1869
03:19
It's very, very small.
57
199530
1435
03:22
These glasses work with your phone streaming back and forth,
58
202333
3703
03:26
allowing the glasses to be very lightweight
59
206070
2702
03:28
and access all your phone apps.
60
208806
2002
03:30
And if you're wondering, I'm actually wearing the glasses too.
61
210841
2903
03:33
They're actually helping me see all of you in the audience
62
213777
2736
03:36
because they have prescription lenses inside them.
63
216547
2402
03:38
But they’re also displaying my speaker notes for me as well.
64
218983
2836
03:41
(Laughter)
65
221819
1234
03:43
For this demo,
66
223087
1601
03:44
you'll see what Nishtha is seeing on the screen behind her.
67
224688
2937
03:47
And this is the very first time we're showing these glasses in public.
68
227658
3837
03:51
So this is going to be a live demo of conceptual hardware, early software.
69
231528
4872
03:56
What could go wrong?
70
236433
1669
03:58
Nishtha, best of luck.
71
238135
1902
04:00
No pressure.
72
240070
1168
04:01
NB: Amazing.
73
241705
1402
04:03
Alright, let’s just make sure that these glasses are shown
74
243107
2869
04:06
on the screen behind us.
75
246010
1534
04:08
OK.
76
248712
1235
04:09
[Glasses screen off]
77
249980
1602
04:13
Awesome.
78
253250
1201
04:14
(Applause)
79
254451
2002
04:16
Now I'm going to launch Gemini.
80
256487
2068
04:19
Gemini: Hello there.
81
259657
1368
04:21
NB: Hi, Gemini.
82
261058
1401
04:22
Can you start us off with something fun and creative,
83
262493
3203
04:25
and write a haiku for what I'm seeing right now?
84
265729
3170
04:30
G: Sure.
85
270100
1168
04:31
Faces all aglow.
86
271268
1168
04:32
Eager minds await the words.
87
272469
1368
04:33
Sparks of thought ignite.
88
273871
1768
04:36
SI: Some anxious but happy faces as well, yeah.
89
276540
3670
04:40
As you can see, the AI sees what Nishtha sees,
90
280210
2770
04:42
hears what she hears
91
282980
1401
04:44
and is responding in real time.
92
284381
1702
04:46
But that’s just table stakes.
93
286417
1434
04:47
I think everyone in this audience has seen demos like this before.
94
287851
3204
04:51
Let's actually step it up a notch with something we call Memory.
95
291088
3637
04:54
For a rolling contextual window,
96
294725
2002
04:56
the AI remembers what you see
97
296760
1969
04:58
without having to be told what to keep track of.
98
298762
3003
05:02
NB: So you guys may have noticed
99
302199
1802
05:04
I snuck a peek back at the shelf a moment ago.
100
304034
3304
05:07
I wasn't paying attention, but let's see if Gemini was.
101
307371
3203
05:12
Hey, did you happen to catch the title of the white book
102
312643
4838
05:17
that was on the shelf behind me?
103
317514
1769
05:20
G: The white book is “Atomic Habits” by James Clear.
104
320617
2903
05:24
NB: That is absolutely right.
105
324421
2169
05:26
So let's try something harder.
106
326623
1936
05:28
I keep losing my hotel key card.
107
328992
2403
05:31
Do you know where I last left the card?
108
331428
2303
05:35
G: The hotel key card is to the right of the music record.
109
335265
3204
05:39
NB: Great.
110
339436
1368
05:41
SI: For someone as forgetful as me, that's a killer app.
111
341572
2769
05:45
OK, this AI is inherently multimodal.
112
345075
2970
05:48
So it's using visual data and natural language
113
348078
2703
05:50
to digest complex queries in real time
114
350814
3070
05:53
while remembering your past context.
115
353917
2336
05:56
Nishtha, let's see some more.
116
356286
1535
05:57
NB: Yeah, let’s bring this book back.
117
357821
2303
06:00
So I don't really have time to read it right now.
118
360157
3136
06:03
But I wonder if Gemini might be able to help me summarize what's going on.
119
363327
5138
06:08
Like, I don't know.
120
368499
2002
06:10
What does this diagram mean?
121
370534
1668
06:12
It looks complex.
122
372236
1368
06:14
G: This diagram is labeled “The Habit Line”
123
374805
2436
06:17
and illustrates the process of habit formation.
124
377241
2802
06:20
It shows how automaticity increases with repetitions
125
380077
3303
06:23
until a habit is formed.
126
383380
1869
06:25
NB: OK.
127
385616
1134
06:26
SI: Now imagine being able to communicate and understand any language.
128
386784
4137
06:30
The AI can shift its output to speak different languages
129
390954
3003
06:33
with dialects and accents that feel human and natural.
130
393991
3336
06:37
NB: Yeah, let’s go to this sign over here.
131
397961
3237
06:41
Can you translate this to English for me?
132
401231
2970
06:45
G: The sign states: “Private property, no trespassing.”
133
405569
3737
06:49
SI: OK, let's prove this is a live demo.
134
409339
1969
06:51
Does anyone in the audience have a suggestion
135
411308
2102
06:53
for a different language to translate to?
136
413444
1968
06:55
Audience: Farsi.
137
415746
1134
06:56
SI: Farsi.
138
416914
1334
06:58
We tried Farsi, it failed last time, but let's try it again.
139
418282
3103
07:01
NB: Do you want to try Farsi?
140
421385
1902
07:03
SI: Because I do speak Farsi.
141
423287
1401
07:04
It's my mother tongue, so thank you.
142
424721
2636
07:07
NB: Gemini, let’s just give this a shot.
143
427357
2169
07:09
Can you translate this sign to Farsi for us?
144
429560
3236
07:14
G: (Speaking Farsi)
145
434031
3069
07:17
SI: Great, awesome.
146
437568
1902
07:19
It speaks Farsi.
147
439503
1468
07:21
NB: That’s amazing.
148
441004
1268
07:22
So, as Shahram mentioned,
149
442806
1835
07:24
you all may have seen translation demos like this before,
150
444675
3336
07:28
but what's new now is that in addition to just saying things
151
448045
4037
07:32
in a different language,
152
452115
1202
07:33
I can also speak to Gemini in another language.
153
453317
2969
07:36
I know Hindi, so let's give this a shot.
154
456620
2302
07:41
(Speaks Hindi)
155
461525
4604
07:49
G: (Speaks Hindi)
156
469099
2503
07:52
NB: So Gemini said you all look focused and excited,
157
472703
3203
07:55
and it has a better accent than I do.
158
475906
1902
07:57
(Laughter)
159
477841
1235
07:59
SI: Alright, now let's see how the AI can connect the physical world
160
479776
3537
08:03
with your digital content and take action.
161
483313
2770
08:06
NB: Yeah, let’s get some music going in here.
162
486116
3070
08:09
OK, Gemini, why don't you play us a track from this record?
163
489920
5339
08:16
GB: Sure, here’s “Bad Dreams” by Teddy Swims.
164
496860
3370
08:20
(Music)
165
500230
3003
08:23
SI: Perfect.
166
503267
1167
08:24
In a few seconds, the AI recognized the album,
167
504468
2569
08:27
looked up the tracklist
168
507070
1268
08:28
and took action using the phone apps.
169
508372
2435
08:30
OK, Nishtha, it does look like the demo gods are with us.
170
510807
2703
08:33
Maybe with you more than me.
171
513510
1635
08:35
Let's do one last demo
172
515846
1268
08:37
I know you're keen to give.
173
517114
1301
08:38
NB: Yes, this is my first time in Vancouver,
174
518448
3070
08:41
and I love going on walks.
175
521518
1735
08:43
So why don't you navigate me to a park nearby with views of the ocean?
176
523287
6106
08:51
G: OK, I am starting navigation to Lighthouse Park,
177
531562
3370
08:54
which has magnificent views of the Pacific Ocean.
178
534932
2736
08:58
Is there anything else I can assist you with?
179
538302
2302
09:00
NB: Honestly, with these directions and a 3D map,
180
540637
3370
09:04
I should be all set, and hopefully I won’t look like a tourist.
181
544041
2969
09:07
Thank you all.
182
547010
1335
09:08
(Applause)
183
548378
6373
09:15
SI: Thank you, Nishtha, that was awesome.
184
555252
2436
09:17
OK, we've seen glasses.
185
557721
1635
09:19
Now let's turn our attention to the other side of the spectrum: headsets.
186
559389
3871
09:23
You've seen these types of devices before,
187
563827
2302
09:26
but when we first brought AI to a headset,
188
566163
2269
09:28
it completely caught me by surprise.
189
568465
2236
09:31
For this demo, we're going to use the Project Moohan headset
190
571101
2836
09:33
that Samsung is launching later this year.
191
573971
2636
09:36
Compared to glasses, headsets give you an infinite display
192
576640
3403
09:40
for getting work done or immersing yourself in a movie,
193
580077
3003
09:43
or maybe a TED Talk at some point.
194
583113
1869
09:45
Let me bring up my colleague Max to show us even more.
195
585549
3003
09:48
Hey, Max.
196
588585
1168
09:49
Max Spear: Hello.
197
589786
1135
09:50
(Applause)
198
590954
1001
09:51
SI: And the same thing is going to go here.
199
591989
2035
09:54
You'll see exactly what Max is seeing behind on the screen.
200
594024
3036
09:58
Go for it, Max.
201
598128
1168
09:59
MS: Let’s do it.
202
599296
1368
10:01
You'll notice we start grounded in the real world.
203
601231
2369
10:03
And I'm going to control the entire system with my eyes, hands and voice.
204
603634
4738
10:08
But where things get really interesting
205
608905
1869
10:10
is when we invite Gemini in as this conversational companion
206
610807
3404
10:14
that can come with us anywhere.
207
614244
1668
10:17
G: Hello.
208
617914
1202
10:19
MS: Hey, Gemini, can you bring up my trip planner for me, please?
209
619149
4104
10:23
G: Of course. Opening up your trip planner.
210
623286
2603
10:25
MS: Cool, but I left these windows really disorganized.
211
625922
2803
10:28
Can you help with that?
212
628759
1401
10:31
G: Of course I can help with that.
213
631695
2102
10:34
SI: No clicks, no keyboards.
214
634898
1935
10:36
It's just a conversation.
215
636867
1601
10:38
And the AI is taking action.
216
638502
2536
10:41
OK, some more audience participation.
217
641438
2169
10:43
Someone shout out a name of a place you want to visit.
218
643640
2603
10:46
Audience: Melbourne.
219
646643
1368
10:48
Audience: Cape Town.
220
648011
1669
10:49
SI: Let's go to Cape Town.
221
649680
1267
10:50
Max: OK, sounds fun.
222
650981
1268
10:52
Can you please take me to Cape Town?
223
652582
2403
10:57
G: Certainly.
224
657087
1168
10:58
Let me help with organizing the windows.
225
658288
2202
11:00
SI (Laughs)
226
660490
1035
11:01
MS: Awesome.
227
661525
1168
11:02
And can you also take me to Cape Town?
228
662726
1835
11:04
(Laughter)
229
664561
1201
11:06
G: I can certainly do that.
230
666096
1602
11:07
Let me take you to Cape Town.
231
667698
1401
11:09
MS: And we’re very organized as we go there, perfect.
232
669099
2569
11:12
SI: As you can see, the AI is taking Max's requests,
233
672803
2936
11:15
figuring out how best to answer it,
234
675772
1936
11:17
opening up the Maps app.
235
677708
1568
11:19
And from there, he can actually explore anywhere in the world in this 3D view.
236
679309
4438
11:24
MS: OK, this is pretty incredible.
237
684481
1935
11:26
Viewing the world from this angle, I can even zoom in to city levels.
238
686416
3270
11:29
But what's really interesting is having an AI here
239
689686
2603
11:32
who can see what I see.
240
692289
1435
11:33
Can you tell me more about the significance of this place?
241
693724
3470
11:39
G: I can indeed provide information about Table Mountain.
242
699329
3303
11:43
Table Mountain holds a profound significance,
243
703066
2503
11:45
deeply rooted in both its natural splendor
244
705602
2102
11:47
and its rich cultural history.
245
707738
1768
11:49
It has long been a spiritual and cultural symbol
246
709506
2569
11:52
for the Indigenous Khoekhoe and San people.
247
712109
2435
11:54
MS: OK, very good, Gemini.
248
714578
1434
11:56
I'll come back to this later.
249
716546
1402
11:57
It also does look like a table, that's handy.
250
717981
2102
12:00
SI: OK, Max, let's bring this closer to home.
251
720117
3036
12:03
How about exploring winter sports, seeing that we're in Vancouver.
252
723186
3137
12:06
MS: OK, let’s get some inspiration for Shahram.
253
726323
2402
12:08
Can you please show me 360 videos of this?
254
728759
3103
12:11
[Powder snowboarding]
255
731895
1668
12:14
G: Sure, here are some 360-degree powder snowboarding videos.
256
734297
3938
12:18
MS: OK, I’m kind of new to snowboarding,
257
738602
2435
12:21
so hopefully this video will help me learn.
258
741037
2136
12:23
Although this looks already way more intense
259
743173
2536
12:25
than I think I'm ready for.
260
745742
1468
12:28
What was the name of that trick?
261
748111
1802
12:32
G: That’s a backside 540 with a grab.
262
752449
2436
12:36
SI: The AI knows exactly what Max is looking at,
263
756386
2503
12:38
even in rich video content,
264
758922
1869
12:40
and can answer any question in real-time.
265
760824
2402
12:43
Max, are you sure this is Whistler?
266
763894
1968
12:46
MS: You know, Shahram, I actually have no idea.
267
766429
2269
12:48
I've never been here before, but let's find out.
268
768732
3537
12:52
What run do you think this is?
269
772302
1802
12:57
G: It’s probably one of the ungroomed runs in summit bowl
270
777307
3070
13:00
or the top section of West Ridge on Mount Bachelor,
271
780410
2403
13:02
as they are close to the summit and face north
272
782813
2168
13:05
towards Three Sisters and Broken Top,
273
785015
1768
13:06
the prominent mountains visible in the background.
274
786817
2402
13:09
MS: OK, Mount Bachelor, Oregon, got it.
275
789252
2202
13:11
SI: OK, let's have a little bit of fun now.
276
791488
2736
13:14
Let's have AI narrate this video.
277
794224
2069
13:16
Somebody in the audience, shout out a fun narration style
278
796326
3170
13:19
like, nature documentary or something like that.
279
799529
3070
13:22
Audience: Morgan Freeman.
280
802632
1235
13:24
SI: We can't do actors, but we'll get to that at some point.
281
804634
3470
13:28
Any other suggestions?
282
808839
1401
13:30
Audience: Movie trailer.
283
810273
1268
13:31
Audience: Horror movie.
284
811908
1302
13:33
SI: Horror movie.
285
813243
1301
13:34
That's going to be -- it might be gory, but let's go for it.
286
814544
2970
13:37
MS: Slightly dark, but let’s do it.
287
817514
2536
13:40
Can you please describe what you're seeing
288
820083
2102
13:42
as if you're an overly enthusiastic character in a horror movie?
289
822219
4437
13:49
G: Oh, darling, this is simply marvelous.
290
829025
2269
13:51
A desolate mountain scape.
291
831328
1501
13:52
How delightfully ominous.
292
832863
1634
13:54
Prepare for a chilling dance with the unknown.
293
834531
2636
13:57
Where every gust of wind whispers tales of icy terror and lurking shadows.
294
837200
4371
14:01
MS: OK, Gemini, that’s great.
295
841571
1735
14:03
Return to more positive things another day.
296
843306
2970
14:06
SI: OK, Max, I know there's one last demo,
297
846309
2570
14:08
and you've been apparently practicing your farming skills?
298
848912
3437
14:12
MS: Yes, Shahram, I’ve been excited to get back to my rural roots
299
852382
3136
14:15
and also embrace one of my favorite things,
300
855552
2602
14:18
which is playing games.
301
858188
1601
14:19
But it becomes a whole new, next-level experience
302
859789
2303
14:22
when you have an AI by your side,
303
862125
1702
14:23
both to help keep you company as you play,
304
863860
2069
14:25
but also to help you learn how to play these kind of games in the first place.
305
865962
3938
14:29
So, for example, what do you think I should do next here?
306
869933
3470
14:36
G: It looks like this is early in the game in “Stardew Valley.”
307
876673
3337
14:40
You need to earn money, so try planting some seeds.
308
880043
2402
14:42
First, use your hoe to till the soil, and then plant your seeds.
309
882445
3337
14:45
It looks like you have some parsnips.
310
885815
1802
14:47
Since it isn't raining,
311
887651
1167
14:48
you'll also need to water them with your watering can.
312
888852
2702
14:52
MS: Seems like a lot, and maybe robots will do this for us one day,
313
892055
3170
14:55
but for now, I'm going to have to return back to my farm later.
314
895258
2970
14:58
For one last thing, can you please give a playful two-sentence summary
315
898261
3704
15:01
of all the ground we covered today?
316
901965
1802
15:07
G: We soared from organizing a digital realm
317
907203
2470
15:09
to traversing snowy mountains,
318
909706
1668
15:11
and even found time to farm some virtual parsnips.
319
911408
2936
15:14
Quite the adventure today.
320
914377
1702
15:16
MS: Quite the adventure, indeed.
321
916112
1569
15:17
Thanks, Gemini.
322
917681
1201
15:18
(Applause)
323
918915
5372
15:24
SI: Fantastic, thank you Max.
324
924321
1668
15:26
We're entering an exciting new phase of the computing revolution.
325
926523
4171
15:30
Headsets and glasses are just the beginning.
326
930727
2769
15:33
All this points to a single vision of the future,
327
933530
2803
15:36
a world where helpful AI will converge with lightweight XR.
328
936366
4705
15:41
XR devices will become increasingly more wearable,
329
941471
3103
15:44
giving us instant access to information.
330
944607
2703
15:47
While AI is going to become more contextually aware, more conversational,
331
947344
4337
15:51
more personalized,
332
951715
1334
15:53
working with us on our terms and in our language.
333
953083
4004
15:57
We're no longer augmenting our reality,
334
957120
2469
15:59
but rather augmenting our intelligence.
335
959622
2703
16:02
Thank you so much.
336
962726
1434
16:04
(Applause)
337
964194
2536
About this website

This site will introduce you to YouTube videos that are useful for learning English. You will see English lessons taught by top-notch teachers from around the world. Double-click on the English subtitles displayed on each video page to play the video from there. The subtitles scroll in sync with the video playback. If you have any comments or requests, please contact us using this contact form.

https://forms.gle/WvT1wiN1qDtmnspy7