The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED

1,843,019 views ・ 2023-04-20

TED

Please double-click on the English subtitles below to play the video.

00:03

We started OpenAI seven years ago

3875

2503

00:06

because we felt like something really interesting was happening in AI

6378

3712

00:10

and we wanted to help steer it in a positive direction.

10131

3170

00:15

It's honestly just really amazing to see

15220

2085

00:17

how far this whole field has come since then.

17347

3086

00:20

And it's really gratifying to hear from people like Raymond

20433

3629

00:24

who are using the technology we are building, and others,

24104

2836

00:26

for so many wonderful things.

26982

2127

00:29

We hear from people who are excited,

29150

2503

00:31

we hear from people who are concerned,

31653

1835

00:33

we hear from people who feel both those emotions at once.

33530

2961

00:36

And honestly, that's how we feel.

36533

2252

00:40

Above all, it feels like we're entering an historic period right now

40245

4087

00:44

where we as a world are going to define a technology

44374

4421

00:48

that will be so important for our society going forward.

48795

3086

00:52

And I believe that we can manage this for good.

52924

2628

00:56

So today, I want to show you the current state of that technology

56845

4171

01:01

and some of the underlying design principles that we hold dear.

61016

3086

01:09

So the first thing I'm going to show you

69983

1918

01:11

is what it's like to build a tool for an AI

71943

2086

01:14

rather than building it for a human.

74029

1876

01:17

So we have a new DALL-E model, which generates images,

77574

3545

01:21

and we are exposing it as an app for ChatGPT to use on your behalf.

81161

4045

01:25

And you can do things like ask, you know,

85248

2461

01:27

suggest a nice post-TED meal and draw a picture of it.

87751

6631

01:35

(Laughter)

95216

1419

01:38

Now you get all of the, sort of, ideation and creative back-and-forth

98303

4671

01:43

and taking care of the details for you that you get out of ChatGPT.

103016

4004

01:47

And here we go, it's not just the idea for the meal,

107062

2669

01:49

but a very, very detailed spread.

109773

3587

01:54

So let's see what we're going to get.

114110

2044

01:56

But ChatGPT doesn't just generate images in this case --

116154

3795

01:59

sorry, it doesn't generate text, it also generates an image.

119991

2836

02:02

And that is something that really expands the power

122827

2419

02:05

of what it can do on your behalf in terms of carrying out your intent.

125246

3504

02:08

And I'll point out, this is all a live demo.

128750

2085

02:10

This is all generated by the AI as we speak.

130835

2169

02:13

So I actually don't even know what we're going to see.

133046

2544

02:16

This looks wonderful.

136216

2294

02:18

(Applause)

138510

3712

02:22

I'm getting hungry just looking at it.

142514

1877

02:24

Now we've extended ChatGPT with other tools too,

144724

2753

02:27

for example, memory.

147519

1168

02:28

You can say "save this for later."

148728

2795

02:33

And the interesting thing about these tools

153233

2043

02:35

is they're very inspectable.

155318

1377

02:36

So you get this little pop up here that says "use the DALL-E app."

156695

3128

02:39

And by the way, this is coming to you, all ChatGPT users, over upcoming months.

159823

3712

02:43

And you can look under the hood and see that what it actually did

163535

3086

02:46

was write a prompt just like a human could.

166621

2169

02:48

And so you sort of have this ability to inspect

168790

2628

02:51

how the machine is using these tools,

171459

2086

02:53

which allows us to provide feedback to them.

173586

2086

02:55

Now it's saved for later,

175714

1209

02:56

and let me show you what it's like to use that information

176965

2878

02:59

and to integrate with other applications too.

179884

2503

03:02

You can say,

182387

2210

03:04

“Now make a shopping list for the tasty thing

184639

5506

03:10

I was suggesting earlier.”

190186

1835

03:12

And make it a little tricky for the AI.

192021

2128

03:16

"And tweet it out for all the TED viewers out there."

196276

4337

03:20

(Laughter)

200655

2252

03:22

So if you do make this wonderful, wonderful meal,

202949

2461

03:25

I definitely want to know how it tastes.

205410

2044

03:28

But you can see that ChatGPT is selecting all these different tools

208496

3504

03:32

without me having to tell it explicitly which ones to use in any situation.

212000

4379

03:37

And this, I think, shows a new way of thinking about the user interface.

217088

3879

03:40

Like, we are so used to thinking of, well, we have these apps,

220967

3796

03:44

we click between them, we copy/paste between them,

224763

2335

03:47

and usually it's a great experience within an app

227098

2294

03:49

as long as you kind of know the menus and know all the options.

229434

2961

03:52

Yes, I would like you to.

232395

1293

03:53

Yes, please.

233730

1126

03:54

Always good to be polite.

234898

1251

03:56

(Laughter)

236149

2628

04:00

And by having this unified language interface on top of tools,

240361

5464

04:05

the AI is able to sort of take away all those details from you.

245867

4630

04:10

So you don't have to be the one

250538

1543

04:12

who spells out every single sort of little piece

252123

2294

04:14

of what's supposed to happen.

254459

1543

04:16

And as I said, this is a live demo,

256419

1710

04:18

so sometimes the unexpected will happen to us.

258129

3379

04:21

But let's take a look at the Instacart shopping list while we're at it.

261549

3420

04:25

And you can see we sent a list of ingredients to Instacart.

265386

3254

04:29

Here's everything you need.

269349

1543

04:30

And the thing that's really interesting

270892

1877

04:32

is that the traditional UI is still very valuable, right?

272811

2919

04:35

If you look at this,

275772

1877

04:37

you still can click through it and sort of modify the actual quantities.

277690

4296

04:41

And that's something that I think shows

281986

1877

04:43

that they're not going away, traditional UIs.

283863

3253

04:47

It's just we have a new, augmented way to build them.

287158

2795

04:49

And now we have a tweet that's been drafted for our review,

289994

2920

04:52

which is also a very important thing.

292956

1793

04:54

We can click “run,” and there we are, we’re the manager, we’re able to inspect,

294749

3712

04:58

we're able to change the work of the AI if we want to.

298461

2836

05:02

And so after this talk, you will be able to access this yourself.

302924

5964

05:17

And there we go.

317647

1710

05:19

Cool.

319816

1126

05:22

Thank you, everyone.

322485

1168

05:23

(Applause)

100

323653

3003

05:29

So we’ll cut back to the slides.

101

329367

1627

05:32

Now, the important thing about how we build this,

102

332954

3587

05:36

it's not just about building these tools.

103

336583

2210

05:38

It's about teaching the AI how to use them.

104

338793

2252

05:41

Like, what do we even want it to do

105

341087

1710

05:42

when we ask these very high-level questions?

106

342839

2419

05:45

And to do this, we use an old idea.

107

345258

2669

05:48

If you go back to Alan Turing's 1950 paper on the Turing test, he says,

108

348261

3337

05:51

you'll never program an answer to this.

109

351598

2043

05:53

Instead, you can learn it.

110

353683

1627

05:55

You could build a machine, like a human child,

111

355351

2169

05:57

and then teach it through feedback.

112

357520

2127

05:59

Have a human teacher who provides rewards and punishments

113

359689

2711

06:02

as it tries things out and does things that are either good or bad.

114

362400

3212

06:06

And this is exactly how we train ChatGPT.

115

366237

2002

06:08

It's a two-step process.

116

368239

1168

06:09

First, we produce what Turing would have called a child machine

117

369449

3086

06:12

through an unsupervised learning process.

118

372535

1960

06:14

We just show it the whole world, the whole internet

119

374495

2461

06:16

and say, “Predict what comes next in text you’ve never seen before.”

120

376956

3212

06:20

And this process imbues it with all sorts of wonderful skills.

121

380168

3044

06:23

For example, if you're shown a math problem,

122

383212

2086

06:25

the only way to actually complete that math problem,

123

385298

2544

06:27

to say what comes next,

124

387884

1334

06:29

that green nine up there,

125

389218

1293

06:30

is to actually solve the math problem.

126

390511

2294

06:34

But we actually have to do a second step, too,

127

394432

2169

06:36

which is to teach the AI what to do with those skills.

128

396601

2544

06:39

And for this, we provide feedback.

129

399187

1668

06:40

We have the AI try out multiple things, give us multiple suggestions,

130

400855

3253

06:44

and then a human rates them, says “This one’s better than that one.”

131

404150

3212

06:47

And this reinforces not just the specific thing that the AI said,

132

407362

3086

06:50

but very importantly, the whole process that the AI used to produce that answer.

133

410448

3795

06:54

And this allows it to generalize.

134

414243

1585

06:55

It allows it to teach, to sort of infer your intent

135

415828

2419

06:58

and apply it in scenarios that it hasn't seen before,

136

418247

2503

07:00

that it hasn't received feedback.

137

420750

1585

07:02

Now, sometimes the things we have to teach the AI

138

422669

2460

07:05

are not what you'd expect.

139

425171

1543

07:06

For example, when we first showed GPT-4 to Khan Academy,

140

426756

3086

07:09

they said, "Wow, this is so great,

141

429884

1627

07:11

We're going to be able to teach students wonderful things.

142

431552

2753

07:14

Only one problem, it doesn't double-check students' math.

143

434347

3462

07:17

If there's some bad math in there,

144

437809

1626

07:19

it will happily pretend that one plus one equals three and run with it."

145

439477

3462

07:23

So we had to collect some feedback data.

146

443523

2294

07:25

Sal Khan himself was very kind

147

445858

1544

07:27

and offered 20 hours of his own time to provide feedback to the machine

148

447443

3337

07:30

alongside our team.

149

450780

1501

07:32

And over the course of a couple of months we were able to teach the AI that,

150

452323

3587

07:35

"Hey, you really should push back on humans

151

455910

2044

07:37

in this specific kind of scenario."

152

457954

2044

07:41

And we've actually made lots and lots of improvements to the models this way.

153

461416

4921

07:46

And when you push that thumbs down in ChatGPT,

154

466379

2544

07:48

that actually is kind of like sending up a bat signal to our team to say,

155

468965

3462

07:52

“Here’s an area of weakness where you should gather feedback.”

156

472427

2919

07:55

And so when you do that,

157

475388

1168

07:56

that's one way that we really listen to our users

158

476597

2294

07:58

and make sure we're building something that's more useful for everyone.

159

478933

3378

08:02

Now, providing high-quality feedback is a hard thing.

160

482895

3754

08:07

If you think about asking a kid to clean their room,

161

487025

2460

08:09

if all you're doing is inspecting the floor,

162

489485

2711

08:12

you don't know if you're just teaching them to stuff all the toys in the closet.

163

492196

3796

08:15

This is a nice DALL-E-generated image, by the way.

164

495992

2627

08:19

And the same sort of reasoning applies to AI.

165

499912

4713

08:24

As we move to harder tasks,

166

504667

1794

08:26

we will have to scale our ability to provide high-quality feedback.

167

506502

3796

08:30

But for this, the AI itself is happy to help.

168

510882

3879

08:34

It's happy to help us provide even better feedback

169

514761

2335

08:37

and to scale our ability to supervise the machine as time goes on.

170

517138

3587

08:40

And let me show you what I mean.

171

520767

1543

08:42

For example, you can ask GPT-4 a question like this,

172

522810

4546

08:47

of how much time passed between these two foundational blogs

173

527356

3295

08:50

on unsupervised learning

174

530693

1668

08:52

and learning from human feedback.

175

532403

1794

08:54

And the model says two months passed.

176

534197

2460

08:57

But is it true?

177

537075

1167

08:58

Like, these models are not 100-percent reliable,

178

538284

2252

09:00

although they’re getting better every time we provide some feedback.

179

540536

3921

09:04

But we can actually use the AI to fact-check.

180

544457

3086

09:07

And it can actually check its own work.

181

547543

1877

09:09

You can say, fact-check this for me.

182

549462

1877

09:12

Now, in this case, I've actually given the AI a new tool.

183

552757

3670

09:16

This one is a browsing tool

184

556427

1710

09:18

where the model can issue search queries and click into web pages.

185

558137

3879

09:22

And it actually writes out its whole chain of thought as it does it.

186

562016

3253

09:25

It says, I’m just going to search for this and it actually does the search.

187

565269

3587

09:28

It then it finds the publication date and the search results.

188

568856

3128

09:32

It then is issuing another search query.

189

572026

1919

09:33

It's going to click into the blog post.

190

573945

1877

09:35

And all of this you could do, but it’s a very tedious task.

191

575822

2877

09:38

It's not a thing that humans really want to do.

192

578741

2211

09:40

It's much more fun to be in the driver's seat,

193

580952

2169

09:43

to be in this manager's position where you can, if you want,

194

583162

2836

09:45

triple-check the work.

195

585998

1210

09:47

And out come citations

196

587208

1501

09:48

so you can actually go

197

588709

1168

09:49

and very easily verify any piece of this whole chain of reasoning.

198

589919

3754

09:53

And it actually turns out two months was wrong.

199

593673

2210

09:55

Two months and one week,

200

595883

2169

09:58

that was correct.

201

598094

1251

10:00

(Applause)

202

600888

3837

10:07

And we'll cut back to the side.

203

607645

1502

10:09

And so thing that's so interesting to me about this whole process

204

609147

3920

10:13

is that it’s this many-step collaboration between a human and an AI.

205

613067

3962

10:17

Because a human, using this fact-checking tool

206

617029

2461

10:19

is doing it in order to produce data

207

619532

2210

10:21

for another AI to become more useful to a human.

208

621742

3170

10:25

And I think this really shows the shape of something

209

625454

2545

10:28

that we should expect to be much more common in the future,

210

628040

3087

10:31

where we have humans and machines kind of very carefully

211

631127

2711

10:33

and delicately designed in how they fit into a problem

212

633880

3503

10:37

and how we want to solve that problem.

213

637425

1918

10:39

We make sure that the humans are providing the management, the oversight,

214

639385

3462

10:42

the feedback,

215

642847

1168

10:44

and the machines are operating in a way that's inspectable

216

644015

2752

10:46

and trustworthy.

217

646809

1126

10:47

And together we're able to actually create even more trustworthy machines.

218

647977

3503

10:51

And I think that over time, if we get this process right,

219

651522

2711

10:54

we will be able to solve impossible problems.

220

654233

2127

10:56

And to give you a sense of just how impossible I'm talking,

221

656360

3963

11:00

I think we're going to be able to rethink almost every aspect

222

660323

2878

11:03

of how we interact with computers.

223

663242

2378

11:05

For example, think about spreadsheets.

224

665620

2502

11:08

They've been around in some form since, we'll say, 40 years ago with VisiCalc.

225

668122

4379

11:12

I don't think they've really changed that much in that time.

226

672543

2878

11:16

And here is a specific spreadsheet of all the AI papers on the arXiv

227

676214

5922

11:22

for the past 30 years.

228

682178

1168

11:23

There's about 167,000 of them.

229

683346

1960

11:25

And you can see there the data right here.

230

685348

2878

11:28

But let me show you the ChatGPT take on how to analyze a data set like this.

231

688267

3837

11:37

So we can give ChatGPT access to yet another tool,

232

697318

3837

11:41

this one a Python interpreter,

233

701197

1460

11:42

so it’s able to run code, just like a data scientist would.

234

702657

4004

11:46

And so you can just literally upload a file

235

706661

2335

11:48

and ask questions about it.

236

708996

1335

11:50

And very helpfully, you know, it knows the name of the file and it's like,

237

710373

3545

11:53

"Oh, this is CSV," comma-separated value file,

238

713960

2419

11:56

"I'll parse it for you."

239

716420

1335

11:57

The only information here is the name of the file,

240

717755

2794

12:00

the column names like you saw and then the actual data.

241

720591

3671

12:04

And from that it's able to infer what these columns actually mean.

242

724262

4504

12:08

Like, that semantic information wasn't in there.

243

728766

2294

12:11

It has to sort of, put together its world knowledge of knowing that,

244

731102

3211

12:14

“Oh yeah, arXiv is a site that people submit papers

245

734355

2502

12:16

and therefore that's what these things are and that these are integer values

246

736857

3587

12:20

and so therefore it's a number of authors in the paper,"

247

740486

2628

12:23

like all of that, that’s work for a human to do,

248

743114

2252

12:25

and the AI is happy to help with it.

249

745408

1751

12:27

Now I don't even know what I want to ask.

250

747159

2002

12:29

So fortunately, you can ask the machine,

251

749203

3003

12:32

"Can you make some exploratory graphs?"

252

752248

1877

12:37

And once again, this is a super high-level instruction with lots of intent behind it.

253

757461

4004

12:41

But I don't even know what I want.

254

761507

1668

12:43

And the AI kind of has to infer what I might be interested in.

255

763175

2920

12:46

And so it comes up with some good ideas, I think.

256

766137

2294

12:48

So a histogram of the number of authors per paper,

257

768472

2336

12:50

time series of papers per year, word cloud of the paper titles.

258

770850

2961

12:53

All of that, I think, will be pretty interesting to see.

259

773853

2627

12:56

And the great thing is, it can actually do it.

260

776522

2169

12:58

Here we go, a nice bell curve.

261

778691

1460

13:00

You see that three is kind of the most common.

262

780151

2169

13:02

It's going to then make this nice plot of the papers per year.

263

782320

5797

13:08

Something crazy is happening in 2023, though.

264

788117

2294

13:10

Looks like we were on an exponential and it dropped off the cliff.

265

790411

3128

13:13

What could be going on there?

266

793539

1460

13:14

By the way, all this is Python code, you can inspect.

267

794999

2753

13:17

And then we'll see word cloud.

268

797752

1459

13:19

So you can see all these wonderful things that appear in these titles.

269

799253

3378

13:23

But I'm pretty unhappy about this 2023 thing.

270

803090

2127

13:25

It makes this year look really bad.

271

805634

1711

13:27

Of course, the problem is that the year is not over.

272

807345

2877

13:30

So I'm going to push back on the machine.

273

810222

2878

13:33

[Waitttt that's not fair!!!

274

813142

1585

13:34

2023 isn't over.

275

814727

1293

13:38

What percentage of papers in 2022 were even posted by April 13?]

276

818481

5088

13:44

So April 13 was the cut-off date I believe.

277

824695

2294

13:47

Can you use that to make a fair projection?

278

827656

4922

13:54

So we'll see, this is the kind of ambitious one.

279

834747

2294

13:57

(Laughter)

280

837083

1126

13:59

So you know,

281

839877

1251

14:01

again, I feel like there was more I wanted out of the machine here.

282

841128

3921

14:05

I really wanted it to notice this thing,

283

845049

2502

14:07

maybe it's a little bit of an overreach for it

284

847593

3128

14:10

to have sort of, inferred magically that this is what I wanted.

285

850763

3378

14:14

But I inject my intent,

286

854183

1627

14:15

I provide this additional piece of, you know, guidance.

287

855810

4754

14:20

And under the hood,

288

860564

1168

14:21

the AI is just writing code again, so if you want to inspect what it's doing,

289

861774

3629

14:25

it's very possible.

290

865403

1251

14:26

And now, it does the correct projection.

291

866654

3628

14:30

(Applause)

292

870282

5005

14:35

If you noticed, it even updates the title.

293

875287

2169

14:37

I didn't ask for that, but it know what I want.

294

877498

2336

14:41

Now we'll cut back to the slide again.

295

881794

2544

14:45

This slide shows a parable of how I think we ...

296

885714

4880

14:51

A vision of how we may end up using this technology in the future.

297

891220

3212

14:54

A person brought his very sick dog to the vet,

298

894849

3712

14:58

and the veterinarian made a bad call to say, “Let’s just wait and see.”

299

898561

3336

15:01

And the dog would not be here today had he listened.

300

901897

2795

15:05

In the meanwhile, he provided the blood test,

301

905401

2252

15:07

like, the full medical records, to GPT-4,

302

907653

2586

15:10

which said, "I am not a vet, you need to talk to a professional,

303

910281

3170

15:13

here are some hypotheses."

304

913492

1710

15:15

He brought that information to a second vet

305

915786

2127

15:17

who used it to save the dog's life.

306

917913

1835

15:21

Now, these systems, they're not perfect.

307

921292

2252

15:23

You cannot overly rely on them.

308

923544

2336

15:25

But this story, I think, shows

309

925880

3712

15:29

that a human with a medical professional

310

929592

3044

15:32

and with ChatGPT as a brainstorming partner

311

932678

2461

15:35

was able to achieve an outcome that would not have happened otherwise.

312

935181

3295

15:38

I think this is something we should all reflect on,

313

938476

2419

15:40

think about as we consider how to integrate these systems

314

940895

2711

15:43

into our world.

315

943606

1167

15:44

And one thing I believe really deeply,

316

944815

1835

15:46

is that getting AI right is going to require participation from everyone.

317

946650

4338

15:50

And that's for deciding how we want it to slot in,

318

950988

2377

15:53

that's for setting the rules of the road,

319

953365

1961

15:55

for what an AI will and won't do.

320

955367

2044

15:57

And if there's one thing to take away from this talk,

321

957453

2502

15:59

it's that this technology just looks different.

322

959997

2211

16:02

Just different from anything people had anticipated.

323

962208

2502

16:04

And so we all have to become literate.

324

964710

1835

16:06

And that's, honestly, one of the reasons we released ChatGPT.

325

966587

2961

16:09

Together, I believe that we can achieve the OpenAI mission

326

969548

3128

16:12

of ensuring that artificial general intelligence

327

972718

2252

16:14

benefits all of humanity.

328

974970

1877

16:16

Thank you.

329

976847

1168

16:18

(Applause)

330

978057

6965

16:33

(Applause ends)

331

993322

1168

16:34

Chris Anderson: Greg.

332

994532

1334

16:36

Wow.

333

996242

1167

16:37

I mean ...

334

997868

1126

16:39

I suspect that within every mind out here

335

999662

3753

16:43

there's a feeling of reeling.

336

1003457

2503

16:46

Like, I suspect that a very large number of people viewing this,

337

1006001

3379

16:49

you look at that and you think, “Oh my goodness,

338

1009421

2419

16:51

pretty much every single thing about the way I work, I need to rethink."

339

1011882

3462

16:55

Like, there's just new possibilities there.

340

1015386

2002

16:57

Am I right?

341

1017388

1168

16:58

Who thinks that they're having to rethink the way that we do things?

342

1018597

3337

17:01

Yeah, I mean, it's amazing,

343

1021976

1543

17:03

but it's also really scary.

344

1023561

2002

17:05

So let's talk, Greg, let's talk.

345

1025604

1585

17:08

I mean, I guess my first question actually is just

346

1028524

2377

17:10

how the hell have you done this?

347

1030901

1585

17:12

(Laughter)

348

1032486

1251

17:13

OpenAI has a few hundred employees.

349

1033737

2962

17:16

Google has thousands of employees working on artificial intelligence.

350

1036740

4755

17:21

Why is it you who's come up with this technology

351

1041996

3503

17:25

that shocked the world?

352

1045541

1168

17:26

Greg Brockman: I mean, the truth is,

353

1046709

1751

17:28

we're all building on shoulders of giants, right, there's no question.

354

1048502

3295

17:31

If you look at the compute progress,

355

1051797

1752

17:33

the algorithmic progress, the data progress,

356

1053549

2085

17:35

all of those are really industry-wide.

357

1055634

1835

17:37

But I think within OpenAI,

358

1057469

1252

17:38

we made a lot of very deliberate choices from the early days.

359

1058762

2878

17:41

And the first one was just to confront reality as it lays.

360

1061640

2711

17:44

And that we just thought really hard about like:

361

1064393

2294

17:46

What is it going to take to make progress here?

362

1066687

2210

17:48

We tried a lot of things that didn't work, so you only see the things that did.

363

1068939

3754

17:52

And I think that the most important thing has been to get teams of people

364

1072693

3462

17:56

who are very different from each other to work together harmoniously.

365

1076196

3254

17:59

CA: Can we have the water, by the way, just brought here?

366

1079450

2711

18:02

I think we're going to need it, it's a dry-mouth topic.

367

1082202

3170

18:06

But isn't there something also just about the fact

368

1086665

2795

18:09

that you saw something in these language models

369

1089501

4755

18:14

that meant that if you continue to invest in them and grow them,

370

1094256

3921

18:18

that something at some point might emerge?

371

1098218

3129

18:21

GB: Yes.

372

1101847

1126

18:23

And I think that, I mean, honestly,

373

1103015

2836

18:25

I think the story there is pretty illustrative, right?

374

1105893

2544

18:28

I think that high level, deep learning,

375

1108437

2002

18:30

like we always knew that was what we wanted to be,

376

1110481

2335

18:32

was a deep learning lab, and exactly how to do it?

377

1112858

2419

18:35

I think that in the early days, we didn't know.

378

1115277

2211

18:37

We tried a lot of things,

379

1117529

1210

18:38

and one person was working on training a model

380

1118739

2336

18:41

to predict the next character in Amazon reviews,

381

1121075

2877

18:43

and he got a result where -- this is a syntactic process,

382

1123994

4755

18:48

you expect, you know, the model will predict where the commas go,

383

1128749

3086

18:51

where the nouns and verbs are.

384

1131835

1627

18:53

But he actually got a state-of-the-art sentiment analysis classifier out of it.

385

1133504

4337

18:57

This model could tell you if a review was positive or negative.

386

1137883

2961

19:00

I mean, today we are just like, come on, anyone can do that.

387

1140886

3378

19:04

But this was the first time that you saw this emergence,

388

1144306

3087

19:07

this sort of semantics that emerged from this underlying syntactic process.

389

1147434

5005

19:12

And there we knew, you've got to scale this thing,

390

1152481

2336

19:14

you've got to see where it goes.

391

1154858

1544

19:16

CA: So I think this helps explain

392

1156402

1626

19:18

the riddle that baffles everyone looking at this,

393

1158028

2544

19:20

because these things are described as prediction machines.

394

1160572

2753

19:23

And yet, what we're seeing out of them feels ...

395

1163367

2669

19:26

it just feels impossible that that could come from a prediction machine.

396

1166036

3420

19:29

Just the stuff you showed us just now.

397

1169456

2378

19:31

And the key idea of emergence is that when you get more of a thing,

398

1171875

3838

19:35

suddenly different things emerge.

399

1175754

1585

19:37

It happens all the time, ant colonies, single ants run around,

400

1177339

3045

19:40

when you bring enough of them together,

401

1180384

1877

19:42

you get these ant colonies that show completely emergent, different behavior.

402

1182302

3629

19:45

Or a city where a few houses together, it's just houses together.

403

1185973

3086

19:49

But as you grow the number of houses,

404

1189059

1794

19:50

things emerge, like suburbs and cultural centers and traffic jams.

405

1190894

4588

19:57

Give me one moment for you when you saw just something pop

406

1197276

3211

20:00

that just blew your mind

407

1200529

1668

20:02

that you just did not see coming.

408

1202197

1627

20:03

GB: Yeah, well,

409

1203824

1209

20:05

so you can try this in ChatGPT, if you add 40-digit numbers --

410

1205075

3462

20:08

CA: 40-digit?

411

1208537

1168

20:09

GB: 40-digit numbers, the model will do it,

412

1209705

2169

20:11

which means it's really learned an internal circuit for how to do it.

413

1211915

3254

20:15

And the really interesting thing is actually,

414

1215210

2127

20:17

if you have it add like a 40-digit number plus a 35-digit number,

415

1217337

3212

20:20

it'll often get it wrong.

416

1220591

1710

20:22

And so you can see that it's really learning the process,

417

1222676

2795

20:25

but it hasn't fully generalized, right?

418

1225471

1876

20:27

It's like you can't memorize the 40-digit addition table,

419

1227389

2711

20:30

that's more atoms than there are in the universe.

420

1230100

2294

20:32

So it had to have learned something general,

421

1232394

2086

20:34

but that it hasn't really fully yet learned that,

422

1234480

2377

20:36

Oh, I can sort of generalize this to adding arbitrary numbers

423

1236899

2961

20:39

of arbitrary lengths.

424

1239902

1167

20:41

CA: So what's happened here

425

1241111

1335

20:42

is that you've allowed it to scale up

426

1242488

1793

20:44

and look at an incredible number of pieces of text.

427

1244281

2419

20:46

And it is learning things

428

1246742

1209

20:47

that you didn't know that it was going to be capable of learning.

429

1247951

3379

20:51

GB Well, yeah, and it’s more nuanced, too.

430

1251371

2002

20:53

So one science that we’re starting to really get good at

431

1253415

2878

20:56

is predicting some of these emergent capabilities.

432

1256335

2586

20:58

And to do that actually,

433

1258962

1335

21:00

one of the things I think is very undersung in this field

434

1260339

2711

21:03

is sort of engineering quality.

435

1263050

1501

21:04

Like, we had to rebuild our entire stack.

436

1264551

2044

21:06

When you think about building a rocket,

437

1266637

1877

21:08

every tolerance has to be incredibly tiny.

438

1268555

2211

21:10

Same is true in machine learning.

439

1270766

1626

21:12

You have to get every single piece of the stack engineered properly,

440

1272434

3212

21:15

and then you can start doing these predictions.

441

1275646

2210

21:17

There are all these incredibly smooth scaling curves.

442

1277856

2503

21:20

They tell you something deeply fundamental about intelligence.

443

1280359

2919

21:23

If you look at our GPT-4 blog post,

444

1283320

1710

21:25

you can see all of these curves in there.

445

1285030

1960

21:26

And now we're starting to be able to predict.

446

1286990

2127

21:29

So we were able to predict, for example, the performance on coding problems.

447

1289117

3713

21:32

We basically look at some models

448

1292871

1585

21:34

that are 10,000 times or 1,000 times smaller.

449

1294456

2461

21:36

And so there's something about this that is actually smooth scaling,

450

1296959

3211

21:40

even though it's still early days.

451

1300170

2044

21:42

CA: So here is, one of the big fears then,

452

1302756

2544

21:45

that arises from this.

453

1305300

1252

21:46

If it’s fundamental to what’s happening here,

454

1306593

2127

21:48

that as you scale up,

455

1308720

1210

21:49

things emerge that

456

1309930

2419

21:52

you can maybe predict in some level of confidence,

457

1312349

4171

21:56

but it's capable of surprising you.

458

1316562

2544

22:00

Why isn't there just a huge risk of something truly terrible emerging?

459

1320816

4463

22:05

GB: Well, I think all of these are questions of degree

460

1325320

2545

22:07

and scale and timing.

461

1327865

1209

22:09

And I think one thing people miss, too,

462

1329116

1877

22:10

is sort of the integration with the world is also this incredibly emergent,

463

1330993

3587

22:14

sort of, very powerful thing too.

464

1334621

1585

22:16

And so that's one of the reasons that we think it's so important

465

1336248

3045

22:19

to deploy incrementally.

466

1339293

1167

22:20

And so I think that what we kind of see right now, if you look at this talk,

467

1340502

3629

22:24

a lot of what I focus on is providing really high-quality feedback.

468

1344131

3170

22:27

Today, the tasks that we do, you can inspect them, right?

469

1347301

2711

22:30

It's very easy to look at that math problem and be like, no, no, no,

470

1350012

3211

22:33

machine, seven was the correct answer.

471

1353265

1835

22:35

But even summarizing a book, like, that's a hard thing to supervise.

472

1355100

3212

22:38

Like, how do you know if this book summary is any good?

473

1358312

2586

22:40

You have to read the whole book.

474

1360939

1543

22:42

No one wants to do that.

475

1362482

1168

22:43

(Laughter)

476

1363692

1293

22:44

And so I think that the important thing will be that we take this step by step.

477

1364985

4296

22:49

And that we say, OK, as we move on to book summaries,

478

1369323

2544

22:51

we have to supervise this task properly.

479

1371867

1960

22:53

We have to build up a track record with these machines

480

1373827

2586

22:56

that they're able to actually carry out our intent.

481

1376413

2586

22:59

And I think we're going to have to produce even better, more efficient,

482

1379041

3336

23:02

more reliable ways of scaling this,

483

1382419

1710

23:04

sort of like making the machine be aligned with you.

484

1384129

2878

23:07

CA: So we're going to hear later in this session,

485

1387049

2294

23:09

there are critics who say that,

486

1389343

1543

23:10

you know, there's no real understanding inside,

487

1390928

4587

23:15

the system is going to always --

488

1395557

1627

23:17

we're never going to know that it's not generating errors,

489

1397225

3212

23:20

that it doesn't have common sense and so forth.

490

1400479

2210

23:22

Is it your belief, Greg, that it is true at any one moment,

491

1402689

4088

23:26

but that the expansion of the scale and the human feedback

492

1406818

3629

23:30

that you talked about is basically going to take it on that journey

493

1410489

4963

23:35

of actually getting to things like truth and wisdom and so forth,

494

1415494

3837

23:39

with a high degree of confidence.

495

1419331

1627

23:40

Can you be sure of that?

496

1420999

1335

23:42

GB: Yeah, well, I think that the OpenAI, I mean, the short answer is yes,

497

1422334

3462

23:45

I believe that is where we're headed.

498

1425796

1793

23:47

And I think that the OpenAI approach here has always been just like,

499

1427631

3211

23:50

let reality hit you in the face, right?

500

1430842

1877

23:52

It's like this field is the field of broken promises,

501

1432719

2503

23:55

of all these experts saying X is going to happen, Y is how it works.

502

1435263

3212

23:58

People have been saying neural nets aren't going to work for 70 years.

503

1438475

3337

24:01

They haven't been right yet.

504

1441812

1376

24:03

They might be right maybe 70 years plus one

505

1443188

2044

24:05

or something like that is what you need.

506

1445232

1918

24:07

But I think that our approach has always been,

507

1447192

2169

24:09

you've got to push to the limits of this technology

508

1449361

2419

24:11

to really see it in action,

509

1451822

1293

24:13

because that tells you then, oh, here's how we can move on to a new paradigm.

510

1453115

3670

24:16

And we just haven't exhausted the fruit here.

511

1456785

2127

24:18

CA: I mean, it's quite a controversial stance you've taken,

512

1458954

2794

24:21

that the right way to do this is to put it out there in public

513

1461748

2920

24:24

and then harness all this, you know,

514

1464710

1751

24:26

instead of just your team giving feedback,

515

1466461

2002

24:28

the world is now giving feedback.

516

1468463

2461

24:30

But ...

517

1470924

1168

24:33

If, you know, bad things are going to emerge,

518

1473135

3753

24:36

it is out there.

519

1476930

1168

24:38

So, you know, the original story that I heard on OpenAI

520

1478140

2919

24:41

when you were founded as a nonprofit,

521

1481101

1793

24:42

well you were there as the great sort of check on the big companies

522

1482894

4463

24:47

doing their unknown, possibly evil thing with AI.

523

1487399

3837

24:51

And you were going to build models that sort of, you know,

524

1491278

4755

24:56

somehow held them accountable

525

1496033

1418

24:57

and was capable of slowing the field down, if need be.

526

1497492

4380

25:01

Or at least that's kind of what I heard.

527

1501872

1960

25:03

And yet, what's happened, arguably, is the opposite.

528

1503832

2461

25:06

That your release of GPT, especially ChatGPT,

529

1506334

5673

25:12

sent such shockwaves through the tech world

530

1512049

2002

25:14

that now Google and Meta and so forth are all scrambling to catch up.

531

1514051

3795

25:17

And some of their criticisms have been,

532

1517888

2085

25:20

you are forcing us to put this out here without proper guardrails or we die.

533

1520015

4963

25:25

You know, how do you, like,

534

1525020

2794

25:27

make the case that what you have done is responsible here and not reckless.

535

1527814

3754

25:31

GB: Yeah, we think about these questions all the time.

536

1531568

3128

25:34

Like, seriously all the time.

537

1534738

1418

25:36

And I don't think we're always going to get it right.

538

1536198

2711

25:38

But one thing I think has been incredibly important,

539

1538909

2460

25:41

from the very beginning, when we were thinking

540

1541411

2169

25:43

about how to build artificial general intelligence,

541

1543580

2419

25:45

actually have it benefit all of humanity,

542

1545999

2002

25:48

like, how are you supposed to do that, right?

543

1548001

2127

25:50

And that default plan of being, well, you build in secret,

544

1550170

2711

25:52

you get this super powerful thing,

545

1552923

1626

25:54

and then you figure out the safety of it and then you push “go,”

546

1554549

3003

25:57

and you hope you got it right.

547

1557552

1460

25:59

I don't know how to execute that plan.

548

1559012

1835

26:00

Maybe someone else does.

549

1560889

1168

26:02

But for me, that was always terrifying, it didn't feel right.

550

1562099

2877

26:04

And so I think that this alternative approach

551

1564976

2128

26:07

is the only other path that I see,

552

1567104

2043

26:09

which is that you do let reality hit you in the face.

553

1569147

2503

26:11

And I think you do give people time to give input.

554

1571691

2336

26:14

You do have, before these machines are perfect,

555

1574027

2211

26:16

before they are super powerful, that you actually have the ability

556

1576279

3128

26:19

to see them in action.

557

1579407

1168

26:20

And we've seen it from GPT-3, right?

558

1580617

1752

26:22

GPT-3, we really were afraid

559

1582369

1376

26:23

that the number one thing people were going to do with it

560

1583745

2711

26:26

was generate misinformation, try to tip elections.

561

1586456

2336

26:28

Instead, the number one thing was generating Viagra spam.

562

1588834

2711

26:31

(Laughter)

563

1591545

3169

26:36

CA: So Viagra spam is bad, but there are things that are much worse.

564

1596007

3212

26:39

Here's a thought experiment for you.

565

1599219

1752

26:40

Suppose you're sitting in a room,

566

1600971

1710

26:42

there's a box on the table.

567

1602681

1668

26:44

You believe that in that box is something that,

568

1604349

3003

26:47

there's a very strong chance it's something absolutely glorious

569

1607394

2961

26:50

that's going to give beautiful gifts to your family and to everyone.

570

1610397

3920

26:54

But there's actually also a one percent thing in the small print there

571

1614359

3629

26:58

that says: “Pandora.”

572

1618029

1877

26:59

And there's a chance

573

1619906

1669

27:01

that this actually could unleash unimaginable evils on the world.

574

1621616

4088

27:06

Do you open that box?

575

1626538

1543

27:08

GB: Well, so, absolutely not.

576

1628123

1460

27:09

I think you don't do it that way.

577

1629624

1919

27:12

And honestly, like, I'll tell you a story that I haven't actually told before,

578

1632210

3796

27:16

which is that shortly after we started OpenAI,

579

1636006

2586

27:18

I remember I was in Puerto Rico for an AI conference.

580

1638592

2711

27:21

I'm sitting in the hotel room just looking out over this wonderful water,

581

1641344

3462

27:24

all these people having a good time.

582

1644806

1752

27:26

And you think about it for a moment,

583

1646558

1752

27:28

if you could choose for basically that Pandora’s box

584

1648310

4504

27:32

to be five years away

585

1652814

2711

27:35

or 500 years away,

586

1655567

1585

27:37

which would you pick, right?

587

1657194

1501

27:38

On the one hand you're like, well, maybe for you personally,

588

1658737

2836

27:41

it's better to have it be five years away.

589

1661573

2002

27:43

But if it gets to be 500 years away and people get more time to get it right,

590

1663617

3628

27:47

which do you pick?

591

1667287

1168

27:48

And you know, I just really felt it in the moment.

592

1668496

2336

27:50

I was like, of course you do the 500 years.

593

1670874

2002

27:53

My brother was in the military at the time

594

1673293

2002

27:55

and like, he puts his life on the line in a much more real way

595

1675295

2961

27:58

than any of us typing things in computers

596

1678256

2628

28:00

and developing this technology at the time.

597

1680926

2585

28:03

And so, yeah, I'm really sold on the you've got to approach this right.

598

1683511

4547

28:08

But I don't think that's quite playing the field as it truly lies.

599

1688058

3628

28:11

Like, if you look at the whole history of computing,

600

1691686

2670

28:14

I really mean it when I say that this is an industry-wide

601

1694397

4463

28:18

or even just almost like

602

1698902

1543

28:20

a human-development- of-technology-wide shift.

603

1700487

3336

28:23

And the more that you sort of, don't put together the pieces

604

1703865

4088

28:27

that are there, right,

605

1707994

1293

28:29

we're still making faster computers,

606

1709329

1752

28:31

we're still improving the algorithms, all of these things, they are happening.

607

1711081

3670

28:34

And if you don't put them together, you get an overhang,

608

1714793

2627

28:37

which means that if someone does,

609

1717420

1627

28:39

or the moment that someone does manage to connect to the circuit,

610

1719089

3086

28:42

then you suddenly have this very powerful thing,

611

1722175

2252

28:44

no one's had any time to adjust,

612

1724427

1544

28:46

who knows what kind of safety precautions you get.

613

1726012

2336

28:48

And so I think that one thing I take away

614

1728390

1918

28:50

is like, even you think about development of other sort of technologies,

615

1730308

3837

28:54

think about nuclear weapons,

616

1734187

1376

28:55

people talk about being like a zero to one,

617

1735563

2002

28:57

sort of, change in what humans could do.

618

1737565

2628

29:00

But I actually think that if you look at capability,

619

1740235

2461

29:02

it's been quite smooth over time.

620

1742696

1585

29:04

And so the history, I think, of every technology we've developed

621

1744281

3670

29:07

has been, you've got to do it incrementally

622

1747993

2002

29:10

and you've got to figure out how to manage it

623

1750036

2127

29:12

for each moment that you're increasing it.

624

1752163

2461

29:14

CA: So what I'm hearing is that you ...

625

1754666

2252

29:16

the model you want us to have

626

1756918

1668

29:18

is that we have birthed this extraordinary child

627

1758628

2795

29:21

that may have superpowers

628

1761423

2544

29:24

that take humanity to a whole new place.

629

1764009

2544

29:26

It is our collective responsibility to provide the guardrails

630

1766594

5005

29:31

for this child

631

1771641

1210

29:32

to collectively teach it to be wise and not to tear us all down.

632

1772892

5047

29:37

Is that basically the model?

633

1777939

1377

29:39

GB: I think it's true.

634

1779357

1168

29:40

And I think it's also important to say this may shift, right?

635

1780567

2878

29:43

We've got to take each step as we encounter it.

636

1783445

3253

29:46

And I think it's incredibly important today

637

1786740

2002

29:48

that we all do get literate in this technology,

638

1788783

2878

29:51

figure out how to provide the feedback,

639

1791661

1919

29:53

decide what we want from it.

640

1793621

1377

29:54

And my hope is that that will continue to be the best path,

641

1794998

3128

29:58

but it's so good we're honestly having this debate

642

1798168

2377

30:00

because we wouldn't otherwise if it weren't out there.

643

1800545

2628

30:03

CA: Greg Brockman, thank you so much for coming to TED and blowing our minds.

644

1803631

3629

30:07

(Applause)

645

1807302

1626

New videos

06:27

How do drugs make you hallucinate? - Anees Bahji

06:51

The Rise of China's Homegrown Brands — and Why ...

06:16

How important is politeness? ⏲️ 6 Minute English

07:44

North Korea’s secrets revealed by phone: Study:...

17:30

Advanced English Learning: Speaking Practice

03:48

What can you do? Easy English Conversations 💬 ...

08:33

Can AI Help with the Chaos of Family Life? | Av...

12:13

Speak English Confidently: Daily Tricks & Tips 🧠

Original video on YouTube.com

The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED - YouTube

About this website

This site will introduce you to YouTube videos that are useful for learning English. You will see English lessons taught by top-notch teachers from around the world. Double-click on the English subtitles displayed on each video page to play the video from there. The subtitles scroll in sync with the video playback. If you have any comments or requests, please contact us using this contact form.

https://forms.gle/WvT1wiN1qDtmnspy7

Playback speed

Subtitle font size

The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED

New videos

The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED

New videos

Original video on YouTube.com