HomeVideos

Uri Gellar, James Randi and a ChatGPT Thought Experiment

Now Playing

Uri Gellar, James Randi and a ChatGPT Thought Experiment

Transcript

139 segments

0:00

So when I was a kid, there was this guy called

0:02

Uri Geller. He claimed to be a psychic. He was

0:05

a big deal. He was on the John Carson show...

0:07

Johnny Carson... So like, imagine a show like Jimmy

0:10

Fallon

0:10

and Jimmy Kimmel and Colbert's audiences all

0:13

roll up into one. Uri was, of course, a fraud

0:15

because psychic powers aren't a thing, but he

0:18

fooled the CIA, professors, scientists,

0:20

parapsychologists, physicists, et cetera, et cetera.

0:22

He was exposed by a guy named James Randi,

0:26

who wrote this video's featured book. Randi

0:31

wasn't a scientist. He was a stage magician.

0:35

Turns out that the secret to spotting a fraud

0:36

isn't asking the experts in the field in

0:38

question.

0:39

It's asking someone he knows how to reproduce

0:40

the result. So let's do a thought experiment

0:44

about ChatGPT. So imagine for a moment that

0:47

you wanted to build an online service that mimics

0:50

what ChatGPT does. For now, just worry about

0:52

the text part. Let's not worry about Dall-E or

0:54

Sora

0:55

or any of that stuff, but we want to do it

0:57

without using any neural networks or anything

0:59

that people

1:00

might normally consider to be AI. This has to

1:01

remain a thought experiment for reasons I'll

1:03

talk

1:03

about later, but let's go through it together.

1:05

It'll be an interesting exercise. So you want

1:08

to

1:08

build a conventional internet software startup

1:10

and you've got a handful of years to get a

1:12

product to

1:13

market. You don't have to care about revenue

1:15

and you have, I'm just going to pick a number

1:16

out of

1:17

the air, say $63 billion, $920 million of

1:20

investment money to spend on it. Do you think

1:23

that could be

1:23

done? Because I do. And I think there's a

1:25

decent chance there'd be enough money left over

1:27

to feed

1:27

all of the hungry people in the US for a decent

1:29

fraction of a year. So you start off by

1:32

building a

1:33

giant index of the web and, what the hell, while

1:35

you're at it throw in a few terabytes of pirated

1:37

books and journals. Now, we're not talking

1:39

about training a model or anything like that.

1:42

This is

1:42

straightforward text indexing the way Google

1:44

used to work back before they killed it. Now

1:46

you take

1:46

the query text, we're not going to call these

1:48

prompts although there would be a lot of

1:50

similarity.

1:50

And you use the standard text search algorithms

1:52

to find the top N similar matches. Might be

1:55

100

1:55

might be 1000, who knows. We would run experiment

1:57

later to see what number gives us the best

1:59

results.

1:59

Then we take the text that matched and we dump

2:01

it into some series of algorithms to extract

2:04

the

2:04

text, break it up in its parts of speech, find

2:06

permutations of it, move words around, replace

2:08

words

2:08

with synonyms, et cetera, and then decide what

2:10

word we ought to output next. There's a lot of

2:13

prior

2:14

art we could pull from. The act of outputting

2:16

similar text goes back at least to Eliza in the

2:18

1960s,

2:19

then there was a chatbot made by a few people

2:21

for a few thousand dollars. It supposedly

2:23

succeeded

2:23

in passing a Turing test in a competition in

2:27

2014. Siri reportedly cost Apple 200

2:30

million dollars

2:31

and we've got, what, 300 times that much money?

2:33

IBM supposedly spent four billion dollars on

2:35

its

2:35

Jeopardy winning Watson bot. And that wasn't

2:37

just to compete on the show. That also includes

2:39

all the

2:40

money they spent trying and failing to cure

2:42

cancer with it after it won Jeopardy. We have

2:44

16 times

2:45

more money than that. And unlike Watson or Siri

2:48

or the Turing test bot, we don't have to care

2:50

if it

2:51

answers the questions correctly. We'll just

2:53

stick a disclaimer at the bottom of the output

2:55

page that

2:55

says the bot hallucinates and the user is

2:57

responsible for verifying everything. How close

3:00

do you think

3:00

we could get to what ChatGPT could do? It's

3:02

hard to say because it could never ever happen,

3:04

both because no one would spend that much money

3:06

making a chatbot without being convinced that

3:08

it

3:08

might magically become an equivalent of a PhD

3:10

in every subject. And because we get sued into

3:13

homelessness by all of the people whose content

3:15

we were plagiarizing because we wouldn't have

3:17

the

3:17

defense of claiming that we "grew" it instead of

3:19

building it, and we can't be held responsible

3:21

because

3:22

no one can understand how it works. But I've

3:24

done a lot of data searching projects. And

3:26

given that

3:27

amount of money, I don't have a difficult time

3:29

believing we could get there. And I bet we

3:31

could

3:31

make it even easier by taking what has been

3:32

learned about the attention mechanism since the

3:35

2017

3:35

"attention is all you need" paper. And then retrofitting

3:38

that knowledge back onto non-neural

3:40

network vector encoding techniques like latent

3:42

semantic analysis or hyperspace analogs to

3:44

language.

3:45

And what would make it easiest of all is that

3:48

the hardest part of any such project,

3:50

making sure the answer is correct, no longer

3:52

applies, because we can just stick our fingers

3:54

in our ears and chant "It's just a hallucination!!!"

3:56

over and over anytime somebody complains.

3:58

At the very least, I think it'll be much more

4:00

technically possible that ChatGPT being PhD

4:02

level

4:02

smart in every subject. Just something to think

4:05

about. Thanks for watching. Let's be careful

4:08

out there.

Interactive Summary

The video discusses the skepticism surrounding modern AI by drawing a parallel to Uri Geller, a fraud who claimed to be psychic but was exposed by a magician rather than a scientist. The speaker proposes a thought experiment: building a service identical to ChatGPT using traditional search indexing and natural language processing techniques instead of neural networks. With a massive budget of $63 billion, he argues that such a system could be built by simply indexing the web and using a 'hallucination' disclaimer to avoid the responsibility of being correct, suggesting that the perceived 'intelligence' of AI might just be sophisticated text reproduction.

Suggested questions

4 ready-made prompts