HomeVideos

What's really going on with AI, Expert weighs in | TheStandup

Now Playing

What's really going on with AI, Expert weighs in | TheStandup

Transcript

1190 segments

0:00

Welcome everybody to the standup. We

0:01

have an extremely special episode of the

0:03

standup. Casey has started his own

0:06

podcast, a competing podcast in which he

0:08

is how could you

0:14

uh anyways sorry he and Dmitri are

0:16

talking about and having a real

0:18

conversation about what is the actual

0:20

effects of AI and all that. And you're

0:21

probably asking, well, why would they do

0:23

that? Well, Dimmitri turns out to be a

0:24

legend when it comes to AI. He knows a

0:26

lot about it, been working in it for a

0:28

very, very long time. and just like

0:30

Casey is significantly more competent

0:33

than TJ and I combined. And so

0:35

therefore, their conversations are

0:36

actually useful and good to listen to.

0:38

And we thought we'd bring them on and

0:40

ask some of our own very own questions.

0:42

Kind of a podcast melding.

0:44

>> The podcast was actually Dimmitri's

0:45

idea. It wasn't even my idea. I It

0:48

should have been my idea because I agree

0:50

that I should ditch you guys and go with

0:52

some like some higher quality co-hosts

0:54

here. I mean, that's obviously clear,

0:56

but but it wasn't. Uh cuz Dimmitri, you

1:00

know, you were actually the one who was

1:01

kind of like, I want to talk about AI

1:03

because I'm just basically you weren't

1:04

happy. You were like, I I don't like

1:07

that. I don't like what's being said. It

1:09

feels like

1:09

>> Did you go on Twitter by accident? Is

1:11

that what happened? You went on Twitter.

1:14

>> But wait a second, we need a podcast

1:15

immediately.

1:16

>> Let me add a little bit to what Casey

1:19

said there.

1:21

>> So, actually, I did not set out to uh to

1:24

have a podcast. uh I set out so I I get

1:28

so I've been working this a long time

1:29

like 20 plus years uh I have lots of

1:31

friends who are not programmers or any

1:33

kind of like you know lawyers,

1:34

accountants, doctors whatever and

1:36

they're frequently asking me you know

1:37

first of all can I use this in my job

1:38

and other things like how is this going

1:40

to affect like I have kids in college

1:42

should I stop sending them to college

1:44

right you know all of those kinds of

1:46

questions so uh at some point I thought

1:48

I should just try to put this stuff

1:49

together somehow and first I started

1:51

writing stuff but that felt awkward then

1:53

I thought maybe I'll record something,

1:55

but I'm not really a recording

1:57

personality. Uh, and then Casey and I

1:59

have been talking about AI on and off

2:01

for uh for years. And my suggestion was,

2:04

what if we record like one session? Uh,

2:07

and I had a list of things that we could

2:08

talk about. And Casey said, well,

2:10

there's so many things here that we

2:12

could just turn this into a podcast. So,

2:13

that's how we ended up uh ended up here.

2:16

Uh, but let me add one one more thing to

2:17

that, which is uh so you know, I know

2:19

that there are a lot of people out there

2:20

who are, you know, I see Casec

2:23

personalities. I am that kind of

2:24

personality as well. So part of uh part

2:27

of what I want to do here is to um

2:32

facilitate having more Casey out there

2:35

uh on stuff where I can be the wingman,

2:38

right? Uh that Casey understand. So one

2:41

of the things that I appreciate about

2:43

appreciate about him quite a bit is that

2:45

um he wants to know what he's talking

2:46

about before he says something which you

2:49

know puts him in the top top 20% of

2:51

podcasters at least.

2:53

At least

2:54

>> at least

2:55

>> at least top 50% of this podcast.

3:01

>> Wait a second.

3:03

>> Hey, is that HTTP? Get that out of here.

3:06

That's not how we order coffee. We order

3:08

coffee via ssh terminal.shop. Yeah, you

3:11

want a real experience. You want real

3:13

coffee. You want awesome subscriptions

3:15

so you never have to remember again. Oh,

3:17

you want exclusive blends with exclusive

3:19

coffee and exclusive content? Then check

3:22

out CRON. You don't know what SSH is?

3:26

>> Well, maybe the coffee is not for you.

3:28

terminal coffee

3:31

in hand.

3:36

What I hope comes uh comes out of this

3:39

uh in total is that uh we get kind of uh

3:43

Casey culture aligned commentary on uh

3:46

on AI and I mean Casey can I don't want

3:49

to put words in his mouth but uh like we

3:52

see software in a very similar way,

3:54

right? I'm similarly skeptical about the

3:56

quality of anything coming out of big

3:58

tech these days about like questionable

4:00

ethical business practices and so on.

4:02

So, uh on on general software, not AI

4:06

software, Casey and I see eye on on most

4:08

things. Uh and so what I'm hoping is

4:10

that I can be the the wingman and uh

4:13

like give a platform for more Casey

4:15

culture commentary specifically on this

4:17

crazy AI thing that's taking over

4:18

everyone's life, including my own,

4:20

right? As I you know in our inaugural

4:22

episode I mentioned that uh my life used

4:24

to be much more quiet and nobody

4:26

believed AI would work and we were just

4:27

quietly doing fun stuff and uh and uh in

4:31

a way things were better. I mean it's

4:33

good that uh that things are working now

4:35

or sort of working. Uh and the sort of

4:37

working actually is a big part of uh of

4:39

what I care about uh trying to make it

4:41

better than sort of work. Um so I don't

4:44

know. I've gone on long enough. Maybe

4:45

Casey you should uh put a bow on it.

4:48

>> I think you're I think you're being way

4:49

too generous to me. Yeah, I'm I'm the

4:50

wingman on our podcast because I don't

4:52

really like I just don't work with AI

4:54

stuff. So to me it was just like a good

4:56

opportunity to like have somebody on who

4:58

like I trusted to give solid perspective

5:01

on AI because like it's really hard to

5:04

you know the only real way I thought

5:06

otherwise that I would be able to give

5:08

any commentary is I'd have to go spend a

5:09

ton of time with it, right? And I just

5:11

didn't really want to do that. So so

5:13

it's been great. Uh and I' I've really

5:15

enjoyed it. Obviously, Demetri and I

5:17

have recorded a couple things that, you

5:18

know, we haven't posted yet. So, but I

5:20

Yeah, it's it's been really great having

5:22

you on. So, and I I should kick it back

5:25

to to Prime and Te. So, like we don't

5:27

want to just talk about our podcast

5:29

here. So, like you guys had some AI

5:31

questions you wanted to talk about. Uh

5:32

or I don't know,

5:34

>> take take it wherever you guys want to

5:35

take it. We are here. We are here at

5:37

your disposal.

5:38

>> I do want to start with some things

5:39

which is Demetri, can you please uh we

5:40

didn't give you really like an actual

5:42

qualifying intro other than you're

5:44

legendary. Yeah, I was just going to say

5:45

I I appreciate the kind words like I I

5:47

consider myself relatively competent,

5:49

but I mean there are like big names in

5:51

the industry who are uh either friends

5:53

or friends of friends and it's uh useful

5:55

for me to keep uh keep myself calibrated

5:58

relative uh you know uh relative to

6:01

people of uh much more at least public

6:03

uh uh public renowned. Um so yeah I I

6:06

don't know I guess uh in terms of

6:08

introduction we uh I've been doing um

6:11

broadly AI related uh research for 20

6:14

plus years. I shipped uh shipped to many

6:18

people my very first custom design AI in

6:21

2025 uh at Google. It was one of their

6:23

first

6:24

>> uh 25

6:25

>> uh sorry 20 2005.

6:27

>> Yeah I was going to say like that's like

6:28

20 years old.

6:29

>> That's like banana nano banana if you're

6:32

shipping that one. Okay. Okay.

6:33

Yeah, exactly.

6:35

>> Uh so anyway, um I've been doing this

6:38

for a long time and seen uh lots of the

6:41

ups and downs in the business and also I

6:42

I study the history of the business as

6:44

well. So I know about um you know the

6:47

ups and downs for like the preceding 50

6:49

years um like going back to you know

6:51

going back to even the the 60s. Uh and

6:53

so um one of the things that really

6:55

interests me is how this is going to

6:57

affect other people um like people that

7:00

I work with for example who are now I

7:02

mean uh in the first episode we just

7:04

talked about the impact on junior

7:05

engineers and separately

7:07

>> probably should talk about the impact on

7:08

senior engineers right because there's

7:09

they face a different problem which is

7:13

if you're like let's say you're whatever

7:16

uh 45 years old now uh you've been doing

7:18

this a long time you have a good stable

7:20

career making good money uh and now

7:22

you're wondering ing what happens if

7:25

they like crack the nut and in 5 years

7:29

I'm useless, right? And I know I know

7:31

people who are wondering this myself.

7:33

>> Mhm.

7:33

>> Um and that's an especially bad time to

7:36

be useless, right? If you're like late

7:38

midcareer, what do I do if I'm 50 years

7:40

old and I don't know like what do I do

7:43

next? Right? uh and there there it's not

7:45

anything like the the problems that the

7:47

junior engineers face where it's I mean

7:49

again big problem for the junior

7:51

engineers but they have their whole

7:52

lives ahead of them and they can't they

7:53

have time to try to do something else uh

7:56

but if you're 50 years old and uh don't

7:58

have you know 15 years of uh building up

8:02

a new career ahead of you what do you do

8:04

uh so that's that's something that I see

8:06

people worrying about a lot as well um

8:10

yeah I don't I guess maybe let me not

8:13

talk talk too much about myself.

8:15

>> You can't talk too much. We're used to

8:16

hanging out with Casey. All right, it's

8:17

fine. You know,

8:20

>> we like it when people talk. That's why

8:21

we bring smart guests on.

8:24

>> Uh yeah. So I guess another thing I I

8:26

will add and this this connects back to

8:28

kind of having a Casey style perspective

8:30

on the industry is that

8:32

>> um I've seen a lot of how the sausage

8:34

get made uh both in terms of like the

8:37

the technical side but also uh also the

8:39

business side and this is something that

8:41

uh we we talk about in later episodes

8:44

about uh how how much can you take

8:46

certain claims at face value and how to

8:49

evaluate which claims can be taken at at

8:51

face value and so on. Um so what I'm uh

8:56

in the same way that um like you can't

8:59

take whatever Microsoft claims at face

9:02

value about improvements in Windows

9:03

performance. Um right you can same

9:06

>> this is going to be the best Windows yet

9:08

that's going to be perfect and flawless

9:11

will be safe.

9:12

>> It was going to be the last Windows ever

9:13

as well.

9:16

>> Well I did I they are still working on

9:19

making it the last Windows ever.

9:21

>> That's a good point.

9:24

the Linux desktop is what TJ's trying to

9:26

say.

9:26

>> They they put that into the AI and it

9:29

gave them a plan back of how to make it

9:31

the last Windows ever. They're executing

9:34

on that flawlessly.

9:35

>> Yeah. They're like, "We got this thing

9:37

called Windows 11 and it will ensure

9:39

that Windows 10 is the very last version

9:41

of Windows."

9:45

>> All right. And if that doesn't work, we

9:46

have Windows 12.

9:48

>> Yeah.

9:50

I um so I I've just got a few topics

9:52

that I thought would be fun. We can just

9:54

see where uh like they lead us and what

9:57

happens. I think there's similarly to on

10:00

on the episode at least that I've

10:02

listened to. Um you start on a topic and

10:05

there's like 95 different branches that

10:07

we can go off of. So I I'm not if we

10:09

don't get to all of them, it's whatever.

10:11

I I think it uh it'll be fun. But um one

10:14

of the things I'm interested in and like

10:16

it's just hard to get anybody's like

10:18

unbiased

10:20

everyone's got a biased whatever just

10:22

like a more rational take on like this

10:24

current state of token cost and what

10:28

that's looking like going forward. I

10:30

don't know if you have any thoughts

10:31

Dimmitri about like is it going to be

10:34

that Sam Alman was right and we're going

10:36

to get 10x cheaper every year or

10:38

whatever. I don't know. That's what

10:38

Twitch chat always tells.

10:40

>> That's what Sam Sam has it on the

10:41

record. He said it twice now. He said it

10:43

he said it twice and made projections

10:44

into the future saying it's 100x

10:46

cheaper. GP2 5.2 high will be uh 100x

10:49

cheaper in uh two years.

10:52

>> Okay.

10:53

>> Uh so a lot of that is infrastructure

10:56

development and algorithm development

10:58

that is trade secrets that

11:00

>> like I can't evaluate uh and even like

11:03

even an open AI insider it would be

11:05

illegal for them to to evaluate in

11:07

public right. Um I I don't know what

11:11

they're cooking up there. I mean they

11:13

have an extremely talented team. Uh 100x

11:16

cheaper seems hard to believe, but

11:19

uh I guess wait and see. So I mean one

11:22

of the thing one of the problems with

11:23

these um with evaluating claims in this

11:26

business is that um the timing matters

11:29

as much as the content of the claim. So

11:32

like uh like Musk is a good example just

11:34

to take Sam out of this for a moment

11:37

>> where uh MK was Musk was saying you know

11:40

we'll have uh I mean we'll have uh you

11:43

know reusable rockets in I think he

11:45

first made the claim in 2005 or

11:47

something like that and it took many

11:48

years before that actually worked later

11:50

you know when was the first promise of

11:52

uh Tesla full self-driving that was like

11:54

2016 that he was saying

11:56

>> and then every six months after that

11:58

>> right um so So, I would um I found it

12:03

useful to try to separate out the

12:06

content of the claim from the timing of

12:07

the claim.

12:08

>> So, I I would not be surprised um I

12:11

would not be surprised if eventually we

12:12

can get uh 100x cheaper token cost.

12:16

Whether or not they can do it now, I

12:18

that's beyond my knowledge because

12:20

that's that's uh partly that's

12:23

infrastructure, partly that's uh like

12:26

custom hardware, partly that's can you

12:28

get cheaper electricity, part like there

12:30

so many things that go into the into the

12:32

business that it's hard uh hard for me

12:35

to to claim one way or another. Uh what

12:37

I can tell you is that it has been as

12:38

I'm sure you've seen uh it has been a um

12:42

land rush mentality, right? So,

12:43

everyone's building as quickly as they

12:45

can, whatever they can. Put it out. Does

12:47

it work? Does it barely work? Okay, move

12:49

on to the next thing.

12:50

>> Um, and uh some of that is

12:54

>> uh some of that is maybe um

12:57

recklessness, but some of that is just

12:58

market forces, right? Because the um

13:02

right now there are multiple really big

13:03

players and probably there won't be as

13:06

many big players in 10 years. And so

13:08

they like given how much money they've

13:10

all put into this already, they really

13:12

don't want to be the ones caught out,

13:13

right? And so the like the the arms race

13:16

is uh rational from a business

13:20

perspective whether or not it's it's

13:22

producing um you know completely

13:25

rational engineering artifacts. So uh

13:27

all of that is to say uh I'm sure that

13:29

there is very substantial um opportunity

13:33

for uh for improving efficiency in the

13:36

current stock and I don't know if they

13:39

so my biggest question would be um is

13:42

the is the internal stock stable enough

13:45

that you can optimize it now right

13:47

because like say you spend a year

13:48

optimizing it to get the token cost down

13:50

and then there's

13:52

>> whatever the next archite architectural

13:54

innovation is does that invalidate your

13:56

optimization or not. So I I don't know,

13:58

right? Um and so you you will hear me

14:01

say I don't know perhaps more than most

14:03

uh people talking about AI just because

14:06

it's uh there's a lot that's unknown

14:08

even to people who are deep like uh deep

14:11

inside these companies about what's

14:12

what's going to happen in like two or

14:14

three years, right? Um

14:16

>> yeah, I I would add one thing to that

14:19

that you know um it's not not adding

14:23

something as in information, adding

14:24

something as in sort of like a thing to

14:26

think about and that is that uh it's

14:29

worth noting that I if the costs were

14:35

really going to get that low that fast,

14:38

I think I would have expected Google's

14:41

costs to be very low already. And I know

14:43

it's a weird thing to say, but

14:46

>> if you assume that there's any hardware

14:47

component, like if you think that that

14:49

all 100x is going to come from software,

14:51

then that seems okay, maybe that could

14:53

happen. But if we're counting on a

14:55

substantial portion of that 100x coming

14:57

from infrastructure builds, it feels to

15:00

me like Google has kind of been building

15:02

AI centric like stuff for quite some

15:06

time and has gotten their thing that

15:08

like their TPUs are very much just like

15:11

we built machines whose only purpose is

15:13

to do this job as opposed to like say

15:16

for example Nvidia who's not doing that

15:18

like the Nvidia cards are kind of like

15:20

still in a weird like hybrid state where

15:22

only the

15:24

only the very latest Nvidas could be

15:26

said to be focusing on AI really and

15:28

even those are still a little bit hybrid

15:30

feeling to me. So I could imagine if

15:34

someone said, well, you know, if all we

15:36

had was Nvidia to look at, maybe you

15:39

could believe that like, you know,

15:40

there's a very long lead time on

15:42

hardware. So, you know, we don't know

15:44

what they, you know, when they shifted

15:46

probably four or five years ago to going

15:48

like, "Oh my god, AI is going to be the

15:49

biggest thing. We need to completely

15:50

redesign everything to just focus on

15:52

that." That stuff will only be, you

15:55

know, we'll only be seeing the end of

15:56

that pipeline, you know, now or

15:59

something like that, right? So, I could

16:02

see that, but I feel like Google's kind

16:03

of I mean, am I wrong about this speech?

16:05

I feel like Google's been doing this for

16:06

a long time. If there was a huge if

16:08

there was huge gains to be had by doing

16:10

just AI hardware, I would have thought

16:12

they'd have them already. Is that am I

16:14

way off?

16:15

>> I'm not sure they don't. Right. So, the

16:17

first the first time I heard about

16:19

>> uh TPU

16:22

uh I think the first time I heard that

16:24

they were starting to work on it was

16:26

like 2014 or something like So, that's

16:28

just when I heard about it, right? like

16:30

>> I am not a Google insider so I'm sure

16:33

like uh knowing how Google works I'm

16:35

sure that they were talking about that

16:36

like 3 years before I heard about it in

16:38

public right but when I heard when I

16:39

first heard about TPU development was

16:41

somewhere around 2014

16:43

>> so I guess I I have not been studying

16:46

the uh like whatever internal economics

16:49

reports uh Google puts out it's possible

16:53

that they are already substantially

16:54

cheaper than um

16:56

>> I would think

16:57

>> on on operating cost substantially

16:58

cheaper than open I don't So uh and but

17:00

I mean this is a something that's a

17:02

common uh folk belief in the industry

17:05

which is um like Google is sort of the

17:08

quiet monster in the business because

17:11

like they

17:13

if you compare them for a moment to say

17:15

open AI uh they have had they have been

17:17

infrastructure leaders forever. They

17:19

have been AI leaders forever. Uh unlike

17:21

open AI they have uh they have u I'm

17:25

trying not to say the word surveillance

17:27

but I'm just going to say surveillance.

17:30

on the entire internet, right? So, they

17:32

have just a constant flow of

17:34

>> uh potentially trainable information and

17:37

you know, XAI has um uh has that flow

17:41

that uh that is their own proprietary

17:43

flow of data.

17:44

>> Um so, uh many people have have this

17:48

feeling of Google was kind of slowed on

17:50

the on the product side. I emphasize on

17:53

the product side uh with uh like chat uh

17:57

chat LLM products. Uh but there is many

18:00

people have this feeling of um possibly

18:03

Google is just going to quiet quietly

18:05

leverage all those advantages that uh

18:07

nobody else has. Uh and like XAI is the

18:10

only one that's sort of sort of close in

18:13

um in having that capability. It's

18:16

interesting to note that uh as far as

18:18

I've seen in public XAI,

18:20

they're merging with uh Starlink.

18:23

They're talking about sorry with uh

18:24

>> SpaceX. Yeah,

18:25

>> SpaceX. Uh and they're talking now about

18:27

orbital data centers. And something that

18:29

I I thought was interesting there um was

18:33

uh that the designs that they've talked

18:35

about are GPU based, not TPU based. And

18:38

I don't know if that's because

18:40

>> they don't think they can build it up as

18:42

well as Google did or they don't think

18:44

they can license it. So I don't I don't

18:45

know, right? But the thing I will point

18:47

out the the designs I've seen uh I mean

18:49

I guess I haven't looked looked in like

18:51

3 months but the designs I've seen were

18:54

uh estimates based on uh you know Nvidia

18:56

style GPUs and not TPUs. So they must

19:00

know something because this is this is

19:02

the kind of thing that they do, right?

19:04

Like the big um

19:08

the biggest competence advantage that I

19:10

would put in the kind of Musk category

19:12

of companies is uh the relentless

19:15

execution on on uh building and

19:17

optimizing and streamlining physical

19:20

stuff. That's something that they've

19:21

been doing really well for a very long

19:22

time. So if um anyway that caught my eye

19:25

that the designs I saw were based on

19:27

GPUs and not TPUs.

19:29

>> Um and I I do know that they're working

19:31

on their own um uh their own AI chips. I

19:35

don't I don't know that much about how

19:36

far they are there, but it's interesting

19:38

that that when they were talking about

19:40

the orbital data center designs, they

19:41

were talking about u get conventional

19:43

GPUs as we understand them.

19:47

>> I got to say orbital data center is like

19:48

the coolest sounding thing to build.

19:51

>> Yes. Like I it seems really not

19:53

practical. I get like I get there

19:55

saying, "Oh, we're going to have robots

19:57

do it and it'll be outer space and it's

19:59

really cool." Um but I got I mean you I

20:02

feel like everyone's got to admit

20:04

orbital data center is probably the

20:06

coolest thing to say you could work at

20:07

in software development. So

20:10

>> I think like yeah based on what I've

20:12

seen for orbital data centers it sounds

20:14

like it's it is either much cooler or

20:18

much less cool than what you're thinking

20:19

about depending depending on your

20:21

perspective. Right? So,

20:25

excuse me.

20:27

If you're imagining like a giant

20:31

building in space, like

20:34

>> like it it sounds like uh the kind of

20:37

more leading designs that are like more

20:40

plausible. They're more like Starlink.

20:42

They're more like satellite clusters

20:45

where you just have lots of little

20:48

things that talk to each other over

20:50

light links basically for high-speed

20:52

communication. Um,

20:54

>> well, and giant giant radiators, right?

20:56

>> And huge.

20:57

>> Yep.

20:58

>> It's almost all radiator. And then

21:00

there's a little like nub in the center

21:02

contains that contains the GPUs, right?

21:05

>> Yeah.

21:06

>> That doesn't sound as cool as what I had

21:08

in my head of a big skyscraper in outer

21:11

space where we've got racks of GPUs and

21:13

you can walk around in it and it's like

21:14

really cool.

21:16

>> Um, but that's okay because when you

21:19

tell your parents that you work at an

21:20

orbital data center, they won't know. So

21:22

it's fine.

21:23

>> That's what they imagine is that

21:25

>> Yeah. Yeah. I say it's exactly like Star

21:27

Wars mom and dad. Trust me. Trust me.

21:31

>> Unfortunately, like heat is just very

21:34

>> I guess the as the the kids say say

21:36

problematic in space. Like you don't

21:39

have anywhere it

21:41

>> like it's very hard to get rid of heat

21:43

in space. So this is a problem. As

21:45

Demetri was saying, you you end up

21:47

having to have a lot of surface area per

21:50

chip, like per per heat generating

21:52

Nvidia, like GB200 or whatever. I mean,

21:55

it won't be a GB200 by the time they get

21:57

these things launched, but the uh you

21:59

know,

22:01

>> that's the thing I didn't I didn't get.

22:03

I don't

22:04

>> What is the advantage then? Because in

22:06

in like I get that it feels like if you

22:09

talk to like a regular person about

22:11

putting it in outer space, they're like,

22:12

"Well, it's cold in outer space, so

22:14

that's going to be really good for

22:16

GPUs." But it's the opposite, I think.

22:17

Right. I mean, I So, I don't get

22:20

>> It's not that it's not cold in outer

22:21

space. Cold is the lack of available,

22:23

you know, like not having energy.

22:25

There's just there's no density, so you

22:27

have nothing you can

22:28

>> Yeah. I meant like more like they said

22:30

it cooling would be super easy. I feel

22:32

like that's like the normie thing if you

22:34

say put something out cuz like it'll be

22:36

cold. Sick. It's like a huge

22:37

refrigerator, bro. But that's not how

22:40

heat works. Like so I get that part. So

22:42

then what is the advantage? Why this is

22:44

we also don't have to go way down this

22:45

route, but I have been wondering this

22:47

myself. So what like why do they want to

22:49

put it in outer space?

22:50

>> Incredible investment TJ. Like the

22:52

amount of investments that are coming in

22:54

will be

22:54

>> the TAM the TAM for outer space is big.

22:57

Think about how many data centers we can

22:58

put in outer space. Well, it's so it's

23:01

free power and difficult but free

23:03

cooling, right? So, if you if you can

23:05

make the physics work and I don't I

23:07

don't know, right? Like they have really

23:08

really good like thermal people at you

23:11

can imagine they have very good thermal

23:12

people at uh

23:13

>> uh at SpaceX. So, uh I I assume that

23:17

they have done simulations that that

23:19

suggest to them that this can be made

23:21

viable. Um but yeah, so it's free power

23:24

and also like more free power than you

23:26

would get for the same panel on Earth

23:28

because there's no right

23:29

>> atmosphere eating up your atmosphere or

23:31

weather for that matter.

23:32

>> Yeah. No clouds,

23:33

>> right? So free power and uh tricky but

23:37

free cooling. Um as

23:40

>> so it's like when you build a really

23:42

complicated thing, but then it's passive

23:43

in a video game. I understand. This is

23:45

this mechanic makes sense to me. And one

23:48

thing I would add uh this so this is

23:50

entirely business strategy speculation.

23:53

Um so whatever like take like poor big

23:55

>> it's financial advice. Got it.

23:57

>> Exactly. Right.

23:58

>> Investing right now. If it works, if it

24:02

works and you are

24:05

uh Musk's AI team and you just get to

24:09

launch your AI into space and everyone

24:12

everyone else is like fighting local

24:14

governments and can I build a hydro

24:16

plant here? Oh, how much do I have to

24:18

pay you to? Right. And you'll just be

24:19

like, we're going to space, right? Like

24:21

you losers can hang out here. We're

24:22

going to space. And by the way, if you

24:24

want to go to space, you have to come

24:25

through me, right?

24:26

>> Yeah. Yeah.

24:26

>> Yeah. He pretty much owns the the

24:28

passage to space. So it's

24:30

>> it's pretty

24:31

>> they can't ask Katy Perry. She's the

24:33

nautace.

24:34

>> Well and also like it's pretty trivial

24:36

for like when you look at the

24:38

projections for these sorts of things.

24:40

It's all about launch cost. Like whether

24:42

or not it makes sense to put a data

24:43

center in space, a quote unquote data

24:45

center in space. It's like here is the

24:48

launch cost. like at this launch cost we

24:51

think we could do it because that and

24:52

that's based on like the failure rate of

24:54

the thing how you know long it's

24:55

expected to be able to be up there you

24:57

know all those sorts of of things

24:59

>> and so at the end of the day if you know

25:02

if you look at somewhere like SpaceX and

25:05

they're like well for us we could get

25:08

our launch cost down to $50 a kilogram

25:11

or whatever it you know the magic number

25:12

is

25:13

>> we just charge everyone else $75 a

25:16

kilogram and then it's it's not worth it

25:18

for them to put it in space or theirs

25:20

will always be worse than ours, right?

25:21

It's a very good position to be in and a

25:24

and a strangely profitable bet for

25:25

SpaceX if it turns out that these are

25:28

actually useful in the future. So,

25:31

>> so I think we should probably attempt to

25:32

get a little bit more on uh on a more

25:35

practical approach to AI. I do think

25:37

orbital data centers does sound amazing.

25:39

>> You don't want the title of this video

25:40

to be orbital data centers. THERE'S

25:42

GOING TO BE AI in space. That's going to

25:45

How many clicks is that going to

25:46

generate? Come on, man. space. Go ahead.

25:49

>> That's my That's my rock opera I'm

25:51

writing. That's crazy. How did you know?

25:53

Case

25:54

>> cuz I am also writing a rock opera

25:56

called Ali in Space as it turns out.

25:58

>> Oh, we could collab.

26:00

>> Yep.

26:01

>> That's not going to happen. Uh he's just

26:02

he's just going to take it all from you.

26:04

Okay. So, I guess we should

26:08

we should probably uh talk. I think the

26:10

number I get a lot of concerns uh from

26:13

people and one of the most frequent

26:15

questions I've been getting lately is is

26:17

my future just reviewing code.

26:21

>> Yeah. So, quite possible. Um let me let

26:24

me try to be measured in what I like how

26:28

likely I think that is. Um so look one

26:30

one thing that I've been um I I have

26:33

discovered in my own work and that I've

26:34

been saying publicly for a while now is

26:37

uh so I I try to work on um like

26:40

repeatable and reproducible AI results.

26:42

So like the bucket term I use for that

26:44

is has been reliable AI but it turns out

26:46

that there's a company called reliable

26:47

AI and so anyway so

26:51

uh so anyway uh the thing that I I I I

26:54

think you can do right now like

26:55

completely handsoff and not have to

26:56

review anything right so this is what I

26:58

mean by reliable right that like if I

27:00

ask you th to whatever go implement uh

27:04

like download this thing from this HTTP

27:05

endpoint and like save it into the

27:07

database and give me a dashboard that

27:10

whatever calculates the averages of some

27:11

things, right?

27:12

>> I can just tell you that and expect a

27:14

result and it will work and I don't have

27:15

to talk to you about it again, right? So

27:17

that's what I mean uh in this completely

27:18

hands-off style. Uh right now I think

27:20

the limit of that is somewhere around a

27:22

couple thousand lines of like relatively

27:24

standard junior level code. Uh and I

27:26

want to emphasize because some people

27:27

some people have uh justifiably pushed

27:30

back on this a bit. I'm not saying you

27:31

can't do more by having oversight and

27:34

doing multiple tries and having

27:35

algorithmic test suites. I'm not saying

27:37

you can't do more right now. What I'm

27:38

talking about is what's the limit of

27:39

what you can do hands-off and you can

27:40

mostly just trust that it happens. Mhm.

27:43

>> Um

27:44

I think I think at a at a minimum we are

27:48

going to go through a

27:50

um reviewheavy phase because the the

27:55

businesses are as as I'm sure you know

27:57

um the businesses are trying to shove

27:59

this into everyone's workflow whether

28:00

they like it or not. Right. And I I get

28:02

lots of people lots of people even in

28:04

the AI business who tell me like I hate

28:06

how much they're trying to push us to

28:07

use these things, right? I just

28:09

>> I just want to do my job. Uh but like

28:11

they're monitoring my token usage,

28:12

right? I mean this this

28:14

>> anecdotally too not just like literally

28:15

like I have friends who are like we have

28:17

to use this tool

28:19

>> part of my KPI is like we use we have to

28:22

use clawed code

28:24

>> only. That's and my manager has a

28:28

dashboard with my token count for the

28:30

month and it is a thing we talk about

28:32

and review. Not even like oh you can use

28:35

whatever one you want. We just want to

28:36

see you guys experimenting. They're like

28:37

you have to use cloud code. You have to

28:39

do this. Yeah, it's crazy.

28:41

>> Yeah. Think about Amazon Curo. That's a

28:43

lot of people are talking about that

28:44

right now with the whole Amazon

28:45

accidentally having several SE ones.

28:48

>> Yeah. They're like, you have to use our

28:49

stuff to take ourselves down. If

28:51

anyone's going to bring it down, it's

28:52

going to be me. Okay. That's

28:53

>> We want to make sure we our but we want

28:55

to make sure our site goes down and our

28:57

AI looks bad at the same time. We don't

29:00

want like someone else's AI to look bad.

29:02

>> No, that would be mean. Okay. If it's

29:04

going to if it's going to go down

29:05

because of AI, it's our AI. They're

29:07

putting the eye in AI. Okay.

29:08

>> There you go. Very good. Yes.

29:11

>> So yeah, Prime on on that point I think

29:13

it's very likely that we will go through

29:15

a u reviewheavy phase. Uh I think I

29:21

think that at some point um

29:24

uh like the workers will object to this

29:27

enough that we're going to have to find

29:29

something something more acceptable,

29:30

right? because I don't know if you have

29:31

tried it, but if you're just like

29:33

reviewing stuff like you know claw dumps

29:36

another thing on you every you know five

29:37

minutes you're like okay now I have to

29:38

review this and approve and merge

29:40

whatever or or I mean you can just yolo

29:42

right and but but we've seen what

29:43

happens if you just yolo um so uh I do

29:48

think we will at a minimum go through a

29:51

like reviewing mostly reviewing phase

29:54

and I think people are not going to like

29:56

it

29:57

that's my expectation I don't So I don't

30:00

know how uh I don't see what the

30:02

alternative is right now because like e

30:04

so the either

30:07

>> the two possibilities I see right now

30:08

are uh like they're going to be

30:10

monitoring your token count and so you

30:12

need to be consuming and generating

30:14

tokens and then that has to go that has

30:17

to get uh linked which uh you know in

30:19

some places they actually link the like

30:21

the tokens to PRs right

30:23

>> uh and so

30:25

>> you need to be generating lots of PRs

30:26

you need to be consuming lots of tokens

30:29

How can you do that other than

30:32

just generating many many PRs with AI

30:34

and reviewing and like hoping that

30:37

you're not taking down, you know, taking

30:40

down fraud by by missing something in

30:42

your review, right? So, I I don't know

30:44

how we get out of that other than uh at

30:46

some point the business side says,

30:48

"Okay, like we can cool it, right? we're

30:50

going to um we're going to stop

30:52

monitoring your token use, which I mean

30:54

I I personally I feel like that's not

30:56

that different from a desktop monitor or

30:58

like a keyboard monitor or something. It

30:59

feels very invasive to me. Like, hey,

31:01

you know, I understand you're asking me

31:03

>> I think it's fair to ask me, hey, can

31:04

you use this tool? Can you try to do

31:06

better work? I think that that's fair,

31:08

right? Like I'm ultimately you're like

31:10

you're being paid to do something for

31:11

the business. I think it's fair for them

31:13

to ask, hey, we'd we'd like to be using

31:14

these tools and we think you can do

31:16

whatever like 20% more work if you can

31:19

if you use this. I think that's fair. I

31:21

think

31:22

>> the micro micro tracking feels worse

31:26

than the past. And um

31:30

>> it's really easy to game too, right? You

31:32

just like sequentially look at every

31:34

single file and it's just going to be

31:35

like dang dang dang like

31:37

crushing the tokens. I know people who

31:40

are like at big tech who are working on

31:42

stuff and they have to do what exactly

31:44

what you're saying and they're saying I

31:46

I like I do I do this just to keep up

31:49

appearances for my AI use and you know

31:53

by the end like my output changes maybe

31:56

10%. Right.

31:57

I do like how it's literally just

31:59

setting money on fire and they're like

32:00

this makes me look fast because so that

32:03

actually before we got distracted on the

32:04

orbital data centers which I'll repeat

32:07

coolest name ever

32:08

>> super cool command

32:10

>> I uh was I was interested in you know

32:14

some of your thoughts about sort of the

32:15

tokconomics side of things because right

32:18

now it feels like people are burning a

32:20

bunch of money trying stuff out and

32:21

exploring and willing to just be like oh

32:24

yeah you just spent $5,000 this month.

32:27

Whereas before, like you couldn't get

32:29

like a $45 like online course approved

32:32

and they're like, "Oh, but yeah, yeah,

32:34

$5,000 of tokens for nothing. Yes, we're

32:36

totally down for that. Do that every

32:37

month." because I I was interested to if

32:39

you know

32:41

uh like if we're going to see a big you

32:44

know or your thoughts on if we're going

32:45

to see a big push on like generating

32:47

less or like more focus on like we're

32:50

going to use more tools to check what's

32:51

being gener you know because at some

32:53

point you know Claude Claude just

32:55

released their thing about code review

32:57

and they're like yeah usually it's like

32:58

15 or 25 bucks a review

33:01

>> and you're like well that adds up kind

33:03

of fast especially at the rate people

33:06

are saying I'm shipping 10,000 lines of

33:09

code today or I made a 30 300,000 line

33:12

Ruby on Rails application for my blog.

33:14

That's Gary Tan by the way Casey in case

33:16

you didn't see that tweet.

33:18

>> Okay. Um, hey, he's boiling a lake and

33:21

then he's going to boil the ocean. All

33:22

right. That's what he told us.

33:24

>> Yep.

33:24

>> I side note, I cannot tell if he's the

33:26

most genius rage baiter of all time, but

33:30

we can circle back to that on a

33:32

different episode.

33:32

>> Yeah.

33:33

>> Um,

33:34

>> yeah. So, because I'm I'm interested in

33:36

in sort of that angle because I do find

33:38

like some tasks I can get done like a

33:40

lot faster with letting agents work on

33:42

it or like agents spin on something like

33:45

forever. It has a really clear outcome.

33:46

There's some stuff I already have a

33:48

bunch of patterns in my codebase spin.

33:50

But I'm like, is is everyone just going

33:52

to be chill forever with me spending

33:55

$500 of tokens overnight on this? That

33:57

doesn't seem like what a business would

34:00

like. I don't think we're going to get

34:01

$500 of value back from this feature.

34:04

So, I don't know if you have any

34:05

thoughts like in that vertical or

34:07

>> um Yeah.

34:08

>> So, this is speculation.

34:10

>> Yeah. Speculation. Uh a lot of this is

34:14

>> um like principal agent problems for the

34:17

business. It's like

34:19

>> the So there are for sure uh like CEOs

34:23

and like one step down from CEOs who one

34:25

way or another um their uh their

34:29

personal benefit is tied to claiming

34:32

that they did a lot with AI.

34:34

>> Yes, it's true.

34:35

>> Yeah.

34:36

>> Um and as you know uh like in any big

34:39

organization you never just evaluate

34:41

something for whether it's good or not.

34:42

It has to be evaluated through some KPI,

34:44

right? Uh, so

34:46

>> yes,

34:46

>> there are people I'm sure like I haven't

34:48

seen this personally, but I'm sure that

34:49

something close to this exists that um

34:52

there is someone with many millions of

34:54

dollars worth of bonuses tied up to uh

34:57

can I get our token usage to double this

34:59

year and can I get our PRs to increase

35:02

by 20% as attributed to that token

35:05

increase? Right. Right. Like extremely

35:07

bureaucratic thinking about using this

35:10

new technology. Right. So I'm sure that

35:11

there are many people like that. Um, and

35:13

so, uh, those to those people, they're

35:17

not they're not spending their own

35:18

money, right? They're spending the

35:19

company's money. They're tell they're

35:20

telling you to spend the company's money

35:21

to do something that makes the metric go

35:23

up that gets them a bonus, right?

35:24

>> Yeah.

35:25

>> Um, so I don't know how long that's

35:27

going to last, but there's certainly

35:29

right now

35:30

>> um, a very large premium at multiple

35:32

levels of the kind of the business stack

35:34

where people are like, you know, why why

35:36

isn't your token usage double, right?

35:38

And

35:39

>> yes,

35:39

>> some people some people that sounds like

35:41

a joke. I have friends who literally

35:42

they say my boss came and asked me why

35:44

hasn't my token use usage doubled right

35:46

but you only used $300 of cloud last uh

35:49

last month

35:50

>> honey you barely touched your cloud last

35:52

month what's going on

35:54

>> is something wrong sweetheart

35:56

>> okay so I do actually want to follow up

35:57

on that because this is the you know

35:59

this was one of the big things that has

36:01

been going around is this Amazon hero

36:03

thing taking down everything and now

36:04

they're saying hey we're going to make

36:05

it so that juniors and mid-level people

36:07

who use generative AI must have a senior

36:10

sign off on this.

36:11

>> Can I do a quick question on that as

36:13

well, Prime? Which is, was there a

36:15

policy before seniors didn't have to

36:17

review junior's code just in general?

36:19

>> I assume I assume I assume it's like a

36:20

lot of companies, which is that a junior

36:22

could have a mid-level person review the

36:24

code and say, "Hey, this is not really

36:25

good." Like, not every change needs to

36:27

have, you know, the same cuz you you

36:29

effectively will exhaust and lose your

36:31

senior population if they have to review

36:32

every single PR. And so, kind of

36:34

>> that makes sense. So excluding the

36:37

personnel problem that will likely

36:38

develop from this with Amazon, is this

36:40

like the first sign of people realizing

36:41

that token measurement isn't necessarily

36:44

the best metric? Because I assume that's

36:46

what's hurting Amazon is that they

36:48

really push super hard as token metric

36:50

is the the greatest thing that has ever

36:52

existed and you must only push on this

36:54

one thing and now they're feeling some

36:55

of the effects of maybe moving too fast.

36:57

Is there a world that's going to exist

36:59

where people are going, okay, instead of

37:02

trying to double metrics, maybe we

37:03

should double some other thing. Say, I

37:06

don't know, uptime. Maybe like uptime

37:09

could be the thing that we're like, you

37:10

know, that we're valuing and then that

37:12

can be that could be it. Like is is this

37:14

an actual

37:15

>> like path forward or are we

37:17

>> I don't from your perspective, are we

37:19

just going to see continued push on

37:21

double your token usage, double your

37:22

token usage? because they both sound

37:24

super appealing I guess from a purely

37:26

theoretical point of view of like oh

37:27

more features we could know we could get

37:29

all the features or no let's be stable.

37:33

>> Uh again I think they so as we as we

37:35

discussed with Casey and Casey you

37:37

should you should jump in on on this

37:39

part. Um like we've seen these cycles of

37:43

like everything has to be you know like

37:46

25 years ago everything had to be online

37:48

and then everything had to be uh web 2.0

37:52

And then everything had to be uh like

37:54

social local mobile app and then

37:56

everything had to be crypto right and

37:57

that right so um we are in that phase

38:00

absolutely right now where separate from

38:03

any utility that the technology is

38:05

offering there's just this social mania

38:07

of everything has to be this right now

38:09

uh

38:10

>> I don't know when that fever will break

38:12

um I think certainly events like the

38:14

Amazon um Amazon event will will help

38:18

with that and I I do think that at some

38:20

point people someone is going to say,

38:22

"Okay, look, like um we just can't be

38:24

burning this kind of money all the time

38:26

and also like setting our infrastructure

38:28

on fire and driving our engineers

38:30

crazy."

38:31

At some point, at some point, we should

38:34

actually get a benefit from this instead

38:35

of hurting ourselves.

38:36

>> You can only pick two points on that

38:38

triangle. Drive engineers crazy, burn a

38:40

lot of money, uptime. You get to pick

38:42

two of those points.

38:43

>> Yeah, it's true.

38:44

>> Um but, uh I think I think this could

38:48

last for years. And the reason I say it

38:51

could last for years, I know people will

38:53

not like that. Uh I think the reason

38:55

this can last for years is uh it's easy

38:57

to underestimate

38:58

um like on like you know you guys or

39:01

like people who watch this uh you know

39:03

watch the stream how far how far up the

39:06

the like power law curve of uh early

39:09

adopter you are. And I

39:13

>> just meet lots of people all the time

39:15

who

39:16

>> still conceptualize AI as like it's a

39:18

better search engine, right? And I I

39:21

mean it's uh it's easy to forget that

39:24

actually the vast majority of people

39:26

don't know much about this and aren't

39:27

really trying to use it. And um so all

39:31

those people somehow will have to be

39:33

carried through the uh the fever swamp.

39:36

Um and I don't know like I think that

39:38

could take years. Um,

39:40

>> I would agree with that uh for and add

39:43

the sort of the chaser which I think we

39:46

did talk about on that first episode of

39:47

the podcast which was I don't see it as

39:50

mattering whether it takes down proud

39:52

all the time either is the problem

39:54

because like at least my experience for

39:56

the past 20 years has been it doesn't

39:59

matter how bad the practice is. If it's

40:02

just something that is in the zeitgeist,

40:04

then people do it and you can show them

40:07

>> clearly like you can literally

40:08

demonstrate, look, if you had done it

40:10

this way, this is how much better it

40:11

would be. They don't care because the

40:13

best practices is that you do it this

40:15

other way. The best practice is you use

40:17

Python, then we're using Python or

40:19

whatever it is. And you're like, look,

40:20

if you had just written it this way,

40:21

it's a hundred times faster. And you're

40:23

like, we don't care. That's how this

40:26

industry has operated. And so it would

40:28

be very weird to think that in the

40:30

special case of AI that anyone would

40:33

care one way or another whether it even

40:35

was better. Right? So you can almost

40:37

take your take your guesses about how uh

40:41

good AI will be in practice about

40:43

generating code in the first place. You

40:45

could even put that aside. I think the

40:47

fact that it has this much momentum is

40:49

enough to know that this will be here

40:51

for uh a long time. the current way

40:53

we're doing things. Even if it never

40:55

gets any better than it is right now,

40:56

even if it stayed exactly as good as it

40:59

is right now, no improvements literally

41:01

at all,

41:02

>> I I don't think that would change the

41:04

outcome. Honestly, I really don't.

41:06

>> Well, and there's like a trillion

41:08

dollars invested in it, too, right? So,

41:09

they're like, well, even if even and

41:12

it's I mean,

41:13

>> it's not really even Sunk fallacy. It's

41:15

just like literally sunk cough truthy

41:18

which is like we really we really need

41:20

this to work guys or we're really

41:22

underwater right it's not like

41:25

>> so it will become the workflow I think

41:27

that's like inevitable uh and

41:32

>> like I wish that it I wish that we could

41:34

say like well if the AIS don't improve

41:37

uh substantially then people would like

41:39

rethink the I don't I don't think that's

41:40

true like I think whether or not they're

41:43

able to push the quality up. I mean,

41:45

hopefully they are cuz, you know,

41:47

everyone is putting a lot of money into

41:49

pushing the quality of the ads. So,

41:50

hopefully they do get better. But even

41:52

if they don't, I don't think it's going

41:53

to matter, personally. Hey guys, if you

41:55

like this episode, you can watch the

41:57

rest of it on the Spotify. And don't

41:59

forget to like and subscribe. Woo! See

42:03

you later.

42:03

>> Boot up today.

42:06

Five errors on my screen.

42:11

Terminal coffee

42:14

and

42:15

living the dream.

Interactive Summary

This video features a discussion about the current state and future of AI, including a debate on the projected decrease in token costs, the potential impact of AI on different career stages, and the controversial topic of orbital data centers. The conversation also touches on the business strategies behind AI adoption, the increasing reliance on AI tools in software development workflows, and the potential for AI to automate coding tasks. A significant portion of the discussion revolves around the economic implications of AI, such as token costs and investment, as well as the practical challenges and user experiences with AI-generated code and the evolving role of human oversight in AI-assisted development.

Suggested questions

10 ready-made prompts