HomeVideos

The Supply and Demand of AI Tokens | Dylan Patel Interview

Now Playing

The Supply and Demand of AI Tokens | Dylan Patel Interview

Transcript

1425 segments

0:00

What used to matter a lot was execution

0:02

was very very [ __ ] difficult and

0:04

ideas were cheap. Now ideas are cheap

0:06

and plentiful but execution is very

0:08

easy. So really only the good ideas are

0:11

the ones that can justify the spend on

0:13

super cheap implementation.

0:30

You told me this incredible story about

0:32

how your own team's use of tokens has

0:35

changed dramatically this year. Yeah.

0:37

Retell that story and what it is

0:39

teaching you about what's going on in

0:40

the world.

0:41

>> Last year we thought we were heavy users

0:42

of AI. Everyone's using chat GPT.

0:45

Everyone's using cloud. Everyone's got

0:47

you know I'm providing whatever

0:48

subscriptions anyone wants on the order

0:50

of spend of like tens of thousands of

0:52

dollars for our firm. This year the

0:54

spend has just skyrocketed and and it

0:56

really started in late December with

0:59

Opus that included Doug who's president

1:01

uh Doug Olaflin. He's very much like

1:03

leading the charge in the sense of like

1:04

non-technical people using uh AI for

1:08

coding. Um and so he's basically pled

1:11

the whole firm slowly over time. I think

1:13

he's been the the leader in doing that.

1:14

Obviously the engineers were using it

1:16

anyways but spend in January just

1:18

started to inflect and rocket and rocket

1:20

and rocket and rocket. Um, we signed,

1:23

you know, an enterprise contract with

1:24

Anthropic and it's gone to the point

1:26

where now, um, I think when I last

1:28

talked to you it was 5 million spend

1:30

rate. It's actually 7 million spend

1:31

right now.

1:32

>> That was last week, by the way.

1:33

>> A lot of that is just the usage, right?

1:35

What's what's really, you know, people

1:37

people who are have never coded before

1:39

are using cloud code and spending

1:41

thousands of dollars sometimes a day,

1:44

but across a firm, we're spending $7

1:46

million a year now on cloud code at the

1:49

current rate. um versus our salary

1:52

expense being in the neighborhood of $25

1:54

million. So, you know, we're north of

1:56

25% of spend on cloud code as a

1:59

percentage of salary. And if this

2:00

trajectory continues, then you know,

2:02

we'll spend more than 100% by the end of

2:03

the year. Uh which is a bit terrifying.

2:06

Thankfully, I don't have to decide

2:08

between people and AI because our

2:10

company's growing so fast. It's, you

2:11

know, more so like, okay, well, I don't

2:13

have to hire nearly as fast and I can

2:14

spend a lot more on AI and it works and

2:16

we just grow faster. But I think other

2:18

folks will start to reckon with the fact

2:20

that, huh, if this person can do the

2:23

work of five to 10 to 15 people uh using

2:26

cloud code, then all of a sudden I

2:29

should probably cut people. But right

2:30

now, I think the use cases are so broad.

2:33

For example, one thing is we have a

2:34

reverse engineering lab in Oregon that

2:36

we've been building for a year and a

2:37

half. We have a bunch of, you know,

2:39

fancy microscopes, scanning electron

2:41

microscopes. The whole purpose of this

2:42

is you reverse engineer chips. You get

2:44

uh the architecture out of it. you get

2:46

the materials that they're using to

2:46

manufacture and this is some of the data

2:48

we sell. This is a very slow process of

2:50

analyzing that data. Instead, um one

2:53

person on the team, they've been able to

2:54

spend with a couple thousand dollars of

2:56

cloud tokens. They've been able to

2:57

create this application that is GPU

2:59

accelerated runs on a server that we

3:02

have at Coreweave and anytime we send it

3:04

an image, it's able to take the picture

3:05

of the chip and overlay where every

3:07

single material is. Oh, this part is

3:09

copper. Oh, this part of the gate is uh

3:11

tantelum. This part of the gate is

3:12

germanium. This part of the gate is

3:14

cobalt. And so you can do a finite

3:15

element analysis of the entire stackup

3:17

of the chip very very quickly visual

3:20

with a dashboard guey it's everything

3:22

few thousand dollars would took claude

3:23

the person previously worked at Intel

3:25

and he said that was an entire team's

3:27

job to build that and maintain that now

3:29

rack that up across you know the entire

3:31

firm it's it's insane another example

3:33

that I think is super fun is Malcolm

3:36

who's an economist at a major bank

3:38

before um their economist department was

3:41

like 100 or 200 people what he built was

3:44

the most incredible thing ever. He piped

3:46

all of this different data, you know,

3:48

FRED data and all these other data,

3:49

right? Employment reports and all these

3:51

other things from various APIs. We

3:53

signed a couple contracts with folks to

3:54

get API access to data. Pulled it all

3:56

in, started running regression, started

3:58

looking at the impact of various

4:00

economic revolutions on the economy um

4:03

from a deflationary inflationary

4:05

perspective. The BLS has this entire um

4:08

Bureau of Labor Statistics has this

4:10

entire like set of like 2,000 tasks. And

4:12

so he did that with AI, which ones can

4:14

be done by AI, which ones cannot, and

4:15

grading them across a rubric. You know,

4:17

about 3% are doable now with AI. Um, and

4:20

so he's created this like metric so that

4:22

you can measure things that can be done

4:24

by AI, what what the massive

4:25

deflationary uh, you know, what the cost

4:28

of being able to do those with AI and

4:29

therefore the deflationary aspect of it.

4:31

You know, output can go up. It's called

4:32

phantom GDP is what he called it.

4:33

Phantom GDP. Output can go up, but

4:35

because cost falls so much, actually GDP

4:37

theoretically shrinks. So he created

4:39

this whole analysis and a brand new

4:40

benchmark of uh language models um a set

4:44

of evals across 2,000 different evals.

4:46

Right.

4:46

>> This all by himself.

4:47

>> This is all by himself. Yeah. And he's

4:48

like dude this would have taken the team

4:50

of 200 economist a year. He's just like

4:53

he's like completely cracked out on

4:54

claude. He's like everything has

4:55

changed.

4:56

>> How do you think about as a business

4:57

owner going from close to zero to 25%

5:01

accelerating towards whatever percent of

5:03

total spend? Like at what point are you

5:04

like, whoa, I need to put the brakes on

5:06

this and be careful how much we're

5:08

spending. Maybe we don't need to spend

5:10

on the most cutting it on Opus 4.7,

5:12

which came out today. Maybe I can

5:13

throttle it back to something that's a

5:15

little bit cheaper.

5:15

>> Ultimately, like I'm in the information

5:17

business, right? That that is, you know,

5:18

we sell analysis, we sell, we do

5:20

consulting, we create data sets. I don't

5:22

see why this wouldn't be completely

5:24

commoditized on a pretty rapid basis if

5:27

I'm not constantly improving. my first

5:29

product that I was selling as a data set

5:31

actually it is you know like there's

5:33

more people trying to do it now we've

5:34

made it constantly better and better and

5:35

better and more detailed and so

5:37

therefore it sells a market but the way

5:39

we were doing it in 2023 is not terribly

5:42

different than you know is it's it's

5:44

basically what everyone else is doing

5:45

now if I don't move up the bar then I

5:48

will be commoditized if I don't move

5:50

fast enough I will also lose my edge so

5:52

the question is yes AI commoditizes

5:55

things just like it commoditizes

5:56

software those who can move fast and

5:59

keep control of their customers and keep

6:01

providing them an awesome service and

6:03

keep improving the service won't shrink.

6:05

They'll grow. They'll grow faster. Those

6:07

who are incumbent and not doing

6:08

anything, they're going to lose. And so,

6:10

it's a bit of an existential like if I

6:13

don't adopt AI, someone else will and

6:14

they will beat me. Uh, another easy

6:16

example is the energy space. So, we've

6:18

had a few energy analysts for a couple

6:20

for like a year now. We've been trying

6:21

to build out this energy model. It's

6:23

very complex. Energy's data services

6:25

market is something like $900 million.

6:27

So obviously a huge market for me to try

6:29

and break into but it has you know we

6:30

really hadn't broken into the energy

6:32

data services business despite a year of

6:34

having multiple people on the team. Um

6:35

then cloud code psychosis hits one of

6:38

the people who leads the data center

6:39

energy and industrial sort of business

6:41

at semi analysis uh Jeremy hits him and

6:44

now all of a sudden in 3 weeks um he

6:47

spent a lot he was spending like $6,000

6:49

a day. It was an insane amount but he

6:51

scraped every single power plant in the

6:53

US every single transmission line above

6:55

a certain voltage. um and created this

6:57

entire mapping of the entire US grid as

7:00

well as a lot of demand sources all from

7:02

various public sources of data. Um and

7:04

we've shown it to and and we built and

7:06

it's got like this dashboard where you

7:08

can view and check you can see all the

7:09

micro regions of the US where there's

7:11

power deficits and surpluses. Um all of

7:14

these details built in a handful of

7:16

weeks we started showing some of our

7:17

customers who buy our data center data

7:19

set but are energy like traders. We

7:21

showed some of them and they're like wow

7:23

how long did this take you? This is

7:25

really good. this is better than XYZ

7:26

company and then we like get dig deeper.

7:29

XYZ company has 100 people and have been

7:31

working on this for a decade now.

7:32

Obviously our thing is not fully robust

7:34

as robust but in some ways it is better.

7:36

I'm going to commoditize these energy

7:37

services companies, data services

7:39

company. Who's going to come commoditize

7:40

me if I don't move faster? And so the

7:42

question from a business owner's

7:44

perspective is yeah I'm spending a lot

7:46

but what does that spend getting me? Is

7:47

it getting more revenue? Yeah.

7:50

>> Most software companies try to maximize

7:52

your time on their app to juice

7:53

engagement. RAMP does the exact

7:55

opposite. RAMP understands that no one

7:57

wants to spend hours filing expense

7:59

reports, reviewing expense reports, and

8:01

checking for policy violations. So, they

8:03

built their tools to give that time

8:04

back, using AI to automate 85% of

8:07

expense reviews with 99% accuracy. And

8:10

since RAMP saves companies 5%, it's no

8:12

wonder that Shopify runs on RAM, Stripe

8:14

runs on RAM, and my business does, too.

8:16

To see what happens when you eliminate

8:18

the busy work, check out

8:19

ramp.com/invest.

8:22

OpenAI, Cursor, Enthropic, Perplexity,

8:24

and Verscell all have something in

8:26

common. They all use Work OS. And here's

8:28

why. To achieve enterprise adoption at

8:30

scale, you have to deliver on core

8:32

capabilities like SSO, SKIM, Arbback,

8:35

and audit logs. That's where work OS

8:37

comes in. Instead of spending months

8:38

building these missionritical

8:39

capabilities yourself, you can just use

8:41

work OS APIs to gain all of them on day

8:44

zero. That's why so many of the top AI

8:46

teams you hear about already run on work

8:48

OS. Work OS is the fastest way to become

8:51

enterprise ready and stay focused on

8:52

what matters most, your product. Visit

8:54

works.com to get started. Felix by Rogo

8:58

is a personal finance agent that turns a

9:00

single prompt into finished clientready

9:02

work using your firm's own templates,

9:04

context, and standards. Send Felix an

9:06

email like, "Take these comments and

9:08

turn them for me, or update my tracker

9:10

with the context of these emails, or run

9:12

the ability to pay math on this buyer."

9:14

And Felix sends back finished PowerPoint

9:16

decks, Excel models, and sourced

9:18

research. Felix works the way your team

9:20

already does, delivering work quickly

9:21

and accurately around the clock. Learn

9:24

more at rogo.ai/felix.

9:26

Are you worried that in the limit the

9:29

people that control capital and

9:30

investing capital who are often hiring

9:32

you for for what you do will just say,

9:35

"Well, we have analysts too who are

9:36

really smart about this. Like, we'll

9:37

just build this ourselves." Like if it's

9:39

getting that easy, at what point does it

9:41

just all pull into the investment firms

9:44

that stand to gain the most because they

9:45

have the most leverage on top of the

9:47

data or the insights that that they

9:48

glean?

9:49

>> First of all, any information services

9:51

business, obviously I don't generate as

9:54

much value as my customer does from such

9:55

information. Uh because if I sell you

9:57

information for a dollar, you're only

9:59

buying it for a dollar because you know

10:00

that information helps you make a

10:02

decision that lets you make more than

10:04

$1. And so therefore, you have you have

10:06

arbit you you have made more money off

10:08

of me than I did from the information

10:09

myself. An investment fund, these

10:11

investment funds all have their own

10:12

information services, you know,

10:14

especially like the super like the Jane

10:16

Streets of the world and the Citadels.

10:17

They're they're really detailed on their

10:19

data. And yet, um, these sort of folks

10:21

also purchase data from us and continue

10:23

to do so and continue to grow with us

10:24

because I think there's just some some

10:27

it factor, right? We move faster, we're

10:29

more nimble, we're a smaller team that's

10:32

focused on just one specific thing. uh

10:34

AI infrastructure and and the huge

10:36

revolution that causes in AI um and

10:39

tokconomics and all these things and and

10:41

we sort of really see where it's headed

10:43

and so we're moving faster and building

10:45

faster. Um I think investment

10:48

professionals just would you know yes

10:50

they'll try and build some of the stuff

10:51

we do and um more likely they'll just

10:54

buy the data from us and it's cheaper

10:56

for them to buy the data from us and

10:57

then to build and then build on top of

10:59

it than it is to build it themselves.

11:01

But ultimately some may try. I feel like

11:02

every conversation I have with you, what

11:04

I'm always getting at is just supply and

11:06

demand of tokens like that's the thing

11:07

that's interesting to me in the world

11:08

right now. What has this experience

11:10

taught you about the demand? Has it

11:13

changed your view on the demand side of

11:14

that equation? Just feeling it

11:16

viscerally yourself.

11:17

>> If we take a step back and look at the

11:18

macro lens, right? Enthropic has gone

11:20

from 9 billion revenue to what they're

11:22

at 3540 billion now. Probably by the

11:24

time this airs 40 45 billion, who does

11:26

ARR? Their compute has not grown to the

11:30

same degree. Um, and if you do the

11:32

calculations and you assume they didn't

11:33

decrease their research and development

11:35

compute, they clearly didn't. Their

11:36

release, they have Mythos, they have up

11:38

is 4.7. So they clearly didn't decrease

11:40

their research compute spend. Um, so

11:43

ultimately what they've done, even if

11:44

you assume all incremental compute

11:46

they've gotten has gone towards

11:47

inference, their margins are at a floor

11:49

of 72%. In reality, some of that

11:51

incremental compute they've got probably

11:53

went to research and development. It may

11:54

be higher than 72% gross margins. To be

11:57

clear, at the start of the year, they

11:58

started uh there was um there was a leak

12:00

by someone from their funding some some

12:02

of their funding round docs. Someone

12:03

leaked it 30 something% gross margins.

12:06

Where on earth does a business like this

12:08

grow margins like that? And it's in

12:10

principle, right? Their demand is so

12:11

high. They're able to cut back on usage

12:13

limits, rate limits, all these things.

12:16

Um, what really matters is having an

12:18

anthropic rep and having an enterprise

12:19

contract with them and getting the rate

12:21

limit increases that you need because

12:22

otherwise tokens are ultimately super

12:25

super in demand. Whoever whoever can pay

12:28

for them anthropic has the same problem,

12:29

right? Like I mean not problem, it's

12:31

it's just the reality of how capitalism

12:32

works. Yes, people are spending sending

12:35

them $40 billion AR in tokens and but

12:38

those tokens are generating way more

12:39

than $40 billion in value. Various

12:42

businesses will have different value

12:44

generation per token. But as we get more

12:46

and more intelligent, what really

12:48

matters is access to these most

12:49

intelligent tokens and leveraging them

12:51

at things. You as a person deciding what

12:54

is the best way to leverage these tokens

12:56

to grow business and generate value

12:58

because a lot of folks will want tokens

13:00

and generate tokens. Uh but the shitty

13:02

SAS startup and and and and SF who is

13:06

using Claude to generate, you know,

13:08

their software product is not

13:09

necessarily actually creating a ton of

13:11

value and therefore they're going to get

13:12

priced out of tokens uh soon enough.

13:15

>> Are you at all surprised that I I had

13:16

this experience just today where on the

13:18

flight here I got rate limited out on

13:21

something I saw 4.7 came out and what I

13:24

immediately wanted was like to be on 4.7

13:26

that second and I was it just I couldn't

13:29

think about using 4.6 anymore. or not.

13:31

This 47 is out. I was perfectly happy

13:33

with 4.6 for the last many weeks. It's

13:35

amazing. Are you surprised that people

13:37

are so insistent on going to the most

13:40

expensive leading edge thing to the

13:42

degree they are?

13:42

>> Without a doubt. One of my funniest

13:44

memories in the past month and a half is

13:46

myself and a buddy of mine, Leopold,

13:50

being on our knees in front of an

13:53

anthropic co-founder begging him for

13:54

access to Methos and then pretending it

13:57

doesn't exist cuz we knew it existed.

14:00

were like, "Please give us access." And

14:02

he's like, "I don't know what you're

14:03

talking about."

14:04

>> What was your reaction to that rate card

14:06

or that eval card coming out?

14:08

>> It was rumored in the Bay Area.

14:09

Everyone, you know, we sort of like knew

14:10

it was supposed to be really good, but

14:12

um if you just look at the benchmarks

14:14

and obviously benchmarks change over

14:15

time, Mythos is potentially the biggest

14:18

step up in model capabilities in like 2

14:21

years. I think that's really really an

14:23

an important detail that you know it

14:25

it's so good that they're like don't

14:27

want to release it even though they're

14:28

they they already announced the price to

14:31

their people that they did a selective

14:33

release for cyber for and it's like five

14:34

or 10x the token cost. They just don't

14:36

want to release it um because they're

14:38

worried about the like impact on the

14:39

world and they're releasing a shitty

14:42

worse version of open 47 to us and they

14:45

explicitly said in the model card hey we

14:47

actually preferentially made it worse at

14:49

cyber. I don't know if you read that.

14:50

whoever you are, if you have enough

14:52

capital, you should get a freaking

14:54

enterprise cloud uh enterprise anthropic

14:56

subscription where you pay per token,

14:58

not with these like subscriptions

14:59

because then you won't get rate limited

15:01

much. And then you must you need to

15:02

figure out how to leverage those tokens

15:03

to the highest value task um and make

15:05

money off of it because ultimately what

15:07

you're doing maybe maybe like a year

15:09

from now or two years from now the

15:10

business is actually just arbitrageing

15:11

tokens, right? The tokens are amazing,

15:13

but let's figure out what direction to

15:14

point them in and then three or four

15:16

years from now the model will know, you

15:17

know, what to do with the tokens and how

15:18

to make the most value. You know, you

15:20

can look at this retroactively. Pick any

15:21

benchmark. The cost to hit a certain

15:24

capability tier used to cost X and now

15:27

it cost 1/100th or 1/ 1,000th of that.

15:30

Deepseek, for example, on GPD4 was

15:33

1/600th the cost. And since then, the

15:36

costs have fallen further for GPD4 class

15:38

models. Of course, no one gives a crap

15:41

about GP4 class models. They want the

15:43

frontier because the frontier lets them

15:44

create the economically valuable things.

15:46

But GP4 class models can still be used

15:48

in like stuff and so people are using

15:50

them in some like tiny use cases. It's

15:52

just the cost have fallen so fast. It's

15:54

it's not really what's driving the

15:55

demand. What's driving the demand is is

15:57

all these new use cases. Yeah. Current

16:00

4.6 opus or 4.7 opus tier models a year

16:04

from now my spend for the same exact

16:07

quality of the model would probably be

16:10

like 70k. I bet you it'll be 100 times

16:13

cheaper. irrelevant because I'm going to

16:15

be using a way way way better model

16:16

which can do way way better things.

16:18

Enthropic mythos is more expensive as a

16:20

model but it spends a lot less tokens to

16:22

do the thing and therefore it is

16:24

actually cheaper in most tasks than 46

16:26

opus because it's just way more

16:28

efficient even though each individual

16:29

token is smarter.

16:30

>> When I last saw you Methos had just come

16:32

out maybe the day before or something or

16:34

the the card had just come out and you

16:36

said something like uh it actually made

16:38

you feel like a little scared it was so

16:40

good. What did you mean by that?

16:41

Anthropic's whole like goal in 2025 was

16:46

and and even a lot of 2024 they're like

16:48

hey by the end of 2025 we need an L4

16:51

software engineer uh in our model and

16:54

and they by and large achieved that with

16:55

46 Opus. What they didn't say is that

16:57

you know and if you look at Mythos and

16:59

if you compare like the benchmarks it's

17:01

like an L6 engineer. So L4 is like

17:04

pretty new. L6 is like quite well

17:06

experienced. I think Anthropic said that

17:08

the model internally was available in

17:10

February. So in two months they've gone

17:13

from L4 engineer to L6 engineer. Uh

17:16

what's next? Um you know when when you

17:19

think about the model progress it's only

17:21

accelerated. Enthropic release cadence

17:23

has compressed. Open's release cadence

17:25

has compressed. Why? Because these

17:27

models generally to make a better model

17:28

you need a few things right. You need

17:30

amazing compute. Compute is very

17:31

expensive and it has a time scale that

17:33

we you know we track and it's like you

17:34

know it's growing but like you know it's

17:35

it's sort of set in stone for the next

17:38

you know short short term. it's like

17:39

kind of set in stone what you've already

17:40

signed. Um there will be delays and

17:42

shifts and some somehow you can find a

17:43

little more but it's generally pretty

17:44

set in stone. There's amazing

17:46

researchers that people are paying tens

17:47

of millions of dollars for. And then

17:49

lastly there's implementation.

17:51

Historically has been very difficult. If

17:52

I have an idea now I have to implement

17:54

it. Implementing is hard. Now ideas are

17:57

there. Implementation is very easy. It's

18:00

expensive but it's very easy. So how do

18:02

you how does one decide what ideas to

18:04

implement? And it turns out if your

18:06

implementation is just so much easier

18:08

now you can just implement more ideas

18:10

and move on the treadmill faster and

18:12

faster and faster. Whether that is AI

18:14

model research and so now your model

18:15

release cadence is shrunk to down to 2

18:17

months from where it was 6 months before

18:19

or hey I want to I want to take every

18:21

power plant in the US and every

18:22

transmission line and model it and run

18:24

regressions and see the micro supply and

18:26

demand. I can also do that. The idea is

18:28

cheap. You know which idea makes sense?

18:30

which idea is worth the capital that you

18:32

have to spend on the tokens because the

18:34

implementation is there. It's it's

18:35

that's the I think the key learning and

18:39

if implementation costs continue to tank

18:42

which they are um we don't even have

18:44

mythos yet. It's only been you know a

18:46

handful of hours since Opus 47 launched

18:48

but you know my team is pretty excited

18:50

about it internally. What now comes to

18:52

the world uh it's a complete reordering

18:54

of how like economies work. What used to

18:57

matter a lot was execution was very very

18:59

[ __ ] difficult and ideas were cheap.

19:02

Now ideas are cheap and plentiful but

19:05

execution is very easy. So really only

19:07

the good ideas are worth are the ones

19:09

that can justify the spend on super

19:11

cheap implementation.

19:12

>> So are you actually scared or are you

19:14

just is it just does it just introduce

19:15

an uncertainty that's hard to grapple

19:18

with?

19:18

>> Uncertainty is there. Um but I do I do

19:22

think that causes some fear in terms of

19:26

how does society reform itself? How does

19:29

one

19:30

exist in a world where actually any you

19:33

know your ability to implement something

19:34

is not actually that important. Your

19:36

ability to choose the correct idea for

19:39

AI to implement and then your ability to

19:41

sell that idea or sell what the AI has

19:43

implemented is what matters. Your

19:45

ability to garner capital towards that

19:47

is what matters. And going back to the

19:49

point of like it's very important to

19:50

have the newest model always. Who's

19:52

going to have access to the newest

19:53

model? Anthropics project. I know it's

19:55

not called earwig, but I troll anthropic

19:56

people by calling it earwig. Um,

19:59

glasswig anthropic earwig, you know,

20:01

where they only release mythos to

20:03

certain companies for cyber. That's just

20:05

going to be something that continues.

20:07

Models will have less broad and less

20:09

broad deployment. I know I know Open AI

20:12

and Enthropic and all these people are

20:13

like, we want to have great AI for

20:15

everyone. AI is very [ __ ] expensive.

20:18

Who's going to pay for the trillion

20:19

dollars of infrastructure? People who

20:20

have money and can can build useful

20:22

things with AI. And then you don't want

20:24

people to distill your models. So you

20:25

don't release them broadly. Uh you

20:27

release them to a fewer and fewer set of

20:29

customers. Those customers are also now

20:31

wrestling over the tokens unless

20:33

anthropic jacks them. You know, they

20:34

could double their pricing on Opus and I

20:36

would continue to pay and I bet most

20:37

users would continue to pay.

20:38

>> I bet that wouldn't solve their

20:40

humongous capacity problem that they

20:42

have. So then the question becomes where

20:44

does this cycle end where you know token

20:47

usage and therefore the benefits of

20:49

those tokens the additional value

20:51

generated on top of those tokens

20:52

aggregates among fewer and fewer and

20:54

fewer companies. I don't have mythos.

20:56

You know who has mythos? Top freaking

20:58

banks. Um now they're only using it for

21:00

cyber security. But at some point I can

21:02

envision a world where hey maybe I

21:04

because I have an enterprise enthropic

21:05

contract and because enthropic people

21:07

kind of like me they're willing to give

21:09

us like slightly earlier access or

21:11

slightly higher rate limits or something

21:13

for a model. I hope that's what happens.

21:15

And then my competitor whoever that is

21:18

doesn't have that and I'm able to

21:19

[ __ ] crush them. There are people who

21:21

are like Ken Griffin of Citadel is like

21:23

super well-connected and super rich and

21:25

he's like he he just signs, you know,

21:27

who knows? He goes and signs a deal with

21:28

Open Arenthropic that's like, "Yeah, I'm

21:30

going to get access to your models. Um,

21:32

and I'll buy the first $10 billion worth

21:35

of tokens each year. So, whenever you

21:36

release the model, you know, I'll spend

21:38

the first 10 billion tokens and then

21:39

everyone else can get the model after

21:40

that." And it's like, okay, well, now

21:42

what does that do? Well, now he's going

21:43

to crush everyone in the market. That's

21:44

just an example. Could be cyber like

21:46

Anthropic is worried about, oh, now I

21:47

can hack people. could be information

21:49

services business like myself where I

21:50

crush someone else. I think you know it

21:52

it's it's such a broad base. We don't

21:54

know what these models can do. Anthropic

21:55

doesn't know what these models can do.

21:56

No one knows what these models can do.

21:57

It's up to the end user to figure out

21:59

where they can leverage the tokens to

22:00

see what they can build and imagine

22:02

which is tremendously productive and

22:04

uplifting for humanity. But then what

22:06

happens to the concentration of

22:07

resources and usage of it?

22:08

>> Presumably right now robotics or robots

22:11

consume relatively zero tokens versus

22:14

everything else. Do you see what's your

22:16

view of that? If that's like a second

22:18

demand curve that could start to

22:19

ratchet, there's a new startup every

22:21

single day, you know, within a mile of

22:23

here trying to build something

22:24

interesting in robotics.

22:25

>> So there's this concept of software only

22:27

singularity, which is that the world

22:29

has, you know, AI singularity, but only

22:31

in software. And now what about the rest

22:33

of the world? Vast majority of the world

22:35

is physical. You can see the world

22:38

orient around hardware, not software.

22:40

That's actually why I think software

22:42

only singularity is like just a blip and

22:44

not like a you know we we do get

22:45

everything else because once software is

22:47

super easy what makes robots really hard

22:49

it's like programming microcontrollers

22:51

and actuators and controlling all this

22:52

stuff is very difficult right now the

22:55

interesting thing about models AI models

22:57

is they're actually really inefficient

22:59

in learning it's just we're able to give

23:01

them so much data that they're able to

23:03

learn and surpass us in certain ways

23:05

robots currently the robot models um

23:07

VA's uh vision language action models

23:10

which is very popular right now is

23:12

probably not going to be the thing that

23:14

ultimately scales beyond. They are

23:16

inefficient in data um and we can't

23:18

scale the data for them fast enough.

23:20

There is going to be some way to large

23:22

scale pre-train robot models where just

23:24

like humans see all this data throughout

23:26

their lives. And what's interesting is

23:28

humans the reason why we're so good is

23:30

we're sample efficient. One example, two

23:32

example, we're good. And so applying

23:33

that to robotics. So once you once you

23:35

have this software only singularity

23:37

implementation is super cheap. anyone

23:38

can start to build these mo people can

23:40

start to build models that now robots

23:43

are actually useful and so I think in

23:45

the next six to 18 months we'll start

23:46

seeing real breakthroughs in robotics

23:49

that enable few shot learning i.e.

23:52

there's a pre-trained robot model and

23:54

now there's a robot that you have hired

23:56

or bought or whatever. You show it a few

23:58

examples and it's able to do it. You

23:59

tell it to stack these two things or you

24:01

tell it, hey, this can can actually like

24:03

balance perfectly, you know, and and it

24:05

starts doing these things.

24:06

>> Nicely done.

24:07

>> One shot.

24:09

>> No, trust me, I've spilled many of

24:11

times.

24:12

>> So, I think I think robots will get fot

24:15

learning right now. Now, you know,

24:16

there's a lot of companies doing robots

24:18

for like, you know, advertisement or

24:19

robots for like simple stuff like that,

24:21

but it'll be like, oh, folding clothes,

24:23

but it's going to get really niche like

24:24

robots just for cleaning chalkboards.

24:26

Um, and it's a rental service or, you

24:28

know, it'll be it'll be a model package

24:30

that you download onto your standard

24:31

robot that then does that, right? And

24:33

and you pay for that. And anyways, there

24:35

will be a huge explosion in physical

24:36

good acceleration and and deflationary

24:39

effects there. But and and so that's

24:41

that's ultimately going to keep token

24:43

demand going crazy. I I don't think

24:45

token demand slows down personally.

24:46

>> Did you learn anything else about the

24:48

world based on Mythos's results and how

24:51

it was built? My way of asking like the

24:53

you know if you break down the the

24:54

components of the scaling laws like the

24:56

>> So Methos is a materially larger model

24:58

than prior models and so yes it is a

25:01

much larger model. Now whether or not

25:03

it's it's what chip it's trained on is

25:05

not really relevant. It's the scale and

25:07

obviously you know to a 100,000 black

25:09

wells is equivalent to hundreds of

25:11

thousands of prior generation chips.

25:12

TPUs and tranium have their different

25:14

release cadence. So it's not exactly

25:15

like mirrored one to one. Um but

25:17

ultimately yes mythos is a significantly

25:19

larger model. It's proof that the

25:20

scaling laws still work. Um everything

25:22

about it shows the trend line continues

25:24

of models. More compute into model makes

25:26

model better. And along the whole way

25:28

it's not just more compute into model

25:29

makes model better. along the whole way

25:31

we're also getting these compute

25:32

efficiency wins which are you know as as

25:35

all this research compute that the labs

25:37

are spending is actually turning into if

25:39

I want x capability tier model every 6

25:42

months that cost or every two months

25:43

that cost is dramatically decreasing but

25:45

then if I scale it up massively I get a

25:47

humongous capability jump as well and so

25:50

yes it's it's proof that this is still

25:52

happening Google and anthropic are not

25:53

heavy heavy users of GPUs on the

25:55

training side but openai they'll they'll

25:58

start having their new class of models I

26:00

think they're taking a more sensible

26:01

principled approach to scaling uh in

26:04

small steps. Enthropic really went for a

26:06

huge jump. We'll see better and better

26:07

models throughout the year and the

26:08

release cadence is only going to get

26:10

faster.

26:10

>> We've gone a long way in the

26:11

conversation with saying almost nothing

26:13

about OpenAI which would have been so

26:14

strange.

26:15

>> So, so this is this is the interesting

26:16

thing. Everyone's like, okay, so

26:18

Anthropics just won, right? You know,

26:19

they had Methos in February. They never

26:21

even released it cuz they didn't feel

26:22

the need to. They're already sold out.

26:23

Their revenue is already adding $10

26:25

billion a month. Um and then you've got

26:27

Opus 47 today all before open eyes you

26:31

know um alleged Spud release which you

26:34

know media such as the information and

26:36

others have have posted about. So

26:38

clearly Anthropic is in the lead right

26:40

and OpenAI is cooked. What's interesting

26:42

is because Anthropic has such bounds on

26:45

compute and they can only grow it so

26:48

fast and sort of to the point of you

26:49

know you know Daria Daria used to gloat

26:51

about how OpenAI was being too

26:54

aggressive on compute and Anthropic was

26:56

more sensible in their scaling and now

26:58

Enthropic is like [ __ ] we should have I

27:00

wish we had a lot more compute. OpenAI

27:01

is able to pay the bills perfectly fine.

27:04

In fact, they've raised a ton of money

27:05

to get incremental compute in addition

27:08

to the irresponsible levels of compute

27:10

that they were buying from Oracle and

27:11

Core and SoftBank and all these people

27:13

and Microsoft uh you know such as

27:15

Tranium. Now they're getting tranium as

27:16

well from Amazon. Um so so they've done

27:19

this like insane thing on compute and

27:21

they need know they also know they need

27:22

more. But what's interesting is if you

27:24

were to say Opus 46, you know, let's

27:27

ignore models getting better over time.

27:29

Let's just take diffusion of this

27:31

technology. You and I may get jump on

27:33

the model immediately day one, but other

27:35

businesses take time and it takes time

27:37

for people to learn and the spark of oh

27:40

[ __ ] claude psychosis moment doesn't hit

27:42

everyone at the same time. And so by the

27:45

end of the year, let's say a 46 opus

27:46

tier model the economy would spend

27:48

$und00 billion on. I don't think that's

27:50

unreasonable. It's spending $40 billion

27:51

right now.

27:52

>> That's like a linear extrapolation.

27:54

>> It's a linear extrapolation, not a not

27:55

an exponential. To get the exponential,

27:57

you need the better models. Enthropic

27:59

won't have enough compute to do that.

28:00

And so and and presumably OpenAI and

28:03

Google will hit that tier soon enough.

28:05

Whoever hits that tier next, sure,

28:07

Enthropic may get to charge 70 plus%

28:09

gross margins, but if OpenAI hits it

28:11

next, they charge 50% gross margins.

28:14

They still get all of this incremental

28:15

demand. And probably they also won't

28:17

have enough compute to serve all the

28:18

users. And so, sure, maybe Mythos is a

28:22

model where if the world had enough

28:23

compute, it'd be $500 billion of revenue

28:26

or something crazy. There is such demand

28:28

for these tokens and such limitations on

28:30

compute, you know, and we see this with

28:32

H100 prices skyrocketing and the useful

28:34

life of these GPUs continue to extend.

28:36

It's pretty clear even the tier 2 lab is

28:38

going to be sold out of tokens, let

28:39

alone the tier one lab. The tier one lab

28:41

will have better margins, but the tier

28:43

two lab will be sold out and probably

28:45

the tier three lab will also be close to

28:46

sold out. Economic value that the best

28:48

model can deliver is growing faster than

28:50

our ability to actually serve those

28:52

tokens to people via the infrastructure.

28:54

And so this gap will continue to grow

28:55

and the model labs will continue to have

28:57

expanding margins until people in the

28:59

hardware supply chain infrastructure

29:00

supply chain are like wait no why don't

29:01

I just jack up my margins. So suffice to

29:03

say I think the assessment today or your

29:05

assessment of the demand side is

29:07

completely explosive in your own

29:08

particular example here at semi analysis

29:10

but just more broadly that as people

29:12

fall in you call it AI psychosis as

29:14

people fall into this experience of what

29:16

they can do the implementation

29:18

difficulty going completely away I I've

29:20

certainly felt that you know my own

29:22

token spend is just through the absolute

29:23

roof just in the matter of weeks so that

29:26

that feels like a pretty good assessment

29:27

anything we're missing on the demand

29:28

side

29:29

>> if you don't use more tokens you'll

29:30

never escape the permanent underclass

29:32

just expand on that.

29:33

>> So either either you use more tokens and

29:35

you generate economic value outsized

29:37

economic value for the use of those

29:39

tokens. Um a lot of people are doing it

29:40

the boring lazy way. Oh, I guess I'll

29:42

just work one hour a day instead of

29:43

eight hours a day and I'll have AI do

29:45

most of my job. That's the boring way.

29:47

The cool way is I'll still work eight

29:49

hours a day and I'll I'll do 8x the work

29:51

and maybe I'll make 5x the money. Um

29:54

maybe not you can't do this with a job

29:55

obviously. There's people who have

29:57

multiple jobs. Um there's people who

29:58

like start companies and start selling

30:00

stuff. get that economic value on on

30:02

this AI before everyone is using it and

30:04

it's table stakes. Uh because it's still

30:06

not table stakes if you don't use more

30:08

tokens and generate the value from them

30:10

and capture that value. These there's

30:11

three different problems here. Using

30:12

more tokens, generating value from those

30:14

tokens and capturing value from those

30:16

tok uh from the value that you created

30:17

from the tokens. Uh if you don't do

30:19

these three things, you'll never escape

30:20

the permanent underclass i.e. as models

30:23

continue to skyrocket in capability and

30:25

the concentration of resources

30:26

potentially happens.

30:28

>> Okay, let's talk about supply. what is

30:29

going on like how would you describe the

30:31

frontier of what's changing or what is

30:33

changing at the frontier of supplying

30:35

the the entire stack that's required to

30:37

serve all these tokens as the demand

30:39

curve explodes

30:40

>> as demand skyrockets prices are going up

30:42

for everything on the supply side um

30:45

whether it be the NGPUs

30:47

uh their prices are going up in addition

30:50

their useful life is extending

30:51

>> H100 prices look like this

30:53

>> yeah exactly there's people who have

30:54

argued GPU's full lives are less than 5

30:56

years complete nonsense

30:58

Um there are clusters now resigning

31:01

three or foury old hopper clusters

31:02

resigning for 3 or four more years. Um

31:05

there's A100 clusters that are resigning

31:07

for another couple years. So the useful

31:08

life is clearly not 5 years. It's maybe

31:10

even seven or eight years. Um arguably

31:12

we we don't know yet. We'll see. We'll

31:14

see when Hopper gets there, but it it's

31:16

clearly not 5 years. So the useful life

31:17

is extending and the prices are going up

31:19

on that renewal. So in effect the gross

31:22

margin was not 35% on a cluster, it's

31:25

beyond that. Um so margins are expanding

31:27

in the in the cloud layer. Margins are

31:30

um extremely healthy on the hardware

31:33

layer with you know Nvidia still

31:34

charging 75 or whatever percent gross

31:36

margin as we move down the stack. Memory

31:38

obviously margins have skyrocketed

31:40

there. Places like optics and logic

31:44

there are large prepayments um and

31:46

margins are growing slowly um more so

31:49

the companies that are making chips like

31:50

Nvidia are paying huge prepayments. So

31:53

in effect the cast of capital or timing

31:55

of cash flow return on invested capital

31:57

is going up even if the gross margin

31:58

isn't. And you see this across the whole

32:00

supply chain. You see ASML is completely

32:02

sold out and they need Carl Zeiss to

32:04

expand faster. Everywhere along the

32:06

chain

32:07

everyone's either sold out and margins

32:09

are going up or they're getting

32:10

prepayments increases the return on

32:12

invested capital because the invested

32:13

capital is lower. And so this is like a

32:15

consistent trend across any part. It's

32:17

it's even like you know a PCB to make a

32:19

PCB requires copper foil and that copper

32:22

foil is sold out and people are making

32:23

prepayments for it. It's like anything

32:25

and everything that like has a pulse and

32:28

is like sold out. People are like

32:29

jumping to get more incremental supply

32:31

and fighting over the supply for the

32:33

years after.

32:34

>> As your business scales up, everything

32:35

gets more complex, especially your

32:37

compliance and security needs. With so

32:39

many tools offering band-aids and

32:40

patches, it's unfortunately far too easy

32:42

for something to slip through the

32:43

cracks. Fortunately, Vanta is a powerful

32:45

tool designed to simplify and automate

32:47

your security work and deliver a single

32:49

source of truth for compliance and risk.

32:51

There's a reason that Ramp, Cursor, and

32:53

Snowflake all use Vanta. It frees them

32:55

to focus on building amazing

32:56

differentiated products, knowing that

32:58

compliance and security are under

32:59

control. Learn more at vanta.com/invest.

33:03

I know firsthand how complex the tech

33:05

stack is for asset management firms. And

33:07

seemingly every new tool and data source

33:09

makes the problem even worse. Adding

33:11

more complexity, more headcount, and

33:12

more risk. Ridgeline offers a better way

33:15

forward. One unified platform that

33:16

automates away that that automates away

33:18

that complexity across portfolio

33:20

accounting, reconciliation, reporting,

33:22

trading, compliance, and more. All at

33:24

scale. Ridgeline is revolutionizing

33:26

investment management, helping ambitious

33:28

firms scale faster, operate smarter, and

33:30

stay ahead of the curve. See what

33:31

Ridgeline can unlock for your firm.

33:33

Schedule a demo at ridgeline.ai.

33:36

What do you think are the most important

33:37

bottlenecks? Like typically in economic

33:39

history when there's this kind of

33:41

demand, supply reorients and rises very

33:44

very quickly to meet the demand. It

33:47

seems like it's almost impossible for

33:48

supply right now in this moment to keep

33:50

up. You know, famous last words, every

33:51

every shortage is followed by a glut

33:53

historically. But what are the most

33:55

interesting bottlenecks to you on across

33:57

the supply side?

33:58

>> Supply chains are usually very fast to

34:00

react. Um, one unique thing is that our

34:03

supply chains now are more complex than

34:05

ever. and the things we're building are

34:06

more complex than ever and therefore the

34:07

lead times are longer. Um, and it's not

34:10

like we haven't seen 18-monthl long lead

34:12

times in other industries. It's just

34:15

building incremental supply didn't take

34:17

years. Um, and this is the case with

34:19

memory, right? Memory can only grow

34:22

capacity, you know, low double digit

34:24

percentages a year, right? 20s 30% a

34:27

year. Um, even less for NAND, a little

34:29

bit higher for DRM. Even though the

34:30

demand signal was very strong at the end

34:31

of 2025, the memory companies

34:33

immediately sort of started reacting.

34:35

None of that incremental capacity really

34:37

gets here until the second that they've

34:38

decided to do in addition to the typical

34:40

20 to 30%. You know, they can stretch a

34:43

little bit, but really the true

34:44

incremental supply doesn't come till 28,

34:46

which is a very unique thing. Even if

34:48

they wanted to build as fast as

34:49

possible, it doesn't come till 28 uh

34:51

early late 27 at best. And so the result

34:54

is memory prices have, you know, gone

34:57

through the roof. And guess what?

34:58

they're going to double and triple

34:59

again. Um, at least on DRAM especially,

35:02

people are like, "Oh, the memory storage

35:03

is overplayed. Everyone gets it." And

35:04

it's like, "No, no, no. You don't get

35:05

it." DM will double or triple from here

35:08

still because that's that's how much

35:11

capacity is required and they have to

35:13

steal capacity from somewhere else. And

35:15

the only way to steal capacity from

35:16

somewhere else in a in a capitalist

35:18

economy is demand destruction via higher

35:20

pricing. We're not like rationing stuff

35:21

here. And so ultimately, that's what's

35:23

going to happen. And so margins continue

35:24

to go up. Um, I think Logic also has

35:28

humongous uh capacity problems. TSMC

35:30

just had their earnings. Uh, they keep

35:32

upping capex. Ultimately, you know, it

35:34

takes them quite some time to build

35:35

fabs. Um, they're trying to do

35:36

everything they can to squeeze every

35:38

little output out of every fab that they

35:40

have. But ultimately, they're not

35:42

raising prices fast because they're good

35:43

people. It seems like, um, you know,

35:45

singledigit price increases instead of,

35:47

you know, tripledigit price increases

35:49

like the memory guys have had. And so

35:50

you ultimately have like this like

35:52

market where yeah TSMC is a great

35:53

company but are they are they actually

35:55

going to extract all the value? I

35:56

mentioned things like copper foil, glass

35:58

fibers for PCBs, lasers. These are

36:01

things that are like well understood and

36:02

niche supply chains but they're very

36:04

very tight. Um and ultimately upstream

36:07

the semiconductor wafer fabrication

36:09

equipment supply chain is one that like

36:10

I still think is it's gone up a lot but

36:12

it's still very underappreciated. TSMC

36:15

capex this year they say 56. Uh we've

36:17

had 57.4 4 billion since January. Um,

36:20

and we may up it slightly more just

36:22

because we see some some ways that they

36:24

can get incremental capex. But what

36:26

people aren't focusing on is what does

36:27

that mean next year and what does that

36:28

mean the year after? And it turns out 3

36:31

years from now TSMC is going to spend

36:32

hundred billion on capex. U maybe two

36:35

years from now, right? Maybe 28.

36:36

Sincerely, they may spend $und00 billion

36:38

on capex in 2028. And people like just

36:41

can't fathom that. But what does that

36:42

mean for their downstream supply chains?

36:44

um you know companies like Lamb Research

36:46

or Applied Materials or ASML or their

36:48

further downstream supply chains like

36:50

MKSI and and all these other companies

36:52

the tail whip just gets whipped harder

36:54

and harder and harder and ultimately

36:57

that's a shortage if you know TSMC wants

36:58

to spend $100 billion in 2028 which is a

37:01

real possibility I think people would

37:03

think that's insane but that's a real

37:04

real possibility

37:05

>> what about other parts of the chip

37:06

ecosystem where GPUs have been

37:08

completely dominant what about like CPUs

37:10

or AS6 or things that start to pop out

37:12

as both opportunities and bottlenecks

37:14

beyond just like Nvidia's GPU dominance.

37:17

>> Yeah, I mean AS6 are obviously taking

37:18

off, but I'll sort of pivot away from AI

37:20

chips to talk about these other things.

37:22

There's a project we did on FPGAAS and

37:24

there turns out there's 120 FPGAs per

37:26

per um next generation rack um AI rack

37:30

and then like what about all the FPGA

37:32

names CPU wise all these reinforcement

37:34

learning environments plus all the slop

37:36

code you and I are generating that is

37:37

now running on some you know Versell

37:39

instance or whatever it is um or some

37:42

AWS instant or some bucket that we've

37:43

spun up all of that requires CPU and so

37:46

CPUs are completely sold out and demand

37:48

is skyrocketing there

37:49

>> help people understand the role that CPU

37:51

plays and everything.

37:52

>> Yeah. So there's two there's two main

37:53

reasons why you need tons of CPUs. One

37:55

is when you're doing reinforcement

37:56

learning um the CPU is very critical to

38:00

that. So so before you would throw all

38:02

the internet's data into the model,

38:03

train it, spit it spits and it it spits

38:05

some stuff out. Now you train all the

38:07

world's internet you put all the

38:08

internet data into the model. Then you

38:10

put it in this environment. This

38:11

environment is like hey model try this

38:13

out and it tries stuff out. It tries a

38:15

bunch of different things and in the end

38:17

there is an environment which scores

38:20

whether or not what it tried out is

38:22

successful and it grades it. And these

38:23

environments can be anything. It can be,

38:25

hey, check if the text was outputed in

38:27

the right way, structured outputs. It

38:29

can be very simple stuff. It can be very

38:30

complex stuff. Um, and people are

38:32

starting to get into very complex

38:33

things, right? Like, hey, I want you to

38:36

open this file, change it, edit it,

38:38

update it, submit it to this website. I

38:40

want you to open up this physics

38:41

simulation from Seammens and edit this

38:43

CAD model. So the environments can get

38:45

more and more complex and those

38:46

environments run on CPUs. They don't run

38:48

on GPUs. They don't run on AS6. The AS6

38:50

run the model that takes the input data

38:53

from the environment, runs it through

38:55

the model. The model creates outputs of

38:57

various different trajectories, right?

38:59

Ways that it think it could solve it um

39:01

in different instances. those

39:04

trajectories are graded slashscored and

39:06

the ones that are successful you train

39:07

on and you update and you reiterate and

39:10

you iterate iterate iterate and so CPUs

39:12

are very useful for that one and then

39:14

once you have these great models and

39:16

you're deploying them those models are

39:17

generating code they're generating

39:19

useful output that useful output it

39:21

doesn't go from a GPU straight to the

39:23

human brain um it goes from a GPU or an

39:26

ASIC through to you know a deployed app

39:29

that you're deploying somewhere that

39:30

actually just runs on CPUs so that's

39:32

another area where there's a lot of

39:33

demand and and things are sold out um in

39:36

a large large way.

39:37

>> As you continue to assess and try to be

39:39

the world's best informed person on both

39:41

the trajectory of supply and demand,

39:43

what are things that you wish you knew

39:45

to make that understanding that you

39:47

don't know?

39:47

>> I think the hardest area for us um and

39:52

for everyone is understanding

39:54

tokconomics, economics of tokens. Um, I

39:56

think we have a really tremendously like

39:58

good insight into how much it costs to

40:00

run infrastructure, what the cost of

40:02

tokens are, what the cost of models are,

40:04

what the margins of these labs are, but

40:06

the usage and adoption is what's really

40:08

difficult to model, you know,

40:10

continuously, right? We we have these

40:12

like we had like crazy in January, we

40:13

had crazy estimates for February,

40:14

anthropic smashed them. How do we

40:16

calibrate this model? What are the data

40:17

sources for this? February, uh, we had

40:20

crazy assumptions for March and then

40:21

they smashed them. And everyone sees the

40:23

number of 10 billion and they're like

40:24

what the how do they add 10 billion in

40:26

revenue? Who is using all these tokens?

40:28

Why are they using them? What are they

40:29

building with them? And then more

40:30

importantly with what they're building

40:32

with these tokens, how is that actually

40:34

diffusing into the economy? And what

40:35

value is that generating? Because it's

40:37

not really something that you can

40:38

capture in any any GDP statistic, right?

40:41

all of the value of the tokens that I

40:43

use get transformed into better

40:44

information which I then sell at a

40:47

discount to what people used to sell

40:48

information for relatively because um

40:51

and therefore that information is now

40:53

making its way throughout the economy

40:55

and and people are making better

40:56

investment decisions or better

40:59

competitive decisions if they're a semi

41:00

company or data center company or

41:02

hyperscaler and now how how much what

41:04

what is the value of this and what has

41:05

that what has that done to the economy

41:07

it's clearly by every subjective metric

41:10

amazing Amazing. But where is the

41:12

phantom GDP? What is the phantom GDP?

41:15

How do we track the real economic?

41:17

Because because the GDP metrics are not,

41:20

you know, accurate if you were to say

41:22

what is the GDP that Dylan Patel is

41:23

making. It's tiny compared to what the

41:25

value that I think is being created. And

41:27

so ultimately, what is the value being

41:29

created by these tokens? Not on a basis

41:31

of, you know, just simple, you know,

41:33

what is the knock-on effect, right? What

41:35

is the knock-on effect of all the things

41:36

that these things are doing? I think

41:37

that's the real uh question and

41:39

challenge uh that's hard to measure. I

41:42

think we've got a tremendous, you know,

41:44

reading on the supply side of things. I

41:46

think we've got a tremendous reading on

41:47

even a lot of the demand side signals,

41:49

but it's it's what is the value these

41:50

tokens are generating. That's hard to

41:52

quantify and measure.

41:54

>> I hope we get a chance to do this like

41:55

every 3 months because this changes so

41:56

quickly. What do you think is going to

41:58

happen next? Like when I when I come

42:00

back 3 months from now and we're in San

42:01

Francisco together again, what do you

42:02

expect?

42:03

>> Large scale protests.

42:05

>> Really?

42:05

>> Yeah. Yeah, I think there will be a

42:06

large scale protest against anthropic

42:09

>> and open AI.

42:09

>> Expand on that a little more.

42:10

>> Um, people hate AI. Um, AI is less

42:14

popular than ICE, less popular than

42:17

politicians. Confused how Pew surveyed

42:19

this, but apparently AI is less popular

42:21

than politicians. You know, with

42:22

Enthropic adding so much revenue, that's

42:25

going to start causing business changes

42:26

downstream. People are going to get more

42:28

and more scared of AI. they'll start

42:30

blaming more and more of their own

42:31

problems and things that are, you know,

42:34

global, you know, have been deep-seated

42:36

problems for a long time. Those will

42:37

bubble up and be blamed on AI. Um,

42:41

probably some politician or some social

42:43

media people will start to be able to

42:44

take uh influencer will be able to start

42:46

taking and weaponizing AI against

42:49

people. You look at the comments of news

42:51

articles where Sam Alman had a Molotov

42:53

cocktail thrown in his house twice in

42:55

like two weeks. They're like, people are

42:57

cheering it on. Uh, and this is just the

43:00

beginning. So, I think I think we'll see

43:01

large scale protests against AI in three

43:03

months.

43:04

>> What is the counterwe to that? Like, how

43:06

should the AI industry head that off?

43:08

>> First of all, Sam Alman and Dario have

43:10

to stop getting on interviews. They're

43:11

so uncarismatic.

43:13

I don't know what they're doing. Every

43:15

interview they do is like, wow, normal

43:17

people are going to hate you even more.

43:19

Like, Sam being on Tucker Carlson

43:21

probably made all Republicans hate

43:22

OpenAI. And same with Dario. They just

43:24

have no charisma. I think that's first.

43:26

Two, they need to start showing

43:28

uplifting things that can be done with

43:30

AI. Um, three, they need to stop talking

43:32

about how the capabilities are going to

43:34

change the whole world constantly

43:35

because then people are going to get

43:36

fear of that capability because they

43:38

have no connection.

43:39

>> They don't know how to use it. Yeah.

43:40

>> There's no connection to it either. It's

43:41

like the average person doesn't know an

43:43

anthropic employee. The average person

43:45

doesn't know an open eye employee.

43:46

average person doesn't know who these

43:48

people are, what their goals are, and

43:49

they just view them as like this like

43:51

sneaky cobball of like 5,000 people at

43:54

this company that are going to change

43:54

the world and automate all the jobs and

43:56

and destroy society. That's what they

43:58

view it as. And and as people who are

44:00

funding the building of all these data

44:02

centers and and power plants that are

44:04

going to pollute the world, right? They

44:05

don't quite understand what's happening.

44:06

You know, they have to stop talking

44:07

about the future thing that's going to

44:09

happen and only talk about present, how

44:10

uplifting AI is. I think it's a huge

44:13

reorg and rebranding that needs to be

44:14

done.

44:15

>> I love doing this with you. Thanks for

44:16

your time.

44:16

>> Awesome. Thanks.

44:22

>> Your finance team isn't losing money on

44:24

big mistakes. It's leaking through a

44:25

thousand tiny decisions nobody's

44:27

watching. Ramp puts guard rails on

44:29

spending before it happens. Real-time

44:31

limits, automatic rules, zero

44:32

firefighting. Try it at ramp.com/invest.

44:37

As your business grows, Vant scales with

44:39

you, automating compliance, and giving

44:40

you a single source of truth for

44:42

security and risk. Learn more at

44:44

vanta.com/invest.

44:47

Ridgeline is redefining asset management

44:49

technology as a true partner, not just a

44:51

software vendor. They've helped firms 5x

44:53

and scale, enabling faster growth,

44:55

smarter operations, and a competitive

44:57

edge. Visit ridgelineapps.com

44:59

to see what they can unlock for your

45:01

firm.

45:02

Every investment firm is unique, and

45:04

generic AI doesn't understand your

45:05

process. Rogo does. It's an AI platform

45:08

built specifically for Wall Street,

45:09

connected to your data, understanding

45:11

your process, and producing real

45:12

outputs. Check them out at

45:14

rogo.ai/invest.

45:17

The best AI and software companies from

45:19

OpenAI to Cursor to Perplexity use work

45:21

OS to become enterprise ready overnight,

45:23

not in months. Visit works.com to skip

45:26

the unglamorous infrastructure work and

45:28

focus on your product.

Interactive Summary

The video features a deep dive into the rapid adoption of AI tokens within professional workflows, particularly focusing on the shift from 'ideas being cheap' to 'execution becoming easy'. The speaker discusses how AI-driven tools are dramatically increasing productivity in sectors ranging from semiconductor reverse engineering to economic research. The conversation also explores the explosive growth in token demand, the structural supply chain bottlenecks for compute infrastructure, and the potential societal backlash against the rapid advancement of AI technologies.

Suggested questions

4 ready-made prompts