Dario Amodei — “We are near the end of the exponential”

Watch on YouTube

Now Playing

Transcript

1480 segments

0:00

We talked three years ago. In your view, what has been the biggest update over the last three years?

0:05

What has been the biggest difference between what it felt like then versus now?

0:10

Broadly speaking, the exponential of the underlying technology has gone about as

0:18

I expected it to go. There's plus or minus

0:23

a year or two here and there. I don't know that I would've

0:27

predicted the specific direction of code. But when I look at the exponential,

0:34

it is roughly what I expected in terms of the march of the models from smart high

0:39

school student to smart college student to beginning to do PhD and professional stuff,

0:44

and in the case of code reaching beyond that. The frontier is a little bit uneven, but it's

0:49

roughly what I expected. What has been the most surprising

0:55

thing is the lack of public recognition of how close we are to the end of the exponential.

1:02

To me, it is absolutely wild that you have people — within the bubble and outside the

1:09

bubble — talking about the same tired, old hot-button political issues, when we are

1:19

near the end of the exponential. I want to understand what that

1:24

exponential looks like right now. The first question I asked you when

1:27

we recorded three years ago was, "what’s up with scaling and why does it work?"

1:31

I have a similar question now, but it feels more complicated.

1:35

At least from the public's point of view, three years ago there were well-known public trends

1:41

across many orders of magnitude of compute where you could see how the loss improves.

1:45

Now we have RL scaling and there's no publicly known scaling law for it.

1:49

It's not even clear what the story is. Is this supposed to be teaching the model skills?

1:54

Is it supposed to be teaching meta-learning? What is the scaling hypothesis at this point?

1:59

I actually have the same hypothesis I had even all the way back in 2017.

2:06

I think I talked about it last time, but I wrote a doc called "The Big Blob of Compute Hypothesis".

2:12

It wasn't about the scaling of language models in particular.

2:15

When I wrote it GPT-1 had just come out. That was one among many things.

2:22

Back in those days there was robotics. People tried to work on reasoning as

2:26

a separate thing from language models, and there was scaling of the kind of RL

2:30

that happened in AlphaGo and in Dota at OpenAI. People remember StarCraft at DeepMind, AlphaStar.

2:43

It was written as a more general document. Rich Sutton put out "The Bitter

2:52

Lesson" a couple years later. The hypothesis is basically the same.

2:57

What it says is that all the cleverness, all the techniques, all the "we need a new method to do

3:04

something", that doesn't matter very much. There are only a few things that matter.

3:08

I think I listed seven of them. One is how much raw compute you have.

3:13

The second is the quantity of data. The third is the quality and distribution of data.

3:20

It needs to be a broad distribution. The fourth is how long you train for.

3:27

The fifth is that you need an objective function that can scale to the moon.

3:32

The pre-training objective function is one such objective function.

3:36

Another is the RL objective function that says you have a goal,

3:42

you're going to go out and reach the goal. Within that, there's objective rewards like

3:48

you see in math and coding, and there's more subjective rewards like you see in

3:52

RLHF or higher-order versions of that. Then the sixth and seventh were things

3:59

around normalization or conditioning, just getting the numerical stability

4:04

so that the big blob of compute flows in this laminar way instead of running into problems.

4:11

That was the hypothesis, and it's a hypothesis I still hold.

4:15

I don't think I've seen very much that is not in line with it.

4:21

The pre-training scaling laws were one example of what we see there. Those have continued going.

4:31

Now it's been widely reported, we feel good about pre-training.

4:35

It’s continuing to give us gains. What has changed is that now we're

4:41

also seeing the same thing for RL. We're seeing a pre-training phase

4:46

and then an RL phase on top of that. With RL, it’s actually just the same.

4:55

Even other companies have published things in some of their releases that say, "We train the

5:05

model on math contests — AIME or other things — and how well the model does is log-linear in

5:14

how long we've trained it." We see that as well,

5:17

and it's not just math contests. It's a wide variety of RL tasks.

5:21

We're seeing the same scaling in RL that we saw for pre-training.

5:27

You mentioned Rich Sutton and "The Bitter Lesson". I interviewed him last year,

5:31

and he's actually very non-LLM-pilled. I don’t know if this is his perspective,

5:38

but one way to paraphrase his objection is: Something which possesses the true core of human

5:44

learning would not require all these billions of dollars of data and compute and these bespoke

5:51

environments, to learn how to use Excel, how to use PowerPoint, how to navigate a web browser.

5:57

The fact that we have to build in these skills using these RL environments hints that we are

6:04

actually lacking a core human learning algorithm. So we're scaling the wrong thing. That does raise

6:13

the question. Why are we doing all this RL scaling if we think there's something that's going to be

6:16

human-like in its ability to learn on the fly? I think this puts together several things that

6:23

should be thought of differently. There is a genuine puzzle here,

6:29

but it may not matter. In fact, I would guess it probably

6:33

doesn't matter. There is an interesting thing. Let me take the RL out of it for a second, because I

6:37

actually think it's a red herring to say that RL is any different from pre-training in this matter.

6:43

If we look at pre-training scaling, it was very interesting

6:47

back in 2017 when Alec Radford was doing GPT-1. The models before GPT-1 were trained on datasets

6:59

that didn't represent a wide distribution of text. You had very standard language

7:08

modeling benchmarks. GPT-1 itself was trained on

7:11

a bunch of fanfiction, I think actually. It was literary text, which is a very

7:17

small fraction of the text you can get. In those days it was like a billion words

7:23

or something, so small datasets representing a pretty narrow distribution of what you can

7:32

see in the world. It didn't generalize well. If you did better on some fanfiction corpus,

7:43

it wouldn't generalize that well to other tasks. We had all these measures. We had

7:47

all these measures of how well it did at predicting all these other kinds of texts.

7:55

It was only when you trained over all the tasks on the internet — when you did a general internet

8:01

scrape from something like Common Crawl or scraping links in Reddit, which is what we did for

8:06

GPT-2 — that you started to get generalization. I think we're seeing the same thing on RL.

8:15

We're starting first with simple RL tasks like training on math competitions, then moving to

8:24

broader training that involves things like code. Now we're moving to many other tasks.

8:31

I think then we're going to increasingly get generalization.

8:35

So that kind of takes out the RL vs. pre-training side of it.

8:39

But there is a puzzle either way, which is that in pre-training we use trillions of tokens.

8:50

Humans don't see trillions of words. So there is an actual sample

8:54

efficiency difference here. There is actually something different here.

8:59

The models start from scratch and they need much more training.

9:06

But we also see that once they're trained, if we give them a long context length of

9:15

a million — the only thing blocking long context is inference — they're very good at

9:17

learning and adapting within that context. So I don’t know the full answer to this.

9:24

I think there's something going on where pre-training is not like

9:28

the process of humans learning, but it's somewhere between the process of humans

9:32

learning and the process of human evolution. We get many of our priors from evolution.

9:38

Our brain isn't just a blank slate. Whole books have been written about this.

9:43

The language models are much more like blank slates.

9:45

They literally start as random weights, whereas the human brain starts with all these regions

9:50

connected to all these inputs and outputs. Maybe we should think of pre-training — and

9:56

for that matter, RL as well — as something that exists in the middle space between

10:02

human evolution and human on-the-spot learning. And we should think of the in-context learning

10:10

that the models do as something between long-term human learning and short-term human learning.

10:17

So there's this hierarchy. There’s evolution, there's long-term learning, there's short-term

10:22

learning, and there's just human reaction. The LLM phases exist along this spectrum,

10:28

but not necessarily at exactly the same points. There’s no analog to some of the human modes

10:34

of learning the LLMs are falling in between the points. Does that make sense?

10:40

Yes, although some things are still a bit confusing.

10:42

For example, if the analogy is that this is like evolution so it's fine that it's

10:45

not sample efficient, then if we're going to get super sample-efficient

10:51

agent from in-context learning, why are we bothering to build all these RL environments?

10:56

There are companies whose work seems to be teaching models how to use this API,

11:00

how to use Slack, how to use whatever. It's confusing to me why there's so much emphasis

11:04

on that if the kind of agent that can just learn on the fly is emerging or has already emerged.

11:11

I can't speak for the emphasis of anyone else. I can only talk about how we think about it.

11:20

The goal is not to teach the model every possible skill within RL,

11:25

just as we don't do that within pre-training. Within pre-training, we're not trying to expose

11:29

the model to every possible way that words could be put together.

11:37

Rather, the model trains on a lot of things and then reaches generalization across pre-training.

11:43

That was the transition from GPT-1 to GPT-2 that I saw up close. The model reaches a point. I had

11:53

these moments where I was like, "Oh yeah, you just give the model a list of numbers — this is

12:01

the cost of the house, this is the square feet of the house — and the model completes the pattern

12:05

and does linear regression." Not great, but it does it,

12:08

and it's never seen that exact thing before. So to the extent that we are building these

12:16

RL environments, the goal is very similar to what was done five or ten years ago with pre-training.

12:26

We're trying to get a whole bunch of data, not because we want to cover a specific document or a

12:32

specific skill, but because we want to generalize. I think the framework you're laying down obviously

12:39

makes sense. We're making progress toward AGI. Nobody at this point disagrees we're going to

12:46

achieve AGI this century. The crux is you say we're

12:49

hitting the end of the exponential. Somebody else looks at this and says,

12:55

"We've been making progress since 2012, and by 2035 we'll have a human-like agent."

13:04

Obviously we’re seeing in these models the kinds of things that evolution did,

13:07

or that learning within a human lifetime does. I want to understand what you’re seeing

13:11

that makes you think it's one year away and not ten years away.

13:17

There are two claims you could make here, one stronger and one weaker.

13:26

Starting with the weaker claim, when I first saw the scaling back in 2019,

13:35

I wasn’t sure. This was a 50/50 thing. I thought I saw something. My claim was that this

13:43

was much more likely than anyone thinks. Maybe there's a 50% chance this happens.

13:51

On the basic hypothesis of, as you put it, within ten years we'll get to what I call a "country of

14:00

geniuses in a data center", I'm at 90% on that. It's hard to go much higher than 90%

14:06

because the world is so unpredictable. Maybe the irreducible uncertainty puts us at 95%,

14:13

where you get to things like multiple companies having internal turmoil, Taiwan gets invaded,

14:24

all the fabs get blown up by missiles. Now you've jinxed us, Dario.

14:30

You could construct a 5% world where things get delayed for ten years.

14:43

There's another 5% which is that I'm very confident on tasks that can be verified.

14:50

With coding, except for that irreducible uncertainty,

14:54

I think we'll be there in one or two years. There's no way we will not be there in ten years

14:58

in terms of being able to do end-to-end coding. My one little bit of fundamental uncertainty,

15:05

even on long timescales, is about tasks that aren't verifiable: planning a mission to Mars;

15:14

doing some fundamental scientific discovery like CRISPR; writing a novel.

15:21

It’s hard to verify those tasks. I am almost certain we have a

15:27

reliable path to get there, but if there's a little bit of uncertainty it's there.

15:34

On the ten-year timeline I'm at 90%, which is about as certain as you can be.

15:40

I think it's crazy to say that this won't happen by 2035.

15:46

In some sane world, it would be outside the mainstream.

15:48

But the emphasis on verification hints to me a lack of belief that these models are generalized.

15:58

If you think about humans, we're both good at things for which we get verifiable reward

16:03

and things for which we don't. No, this is why I’m almost sure.

16:07

We already see substantial generalization from things that verify to things that

16:12

don't. We're already seeing that. But it seems like you were emphasizing

16:15

this as a spectrum which will split apart which domains in which we see more progress.

16:21

That doesn't seem like how humans get better. The world in which we don't get there is the world

16:27

in which we do all the verifiable things. Many of them generalize,

16:34

but we don't fully get there. We don’t fully color in the other side

16:40

of the box. It's not a binary thing. Even if generalization is weak and you can only do

16:47

verifiable domains, it's not clear to me you could automate software engineering in such a world.

16:49

You are "a software engineer" in some sense, but part of being a software engineer for you involves

16:58

writing long memos about your grand vision. I don’t think that’s part of the job of SWE.

17:03

That's part of the job of the company, not SWE specifically.

17:04

But SWE does involve design documents and other things like that.

17:10

The models are already pretty good at writing comments.

17:14

Again, I’m making much weaker claims here than I believe, to distinguish between two things.

17:24

We're already almost there for software engineering.

17:28

By what metric? There's one metric which is how many lines of code are written by AI.

17:32

If you consider other productivity improvements in the history of software engineering,

17:36

compilers write all the lines of software. There's a difference between how many lines

17:40

are written and how big the productivity improvement is. "We’re almost there" meaning…

17:47

How big is the productivity improvement, not just how many lines are written by AI?

17:52

I actually agree with you on this. I've made a series of predictions on

17:57

code and software engineering. I think people have repeatedly misunderstood them.

18:03

Let me lay out the spectrum. About eight or nine months ago,

18:09

I said the AI model will be writing 90% of the lines of code in three to six months.

18:16

That happened, at least at some places. It happened at Anthropic, happened with

18:21

many people downstream using our models. But that's actually a very weak criterion.

18:27

People thought I was saying that we won't need 90% of the software engineers. Those things are worlds

18:32

apart. The spectrum is: 90% of code is written by the model, 100% of code is written by the model.

18:41

That's a big difference in productivity. 90% of the end-to-end SWE tasks — including

18:47

things like compiling, setting up clusters and environments, testing features,

18:54

writing memos — are done by the models. 100% of today's SWE tasks are done by the models.

19:02

Even when that happens, it doesn't mean software engineers are out of a job.

19:06

There are new higher-level things they can do, where they can manage.

19:10

Then further down the spectrum, there's 90% less demand for SWEs, which I think

19:15

will happen but this is a spectrum. I wrote about it in "The Adolescence

19:21

of Technology" where I went through this kind of spectrum with farming.

19:26

I actually totally agree with you on that.

19:29

These are very different benchmarks from each other,

19:32

but we're proceeding through them super fast. Part of your vision is that going from 90 to 100

19:38

is going to happen fast, and that it leads to huge productivity improvements.

19:45

But what I notice is that even in greenfield projects people start with Claude Code or

19:49

something, people report starting a lot of projects… Do we see in the world out there

19:54

a renaissance of software, all these new features that wouldn't exist otherwise?

19:58

At least so far, it doesn't seem like we see that. So that does make me wonder.

20:02

Even if I never had to intervene with Claude Code, the world is complicated.

20:09

Jobs are complicated. Closing the loop on self-contained systems, whether it’s just

20:14

writing software or something, how much broader gains would we see just from that?

20:20

Maybe that should dilute our estimation of the "country of geniuses".

20:24

I simultaneously agree with you that it's a reason why these things don't happen instantly,

20:35

but at the same time, I think the effect is gonna be very fast.

20:41

You could have these two poles. One is that AI is not going to make

20:47

progress. It's slow. It's going to take forever to diffuse within the economy.

20:52

Economic diffusion has become one of these buzzwords that's a reason why

20:56

we're not going to make AI progress, or why AI progress doesn't matter.

21:00

The other axis is that we'll get recursive self-improvement, the whole thing.

21:05

Can't you just draw an exponential line on the curve?

21:08

We're going to have Dyson spheres around the sun so many nanoseconds after we get recursive.

21:17

I'm completely caricaturing the view here, but there are these two extremes.

21:23

But what we've seen from the beginning, at least if you look within Anthropic, there's this bizarre

21:30

10x per year growth in revenue that we've seen. So in 2023, it was zero to $100 million.

21:38

In 2024, it was $100 million to $1 billion. In 2025, it was $1 billion to $ 9-10 billion.

21:46

You guys should have just bought a billion dollars of your own products so you could just…

21:50

And the first month of this year, that exponential is...

21:54

You would think it would slow down, but we added another few billion to revenue in January.

22:05

Obviously that curve can't go on forever. The GDP is only so large.

22:10

I would even guess that it bends somewhat this year, but that is a fast curve. That's a really

22:20

fast curve. I would bet it stays pretty fast even as the scale goes to the entire economy.

22:25

So I think we should be thinking about this middle world where things are extremely fast, but not

22:34

instant, where they take time because of economic diffusion, because of the need to close the loop.

22:39

Because it's fiddly: "I have to do change management within my enterprise… I set this up,

22:50

but I have to change the security permissions on this in order to make it actually work…

22:55

I had this old piece of software that checks the model before it's compiled

23:01

and released and I have to rewrite it. Yes, the model can do that, but I have

23:05

to tell the model to do that. It has to take time to do that."

23:10

So I think everything we've seen so far is compatible with the idea that there's one fast

23:17

exponential that's the capability of the model. Then there's another fast exponential

23:22

that's downstream of that, which is the diffusion of the model into the economy.

23:26

Not instant, not slow, much faster than any previous technology, but it has its limits.

23:37

When I look inside Anthropic, when I look at our customers: fast adoption, but not infinitely fast.

23:44

Can I try a hot take on you? Yeah.

23:45

I feel like diffusion is cope that people say. When the model isn't able to do something,

23:51

they're like, "oh, but it's a diffusion issue." But then you should use the comparison to humans.

23:56

You would think that the inherent advantages that AIs have would make diffusion a much easier

24:01

problem for new AIs getting onboarded than new humans getting onboarded.

24:06

An AI can read your entire Slack and your drive in minutes.

24:08

They can share all the knowledge that the other copies of the same instance have.

24:12

You don't have this adverse selection problem when you're hiring AI, so you

24:14

can just hire copies of a vetted AI model. Hiring a human is so much more of a hassle.

24:20

People hire humans all the time. We pay humans upwards of $50 trillion

24:23

in wages because they're useful, even though in principle it would be much easier to integrate

24:29

AIs into the economy than it is to hire humans. The diffusion doesn't really explain.

24:34

I think diffusion is very real and doesn't exclusively have

24:41

to do with limitations on the AI models. Again, there are people who use diffusion

24:49

as kind of a buzzword to say this isn't a big deal. I'm not talking about that. I'm

24:54

not talking about how AI will diffuse at the speed of previous technologies.

24:58

I think AI will diffuse much faster than previous technologies have, but not infinitely fast.

25:04

I'll just give an example of this. There's Claude Code. Claude Code is extremely easy to set up.

25:10

If you're a developer, you can just start using Claude Code.

25:14

There is no reason why a developer at a large enterprise should not be adopting

25:19

Claude Code as quickly as an individual developer or developer at a startup.

25:25

We do everything we can to promote it. We sell Claude Code to enterprises.

25:31

Big enterprises, big financial companies, big pharmaceutical companies, all of them are adopting

25:38

Claude Code much faster than enterprises typically adopt new technology. But again,

25:46

it takes time. Any given feature or any given product, like Claude Code or Cowork, will get

25:54

adopted by the individual developers who are on Twitter all the time, by the Series A startups,

26:02

many months faster than they will get adopted by a large enterprise that does food sales.

26:11

There are just a number of factors. You have to go through legal,

26:14

you have to provision it for everyone. It has to pass security and compliance.

26:20

The leaders of the company who are further away from the AI revolution are forward-looking,

26:26

but they have to say, "Oh, it makes sense for us to spend 50 million.

26:31

This is what this Claude Code thing is. This is why it helps our company.

26:35

This is why it makes us more productive." Then they have to explain

26:37

to the people two levels below. They have to say, "Okay, we have 3,000 developers.

26:42

Here's how we're going to roll it out to our developers."

26:45

We have conversations like this every day. We are doing everything we can to make

26:50

Anthropic's revenue grow 20 or 30x a year instead of 10x a year.

26:57

Again, many enterprises are just saying, "This is so productive.

27:02

We're going to take shortcuts in our usual procurement process."

27:05

They're moving much faster than when we tried to sell them just

27:08

the ordinary API, which many of them use. Claude Code is a more compelling product,

27:13

but it's not an infinitely compelling product. I don't think even AGI or powerful AI or

27:19

"country of geniuses in a data center" will be an infinitely compelling product.

27:22

It will be a compelling product enough maybe to get 3-5x, or 10x, a year of growth, even when

27:28

you're in the hundreds of billions of dollars, which is extremely hard to do and has never been

27:32

done in history before, but not infinitely fast. I buy that it would be a slight slowdown.

27:36

Maybe this is not your claim, but sometimes people talk about this like,

27:39

"Oh, the capabilities are there, but because of diffusion... otherwise we're basically at AGI".

27:46

I don't believe we're basically at AGI. I think if you had the "country

27:49

of geniuses in a data center"... If we had the "country of geniuses

27:53

in a data center", we would know it. We would know it if you had the

27:57

"country of geniuses in a data center". Everyone in this room would know it.

28:01

Everyone in Washington would know it. People in rural parts might not know it,

28:07

but we would know it. We don't have that now. That is very clear.

29:42

Coming back to concrete prediction… Because there are so many different things to disambiguate,

29:47

it can be easy to talk past each other when we're talking about capabilities.

29:50

For example, when I interviewed you three years ago, I asked you a prediction about what

29:54

we should expect three years from now. You were right. You said, "We should expect systems which,

30:00

if you talk to them for the course of an hour, it's hard to tell them apart from

30:04

a generally well-educated human." I think you were right about that.

30:07

I think spiritually I feel unsatisfied because my internal expectation was that such a system could

30:13

automate large parts of white-collar work. So it might be more productive to talk about

30:17

the actual end capabilities you want from such a system.

30:21

I will basically tell you where I think we are. Let me ask a very specific question so that

30:28

we can figure out exactly what kinds of capabilities we should think about soon.

30:32

Maybe I'll ask about it in the context of a job I understand well, not because it's the most

30:36

relevant job, but just because I can evaluate the claims about it. Take video editors. I have

30:42

video editors. Part of their job involves learning about our audience's preferences,

30:47

learning about my preferences and tastes, and the different trade-offs we have.

30:50

They’re, over the course of many months, building up this understanding of context.

30:55

The skill and ability they have six months into the job, a model that can

30:58

pick up that skill on the job on the fly, when should we expect such an AI system?

31:04

I guess what you're talking about is that we're doing this interview for three hours.

31:09

Someone's going to come in, someone's going to edit it.

31:11

They're going to be like, "Oh, I don't know, Dario scratched his head and we could edit that out."

31:19

"Magnify that." "There was this long

31:22

discussion that is less interesting to people. There's another thing that's more interesting

31:27

to people, so let's make this edit." I think the "country of geniuses in

31:33

a data center" will be able to do that. The way it will be able to do that is it will

31:38

have general control of a computer screen. You'll be able to feed this in.

31:43

It'll be able to also use the computer screen to go on the web, look at all your previous

31:49

interviews, look at what people are saying on Twitter in response to your interviews,

31:54

talk to you, ask you questions, talk to your staff, look at the history of edits

31:59

that you did, and from that, do the job. I think that's dependent on several things.

32:06

I think this is one of the things that's actually blocking deployment:

32:10

getting to the point on computer use where the models are really masters at using the computer.

32:16

We've seen this climb in benchmarks, and benchmarks are always imperfect measures.

32:20

But I think when we first released computer use a year and a quarter ago, OSWorld was at maybe 15%.

32:33

I don't remember exactly, but we've climbed from that to 65-70%.

32:40

There may be harder measures as well, but I think computer use has to pass a point of reliability.

32:46

Can I just follow up on that before you move on to the next point?

32:50

For years, I've been trying to build different internal LLM tools for myself.

32:54

Often I have these text-in, text-out tasks, which should be dead center

32:59

in the repertoire of these models. Yet I still hire humans to do them.

33:03

If it's something like, "identify what the best clips would be in this transcript",

33:07

maybe the LLMs do a seven-out-of-ten job on them. But there's not this ongoing way I can engage

33:12

with them to help them get better at the job the way I could with a human employee.

33:16

That missing ability, even if you solve computer use, would still block

33:20

my ability to offload an actual job to them. This gets back to what we were talking about

33:28

before with learning on the job. It's very interesting. I think with the coding agents,

33:34

I don't think people would say that learning on the job is what is preventing the coding agents

33:39

from doing everything end to end. They keep getting better. We have engineers

33:46

at Anthropic who don't write any code. When I look at the productivity, to your

33:51

previous question, we have folks who say, "This GPU kernel, this chip, I used to write it myself.

33:58

I just have Claude do it." There's this enormous improvement in productivity.

34:04

When I see Claude Code, familiarity with the codebase or a feeling that the model

34:13

hasn't worked at the company for a year, that's not high up on the list of complaints I see.

34:18

I think what I'm saying is that we're kind of taking a different path.

34:22

Don't you think with coding that's because there

34:24

is an external scaffold of memory which exists instantiated in the codebase?

34:28

I don't know how many other jobs have that. Coding made fast progress precisely because

34:33

it has this unique advantage that other economic activity doesn't.

34:37

But when you say that, what you're implying is that by reading the codebase into the context,

34:44

I have everything that the human needed to learn on the job.

34:48

So that would be an example of—whether it's written or not, whether it's available or

34:54

not—a case where everything you needed to know you got from the context window.

35:00

What we think of as learning—"I started this job, it's going to take me six months to understand the

35:05

code base"—the model just did it in the context. I honestly don't know how to think about

35:09

this because there are people who qualitatively report what you're saying.

35:16

I'm sure you saw last year, there was a major study where they had experienced developers try

35:21

to close pull requests in repositories that they were familiar with. Those developers reported an

35:28

uplift. They reported that they felt more productive with the use of these models.

35:31

But in fact, if you look at their output and how much was actually merged back in,

35:35

there was a 20% downlift. They were less productive

35:37

as a result of using these models. So I'm trying to square the qualitative

35:40

feeling that people feel with these models versus, 1) in a macro level,

35:44

where is this renaissance of software? And then 2) when people do these independent

35:48

evaluations, why are we not seeing the productivity benefits we would expect?

35:53

Within Anthropic, this is just really unambiguous. We're under an incredible amount of commercial

35:59

pressure and make it even harder for ourselves because we have all this safety stuff we do that

36:03

I think we do more than other companies. The pressure to survive economically

36:11

while also keeping our values is just incredible. We're trying to keep this 10x revenue curve going.

36:18

There is zero time for bullshit. There is zero time for feeling

36:23

like we're productive when we're not. These tools make us a lot more productive.

36:30

Why do you think we're concerned about competitors using the tools?

36:34

Because we think we're ahead of the competitors. We wouldn't be going through all this trouble if

36:43

this were secretly reducing our productivity. We see the end productivity every few

36:49

months in the form of model launches. There's no kidding yourself about this.

36:54

The models make you more productive. 1) People feeling like they're productive is

37:00

qualitatively predicted by studies like this. But 2) if I just look at the end output,

37:04

obviously you guys are making fast progress. But the idea was supposed to be that with

37:10

recursive self-improvement, you make a better AI, the AI helps you build a

37:14

better next AI, et cetera, et cetera. What I see instead—if I look at you,

37:18

OpenAI, DeepMind—is that people are just shifting around the podium every few months.

37:22

Maybe you think that stops because you've won or whatever.

37:25

But why are we not seeing the person with the best coding model have this lasting

37:31

advantage if in fact there are these enormous productivity gains from the last coding model.

37:38

I think my model of the situation is that there's an advantage that's gradually growing.

37:45

I would say right now the coding models give maybe, I don't know,

37:51

a 15-20% total factor speed up. That's my view. Six months ago, it was maybe 5%.

38:01

So it didn't matter. 5% doesn't register. It's now just getting to the point where it's

38:06

one of several factors that kind of matters. That's going to keep speeding up.

38:12

I think six months ago, there were several companies that were at roughly the same

38:18

point because this wasn't a notable factor, but I think it's starting to speed up more and more.

38:25

I would also say there are multiple companies that write models that are used for code and we're not

38:32

perfectly good at preventing some of these other companies from using our models internally.

38:41

So I think everything we're seeing is consistent with this kind of snowball model.

38:52

Again, my theme in all of this is all of this is soft takeoff, soft, smooth exponentials,

39:00

although the exponentials are relatively steep. So we're seeing this snowball gather momentum

39:05

where it's like 10%, 20%, 25%, 40%. As you go, Amdahl's law, you have

39:13

to get all the things that are preventing you from closing the loop out of the way.

39:17

But this is one of the biggest priorities within Anthropic.

39:22

Stepping back, before in the stack we were talking about when do we get this on-the-job learning?

39:29

It seems like the point you were making on the coding thing is that we actually

39:32

don't need on-the-job learning. You can have tremendous productivity

39:36

improvements, you can have potentially trillions of dollars of revenue for AI companies, without

39:40

this basic human ability to learn on the job. Maybe that's not your claim, you should clarify.

39:47

But in most domains of economic activity, people say, "I hired somebody, they weren't that useful

39:53

for the first few months, and then over time they built up the context, understanding."

39:58

It's actually hard to define what we're talking about here.

40:00

But they got something and then now they're a powerhorse and they're so valuable to us.

40:05

If AI doesn't develop this ability to learn on the fly, I'm a bit skeptical that we're going to see

40:12

huge changes to the world without that ability. I think two things here. There's the state

40:17

of the technology right now. Again, we have these two stages.

40:22

We have the pre-training and RL stage where you throw a bunch of data and tasks into

40:27

the models and then they generalize. So it's like learning, but it's like

40:31

learning from more data and not learning over one human or one model's lifetime.

40:38

So again, this is situated between evolution and human learning.

40:42

But once you learn all those skills, you have them.

40:45

Just like with pre-training, just how the models know more, if I look at a pre-trained model,

40:52

it knows more about the history of samurai in Japan than I do.

40:55

It knows more about baseball than I do. It knows more about low-pass filters

41:03

and electronics, all of these things. Its knowledge is way broader than mine.

41:08

So I think even just that may get us to the point where the models are better at everything.

41:18

We also have, again, just with scaling the kind of existing setup, the in-context learning.

41:24

I would describe it as kind of like human on-the-job learning,

41:27

but a little weaker and a little short term. You look at in-context learning and if you give

41:33

the model a bunch of examples it does get it. There's real learning that happens in context.

41:38

A million tokens is a lot. That can be days of human learning.

41:42

If you think about the model reading a million words, how long would it

41:50

take me to read a million? Days or weeks at least. So you have these two things.

41:57

I think these two things within the existing paradigm may just be enough to get you the

42:01

"country of geniuses in a data center". I don't know for sure, but I think

42:04

they're going to get you a large fraction of it. There may be gaps, but I certainly think that just

42:10

as things are, this is enough to generate trillions of dollars of revenue. That's one. Two,

42:17

is this idea of continual learning, this idea of a single model learning on the job.

42:24

I think we're working on that too. There's a good chance that in the next

42:29

year or two, we also solve that. Again, I think you get most

42:36

of the way there without it. The trillions of dollars a year market,

42:45

maybe all of the national security implications and the safety implications that I wrote about in

42:49

"Adolescence of Technology" can happen without it. But we, and I imagine others, are working on it.

42:57

There's a good chance that we will get there within the next year or two.

43:03

There are a bunch of ideas. I won't go into all of them in detail, but

43:07

one is just to make the context longer. There's nothing preventing

43:10

longer contexts from working. You just have to train at longer contexts

43:14

and then learn to serve them at inference. Both of those are engineering problems that

43:18

we are working on and I would assume others are working on them as well.

43:22

This context length increase, it seemed like there was a period from 2020 to 2023

43:26

where from GPT-3 to GPT-4 Turbo, there was an increase from 2000 context lengths to 128K.

43:31

I feel like for the two-ish years since then, we've been in the same-ish ballpark.

43:37

When context lengths get much longer than that, people report qualitative

43:41

degradation in the ability of the model to consider that full context.

43:47

So I'm curious what you're internally seeing that makes you think, "10 million contexts,

43:50

100 million contexts to get six months of human learning and building context".

43:54

This isn't a research problem. This is an engineering and inference problem.

43:58

If you want to serve long context, you have to store your entire KV cache.

44:06

It's difficult to store all the memory in the GPUs, to juggle the memory around.

44:11

I don't even know the details. At this point, this is at a level of detail

44:15

that I'm no longer able to follow, although I knew it in the GPT-3 era. "These are the weights,

44:21

these are the activations you have to store…" But these days the whole thing is flipped

44:26

because we have MoE models and all of that. Regarding this degradation you're talking about,

44:34

without getting too specific, there's two things. There's the context length you train at and

44:41

there's a context length that you serve at. If you train at a small context length

44:45

and then try to serve at a long context length, maybe you get these degradations.

44:49

It's better than nothing, you might still offer it, but you get these degradations.

44:52

Maybe it's harder to train at a long context length.

44:56

I want to, at the same time, ask about maybe some rabbit holes.

45:01

Wouldn't you expect that if you had to train on longer context length,

45:04

that would mean that you're able to get less samples in for the same amount of compute?

45:10

Maybe it's not worth diving deep on that. I want to get an answer to the

45:14

bigger picture question. I don't feel a preference

45:20

for a human editor that's been working for me for six months versus an AI that's been

45:25

working with me for six months, what year do you predict that that will be the case?

45:33

My guess for that is there's a lot of problems where basically we can do this when we have

45:38

the "country of geniuses in a data center". My picture for that, if you made me guess, is

45:48

one to two years, maybe one to three years. It's really hard to tell. I have a strong view—99%,

45:54

95%—that all this will happen in 10 years. I think that's just a super safe bet.

46:00

I have a hunch—this is more like a 50/50 thing—that it's going to be more like

46:04

one to two, maybe more like one to three. So one to three years. Country of geniuses,

46:10

and the slightly less economically valuable task of editing videos.

46:14

It seems pretty economically valuable, let me tell you.

46:17

It's just there are a lot of use cases like that. There are a lot of similar ones.

46:20

So you're predicting that within one to three years.

46:23

And then, generally, Anthropic has predicted that by late '26 or early '27 we will have AI systems

46:28

that "have the ability to navigate interfaces available to humans doing digital work today,

46:34

intellectual capabilities matching or exceeding that of Nobel Prize winners, and the ability to

46:38

interface with the physical world". You gave an interview two months ago

46:42

with DealBook where you were emphasizing your company's more responsible compute

46:48

scaling as compared to your competitors. I'm trying to square these two views.

46:52

If you really believe that we're going to have a country of geniuses, you want as

46:57

big a data center as you can get. There's no reason to slow down.

47:00

The TAM of a Nobel Prize winner, that can actually do everything a Nobel Prize

47:04

winner can do, is trillions of dollars. So I'm trying to square this conservatism,

47:10

which seems rational if you have more moderate timelines, with your stated views about progress.

47:16

It actually all fits together. We go back to this fast, but not infinitely fast, diffusion.

47:23

Let's say that we're making progress at this rate. The technology is making progress this fast.

47:29

I have very high conviction that we're going to get there within a few years.

47:39

I have a hunch that we're going to get there within a year or two.

47:41

So there’s a little uncertainty on the technical side, but pretty strong

47:46

confidence that it won't be off by much. What I'm less certain about is, again,

47:51

the economic diffusion side. I really do believe that we could

47:56

have models that are a country of geniuses in the data center in one to two years.

48:03

One question is: How many years after that do the trillions in revenue start rolling in?

48:14

I don't think it's guaranteed that it's going to be immediate.

48:19

It could be one year, it could be two years, I could even stretch it to five

48:27

years although I'm skeptical of that. So we have this uncertainty. Even if the technology goes as

48:35

fast as I suspect that it will, we don't know exactly how fast it's going to drive revenue.

48:41

We know it's coming, but with the way you buy these data centers, if you're off by a couple

48:47

years, that can be ruinous. It is just like how I

48:50

wrote in "Machines of Loving Grace". I said I think we might get this powerful AI,

48:55

this "country of genius in the data center". That description you gave comes

48:57

from "Machines of Loving Grace". I said we'll get that in 2026, maybe 2027. Again,

49:02

that is my hunch. I wouldn't be surprised if I'm off by a year or two, but that is my hunch.

49:08

Let's say that happens. That's the starting gun. How long does it take to cure all the diseases?

49:13

That's one of the ways that drives a huge amount of economic value. You cure every disease. There's

49:21

a question of how much of that goes to the pharmaceutical company or the AI company,

49:24

but there's an enormous consumer surplus because —assuming we can get access for everyone,

49:29

which I care about greatly—we cure all of these diseases. How long does it take? You

49:34

have to do the biological discovery, you have to manufacture the new drug,

49:40

you have to go through the regulatory process. We saw this with vaccines and COVID.

49:47

We got the vaccine out to everyone, but it took a year and a half.

49:52

My question is: How long does it take to get the cure for everything—which AI is the genius

49:58

that can in theory invent—out to everyone? How long from when that AI first exists

50:03

in the lab to when diseases have actually been cured for everyone?

50:09

We've had a polio vaccine for 50 years. We're still trying to eradicate it in the

50:14

most remote corners of Africa. The Gates Foundation is trying

50:18

as hard as they can. Others are trying as hard

50:20

as they can. But that's difficult. Again, I don't expect most of the economic diffusion

50:25

to be as difficult as that. That's the most difficult case. But there's a real dilemma here.

50:32

Where I've settled on it is that it will be faster than anything we've seen in the

50:39

world, but it still has its limits. So when we go to buying data centers,

50:47

again, the curve I'm looking at is: we've had a 10x a year increase every year.

50:54

At the beginning of this year, we're looking at $10 billion in annualized revenue.

51:02

We have to decide how much compute to buy. It takes a year or two to actually build out

51:10

the data centers, to reserve the data center. Basically I'm saying, "In 2027,

51:16

how much compute do I get?" I could assume that the

51:24

revenue will continue growing 10x a year, so it'll be $100 billion at the end of

51:31

2026 and $1 trillion at the end of 2027. Actually it would be $5 trillion dollars

51:39

of compute because it would be $1 trillion a year for five years.

51:43

I could buy $1 trillion of compute that starts at the end of 2027.

51:49

If my revenue is not $1 trillion dollars, if it's even $800 billion, there's no force on earth,

51:56

there's no hedge on earth that could stop me from going bankrupt if I buy that much compute.

52:03

Even though a part of my brain wonders if it's going to keep growing 10x,

52:07

I can't buy $1 trillion a year of compute in 2027. If I'm just off by a year in that rate of growth,

52:17

or if the growth rate is 5x a year instead of 10x a year, then you go bankrupt.

52:25

So you end up in a world where you're supporting hundreds of billions, not trillions.

52:33

You accept some risk that there's so much demand that you can't support the revenue,

52:38

and you accept some risk that you got it wrong and it's still slow.

52:43

When I talked about behaving responsibly, what I meant actually was not the absolute amount.

52:51

I think it is true we're spending somewhat less than some of the other players.

52:55

It's actually the other things, like have we been thoughtful about it or are we YOLOing and saying,

53:01

"We're going to do $100 billion here or $100 billion there"?

53:05

I get the impression that some of the other companies have not written down

53:09

the spreadsheet, that they don't really understand the risks they're taking.

53:12

They're just doing stuff because it sounds cool. We've thought carefully about it. We're

53:19

an enterprise business. Therefore, we can rely more on revenue. It's less fickle than consumer.

53:26

We have better margins, which is the buffer between buying too much and buying too little.

53:31

I think we bought an amount that allows us to capture pretty strong upside worlds.

53:37

It won't capture the full 10x a year. Things would have to go pretty badly for

53:42

us to be in financial trouble. So we've thought carefully and

53:46

we've made that balance. That's what I mean when

53:48

I say that we're being responsible. So it seems like it's possible that we

53:54

actually just have different definitions of the "country of a genius in a data center".

53:56

Because when I think of actual human geniuses, an actual country of human geniuses in a data center,

54:02

I would happily buy $5 trillion worth of compute to run an actual country of

54:08

human geniuses in a data center. Let's say JPMorgan or Moderna or

54:11

whatever doesn't want to use them. I've got a country of geniuses.

54:14

They'll start their own company. If they can't start their own company and they're bottlenecked

54:18

by clinical trials… It is worth stating that with clinical trials, most clinical trials fail because

54:22

the drug doesn't work. There's not efficacy. I make exactly that point in "Machines of

54:27

Loving Grace", I say the clinical trials are going to go much faster

54:30

than we're used to, but not infinitely fast. Okay, and then suppose it takes a year for

54:35

the clinical trials to work out so that you're getting revenue from that and can make more drugs.

54:39

Okay, well, you've got a country of geniuses and you're an AI lab.

54:44

You could use many more AI researchers. You also think there are these self-reinforcing

54:50

gains from smart people working on AI tech. You can have the data center

54:56

working on AI progress. Are there substantially

55:01

more gains from buying $1 trillion a year of compute versus $300 billion a year of compute?

55:07

If your competitor is buying a trillion, yes there is.

55:09

Well, no, there's some gain, but then again, there's this chance that they go bankrupt before.

55:17

Again, if you're off by only a year, you destroy yourselves. That's the balance. We're

55:23

buying a lot. We're buying a hell of a lot. We're buying an amount that's comparable to

55:30

what the biggest players in the game are buying. But if you're asking me, "Why haven't we signed

55:39

$10 trillion of compute starting in mid-2027?"... First of all, it can't be produced.

55:44

There isn't that much in the world. But second, what if the country of

55:50

geniuses comes, but it comes in mid-2028 instead of mid-2027? You go bankrupt.

55:56

So if your projection is one to three years, it seems like you should want

56:00

$10 trillion of compute by 2029 at the latest? Even in the longest version of the timelines

56:11

you state, the compute you are ramping up to build doesn't seem in accordance.

56:16

What makes you think that? Human wages, let's say,

56:21

are on the order of $50 trillion a year— So I won't talk about Anthropic in particular,

56:27

but if you talk about the industry, the amount of compute the industry is building this year is

56:38

probably, call it, 10-15 gigawatts. It goes up by roughly 3x a year.

56:48

So next year's 30-40 gigawatts. 2028 might be 100 gigawatts. 2029 might be like 300 gigawatts.

57:03

I'm doing the math in my head, but each gigawatt costs maybe $10 billion,

57:07

on the order of $10-15 billion a year. You put that all together and you're

57:14

getting about what you described. You’re getting exactly that. You're getting multiple

57:16

trillions a year by 2028 or 2029. You're getting exactly what you predict.

57:23

That's for the industry. That's for the industry, that’s right.

57:26

Suppose Anthropic's compute keeps 3x-ing a year, and then by 2027-28, you have 10 gigawatts.

57:34

Multiply that by, as you say, $10 billion. So then it's like $100 billion a year.

57:40

But then you're saying the TAM by 2028 is $200 billion.

57:43

Again, I don't want to give exact numbers for Anthropic, but these numbers are too small.

57:48

Okay, interesting. You've told investors

58:49

that you plan to be profitable starting in 2028. This is the year when we're potentially getting

58:55

the country of geniuses as a data center. This is now going to unlock all this progress

59:02

in medicine and health and new technologies. Wouldn't this be exactly the time where you'd

59:11

want to reinvest in the business and build bigger "countries" so they can make more discoveries?

59:16

Profitability is this kind of weird thing in this field.

59:21

I don't think in this field profitability is actually a measure of spending down

59:32

versus investing in the business. Let's just take a model of this.

59:36

I actually think profitability happens when you underestimated the amount of demand you were going

59:41

to get and loss happens when you overestimated the amount of demand you were going to get,

59:46

because you're buying the data centers ahead of time. Think about it this way. Again,

59:52

these are stylized facts. These numbers are not exact. I'm just trying to make a toy model here.

59:56

Let's say half of your compute is for training and half of your compute is for inference.

60:02

The inference has some gross margin that's more than 50%.

60:07

So what that means is that if you were in steady-state, you build a data center and if

60:12

you knew exactly the demand you were getting, you would get a certain amount of revenue.

60:23

Let’s say you pay $100 billion a year for compute. On $50 billion a year you support

60:28

$150 billion of revenue. The other $50 billion is used for training.

60:36

Basically you’re profitable and you make $50 billion of profit.

60:40

Those are the economics of the industry today, or not today but where we’re

60:45

projecting forward in a year or two. The only thing that makes that not the

60:49

case is if you get less demand than $50 billion. Then you have more than 50% of your data center

60:57

for research and you're not profitable. So you train stronger models,

61:01

but you're not profitable. If you get more demand than you thought, then

61:07

research gets squeezed, but you're kind of able to support more inference and you're more profitable.

61:16

Maybe I'm not explaining it well, but the thing I'm trying to say is that you

61:19

decide the amount of compute first. Then you have some target desire of

61:24

inference versus training, but that gets determined by demand.

61:28

It doesn't get determined by you. What I'm hearing is the reason

61:30

you're predicting profit is that you are systematically underinvesting in compute?

61:37

No, no, no. I'm saying it's hard to predict. These things about 2028 and when it will happen,

61:43

that's our attempt to do the best we can with investors.

61:46

All of this stuff is really uncertain because of the cone of uncertainty.

61:50

We could be profitable in 2026 if the revenue grows fast enough.

61:58

If we overestimate or underestimate the next year, that could swing wildly.

62:04

What I'm trying to get at is that you have a model in your head of a business that invests,

62:09

invests, invests, gets scale and then becomes profitable.

62:14

There's a single point at which things turn around.

62:16

I don't think the economics of this industry work that way.

62:19

I see. So if I'm understanding correctly, you're saying that because of the discrepancy

62:24

between the amount of compute we should have gotten and the amount of compute we got,

62:27

we were sort of forced to make profit. But that doesn't mean we're going

62:30

to continue making profit. We're going to reinvest the money

62:33

because now AI has made so much progress and we want a bigger country of geniuses.

62:37

So back into revenue is high, but losses are also high.

62:44

If every year we predict exactly what the demand is going to be, we'll be profitable every year.

62:50

Because spending 50% of your compute on research, roughly, plus a gross margin that's higher than

63:00

50% and correct demand prediction leads to profit. That's the profitable business model that I think

63:07

is kind of there, but obscured by these building ahead and prediction errors.

63:13

I guess you're treating the 50% as a sort of given constant, whereas in fact,

63:21

if AI progress is fast and you can increase the progress by scaling up more, you should just have

63:24

more than 50% and not make profit. But here's what I'll say. You

63:26

might want to scale it up more. Remember the log returns to scale.

63:34

If 70% would get you a very little bit of a smaller model through a factor of 1.4x...

63:42

That extra $20 billion, each dollar there is worth much less to you because of the log-linear setup.

63:51

So you might find that it's better to invest that $20 billion in serving

63:58

inference or in hiring engineers who are kind of better at what they're doing.

64:05

So the reason I said 50%... That's not exactly our target. It's not exactly going to be 50%.

64:10

It’ll probably vary over time. What I'm saying is the log-linear return, what it leads to is you

64:18

spend of order one fraction of the business. Like not 5%, not 95%. Then you get diminishing returns.

64:28

I feel strange that I'm convincing Dario to believe in AI progress or something.

64:34

Okay, you don't invest in research because it has diminishing returns,

64:37

but you invest in the other things you mentioned. I think profit at a sort of macro level—

64:39

Again, I'm talking about diminishing returns, but after you're spending $50 billion a year.

64:46

This is a point I'm sure you would make, but diminishing returns on a genius could

64:51

be quite high. More generally,

64:54

what is profit in a market economy? Profit is basically saying other

64:58

companies in the market can do more things with this money than I can.

65:02

Put aside Anthropic. I don't want to give information about Anthropic.

65:06

That’s why I'm giving these stylized numbers. But let's just derive the

65:10

equilibrium of the industry. Why doesn't everyone spend 100% of their

65:21

compute on training and not serve any customers? It's because if they didn't get any revenue,

65:25

they couldn't raise money, they couldn't do compute deals,

65:27

they couldn't buy more compute the next year. So there's going to be an equilibrium where every

65:31

company spends less than 100% on training and certainly less than 100% on inference.

65:38

It should be clear why you don't just serve the current models and never train another model,

65:44

because then you don't have any demand because you'll fall behind. So there's some equilibrium.

65:49

It's not gonna be 10%, it's not gonna be 90%. Let's just say as a stylized fact, it's 50%.

65:55

That's what I'm getting at. I think we're gonna be in a position where that equilibrium of how much

66:01

you spend on training is less than the gross margins that you're able to get on compute.

66:08

So the underlying economics are profitable. The problem is you have this hellish demand

66:14

prediction problem when you're buying the next year of compute and you might guess under and be

66:21

very profitable but have no compute for research. Or you might guess over and you are not

66:30

profitable and you have all the compute for research in the world. Does that make sense?

66:36

Just as a dynamic model of the industry? Maybe stepping back, I'm not saying I think

66:42

the "country of geniuses" is going to come in two years and therefore you should buy this compute.

66:47

To me, the end conclusion you're arriving at makes a lot of sense.

66:51

But that's because it seems like "country of geniuses" is hard and there's a long way to go.

66:57

So stepping back, the thing I'm trying to get at is more that it seems like your worldview

67:03

is compatible with somebody who says, "We're like 10 years away from a world in which we're

67:07

generating trillions of dollars of value." That's just not my view. So I'll make

67:14

another prediction. It is hard for me to see that there won't be trillions

67:20

of dollars in revenue before 2030. I can construct a plausible world.

67:26

It takes maybe three years. That would be the end of what I think it's plausible.

67:31

Like in 2028, we get the real "country of geniuses in the data center".

67:36

The revenue's going into the low hundreds of billions by 2028, and then the country

67:46

of geniuses accelerates it to trillions. We’re basically on the slow end of diffusion.

67:52

It takes two years to get to the trillions. That would be the world where it takes until 2030.

67:59

I suspect even composing the technical exponential and diffusion exponential,

68:05

we’ll get there before 2030. So you laid out a model where Anthropic makes

68:10

profit because it seems like fundamentally we're in a compute-constrained world.

68:14

So eventually we keep growing compute— I think the way the profit comes is… Again,

68:21

let's just abstract the whole industry here. Let's just imagine we're in an economics textbook.

68:27

We have a small number of firms. Each can invest a limited amount.

68:33

Each can invest some fraction in R&D. They have some marginal cost to serve.

68:38

The gross profit margins on that marginal cost are very high because inference is efficient.

68:47

There's some competition, but the models are also differentiated.

68:52

Companies will compete to push their research budgets up.

68:55

But because there's a small number of players, we have the... What is it called?

69:00

The Cournot equilibrium, I think, is what the small number of firm equilibrium is.

69:05

The point is it doesn't equilibrate to perfect competition with zero margins.

69:15

If there's three firms in the economy and all are kind of independently behaving rationally,

69:20

it doesn't equilibrate to zero. Help me understand that, because

69:24

right now we do have three leading firms and they're not making profit. So what is changing?

69:33

Again, the gross margins right now are very positive.

69:38

What's happening is a combination of two things. One is that we're still in the exponential

69:43

scale-up phase of compute. A model gets trained. Let's say a model got

69:53

trained that costs $1 billion last year. Then this year it produced $4 billion of

70:02

revenue and cost $1 billion to inference from. Again, I'm using stylized numbers here, but that

70:12

would be 75% gross margins and this 25% tax. So that model as a whole makes $2 billion.

70:23

But at the same time, we're spending $10 billion to train the next model because

70:27

there's an exponential scale-up. So the company loses money. Each model

70:31

makes money, but the company loses money. The equilibrium I'm talking about is an

70:35

equilibrium where we have the "country of geniuses in a data center", but that

70:43

model training scale-up has equilibrated more. Maybe it's still going up. We're still trying to

70:49

predict the demand, but it's more leveled out. I'm confused about a couple of things there.

70:56

Let's start with the current world. In the current world, you're right that,

71:00

as you said before, if you treat each individual model as a company, it's profitable.

71:05

But of course, a big part of the production function of being a frontier lab is training

71:11

the next model, right? Yes, that's right.

71:13

If you didn't do that, then you'd make profit for two months and then

71:16

you wouldn't have margins because you wouldn't have the best model.

71:19

But at some point that reaches the biggest scale that it can reach.

71:23

And then in equilibrium, we have algorithmic improvements, but we're spending roughly the

71:28

same amount to train the next model as we spend to train the current model.

71:37

At some point you run out of money in the economy. A fixed lump of labor fallacy… The economy is

71:42

going to grow, right? That's one of your predictions. We're going

71:44

to have the data centers in space. Yes, but this is another example

71:47

of the theme I was talking about. The economy will grow much faster

71:53

with AI than I think it ever has before. Right now the compute is growing 3x a year.

71:59

I don't believe the economy is gonna grow 300% a year.

72:03

I said this in "Machines of Loving Grace", I think we may get 10-20%

72:08

per year growth in the economy, but we're not gonna get 300% growth in the economy.

72:13

So I think in the end, if compute becomes the majority of what the economy produces,

72:18

it's gonna be capped by that. So let's assume a model

72:22

where compute stays capped. The world where frontier labs are making money

72:26

is one where they continue to make fast progress. Because fundamentally your margin is limited by

72:34

how good the alternative is. So you are able to make money

72:37

because you have a frontier model. If you didn't have a frontier model

72:39

you wouldn't be making money. So this model requires there

72:45

never to be a steady state. Forever and ever you keep

72:48

making more algorithmic progress. I don't think that's true. I mean,

72:51

I feel like we're in an economics class. Do you know the Tyler Cowen quote?

72:59

We never stop talking about economics. We never stop talking about economics.

73:03

So no, I don't think this field's going to be a monopoly.

73:12

All my lawyers never want me to say the word "monopoly".

73:15

But I don't think this field's going to be a monopoly.

73:17

You do get industries in which there are a small number of players.

73:21

Not one, but a small number of players. Ordinarily, the way you get monopolies

73:27

like Facebook or Meta—I always call them Facebook—is these kinds of network effects.

73:37

The way you get industries in which there are a small number of players,

73:41

is very high costs of entry. Cloud is like this. I think cloud is a good example of this.

73:49

There are three, maybe four, players within cloud. I think that's the same for AI, three, maybe four.

73:56

The reason is that it's so expensive. It requires so much expertise and so

74:02

much capital to run a cloud company. You have to put up all this capital.

74:08

In addition to putting up all this capital, you have to get all of this other stuff

74:11

that requires a lot of skill to make it happen. So if you go to someone and you're like, "I want

74:17

to disrupt this industry, here's $100 billion." You're like, "okay, I'm putting in $100 billion

74:22

and also betting that you can do all these other things that these people have been doing."

74:26

Only to decrease the profit. The effect of your entering

74:29

is that profit margins go down. So, we have equilibria like this

74:33

all the time in the economy where we have a few players. Profits are not astronomical. Margins

74:39

are not astronomical, but they're not zero. That's what we see on cloud. Cloud is very

74:47

undifferentiated. Models are more differentiated than cloud.

74:51

Everyone knows Claude is good at different things than GPT is good at, than Gemini is good at.

74:58

It's not just that Claude's good at coding, GPT is good at math and reasoning.

75:05

It's more subtle than that. Models are good at different types of coding. Models have different

75:09

styles. I think these things are actually quite different from each other, and so I would expect

75:15

more differentiation than you see in cloud. Now, there actually is one counter-argument.

75:26

That counter-argument is if the process of producing models,

75:32

if AI models can do that themselves, then that could spread throughout the economy.

75:37

But that is not an argument for commoditizing AI models in general.

75:41

That's kind of an argument for commoditizing the whole economy at once.

75:45

I don't know what quite happens in that world where basically anyone

75:48

can do anything, anyone can build anything, and there's no moat around anything at all.

75:53

I don't know, maybe we want that world. Maybe that's the end state here.

75:58

Maybe when AI models can do everything, if we've solved all the safety and security problems,

76:09

that's one of the mechanisms for the economy just flattening itself again.

76:17

But that's kind of far post-"country of geniuses in the data center."

76:23

Maybe a finer way to put that potential point is: 1) it seems like AI research is especially

76:32

loaded on raw intellectual power, which will be especially abundant in the world of AGI.

76:37

And 2) if you just look at the world today, there are very few technologies that seem to be

76:41

diffusing as fast as AI algorithmic progress. So that does hint that this industry is

76:50

sort of structurally diffusive. I think coding is going fast, but

76:54

I think AI research is a superset of coding and there are aspects of it that are not going fast.

77:00

But I do think, again, once we get coding, once we get AI models going fast, then that will speed up

77:07

the ability of AI models to do everything else. So while coding is going fast now, I think once

77:13

the AI models are building the next AI models and building everything else,

77:17

the whole economy will kind of go at the same pace. I am worried geographically, though.

77:24

I'm a little worried that just proximity to AI, having heard about AI, may be one differentiator.

77:34

So when I said the 10-20% growth rate, a worry I have is that the growth rate could be like 50%

77:42

in Silicon Valley and parts of the world that are socially connected to Silicon Valley, and not that

77:50

much faster than its current pace elsewhere. I think that'd be a pretty messed up world.

77:54

So one of the things I think about a lot is how to prevent that.

77:57

Do you think that once we have this country of geniuses in a data center, that

78:01

robotics is sort of quickly solved afterwards? Because it seems like a big problem with robotics

78:06

is that a human can learn how to teleoperate current hardware, but current AI models can't,

78:12

at least not in a way that's super productive. And so if we have this ability to learn like

78:16

a human, shouldn't it solve robotics immediately as well?

78:19

I don't think it's dependent on learning like a human.

78:21

It could happen in different ways. Again, we could have trained the model on

78:25

many different video games, which are like robotic controls, or many different simulated robotics

78:30

environments, or just train them to control computer screens, and they learn to generalize.

78:34

So it will happen... it's not necessarily dependent on human-like learning.

78:41

Human-like learning is one way it could happen. If the model's like, "Oh, I pick up a robot,

78:44

I don't know how to use it, I learn," that could happen because we discovered continual learning.

78:50

That could also happen because we trained the model on a bunch of environments and

78:54

then generalized, or it could happen because the model learns that in the context length.

78:58

It doesn't actually matter which way. If we go back to the discussion we had

79:03

an hour ago, that type of thing can happen in several different ways.

79:10

But I do think when for whatever reason the models have those skills, then robotics will be

79:16

revolutionized—both the design of robots, because the models will be much better than humans at

79:21

that, and also the ability to control robots. So we'll get better at building the physical

79:28

hardware, building the physical robots, and we'll also get better at controlling it.

79:32

Now, does that mean the robotics industry will also be generating

79:36

trillions of dollars of revenue? My answer there is yes, but there will be

79:40

the same extremely fast, but not infinitely fast diffusion. So will robotics be revolutionized?

79:46

Yeah, maybe tack on another year or two. That's the way I think about these things.

79:52

Makes sense. There's a general skepticism about extremely fast progress. Here's my view. It sounds

79:58

like you are going to solve continual learning one way or another within a matter of years.

80:02

But just as people weren't talking about continual learning a couple of years ago,

80:06

and then we realized, "Oh, why aren't these models as useful as they could be right now,

80:09

even though they are clearly passing the Turing test and are experts in so many different domains?

80:14

Maybe it's this thing." Then we solve this thing and we realize, actually, there's another thing

80:19

that human intelligence can do that's a basis of human labor that these models can't do.

80:24

So why not think there will be more things like this, where

80:28

we've found more pieces of human intelligence? Well, to be clear, I think continual learning, as

80:33

I've said before, might not be a barrier at all. I think we may just get there by pre-training

80:40

generalization and RL generalization. I think there just

80:48

might not be such a thing at all. In fact, I would point to the history

80:51

in ML of people coming up with things that are barriers that end up kind of

80:56

dissolving within the big blob of compute. People talked about, "How do your models

81:06

keep track of nouns and verbs?" "They can understand syntactically,

81:11

but they can't understand semantically? It's only statistical correlations."

81:16

"You can understand a paragraph, you can’t understand a word.

81:19

There's reasoning, you can't do reasoning." But then suddenly it turns out you can

81:23

do code and math very well. So I think there's actually a

81:27

stronger history of some of these things seeming like a big deal and then kind of dissolving. Some

81:35

of them are real. The need for data is real, maybe continual learning is a real thing.

81:42

But again, I would ground us in something like code.

81:46

I think we may get to the point in a year or two where the models can

81:50

just do SWE end-to-end. That's a whole task. That's a whole sphere of human activity that

81:56

we're just saying models can do now. When you say end-to-end, do you mean

82:02

setting technical direction, understanding the context of the problem, et cetera?

82:06

Yes. I mean all of that. Interesting. I feel like that is AGI-complete,

82:13

which maybe is internally consistent. But it's not like saying 90%

82:17

of code or 100% of code. No, I gave this spectrum:

82:22

90% of code, 100% of code, 90% of end-to-end SWE, 100% of end-to-end SWE.

82:28

New tasks are created for SWEs. Eventually those get done as well.

82:31

It's a long spectrum there, but we're traversing the spectrum very quickly.

82:35

I do think it's funny that I've seen a couple of podcasts you've done where

82:40

the hosts will be like, "But Dwarkesh wrote the essay about the continuous learning thing."

82:43

It always makes me crack up because you've been an AI researcher for 10 years.

82:48

I'm sure there's some feeling of, "Okay, so a podcaster wrote an essay,

82:53

and every interview I get asked about it." The truth of the matter is that we're all

82:59

trying to figure this out together. There are some ways in which I'm

83:04

able to see things that others aren't. These days that probably has more to do

83:08

with seeing a bunch of stuff within Anthropic and having to make a bunch of decisions than I have

83:13

any great research insight that others don't. I'm running a 2,500 person company.

83:20

It's actually pretty hard for me to have concrete research insight, much harder than it would have

83:27

been 10 years ago or even two or three years ago. As we go towards a world of a full drop-in

83:36

remote worker replacement, does an API pricing model still make the most sense?

83:42

If not, what is the correct way to price AGI, or serve AGI?

83:45

I think there's going to be a bunch of different business models here, all at once,

83:49

that are going to be experimented with. I actually do think that the API

83:59

model is more durable than many people think. One way I think about it is if the technology

84:06

is advancing quickly, if it's advancing exponentially, what that means is there's

84:12

always a surface area of new use cases that have been developed in the last three months.

84:20

Any kind of product surface you put in place is always at risk of sort of becoming irrelevant.

84:27

Any given product surface probably makes sense for a range of capabilities of the model.

84:32

The chatbot is already running into limitations where making it smarter doesn't really help the

84:39

average consumer that much. But I don't think that's

84:41

a limitation of AI models. I don't think that's evidence

84:45

that the models are good enough and them getting better doesn't matter to the economy.

84:51

It doesn't matter to that particular product. So I think the value of the API is that the API

84:58

always offers an opportunity, very close to the bare metal, to build on what the latest thing is.

85:06

There's always going to be this front of new startups and new ideas that

85:14

weren't possible a few months ago and are possible because the model is advancing.

85:19

I actually predict that it's going to exist alongside other models, but we're always going

85:28

to have the API business model because there's always going to be a need for a thousand different

85:34

people to try experimenting with the model in a different way. 100 of them become startups and

85:40

ten of them become big successful startups. Two or three really end up being the way

85:45

that people use the model of a given generation. So I basically think it's always going to exist.

85:50

At the same time, I'm sure there's going to be other models as well.

85:55

Not every token that's output by the model is worth the same amount.

86:00

Think about what is the value of the tokens that the model outputs when someone calls

86:10

them up and says, "My Mac isn't working," or something, the model's like, "restart it."

86:16

Someone hasn't heard that before, but the model said that 10 million times.

86:23

Maybe that's worth like a dollar or a few cents or something.

86:26

Whereas if the model goes to one of the pharmaceutical companies and it says, "Oh,

86:34

you know, this molecule you're developing, you should take the aromatic ring from that end of the

86:39

molecule and put it on that end of the molecule. If you do that, wonderful things will happen."

86:46

Those tokens could be worth tens of millions of dollars.

86:52

So I think we're definitely going to see business models that recognize that.

86:56

At some point we're going to see "pay for results" in some form, or we may see forms of compensation

87:06

that are like labor, that kind of work by the hour. I don't know. I think because it's a new

87:16

industry, a lot of things are going to be tried. I don't know what will turn out to

87:19

be the right thing. I take your point that

87:24

people will have to try things to figure out what is the best way to use this blob of intelligence.

87:28

But what I find striking is Claude Code. I don't think in the history of startups

87:34

there has been a single application that has been as hotly competed in as coding agents.

87:42

Claude Code is a category leader here. That seems surprising to me. It doesn't seem

87:49

intrinsically that Anthropic had to build this. I wonder if you have an accounting of why it had

87:54

to be Anthropic or how Anthropic ended up building an application in addition

87:58

to the model underlying it that was successful. So it actually happened in a pretty simple way,

88:02

which is that we had our own coding models, which were good at coding.

88:09

Around the beginning of 2025, I said, "I think the time has come where you can have

88:14

nontrivial acceleration of your own research if you're an AI company by using these models."

88:21

Of course, you need an interface, you need a harness to use them.

88:25

So I encouraged people internally. I didn't say this is one thing that you have to use.

88:31

I just said people should experiment with this. I think it might have been originally

88:37

called Claude CLI, and then the name eventually got changed to Claude Code.

88:42

Internally, it was the thing that everyone was using and it was seeing fast internal adoption.

88:48

I looked at it and I said, "Probably we should launch this externally, right?"

88:53

It's seen such fast adoption within Anthropic. Coding is a lot of what we do.

88:59

We have an audience of many, many hundreds of people that's in some ways at least

89:04

representative of the external audience. So it looks like we already have product

89:08

market fit. Let's launch this thing. And then we launched it. I think just the fact that we

89:15

ourselves are kind of developing the model and we ourselves know what we most need to use the model,

89:21

I think it's kind of creating this feedback loop. I see. In the sense that you, let's say a

89:26

developer at Anthropic is like, "Ah, it would be better if it was better at this X thing."

89:31

Then you bake that into the next model that you build.

89:35

That's one version of it, but then there's just the ordinary product iteration.

89:41

We have a bunch of coders within Anthropic, they use Claude Code

89:47

every day and so we get fast feedback. That was more important in the early days.

89:50

Now, of course, there are millions of people using it, and so we get

89:53

a bunch of external feedback as well. But it's just great to be able to get

89:58

kind of fast internal feedback. I think this is the reason why we

90:03

launched a coding model and didn't launch a pharmaceutical company.

90:10

My background's in biology, but we don't have any of the resources that

90:14

are needed to launch a pharmaceutical company. Let me now ask you about making AI go well.

91:24

It seems like whatever vision we have about how AI goes well has to be compatible with two things:

91:30

1) the ability to build and run AIs is diffusing extremely rapidly and 2) the

91:37

population of AIs, the amount we have and their intelligence, will also increase very rapidly.

91:44

That means that lots of people will be able to build huge populations of misaligned AIs,

91:49

or AIs which are just companies which are trying to increase their

91:53

footprint or have weird psyches like Sydney Bing, but now they're superhuman.

91:57

What is a vision for a world in which we have an equilibrium that is compatible

92:02

with lots of different AIs, some of which are misaligned, running around?

92:06

I think in "The Adolescence of Technology", I was skeptical of the balance of power.

92:13

But the thing I was specifically skeptical of is you have three or four of these companies

92:23

all building models that are derived from the same thing, that they would check each other.

92:36

Or even that any number of them would check each other.

92:40

We might live in an offense-dominant world where one person or one AI model is smart enough to do

92:47

something that causes damage for everything else. In the short run, we have a limited number

92:54

of players now. So we can start

92:56

within the limited number of players. We need to put in place the safeguards.

93:03

We need to make sure everyone does the right alignment work.

93:05

We need to make sure everyone has bioclassifiers. Those are the immediate things we need to do.

93:11

I agree that that doesn't solve the problem in the long run, particularly if the ability of

93:16

AI models to make other AI models proliferates, then the whole thing can become harder to solve.

93:26

I think in the long run we need some architecture of governance.

93:30

We need some architecture of governance that preserves human freedom,

93:35

but also allows us to govern a very large number of human systems, AI systems, hybrid

93:52

human-AI companies or economic units. So we're gonna need to think about:

94:01

how do we protect the world against bioterrorism? How do we protect the world against mirror life?

94:11

Probably we're gonna need some kind of AI monitoring system

94:15

that monitors for all of these things. But then we need to build this in a way

94:20

that preserves civil liberties and our constitutional rights.

94:24

So I think just as anything else, it's a new security landscape with a new set of

94:34

tools and a new set of vulnerabilities. My worry is, if we had 100 years for this

94:40

to happen all very slowly, we'd get used to it. We've gotten used to the presence of explosives

94:49

in society or the presence of various new weapons or the presence of video cameras.

94:58

We would get used to it over 100 years and we’d develop governance mechanisms. We'd

95:03

make our mistakes. My worry is just that this is happening all so fast.

95:07

So maybe we need to do our thinking faster about how to make these governance mechanisms work.

95:13

It seems like in an offense-dominant world, over the course of the next century—the idea is that AI

95:19

is making the progress that would happen over the next century happen in some period of five to ten

95:22

years—we would still need the same mechanisms, or balance of power would be similarly intractable,

95:29

even if humans were the only game in town. I guess we have the advice of AI.

95:36

But it fundamentally doesn't seem like a totally different ball game here.

95:41

If checks and balances were going to work, they would work with humans as well.

95:44

If they aren't going to work, they wouldn't work with AIs as well.

95:47

So maybe this just dooms human checks and balances as well.

95:51

Again, I think there's some way to make this happen.

95:58

The governments of the world may have to work together to make it happen.

96:02

We may have to talk to AIs about building societal structures in such a way that these

96:10

defenses are possible. I don't know. I don’t want to say this is so far ahead in time,

96:15

but it’s so far ahead in technological ability that may happen over a short period of time,

96:21

that it's hard for us to anticipate it in advance. Speaking of governments getting involved,

96:25

on December 26, the Tennessee legislature introduced a bill which said, "It would

96:31

be an offense for a person to knowingly train artificial intelligence to provide

96:34

emotional support, including through open-ended conversations with a user."

96:39

Of course, one of the things that Claude attempts to do is be a thoughtful, knowledgeable friend.

96:48

In general, it seems like we're going to have this patchwork of state laws.

96:52

A lot of the benefits that normal people could experience as a result of AI are going to be

96:56

curtailed, especially when we get into the kinds of things you discuss in "Machines

96:59

of Loving Grace": biological freedom, mental health improvements, et cetera.

97:02

It seems easy to imagine worlds in which these get Whac-A-Moled away by different laws, whereas

97:10

bills like this don't seem to address the actual existential threats that you're concerned about.

97:15

I'm curious to understand, in the context of things like this, Anthropic's position

97:20

against the federal moratorium on state AI laws. There are many different things going on at once.

97:28

I think that particular law is dumb. It was clearly made by legislators

97:34

who just probably had little idea what AI models could do and not do.

97:38

They're like, "AI models serving us, that just sounds scary.

97:41

I don't want that to happen." So we're not in favor of that.

97:47

But that wasn't the thing that was being voted on. The thing that was being voted on is:

97:52

we're going to ban all state regulation of AI for 10 years with no apparent plan to do any

98:00

federal regulation of AI, which would take Congress to pass, which is a very high bar.

98:05

So the idea that we'd ban states from doing anything for 10 years… People said they had

98:11

a plan for the federal government, but there was no actual proposal on the table. There was

98:15

no actual attempt. Given the serious dangers that I lay out in "Adolescence of Technology"

98:22

around things like biological weapons and bioterrorism autonomy risk, and the

98:29

timelines we've been talking about—10 years is an eternity—I think that's a crazy thing to do.

98:36

So if that's the choice, if that's what you force us to choose, then we're going

98:42

to choose not to have that moratorium. I think the benefits of that position

98:47

exceed the costs, but it's not a perfect position if that's the choice.

98:51

Now, I think the thing that we should do, the thing that I would support, is the federal

98:56

government should step in, not saying "states you can't regulate", but "Here's what we're going to

99:02

do, and states you can't differ from this." I think preemption is fine in the sense of

99:08

saying that the federal government says, "Here is our standard. This applies to everyone.

99:12

States can't do something different." That would be something I would support

99:16

if it would be done in the right way. But this idea of states, "You can't do

99:22

anything and we're not doing anything either," that struck us as very much not making sense.

99:29

I think it will not age well, it is already starting to not age well with

99:33

all the backlash that you've seen. Now, in terms of what we would want,

99:39

the things we've talked about are starting with transparency standards in order to monitor some

99:46

of these autonomy risks and bioterrorism risks. As the risks become more serious, as we get more

99:53

evidence for them, then I think we could be more aggressive in some targeted ways and say, "Hey,

99:58

AI bioterrorism is really a threat. Let's pass a law that forces

100:04

people to have classifiers." I could even imagine… It depends.

100:07

It depends how serious the threat it ends up being. We don't know for sure. We need to pursue

100:12

this in an intellectually honest way where we say that ahead of time, the risk has not emerged yet.

100:16

But I could certainly imagine, with the pace that things are going at,

100:21

a world where later this year we say, "Hey, this AI bioterrorism stuff is really serious.

100:27

We should do something about it. We should put it in a federal standard.

100:31

If the federal government won't act, we should put it in a state standard." I could totally see that.

100:36

I'm concerned about a world where if you just consider the pace of progress you're expecting,

100:42

the life cycle of legislation... The benefits are, as you say because

100:48

of diffusion lag, slow enough that I really do think this patchwork of state

100:55

laws, on the current trajectory, would prohibit. I mean if having an emotional chatbot friend is

100:59

something that freaks people out, then just imagine the kinds of actual benefits from AI

101:03

we want normal people to be able to experience. From improvements in health and healthspan and

101:08

improvements in mental health and so forth. Whereas at the same time, it seems like you

101:13

think the dangers are already on the horizon and I just don't see that much… It seems like it would

101:19

be especially injurious to the benefits of AI as compared to the dangers of AI.

101:24

So that's maybe where the cost benefit makes less sense to me.

101:27

So there's a few things here. People talk about there being

101:31

thousands of these state laws. First of all, the vast,

101:34

vast majority of them do not pass. The world works a certain way in theory,

101:41

but just because a law has been passed doesn't mean it's really enforced.

101:44

The people implementing it may be like, "Oh my God, this is stupid.

101:48

It would mean shutting off everything that's ever been built in Tennessee."

101:55

Very often, laws are interpreted in a way that makes them not as dangerous or harmful.

102:02

On the same side, of course, you have to worry if you're passing a law to stop a bad thing;

102:06

you have this problem as well. My basic view is that if we could

102:16

decide what laws were passed and how things were done—and we’re only one small input

102:21

into that—I would deregulate a lot of the stuff around the health benefits of AI.

102:29

I don't worry as much about the chatbot laws. I actually worry more about the drug approval

102:37

process, where I think AI models are going to greatly accelerate the rate at which we discover

102:45

drugs, and the pipeline will get jammed up. The pipeline will not be prepared to process

102:50

all the stuff that's going through it. I think reform of the regulatory process

102:58

should bias more towards the fact that we have a lot of things coming where the safety and

103:02

efficacy is actually going to be really crisp and clear, a beautiful thing, and really effective.

103:12

Maybe we don't need all this superstructure around it that was designed around an era of drugs that

103:21

barely work and often have serious side effects. At the same time, I think we should be

103:26

ramping up quite significantly the safety and security legislation.

103:35

Like I've said, starting with transparency is my view of trying not to hamper the industry,

103:43

trying to find the right balance. I'm worried about it. Some people criticize

103:46

my essay for saying, "That's too slow. The dangers of AI will come too soon

103:50

if we do that." Well, basically,

103:52

I think the last six months and maybe the next few months are going to be about transparency.

103:58

Then, if these risks emerge when we're more certain of them—which

104:02

I think we might be as soon as later this year—then I think we need to act very fast

104:07

in the areas where we've actually seen the risk. I think the only way to do this is to be nimble.

104:13

Now, the legislative process is normally not nimble, but we need to emphasize the

104:21

urgency of this to everyone involved. That's why I'm sending this message of urgency.

104:24

That's why I wrote Adolescence of Technology. I wanted policymakers, economists, national

104:30

security professionals, and decision-makers to read it so that they have some hope of acting

104:36

faster than they would have otherwise. Is there anything you can do or advocate

104:42

that would make it more certain that the benefits of AI are better instantiated?

104:51

I feel like you have worked with legislatures to say, "Okay,

104:54

we're going to prevent bioterrorism here. We're going to increase transparency, we're

104:57

going to increase whistleblower protection." But I think by default, the actual benefits

105:01

we're looking forward to seem very fragile to different kinds of moral panics or

105:08

political economy problems. I don't actually agree that

105:12

much regarding the developed world. I feel like in the developed world,

105:17

markets function pretty well. When there's a lot of money to

105:23

be made on something and it's clearly the best available alternative, it's actually hard for

105:27

the regulatory system to stop it. We're seeing that in AI itself.

105:33

A thing I've been trying to fight for is export controls on chips to China.

105:38

That's in the national security interest of the US.

105:42

That's squarely within the policy beliefs of almost everyone in Congress of both parties.

105:52

The case is very clear. The counterarguments against it, I'll politely call them fishy.

105:59

Yet it doesn't happen and we sell the chips because there's so much money riding on it.

106:08

That money wants to be made. In that case, in my opinion, that's a bad thing.

106:13

But it also applies when it's a good thing. So if we're talking about drugs and benefits of

106:23

the technology, I am not as worried about those benefits being hampered in the developed world.

106:30

I am a little worried about them going too slow. As I said, I do think we should work to speed

106:37

the approval process in the FDA. I do think we should fight against

106:41

these chatbot bills that you're describing. Described individually, I'm against them. I

106:46

think they're stupid. But I actually think the bigger worry is the developing world, where we

106:51

don't have functioning markets and where we often can't build on the technology that we've had.

106:58

I worry more that those folks will get left behind.

107:01

And I worry that even if the cures are developed, maybe there's someone in rural

107:04

Mississippi who doesn't get it as well. That's a smaller version of the concern

107:10

we have in the developing world. So the things we've been doing

107:14

are working with philanthropists. We work with folks who deliver medicine and

107:26

health interventions to the developing world, to sub-Saharan Africa, India, Latin America,

107:34

and other developing parts of the world. That's the thing I think that

107:39

won't happen on its own. You mentioned export controls.

107:42

Why shouldn't the US and China both have a "country of geniuses in a data center"?

107:48

Why won’t it happen or why shouldn't it happen? Why shouldn't it happen.

107:54

If this does happen, we could have a few situations.

108:02

If we have an offense-dominant situation, we could have a situation

108:05

like nuclear weapons, but more dangerous. Either side could easily destroy everything.

108:14

We could also have a world where it's unstable. The nuclear equilibrium is

108:19

stable because it's deterrence. But let's say there was uncertainty about,

108:24

if the two AIs fought, which AI would win? That could create instability. You often have

108:30

conflict when the two sides have a different assessment of their likelihood of winning.

108:34

If one side is like, "Oh yeah, there's a 90% chance I'll win," and the other side thinks

108:40

the same, then a fight is much more likely. They can't both be right,

108:43

but they can both think that. But this seems like a fully general argument

108:46

against the diffusion of AI technology. That's the implication of this world.

108:52

Let me just go on, because I think we will get diffusion eventually.

108:55

The other concern I have is that governments will oppress their own people with AI.

109:04

I'm worried about a world where you have a country in which there’s already a government that's

109:16

building a high-tech authoritarian state. To be clear, this is about the government.

109:21

This is not about the people. We need to find a way for

109:24

people everywhere to benefit. My worry here is about governments.

109:30

My worry is if the world gets carved up into two pieces, one of those two pieces

109:33

could be authoritarian or totalitarian in a way that's very difficult to displace.

109:39

Now, will governments eventually get powerful AI, and is there a risk of authoritarianism?

109:45

Yes. Will governments eventually get powerful AI, and is there a risk of

109:52

bad equilibria? Yes, I think both things. But the initial conditions matter. At some point, we're

110:00

going to need to set up the rules of the road. I'm not saying that one country, either the United

110:05

States or a coalition of democracies—which I think would be a better setup, although it

110:09

requires more international cooperation than we currently seem to want to make—should just say,

110:19

"These are the rules of the road." There's going to be some negotiation.

110:22

The world is going to have to grapple with this. What I would like is for the democratic nations of

110:31

the world—those whose governments represent closer to pro-human values—are holding the

110:39

stronger hand and have more leverage when the rules of the road are set.

110:44

So I'm very concerned about that initial condition.

110:47

I was re-listening to the interview from three years ago, and one of the ways it

110:51

aged poorly is that I kept asking questions assuming there was going to be some key

110:55

fulcrum moment two to three years from now. In fact, being that far out, it just seems

111:00

like progress continues, AI improves, AI is more diffused, and people will use it for more things.

111:05

It seems like you're imagining a world in the future where the countries get together, and

111:09

"Here's the rules of the road, here's the leverage we have, and here's the leverage you have."

111:13

But on the current trajectory, everybody will have more AI.

111:18

Some of that AI will be used by authoritarian countries.

111:20

Some of that within the authoritarian countries will be used by private

111:22

actors versus state actors. It's not clear who will benefit more.

111:26

It's always unpredictable to tell in advance. It seems like the internet privileged

111:30

authoritarian countries more than you would've expected.

111:33

Maybe AI will be the opposite way around. I want to better understand what

111:38

you're imagining here. Just to be precise about it,

111:42

I think the exponential of the underlying technology will continue as it has before.

111:47

The models get smarter and smarter, even when they get to a "country of geniuses in a data center."

111:53

I think you can continue to make the model smarter.

111:56

There's a question of getting diminishing returns on their value in the world.

112:01

How much does it matter after you've already solved human biology?

112:07

At some point you can do harder, more abstruse math problems, but nothing after that matters.

112:12

Putting that aside, I do think the exponential will continue, but there will be certain

112:18

distinguished points on the exponential. Companies, individuals, and countries

112:24

will reach those points at different times. In "The Adolescence of Technology" I talk about:

112:31

Is a nuclear deterrent still stable in the world of AI?

112:38

I don't know, but that's an example of one thing we've taken for granted.

112:42

The technology could reach such a level that we can no longer be certain of it.

112:50

Think of others. There are points where if you reach a certain level, maybe you have offensive

112:57

cyber dominance, and every computer system is transparent to you after that unless the

113:04

other side has an equivalent defense. I don't know what the critical moment

113:09

is or if there's a single critical moment. But I think there will be either a critical

113:14

moment, a small number of critical moments, or some critical window where AI confers

113:22

some large advantage from the perspective of national security, and one country or

113:30

coalition has reached it before others. I'm not advocating that they just say,

113:36

"Okay, we're in charge now." That's not how I think about it.

113:42

The other side is always catching up. There are extreme actions you're not

113:45

willing to take, and it's not right to take complete control anyway.

113:52

But at the point that happens, people are going to understand that the world has changed.

113:58

There's going to be some negotiation, implicit or explicit, about what the

114:05

post-AI world order looks like. My interest is in making that

114:14

negotiation be one in which classical liberal democracy has a strong hand.

114:24

I want to understand what that better means, because you say in the essay,

114:27

"Autocracy is simply not a form of government that people can accept in the post-powerful AI age."

114:33

That sounds like you're saying the CCP as an institution cannot exist after we get AGI.

114:41

That seems like a very strong demand, and it seems to imply a world where the leading lab

114:47

or the leading country will be able to—and by that language, should get to—determine

114:54

how the world is governed or what kinds of governments are, and are not, allowed.

115:02

I believe that paragraph said something like, "You could take it even further and say X."

115:13

I wasn't necessarily endorsing that view. I was saying,

115:18

"Here's a weaker thing that I believe. We have to worry a lot about authoritarians and

115:24

we should try to check them and limit their power. You could take this much further and have a more

115:30

interventionist view that says authoritarian countries with AI are these self-fulfilling

115:38

cycles that are very hard to displace, so you just need to get rid of them from the beginning."

115:43

That has exactly all the problems you say. If you were to make a commitment to

115:49

overthrowing every authoritarian country, they would take a bunch of actions now

115:53

that could lead to instability. That just may not be possible.

116:02

But the point I was making that I do endorse is that it is quite possible that...

116:09

Today, the view, my view, in most of the Western world is that democracy is a better form of

116:16

government than authoritarianism. But if a country’s authoritarian,

116:21

we don’t react the way we’d react if they committed a genocide or something.

116:27

I guess what I'm saying is I'm a little worried that in the age of AGI, authoritarianism will

116:32

have a different meaning. It will be a graver thing.

116:35

We have to decide one way or another how to deal with that.

116:39

The interventionist view is one possible view. I was exploring such views. It may end up being the

116:47

right view, or it may end up being too extreme. But I do have hope. One piece of hope I have is

116:55

that we have seen that as new technologies are invented, forms of government become obsolete.

117:04

I mentioned this in "Adolescence of Technology", where I said feudalism

117:10

was basically a form of government, and when we invented industrialization, feudalism was no

117:18

longer sustainable. It no longer made sense. Why is that hope? Couldn't that imply that

117:23

democracy is no longer going to be a competitive system?

117:26

Right, it could go either way. But these problems with

117:38

authoritarianism get deeper. I wonder if that's an indicator of

117:44

other problems that authoritarianism will have. In other words, because authoritarianism becomes

117:52

worse, people are more afraid of it. They work harder to stop it.

117:59

You have to think in terms of total equilibrium. I just wonder if it will motivate new ways

118:07

of thinking about how to preserve and protect freedom with the new technology.

118:13

Even more optimistically, will it lead to a collective reckoning and a more emphatic

118:22

realization of how important some of the things we take as individual rights are?

118:27

A more emphatic realization that we really can't give these away.

118:32

We've seen there's no other way to live that actually works.

118:39

I am actually hopeful that—it sounds too idealistic, but I believe it could be the

118:46

case—dictatorships become morally obsolete. They become morally unworkable forms of

118:52

government and the crisis that that creates is sufficient to force us to find another way.

119:03

I think there is genuinely a tough question here which I'm not sure how you resolve.

119:07

We've had to come out one way or another on it through history.

119:11

With China in the '70s and '80s, we decided that even though it's an

119:15

authoritarian system, we will engage with it. I think in retrospect that was the right call,

119:18

because it’s a state authoritarian system but a billion-plus people are much wealthier and

119:23

better off than they would've otherwise been. It's not clear that it would've stopped being

119:27

an authoritarian country otherwise. You can just look at North Korea

119:30

as an example of that. I don't know if it takes

119:34

that much intelligence to remain an authoritarian country that continues to coalesce its own power.

119:40

You can imagine a North Korea with an AI that's much worse than everybody else's,

119:44

but still enough to keep power. In general, it seems like we should just

119:50

have this attitude that the benefits of AI—in the form of all these empowerments

119:54

of humanity and health—will be big. Historically, we have decided it's good

120:00

to spread the benefits of technology widely, even to people whose governments are authoritarian.

120:06

It is a tough question, how to think about it with AI, but historically we have said, "yes,

120:10

this is a positive-sum world, and it's still worth diffusing the technology."

120:15

There are a number of choices we have. Framing this as a government-to-government

120:20

decision in national security terms is one lens, but there are a lot of other lenses.

120:27

You could imagine a world where we produce all these cures to diseases.

120:32

The cures are fine to sell to authoritarian countries, but the data centers just aren't.

120:38

The chips and the data centers aren't, and the AI industry itself isn't.

120:44

Another possibility I think folks should think about is this.

120:49

Could there be developments we can make—either that naturally happen as a result of AI,

120:55

or that we could make happen by building technology on AI—that

120:59

create an equilibrium where it becomes infeasible for authoritarian countries

121:05

to deny their people private use of the benefits of the technology?

121:12

Are there equilibria where we can give everyone in an authoritarian country their own AI model that

121:19

defends them from surveillance and there isn't a way for the authoritarian country to crack

121:24

down on this while retaining power? I don't know. That sounds to me like if that went far enough,

121:29

it would be a reason why authoritarian countries would disintegrate from the inside.

121:35

But maybe there's a middle world where there's an equilibrium where, if they want to hold on

121:39

to power, the authoritarians can't deny individualized access to the technology.

121:45

But I actually do have a hope for the more radical version.

121:50

Is it possible that the technology might inherently have properties—or

121:54

that by building on it in certain ways we could create properties—that have this

122:01

dissolving effect on authoritarian structures? Now, we hoped originally—think back to the

122:07

beginning of the Obama administration—that social media and the internet would have

122:13

that property, and it turns out not to. But what if we could try again with the

122:20

knowledge of how many things could go wrong, and that this is a different technology?

122:23

I don't know if it would work, but it's worth a try.

122:26

It's just very unpredictable. There are first principles reasons why

122:30

authoritarianism might be privileged. It's all very unpredictable. We just

122:35

have to recognize the problem and come up with 10 things we can try, try those,

122:40

and then assess which ones are working, if any. Then try new ones if the old ones aren't working.

122:46

But I guess that nets out to today, as you say, that we will not sell data centers,

122:51

or chips, and the ability to make chips to China. So in some sense, you are denying… There would be

122:58

some benefits to the Chinese economy, Chinese people, et cetera, because we're doing that.

123:02

Then there'd also be benefits to the American economy because it's a positive-sum world.

123:06

We could trade. They could have their country's data centers doing one thing.

123:08

We could have ours doing another. Already, you're saying it's not worth that

123:14

positive-sum stipend to empower those countries? What I would say is that we are about to be

123:22

in a world where growth and economic value will come very easily if we're

123:27

able to build these powerful AI models. What will not come easily is distribution

123:35

of benefits, distribution of wealth, political freedom.

123:40

These are the things that are going to be hard to achieve.

123:43

So when I think about policy, I think that the technology and the market will deliver all the

123:50

fundamental benefits, this is my fundamental belief, almost faster than we can take them.

123:55

These questions about distribution and political freedom and rights are the ones that will actually

124:02

matter and that policy should focus on. Speaking of distribution, as you were

124:06

mentioning, we have developing countries. In many cases, catch-up growth has been

124:12

weaker than we would have hoped for. But when catch-up growth does happen,

124:15

it's fundamentally because they have underutilized labor.

124:18

We can bring the capital and know-how from developed countries to these countries,

124:21

and then they can grow quite rapidly. Obviously, in a world where labor is no

124:26

longer the constraining factor, this mechanism no longer works.

124:30

So is the hope basically to rely on philanthropy from

124:33

the people or countries who immediately get wealthy from AI? What is the hope?

124:38

Philanthropy should obviously play some role, as it has in the past.

124:44

But I think growth is always better and stronger if we can make it endogenous.

124:50

What are the relevant industries in an AI-driven world?

124:58

I said we shouldn't build data centers in China, but there's no reason we shouldn't

125:00

build data centers in Africa. In fact, I think it'd be

125:04

great to build data centers in Africa. As long as they're not owned by China,

125:08

we should build data centers in Africa. I think that's a great thing to do.

125:16

There's no reason we can't build a pharmaceutical industry that's AI-driven.

125:22

If AI is accelerating drug discovery, then there will be a bunch of biotech startups.

125:28

Let's make sure some of those happen in the developing world.

125:31

Certainly, during the transition—we can talk about the point where humans have no

125:34

role—humans will still have some role in starting up these companies and supervising the AI models.

125:41

So let's make sure some of those humans are in the developing world

125:44

so that fast growth can happen there as well. You guys recently announced that Claude is going

125:48

to have a constitution that's aligned to a set of values, and not necessarily just to the end user.

125:53

There's a world I can imagine where if it is aligned to the end user,

125:56

it preserves the balance of power we have in the world today because everybody gets to have their

125:59

own AI that's advocating for them. The ratio of bad actors to

126:03

good actors stays constant. It seems to work out for our world today.

126:07

Why is it better not to do that, but to have a specific set of values that the

126:12

AI should carry forward? I'm not sure I'd quite

126:16

draw the distinction in that way. There may be two relevant distinctions here.

126:22

I think you're talking about a mix of the two. One is, should we give the model a set of

126:27

instructions about "do this" versus "don't do this"?

126:31

The other is, should we give the model a set of principles for how to act?

126:44

It's kind of purely a practical and empirical thing that we've observed.

126:48

By teaching the model principles, getting it to learn from principles,

126:52

its behavior is more consistent, it's easier to cover edge cases, and the model is more

126:58

likely to do what people want it to do. In other words, if you give it a list of

127:09

rules—"don't tell people how to hot-wire a car, don't speak in Korean"—it doesn't

127:10

really understand the rules, and it's hard to generalize from them.

127:15

It’s just a list of do’s and don’t’s. Whereas if you give it principles—it

127:21

has some hard guardrails like "Don't make biological weapons" but—overall you're

127:25

trying to understand what it should be aiming to do, how it should be aiming to operate.

127:31

So just from a practical perspective, that turns out to be a more effective way to train the model.

127:35

That's the rules versus principles trade-off. Then there's another thing you're talking about,

127:42

which is the corrigibility versus intrinsic motivation trade-off.

127:51

How much should the model be a kind of "skin suit" where it just directly

128:02

follows the instructions given to it by whoever is giving those instructions,

128:06

versus how much should the model have an inherent set of values and go off and do things on its own?

128:14

There I would actually say everything about the model is closer to the direction that

128:21

it should mostly do what people want. It should mostly follow instructions.

128:24

We're not trying to build something that goes off and runs the world on its own.

128:29

We're actually pretty far on the corrigible side. Now, what we do say is there are certain

128:34

things that the model won't do. I think we say it in various ways in the

128:40

constitution, that under normal circumstances, if someone asks the model to do a task, it should do

128:45

that task. That should be the default. But if you've asked it to do something dangerous, or

128:54

to harm someone else, then the model is unwilling to do that.

129:01

So I actually think of it as a mostly corrigible model that has some limits,

129:07

but those limits are based on principles. Then the fundamental question is,

129:12

how are those principles determined? This is not a special question for Anthropic.

129:15

This would be a question for any AI company. But because you have been the ones to actually

129:22

write down the principles, I get to ask you this question.

129:25

Normally, a constitution is written down, set in stone, and there's a process of

129:29

updating it and changing it and so forth. In this case, it seems like a document

129:34

that people at Anthropic write, that can be changed at any time,

129:37

that guides the behavior of systems that are going to be the basis of a lot of economic activity.

129:45

How do you think about how those principles should be set?

129:50

I think there are maybe three sizes of loop here, three ways to iterate.

129:58

One is we iterate within Anthropic. We train the model, we're not happy with it,

130:01

and we change the constitution. I think that's good to do.

130:06

Putting out public updates to the constitution every once in a while

130:10

is good because people can comment on it. The second level of loop is different companies

130:16

having different constitutions. I think it’s useful. Anthropic puts out a constitution,

130:21

Gemini puts out a constitution, and other companies put out a constitution.

130:28

People can look at them and compare. Outside observers can critique and say,

130:34

"I like this thing from this constitution and this thing from that constitution."

130:40

That creates a soft incentive and feedback for all the companies to

130:45

take the best of each element and improve. Then I think there's a third loop, which is

130:50

society beyond the AI companies and beyond just those who comment without hard power.

130:59

There we've done some experiments. A couple years ago, we did an experiment with the Collective

131:04

Intelligence Project to basically poll people and ask them what should be in our AI constitution.

131:15

At the time, we incorporated some of those changes.

131:17

So you could imagine doing something like that with the new approach we've

131:19

taken to the constitution. It's a little harder because

131:23

it was an easier approach to take when the constitution was a list of dos and don'ts.

131:29

At the level of principles, it has to have a certain amount of coherence.

131:32

But you could still imagine getting views from a wide variety of people.

131:37

You could also imagine—and this is a crazy idea, but this whole

131:42

interview is about crazy ideas—systems of representative government having input.

131:52

I wouldn't do this today because the legislative process is so slow.

131:55

This is exactly why I think we should be careful about the legislative process and AI regulation.

132:00

But there's no reason you couldn't, in principle, say, "All AI models have to have a constitution

132:06

that starts with these things, and then you can append other things after it, but there has to

132:13

be this special section that takes precedence." I wouldn't do that. That's too rigid and sounds

132:22

overly prescriptive in a way that I think overly aggressive legislation is.

132:26

But that is a thing you could try to do. Is there some much less heavy-handed

132:32

version of that? Maybe. I really like control loop two.

132:37

Obviously, this is not how constitutions of actual governments do or should work.

132:42

There's not this vague sense in which the Supreme Court will feel out how people

132:46

are feeling—what are the vibes—and update the constitution accordingly.

132:50

With actual governments, there's a more formal, procedural process.

132:55

But you have a vision of competition between constitutions, which is actually very reminiscent

133:01

of how some libertarian charter cities people used to talk, about what an archipelago of different

133:07

kinds of governments would look like. There would be selection among them of

133:10

who could operate the most effectively and where people would be the happiest.

133:15

In a sense, you're recreating that vision of a utopia of archipelagos.

133:23

I think that vision has things to recommend it and things that will go wrong with it.

133:31

It's an interesting, in some ways compelling, vision, but things will

133:34

go wrong that you hadn't imagined. So I like loop two as well,

133:40

but I feel like the whole thing has got to be some mix of loops one, two, and three,

133:46

and it's a matter of the proportions. I think that's gotta be the answer.

133:53

When somebody eventually writes the equivalent of The Making of the Atomic Bomb for this era,

133:58

what is the thing that will be hardest to glean from the historical record that

134:02

they're most likely to miss? I think a few things. One is,

134:06

at every moment of this exponential, the extent to which the world outside it didn't understand it.

134:12

This is a bias that's often present in history. Anything that actually happened looks

134:17

inevitable in retrospect. When people look back, it will

134:24

be hard for them to put themselves in the place of people who were actually making a bet on this

134:32

thing to happen that wasn't inevitable, that we had these arguments like the arguments I make for

134:38

scaling or that continual learning will be solved. Some of us internally put a high probability

134:48

on this happening, but there's a world outside us that's not acting on that at all.

134:58

I think the weirdness of it, unfortunately the insularity of it...

135:07

If we're one year or two years away from it happening,

135:10

the average person on the street has no idea. That's one of the things I'm trying to change with

135:14

the memos, with talking to policymakers. I don’t know but I think

135:19

that's just a crazy thing. Finally, I would say—and this

135:27

probably applies to almost all historical moments of crisis—how absolutely fast it was happening,

135:33

how everything was happening all at once. Decisions that you might think were

135:39

carefully calculated, well actually you have to make that decision,

135:42

and then you have to make 30 other decisions on the same day because it's all happening so fast.

135:47

You don't even know which decisions are going to turn out to be consequential.

135:52

One of my worries—although it's also an insight into what's happening—is that some

136:00

very critical decision will be some decision where someone just comes into my office and

136:05

is like, "Dario, you have two minutes. Should we do thing A or thing B on this?"

136:14

Someone gives me this random half-page memo and asks, "Should we do A or B?" I'm like, "I

136:20

don't know. I have to eat lunch. Let's do B." That ends up being the most consequential thing ever.

136:26

So final question. There aren't tech CEOs who are usually writing 50-page memos every few months.

136:35

It seems like you have managed to build a role for yourself and a company around

136:40

you which is compatible with this more intellectual-type role of CEO.

136:47

I want to understand how you construct that. How does that work? Do you just go away for

136:53

a couple of weeks and then you tell your company, "This is the memo. Here's what

136:56

we're doing"? It's also reported that you write a bunch of these internally.

136:59

For this particular one, I wrote it over winter break.

137:04

I was having a hard time finding the time to actually write it.

137:08

But I think about this in a broader way. I think it relates to the culture of the company.

137:13

I probably spend a third, maybe 40%, of my time making sure the culture of Anthropic is good.

137:19

As Anthropic has gotten larger, it's gotten harder to get directly involved in the training

137:26

of the models, the launch of the models, the building of the products. It's 2,500

137:30

people. I have certain instincts, but it's very difficult to get involved in every single detail.

137:41

I try as much as possible, but one thing that's very leveraged is making sure Anthropic is a good

137:46

place to work, people like working there, everyone thinks of themselves as team members, and everyone

137:51

works together instead of against each other. We've seen as some of the other AI companies

137:57

have grown—without naming any names—we're starting to see decoherence and people fighting each other.

138:03

I would argue there was even a lot of that from the beginning, but it's gotten worse.

138:08

I think we've done an extraordinarily good job, even if not perfect, of holding the

138:14

company together, making everyone feel the mission, that we're sincere about the mission,

138:19

and that everyone has faith that everyone else there is working for the right reason.

138:23

That we're a team, that people aren't trying to get ahead at each other's expense or

138:28

backstab each other, which again, I think happens a lot at some of the other places.

138:33

How do you make that the case? It's a lot of things. It's me,

138:36

it's Daniela, who runs the company day to day, it's the co-founders,

138:41

it's the other people we hire, it's the environment we try to create.

138:44

But I think an important thing in the culture is that the other leaders as well, but especially me,

138:53

have to articulate what the company is about, why it's doing what it's doing,

138:58

what its strategy is, what its values are, what its mission is, and what it stands for.

139:06

When you get to 2,500 people, you can't do that person by person.

139:09

You have to write, or you have to speak to the whole company.

139:12

This is why I get up in front of the whole company every two weeks and speak for an hour.

139:18

I wouldn't say I write essays internally. I do two things. One, I write this thing

139:22

called a DVQ, Dario Vision Quest. I wasn't the one who named it that.

139:27

That's the name it received, and it's one of these names that I tried to fight because it made it

139:32

sound like I was going off and smoking peyote or something. But the name just stuck. So I get up

139:38

in front of the company every two weeks. I have a three or four-page document,

139:43

and I just talk through three or four different topics about what's going on internally,

139:49

the models we're producing, the products, the outside industry, the world as a whole

139:54

as it relates to AI and geopolitically in general. Just some mix of that. I go

139:59

through very honestly and I say, "This is what I'm thinking, and this is what Anthropic leadership

140:06

is thinking," and then I answer questions. That direct connection has a lot of value that

140:13

is hard to achieve when you're passing things down the chain six levels deep.

140:19

A large fraction of the company comes to attend, either in person or virtually.

140:27

It really means that you can communicate a lot. The other thing I do is I have a channel in

140:32

Slack where I just write a bunch of things and comment a lot.

140:36

Often that's in response to things I'm seeing at the company or questions people ask.

140:44

We do internal surveys and there are things people are concerned about, and so I'll write them up.

140:50

I'm just very honest about these things. I just say them very directly.

140:56

The point is to get a reputation of telling the company the truth about what's happening, to call

141:01

things what they are, to acknowledge problems, to avoid the sort of corpo speak, the kind of

141:07

defensive communication that often is necessary in public because the world is very large and full of

141:14

people who are interpreting things in bad faith. But if you have a company of people who you trust,

141:21

and we try to hire people that we trust, then you can really just be entirely unfiltered.

141:31

I think that's an enormous strength of the company.

141:33

It makes it a better place to work, it makes people more than the sum of their parts,

141:38

and increases the likelihood that we accomplish the mission because everyone is on the same page

141:41

about the mission, and everyone is debating and discussing how best to accomplish the mission.

141:46

Well, in lieu of an external Dario Vision Quest, we have this interview.

141:50

This interview is a little like that. This has been fun, Dario. Thanks for doing it.

141:54

Thank you, Dwarkesh.

Interactive Summary

Ask follow-up questions or revisit key timestamps.

The discussion revolves around the rapid advancement of AI, particularly over the last three years, and the surprising lack of public recognition of its proximity to AGI. The speaker emphasizes the 'Big Blob of Compute Hypothesis,' which prioritizes raw compute, data, training time, and scalable objective functions over novel techniques. AI's learning process is likened to human evolution rather than on-the-spot learning, explaining its high sample inefficiency. Predictions for achieving 'country of geniuses in a data center' are within 1-3 years, with high confidence within 10 years. The economic diffusion of AI, though extremely fast (Anthropic's 10x annual revenue growth), is not instantaneous due to integration complexities. The conversation also covers AI's potential impact on society, including risks of bioterrorism and authoritarianism, and the need for robust governance frameworks and a 'constitution' for AI models. The speaker reflects on the unique intellectual culture at Anthropic, emphasizing transparency and direct communication.

Recently Distilled

Videos recently processed by our community