HomeVideos

Dario Amodei — “We are near the end of the exponential”

Now Playing

Dario Amodei — “We are near the end of the exponential”

Transcript

1480 segments

0:00

We talked three years ago. In your view, what has  been the biggest update over the last three years? 

0:05

What has been the biggest difference  between what it felt like then versus now? 

0:10

Broadly speaking, the exponential of the  underlying technology has gone about as  

0:18

I expected it to go. There's plus or minus  

0:23

a year or two here and there. I don't know that I would've  

0:27

predicted the specific direction of code. But when I look at the exponential,  

0:34

it is roughly what I expected in terms of  the march of the models from smart high  

0:39

school student to smart college student to  beginning to do PhD and professional stuff,  

0:44

and in the case of code reaching beyond that. The frontier is a little bit uneven, but it's  

0:49

roughly what I expected. What has been the most surprising  

0:55

thing is the lack of public recognition of how  close we are to the end of the exponential. 

1:02

To me, it is absolutely wild that you have  people — within the bubble and outside the  

1:09

bubble — talking about the same tired, old  hot-button political issues, when we are  

1:19

near the end of the exponential. I want to understand what that  

1:24

exponential looks like right now. The first question I asked you when  

1:27

we recorded three years ago was, "what’s  up with scaling and why does it work?" 

1:31

I have a similar question now,  but it feels more complicated. 

1:35

At least from the public's point of view, three  years ago there were well-known public trends  

1:41

across many orders of magnitude of compute  where you could see how the loss improves. 

1:45

Now we have RL scaling and there's  no publicly known scaling law for it. 

1:49

It's not even clear what the story is. Is this supposed to be teaching the model skills? 

1:54

Is it supposed to be teaching meta-learning? What is the scaling hypothesis at this point? 

1:59

I actually have the same hypothesis  I had even all the way back in 2017. 

2:06

I think I talked about it last time, but I wrote  a doc called "The Big Blob of Compute Hypothesis". 

2:12

It wasn't about the scaling of  language models in particular. 

2:15

When I wrote it GPT-1 had just come out. That was one among many things. 

2:22

Back in those days there was robotics. People tried to work on reasoning as  

2:26

a separate thing from language models,  and there was scaling of the kind of RL  

2:30

that happened in AlphaGo and in Dota at OpenAI. People remember StarCraft at DeepMind, AlphaStar. 

2:43

It was written as a more general document. Rich Sutton put out "The Bitter  

2:52

Lesson" a couple years later. The hypothesis is basically the same. 

2:57

What it says is that all the cleverness, all the  techniques, all the "we need a new method to do  

3:04

something", that doesn't matter very much. There are only a few things that matter. 

3:08

I think I listed seven of them. One is how much raw compute you have. 

3:13

The second is the quantity of data. The third is the quality and distribution of data. 

3:20

It needs to be a broad distribution. The fourth is how long you train for. 

3:27

The fifth is that you need an objective  function that can scale to the moon. 

3:32

The pre-training objective function  is one such objective function. 

3:36

Another is the RL objective  function that says you have a goal,  

3:42

you're going to go out and reach the goal. Within that, there's objective rewards like  

3:48

you see in math and coding, and there's  more subjective rewards like you see in  

3:52

RLHF or higher-order versions of that. Then the sixth and seventh were things  

3:59

around normalization or conditioning,  just getting the numerical stability  

4:04

so that the big blob of compute flows in this  laminar way instead of running into problems. 

4:11

That was the hypothesis, and  it's a hypothesis I still hold. 

4:15

I don't think I've seen very  much that is not in line with it. 

4:21

The pre-training scaling laws were one example  of what we see there. Those have continued going.  

4:31

Now it's been widely reported,  we feel good about pre-training. 

4:35

It’s continuing to give us gains. What has changed is that now we're  

4:41

also seeing the same thing for RL. We're seeing a pre-training phase  

4:46

and then an RL phase on top of that. With RL, it’s actually just the same. 

4:55

Even other companies have published things in  some of their releases that say, "We train the  

5:05

model on math contests — AIME or other things  — and how well the model does is log-linear in  

5:14

how long we've trained it." We see that as well,  

5:17

and it's not just math contests. It's a wide variety of RL tasks. 

5:21

We're seeing the same scaling in  RL that we saw for pre-training. 

5:27

You mentioned Rich Sutton and "The Bitter Lesson". I interviewed him last year,  

5:31

and he's actually very non-LLM-pilled. I don’t know if this is his perspective,  

5:38

but one way to paraphrase his objection is:  Something which possesses the true core of human  

5:44

learning would not require all these billions  of dollars of data and compute and these bespoke  

5:51

environments, to learn how to use Excel, how to  use PowerPoint, how to navigate a web browser. 

5:57

The fact that we have to build in these skills  using these RL environments hints that we are  

6:04

actually lacking a core human learning algorithm. So we're scaling the wrong thing. That does raise  

6:13

the question. Why are we doing all this RL scaling  if we think there's something that's going to be  

6:16

human-like in its ability to learn on the fly? I think this puts together several things that  

6:23

should be thought of differently. There is a genuine puzzle here,  

6:29

but it may not matter. In fact, I would guess it probably  

6:33

doesn't matter. There is an interesting thing. Let  me take the RL out of it for a second, because I  

6:37

actually think it's a red herring to say that RL  is any different from pre-training in this matter. 

6:43

If we look at pre-training  scaling, it was very interesting  

6:47

back in 2017 when Alec Radford was doing GPT-1. The models before GPT-1 were trained on datasets  

6:59

that didn't represent a wide distribution of text. You had very standard language  

7:08

modeling benchmarks. GPT-1 itself was trained on  

7:11

a bunch of fanfiction, I think actually. It was literary text, which is a very  

7:17

small fraction of the text you can get. In those days it was like a billion words  

7:23

or something, so small datasets representing  a pretty narrow distribution of what you can  

7:32

see in the world. It didn't generalize well.  If you did better on some fanfiction corpus,  

7:43

it wouldn't generalize that well to other  tasks. We had all these measures. We had  

7:47

all these measures of how well it did at  predicting all these other kinds of texts. 

7:55

It was only when you trained over all the tasks  on the internet — when you did a general internet  

8:01

scrape from something like Common Crawl or  scraping links in Reddit, which is what we did for  

8:06

GPT-2 — that you started to get generalization. I think we're seeing the same thing on RL. 

8:15

We're starting first with simple RL tasks like  training on math competitions, then moving to  

8:24

broader training that involves things like code. Now we're moving to many other tasks. 

8:31

I think then we're going to  increasingly get generalization. 

8:35

So that kind of takes out the  RL vs. pre-training side of it. 

8:39

But there is a puzzle either way, which is that  in pre-training we use trillions of tokens. 

8:50

Humans don't see trillions of words. So there is an actual sample  

8:54

efficiency difference here. There is actually something different here. 

8:59

The models start from scratch  and they need much more training. 

9:06

But we also see that once they're trained,  if we give them a long context length of  

9:15

a million — the only thing blocking long  context is inference — they're very good at  

9:17

learning and adapting within that context. So I don’t know the full answer to this. 

9:24

I think there's something going  on where pre-training is not like  

9:28

the process of humans learning, but it's  somewhere between the process of humans  

9:32

learning and the process of human evolution. We get many of our priors from evolution. 

9:38

Our brain isn't just a blank slate. Whole books have been written about this. 

9:43

The language models are  much more like blank slates. 

9:45

They literally start as random weights, whereas  the human brain starts with all these regions  

9:50

connected to all these inputs and outputs. Maybe we should think of pre-training — and  

9:56

for that matter, RL as well — as something  that exists in the middle space between  

10:02

human evolution and human on-the-spot learning. And we should think of the in-context learning  

10:10

that the models do as something between long-term  human learning and short-term human learning.  

10:17

So there's this hierarchy. There’s evolution,  there's long-term learning, there's short-term  

10:22

learning, and there's just human reaction. The LLM phases exist along this spectrum,  

10:28

but not necessarily at exactly the same points. There’s no analog to some of the human modes  

10:34

of learning the LLMs are falling in  between the points. Does that make sense? 

10:40

Yes, although some things  are still a bit confusing. 

10:42

For example, if the analogy is that this  is like evolution so it's fine that it's  

10:45

not sample efficient, then if we're  going to get super sample-efficient  

10:51

agent from in-context learning, why are we  bothering to build all these RL environments? 

10:56

There are companies whose work seems to  be teaching models how to use this API,  

11:00

how to use Slack, how to use whatever. It's confusing to me why there's so much emphasis  

11:04

on that if the kind of agent that can just learn  on the fly is emerging or has already emerged. 

11:11

I can't speak for the emphasis of anyone else. I can only talk about how we think about it. 

11:20

The goal is not to teach the model  every possible skill within RL,  

11:25

just as we don't do that within pre-training. Within pre-training, we're not trying to expose  

11:29

the model to every possible way  that words could be put together. 

11:37

Rather, the model trains on a lot of things and  then reaches generalization across pre-training. 

11:43

That was the transition from GPT-1 to GPT-2 that  I saw up close. The model reaches a point. I had  

11:53

these moments where I was like, "Oh yeah, you  just give the model a list of numbers — this is  

12:01

the cost of the house, this is the square feet of  the house — and the model completes the pattern  

12:05

and does linear regression." Not great, but it does it,  

12:08

and it's never seen that exact thing before. So to the extent that we are building these  

12:16

RL environments, the goal is very similar to what  was done five or ten years ago with pre-training. 

12:26

We're trying to get a whole bunch of data, not  because we want to cover a specific document or a  

12:32

specific skill, but because we want to generalize. I think the framework you're laying down obviously  

12:39

makes sense. We're making progress toward AGI.  Nobody at this point disagrees we're going to  

12:46

achieve AGI this century. The crux is you say we're  

12:49

hitting the end of the exponential. Somebody else looks at this and says,  

12:55

"We've been making progress since 2012,  and by 2035 we'll have a human-like agent." 

13:04

Obviously we’re seeing in these models  the kinds of things that evolution did,  

13:07

or that learning within a human lifetime does. I want to understand what you’re seeing  

13:11

that makes you think it's one  year away and not ten years away. 

13:17

There are two claims you could make  here, one stronger and one weaker. 

13:26

Starting with the weaker claim, when  I first saw the scaling back in 2019,  

13:35

I wasn’t sure. This was a 50/50 thing. I  thought I saw something. My claim was that this  

13:43

was much more likely than anyone thinks. Maybe there's a 50% chance this happens. 

13:51

On the basic hypothesis of, as you put it, within  ten years we'll get to what I call a "country of  

14:00

geniuses in a data center", I'm at 90% on that. It's hard to go much higher than 90%  

14:06

because the world is so unpredictable. Maybe the irreducible uncertainty puts us at 95%,  

14:13

where you get to things like multiple companies  having internal turmoil, Taiwan gets invaded,  

14:24

all the fabs get blown up by missiles. Now you've jinxed us, Dario. 

14:30

You could construct a 5% world where  things get delayed for ten years. 

14:43

There's another 5% which is that I'm very  confident on tasks that can be verified. 

14:50

With coding, except for that  irreducible uncertainty,  

14:54

I think we'll be there in one or two years. There's no way we will not be there in ten years  

14:58

in terms of being able to do end-to-end coding. My one little bit of fundamental uncertainty,  

15:05

even on long timescales, is about tasks that  aren't verifiable: planning a mission to Mars;  

15:14

doing some fundamental scientific  discovery like CRISPR; writing a novel. 

15:21

It’s hard to verify those tasks. I am almost certain we have a  

15:27

reliable path to get there, but if there's  a little bit of uncertainty it's there. 

15:34

On the ten-year timeline I'm at 90%,  which is about as certain as you can be. 

15:40

I think it's crazy to say that  this won't happen by 2035. 

15:46

In some sane world, it would  be outside the mainstream. 

15:48

But the emphasis on verification hints to me a  lack of belief that these models are generalized. 

15:58

If you think about humans, we're both good  at things for which we get verifiable reward  

16:03

and things for which we don't. No, this is why I’m almost sure. 

16:07

We already see substantial generalization  from things that verify to things that  

16:12

don't. We're already seeing that. But it seems like you were emphasizing  

16:15

this as a spectrum which will split apart  which domains in which we see more progress. 

16:21

That doesn't seem like how humans get better. The world in which we don't get there is the world  

16:27

in which we do all the verifiable things. Many of them generalize,  

16:34

but we don't fully get there. We don’t fully color in the other side  

16:40

of the box. It's not a binary thing. Even if generalization is weak and you can only do  

16:47

verifiable domains, it's not clear to me you could  automate software engineering in such a world. 

16:49

You are "a software engineer" in some sense, but  part of being a software engineer for you involves  

16:58

writing long memos about your grand vision. I don’t think that’s part of the job of SWE. 

17:03

That's part of the job of the  company, not SWE specifically. 

17:04

But SWE does involve design  documents and other things like that. 

17:10

The models are already pretty  good at writing comments. 

17:14

Again, I’m making much weaker claims here than  I believe, to distinguish between two things. 

17:24

We're already almost there  for software engineering. 

17:28

By what metric? There's one metric which is  how many lines of code are written by AI. 

17:32

If you consider other productivity improvements  in the history of software engineering,  

17:36

compilers write all the lines of software. There's a difference between how many lines  

17:40

are written and how big the productivity  improvement is. "We’re almost there" meaning…  

17:47

How big is the productivity improvement,  not just how many lines are written by AI? 

17:52

I actually agree with you on this. I've made a series of predictions on  

17:57

code and software engineering. I think people have repeatedly misunderstood them. 

18:03

Let me lay out the spectrum. About eight or nine months ago,  

18:09

I said the AI model will be writing 90% of  the lines of code in three to six months. 

18:16

That happened, at least at some places. It happened at Anthropic, happened with  

18:21

many people downstream using our models. But that's actually a very weak criterion. 

18:27

People thought I was saying that we won't need 90%  of the software engineers. Those things are worlds  

18:32

apart. The spectrum is: 90% of code is written by  the model, 100% of code is written by the model. 

18:41

That's a big difference in productivity. 90% of the end-to-end SWE tasks — including  

18:47

things like compiling, setting up clusters  and environments, testing features,  

18:54

writing memos — are done by the models. 100%  of today's SWE tasks are done by the models. 

19:02

Even when that happens, it doesn't mean  software engineers are out of a job. 

19:06

There are new higher-level things  they can do, where they can manage. 

19:10

Then further down the spectrum, there's  90% less demand for SWEs, which I think  

19:15

will happen but this is a spectrum. I wrote about it in "The Adolescence  

19:21

of Technology" where I went through  this kind of spectrum with farming. 

19:26

I actually totally agree with you on that. 

19:29

These are very different  benchmarks from each other,  

19:32

but we're proceeding through them super fast. Part of your vision is that going from 90 to 100  

19:38

is going to happen fast, and that it  leads to huge productivity improvements. 

19:45

But what I notice is that even in greenfield  projects people start with Claude Code or  

19:49

something, people report starting a lot of  projects… Do we see in the world out there  

19:54

a renaissance of software, all these new  features that wouldn't exist otherwise? 

19:58

At least so far, it doesn't seem like we see that. So that does make me wonder. 

20:02

Even if I never had to intervene with  Claude Code, the world is complicated.  

20:09

Jobs are complicated. Closing the loop on  self-contained systems, whether it’s just  

20:14

writing software or something, how much  broader gains would we see just from that? 

20:20

Maybe that should dilute our estimation  of the "country of geniuses". 

20:24

I simultaneously agree with you that it's a  reason why these things don't happen instantly,  

20:35

but at the same time, I think  the effect is gonna be very fast. 

20:41

You could have these two poles. One is that AI is not going to make  

20:47

progress. It's slow. It's going to take  forever to diffuse within the economy. 

20:52

Economic diffusion has become one of  these buzzwords that's a reason why  

20:56

we're not going to make AI progress,  or why AI progress doesn't matter. 

21:00

The other axis is that we'll get recursive  self-improvement, the whole thing. 

21:05

Can't you just draw an  exponential line on the curve? 

21:08

We're going to have Dyson spheres around the  sun so many nanoseconds after we get recursive. 

21:17

I'm completely caricaturing the view  here, but there are these two extremes. 

21:23

But what we've seen from the beginning, at least  if you look within Anthropic, there's this bizarre  

21:30

10x per year growth in revenue that we've seen. So in 2023, it was zero to $100 million. 

21:38

In 2024, it was $100 million to $1 billion. In 2025, it was $1 billion to $ 9-10 billion. 

21:46

You guys should have just bought a billion  dollars of your own products so you could just… 

21:50

And the first month of this  year, that exponential is... 

21:54

You would think it would slow down, but we  added another few billion to revenue in January. 

22:05

Obviously that curve can't go on forever. The GDP is only so large. 

22:10

I would even guess that it bends somewhat this  year, but that is a fast curve. That's a really  

22:20

fast curve. I would bet it stays pretty fast  even as the scale goes to the entire economy. 

22:25

So I think we should be thinking about this middle  world where things are extremely fast, but not  

22:34

instant, where they take time because of economic  diffusion, because of the need to close the loop. 

22:39

Because it's fiddly: "I have to do change  management within my enterprise… I set this up,  

22:50

but I have to change the security permissions  on this in order to make it actually work…  

22:55

I had this old piece of software that  checks the model before it's compiled  

23:01

and released and I have to rewrite it. Yes, the model can do that, but I have  

23:05

to tell the model to do that. It has to take time to do that." 

23:10

So I think everything we've seen so far is  compatible with the idea that there's one fast  

23:17

exponential that's the capability of the model. Then there's another fast exponential  

23:22

that's downstream of that, which is the  diffusion of the model into the economy. 

23:26

Not instant, not slow, much faster than any  previous technology, but it has its limits. 

23:37

When I look inside Anthropic, when I look at our  customers: fast adoption, but not infinitely fast. 

23:44

Can I try a hot take on you? Yeah. 

23:45

I feel like diffusion is cope that people say. When the model isn't able to do something,  

23:51

they're like, "oh, but it's a diffusion issue." But then you should use the comparison to humans. 

23:56

You would think that the inherent advantages  that AIs have would make diffusion a much easier  

24:01

problem for new AIs getting onboarded  than new humans getting onboarded. 

24:06

An AI can read your entire  Slack and your drive in minutes. 

24:08

They can share all the knowledge that the  other copies of the same instance have. 

24:12

You don't have this adverse selection  problem when you're hiring AI, so you  

24:14

can just hire copies of a vetted AI model. Hiring a human is so much more of a hassle. 

24:20

People hire humans all the time. We pay humans upwards of $50 trillion  

24:23

in wages because they're useful, even though in  principle it would be much easier to integrate  

24:29

AIs into the economy than it is to hire  humans. The diffusion doesn't really explain. 

24:34

I think diffusion is very real  and doesn't exclusively have  

24:41

to do with limitations on the AI models. Again, there are people who use diffusion  

24:49

as kind of a buzzword to say this isn't a  big deal. I'm not talking about that. I'm  

24:54

not talking about how AI will diffuse  at the speed of previous technologies. 

24:58

I think AI will diffuse much faster than previous  technologies have, but not infinitely fast. 

25:04

I'll just give an example of this. There's Claude  Code. Claude Code is extremely easy to set up. 

25:10

If you're a developer, you can  just start using Claude Code. 

25:14

There is no reason why a developer at a  large enterprise should not be adopting  

25:19

Claude Code as quickly as an individual  developer or developer at a startup. 

25:25

We do everything we can to promote it. We sell Claude Code to enterprises. 

25:31

Big enterprises, big financial companies, big  pharmaceutical companies, all of them are adopting  

25:38

Claude Code much faster than enterprises  typically adopt new technology. But again,  

25:46

it takes time. Any given feature or any given  product, like Claude Code or Cowork, will get  

25:54

adopted by the individual developers who are on  Twitter all the time, by the Series A startups,  

26:02

many months faster than they will get adopted  by a large enterprise that does food sales. 

26:11

There are just a number of factors. You have to go through legal,  

26:14

you have to provision it for everyone. It has to pass security and compliance. 

26:20

The leaders of the company who are further away  from the AI revolution are forward-looking,  

26:26

but they have to say, "Oh, it makes  sense for us to spend 50 million. 

26:31

This is what this Claude Code thing is. This is why it helps our company. 

26:35

This is why it makes us more productive." Then they have to explain  

26:37

to the people two levels below. They have to say, "Okay, we have 3,000 developers. 

26:42

Here's how we're going to roll  it out to our developers." 

26:45

We have conversations like this every day. We are doing everything we can to make  

26:50

Anthropic's revenue grow 20 or  30x a year instead of 10x a year. 

26:57

Again, many enterprises are just  saying, "This is so productive. 

27:02

We're going to take shortcuts in  our usual procurement process." 

27:05

They're moving much faster than  when we tried to sell them just  

27:08

the ordinary API, which many of them use. Claude Code is a more compelling product,  

27:13

but it's not an infinitely compelling product. I don't think even AGI or powerful AI or  

27:19

"country of geniuses in a data center"  will be an infinitely compelling product. 

27:22

It will be a compelling product enough maybe to  get 3-5x, or 10x, a year of growth, even when  

27:28

you're in the hundreds of billions of dollars,  which is extremely hard to do and has never been  

27:32

done in history before, but not infinitely fast. I buy that it would be a slight slowdown. 

27:36

Maybe this is not your claim, but  sometimes people talk about this like,  

27:39

"Oh, the capabilities are there, but because of  diffusion... otherwise we're basically at AGI". 

27:46

I don't believe we're basically at AGI. I think if you had the "country  

27:49

of geniuses in a data center"... If we had the "country of geniuses  

27:53

in a data center", we would know it. We would know it if you had the  

27:57

"country of geniuses in a data center". Everyone in this room would know it. 

28:01

Everyone in Washington would know it. People in rural parts might not know it,  

28:07

but we would know it. We don't  have that now. That is very clear. 

29:42

Coming back to concrete prediction… Because there  are so many different things to disambiguate,  

29:47

it can be easy to talk past each other  when we're talking about capabilities. 

29:50

For example, when I interviewed you three  years ago, I asked you a prediction about what  

29:54

we should expect three years from now. You were  right. You said, "We should expect systems which,  

30:00

if you talk to them for the course of an  hour, it's hard to tell them apart from  

30:04

a generally well-educated human." I think you were right about that. 

30:07

I think spiritually I feel unsatisfied because my  internal expectation was that such a system could  

30:13

automate large parts of white-collar work. So it might be more productive to talk about  

30:17

the actual end capabilities  you want from such a system. 

30:21

I will basically tell you where I think we are. Let me ask a very specific question so that  

30:28

we can figure out exactly what kinds of  capabilities we should think about soon. 

30:32

Maybe I'll ask about it in the context of a job  I understand well, not because it's the most  

30:36

relevant job, but just because I can evaluate  the claims about it. Take video editors. I have  

30:42

video editors. Part of their job involves  learning about our audience's preferences,  

30:47

learning about my preferences and tastes,  and the different trade-offs we have. 

30:50

They’re, over the course of many months,  building up this understanding of context. 

30:55

The skill and ability they have six  months into the job, a model that can  

30:58

pick up that skill on the job on the fly,  when should we expect such an AI system? 

31:04

I guess what you're talking about is that  we're doing this interview for three hours. 

31:09

Someone's going to come in,  someone's going to edit it. 

31:11

They're going to be like, "Oh, I don't know, Dario  scratched his head and we could edit that out." 

31:19

"Magnify that." "There was this long  

31:22

discussion that is less interesting to people. There's another thing that's more interesting  

31:27

to people, so let's make this edit." I think the "country of geniuses in  

31:33

a data center" will be able to do that. The way it will be able to do that is it will  

31:38

have general control of a computer screen. You'll be able to feed this in. 

31:43

It'll be able to also use the computer screen  to go on the web, look at all your previous  

31:49

interviews, look at what people are saying  on Twitter in response to your interviews,  

31:54

talk to you, ask you questions, talk to  your staff, look at the history of edits  

31:59

that you did, and from that, do the job. I think that's dependent on several things. 

32:06

I think this is one of the things  that's actually blocking deployment:  

32:10

getting to the point on computer use where the  models are really masters at using the computer. 

32:16

We've seen this climb in benchmarks, and  benchmarks are always imperfect measures. 

32:20

But I think when we first released computer use a  year and a quarter ago, OSWorld was at maybe 15%. 

32:33

I don't remember exactly, but  we've climbed from that to 65-70%. 

32:40

There may be harder measures as well, but I think  computer use has to pass a point of reliability. 

32:46

Can I just follow up on that before  you move on to the next point? 

32:50

For years, I've been trying to build  different internal LLM tools for myself. 

32:54

Often I have these text-in, text-out  tasks, which should be dead center  

32:59

in the repertoire of these models. Yet I still hire humans to do them. 

33:03

If it's something like, "identify what the  best clips would be in this transcript",  

33:07

maybe the LLMs do a seven-out-of-ten job on them. But there's not this ongoing way I can engage  

33:12

with them to help them get better at the  job the way I could with a human employee. 

33:16

That missing ability, even if you  solve computer use, would still block  

33:20

my ability to offload an actual job to them. This gets back to what we were talking about  

33:28

before with learning on the job. It's very  interesting. I think with the coding agents,  

33:34

I don't think people would say that learning on  the job is what is preventing the coding agents  

33:39

from doing everything end to end. They  keep getting better. We have engineers  

33:46

at Anthropic who don't write any code. When I look at the productivity, to your  

33:51

previous question, we have folks who say, "This  GPU kernel, this chip, I used to write it myself. 

33:58

I just have Claude do it." There's this enormous improvement in productivity. 

34:04

When I see Claude Code, familiarity with  the codebase or a feeling that the model  

34:13

hasn't worked at the company for a year, that's  not high up on the list of complaints I see. 

34:18

I think what I'm saying is that we're  kind of taking a different path. 

34:22

Don't you think with coding that's because there  

34:24

is an external scaffold of memory which  exists instantiated in the codebase? 

34:28

I don't know how many other jobs have that. Coding made fast progress precisely because  

34:33

it has this unique advantage that  other economic activity doesn't. 

34:37

But when you say that, what you're implying is  that by reading the codebase into the context,  

34:44

I have everything that the human  needed to learn on the job. 

34:48

So that would be an example of—whether it's  written or not, whether it's available or  

34:54

not—a case where everything you needed  to know you got from the context window. 

35:00

What we think of as learning—"I started this job,  it's going to take me six months to understand the  

35:05

code base"—the model just did it in the context. I honestly don't know how to think about  

35:09

this because there are people who  qualitatively report what you're saying. 

35:16

I'm sure you saw last year, there was a major  study where they had experienced developers try  

35:21

to close pull requests in repositories that they  were familiar with. Those developers reported an  

35:28

uplift. They reported that they felt more  productive with the use of these models. 

35:31

But in fact, if you look at their output  and how much was actually merged back in,  

35:35

there was a 20% downlift. They were less productive  

35:37

as a result of using these models. So I'm trying to square the qualitative  

35:40

feeling that people feel with these  models versus, 1) in a macro level,  

35:44

where is this renaissance of software? And then 2) when people do these independent  

35:48

evaluations, why are we not seeing the  productivity benefits we would expect? 

35:53

Within Anthropic, this is just really unambiguous. We're under an incredible amount of commercial  

35:59

pressure and make it even harder for ourselves  because we have all this safety stuff we do that  

36:03

I think we do more than other companies. The pressure to survive economically  

36:11

while also keeping our values is just incredible. We're trying to keep this 10x revenue curve going. 

36:18

There is zero time for bullshit. There is zero time for feeling  

36:23

like we're productive when we're not. These tools make us a lot more productive. 

36:30

Why do you think we're concerned  about competitors using the tools? 

36:34

Because we think we're ahead of the competitors. We wouldn't be going through all this trouble if  

36:43

this were secretly reducing our productivity. We see the end productivity every few  

36:49

months in the form of model launches. There's no kidding yourself about this. 

36:54

The models make you more productive. 1) People feeling like they're productive is  

37:00

qualitatively predicted by studies like this. But 2) if I just look at the end output,  

37:04

obviously you guys are making fast progress. But the idea was supposed to be that with  

37:10

recursive self-improvement, you make  a better AI, the AI helps you build a  

37:14

better next AI, et cetera, et cetera. What I see instead—if I look at you,  

37:18

OpenAI, DeepMind—is that people are just  shifting around the podium every few months. 

37:22

Maybe you think that stops  because you've won or whatever. 

37:25

But why are we not seeing the person with  the best coding model have this lasting  

37:31

advantage if in fact there are these enormous  productivity gains from the last coding model. 

37:38

I think my model of the situation is that  there's an advantage that's gradually growing. 

37:45

I would say right now the coding  models give maybe, I don't know,  

37:51

a 15-20% total factor speed up. That's  my view. Six months ago, it was maybe 5%. 

38:01

So it didn't matter. 5% doesn't register. It's now just getting to the point where it's  

38:06

one of several factors that kind of matters. That's going to keep speeding up. 

38:12

I think six months ago, there were several  companies that were at roughly the same  

38:18

point because this wasn't a notable factor, but  I think it's starting to speed up more and more. 

38:25

I would also say there are multiple companies that  write models that are used for code and we're not  

38:32

perfectly good at preventing some of these other  companies from using our models internally. 

38:41

So I think everything we're seeing is  consistent with this kind of snowball model. 

38:52

Again, my theme in all of this is all of this  is soft takeoff, soft, smooth exponentials,  

39:00

although the exponentials are relatively steep. So we're seeing this snowball gather momentum  

39:05

where it's like 10%, 20%, 25%, 40%. As you go, Amdahl's law, you have  

39:13

to get all the things that are preventing  you from closing the loop out of the way. 

39:17

But this is one of the biggest  priorities within Anthropic. 

39:22

Stepping back, before in the stack we were talking  about when do we get this on-the-job learning? 

39:29

It seems like the point you were making  on the coding thing is that we actually  

39:32

don't need on-the-job learning. You can have tremendous productivity  

39:36

improvements, you can have potentially trillions  of dollars of revenue for AI companies, without  

39:40

this basic human ability to learn on the job. Maybe that's not your claim, you should clarify. 

39:47

But in most domains of economic activity, people  say, "I hired somebody, they weren't that useful  

39:53

for the first few months, and then over time  they built up the context, understanding." 

39:58

It's actually hard to define  what we're talking about here. 

40:00

But they got something and then now they're  a powerhorse and they're so valuable to us. 

40:05

If AI doesn't develop this ability to learn on the  fly, I'm a bit skeptical that we're going to see  

40:12

huge changes to the world without that ability. I think two things here. There's the state  

40:17

of the technology right now. Again, we have these two stages. 

40:22

We have the pre-training and RL stage where  you throw a bunch of data and tasks into  

40:27

the models and then they generalize. So it's like learning, but it's like  

40:31

learning from more data and not learning  over one human or one model's lifetime. 

40:38

So again, this is situated between  evolution and human learning. 

40:42

But once you learn all  those skills, you have them. 

40:45

Just like with pre-training, just how the models  know more, if I look at a pre-trained model,  

40:52

it knows more about the history  of samurai in Japan than I do. 

40:55

It knows more about baseball than I do. It knows more about low-pass filters  

41:03

and electronics, all of these things. Its knowledge is way broader than mine. 

41:08

So I think even just that may get us to the  point where the models are better at everything. 

41:18

We also have, again, just with scaling the kind  of existing setup, the in-context learning. 

41:24

I would describe it as kind of  like human on-the-job learning,  

41:27

but a little weaker and a little short term. You look at in-context learning and if you give  

41:33

the model a bunch of examples it does get it. There's real learning that happens in context. 

41:38

A million tokens is a lot. That can be days of human learning. 

41:42

If you think about the model reading  a million words, how long would it  

41:50

take me to read a million? Days or weeks  at least. So you have these two things. 

41:57

I think these two things within the existing  paradigm may just be enough to get you the  

42:01

"country of geniuses in a data center". I don't know for sure, but I think  

42:04

they're going to get you a large fraction of it. There may be gaps, but I certainly think that just  

42:10

as things are, this is enough to generate  trillions of dollars of revenue. That's one. Two,  

42:17

is this idea of continual learning, this  idea of a single model learning on the job. 

42:24

I think we're working on that too. There's a good chance that in the next  

42:29

year or two, we also solve that. Again, I think you get most  

42:36

of the way there without it. The trillions of dollars a year market,  

42:45

maybe all of the national security implications  and the safety implications that I wrote about in  

42:49

"Adolescence of Technology" can happen without it. But we, and I imagine others, are working on it. 

42:57

There's a good chance that we will  get there within the next year or two. 

43:03

There are a bunch of ideas. I won't go into all of them in detail, but  

43:07

one is just to make the context longer. There's nothing preventing  

43:10

longer contexts from working. You just have to train at longer contexts  

43:14

and then learn to serve them at inference. Both of those are engineering problems that  

43:18

we are working on and I would assume  others are working on them as well. 

43:22

This context length increase, it seemed  like there was a period from 2020 to 2023  

43:26

where from GPT-3 to GPT-4 Turbo, there was an  increase from 2000 context lengths to 128K. 

43:31

I feel like for the two-ish years since  then, we've been in the same-ish ballpark. 

43:37

When context lengths get much longer  than that, people report qualitative  

43:41

degradation in the ability of the  model to consider that full context. 

43:47

So I'm curious what you're internally seeing  that makes you think, "10 million contexts,  

43:50

100 million contexts to get six months  of human learning and building context". 

43:54

This isn't a research problem. This is  an engineering and inference problem. 

43:58

If you want to serve long context, you  have to store your entire KV cache. 

44:06

It's difficult to store all the memory  in the GPUs, to juggle the memory around. 

44:11

I don't even know the details. At this point, this is at a level of detail  

44:15

that I'm no longer able to follow, although I  knew it in the GPT-3 era. "These are the weights,  

44:21

these are the activations you have to store…" But these days the whole thing is flipped  

44:26

because we have MoE models and all of that. Regarding this degradation you're talking about,  

44:34

without getting too specific, there's two things. There's the context length you train at and  

44:41

there's a context length that you serve at. If you train at a small context length  

44:45

and then try to serve at a long context  length, maybe you get these degradations. 

44:49

It's better than nothing, you might still  offer it, but you get these degradations. 

44:52

Maybe it's harder to train  at a long context length. 

44:56

I want to, at the same time, ask  about maybe some rabbit holes. 

45:01

Wouldn't you expect that if you had  to train on longer context length,  

45:04

that would mean that you're able to get less  samples in for the same amount of compute? 

45:10

Maybe it's not worth diving deep on that. I want to get an answer to the  

45:14

bigger picture question. I don't feel a preference  

45:20

for a human editor that's been working for  me for six months versus an AI that's been  

45:25

working with me for six months, what year  do you predict that that will be the case? 

45:33

My guess for that is there's a lot of problems  where basically we can do this when we have  

45:38

the "country of geniuses in a data center". My picture for that, if you made me guess, is  

45:48

one to two years, maybe one to three years. It's  really hard to tell. I have a strong view—99%,  

45:54

95%—that all this will happen in 10 years. I think that's just a super safe bet. 

46:00

I have a hunch—this is more like a 50/50  thing—that it's going to be more like  

46:04

one to two, maybe more like one to three. So one to three years. Country of geniuses,  

46:10

and the slightly less economically  valuable task of editing videos. 

46:14

It seems pretty economically  valuable, let me tell you. 

46:17

It's just there are a lot of use cases like that. There are a lot of similar ones. 

46:20

So you're predicting that  within one to three years. 

46:23

And then, generally, Anthropic has predicted that  by late '26 or early '27 we will have AI systems  

46:28

that "have the ability to navigate interfaces  available to humans doing digital work today,  

46:34

intellectual capabilities matching or exceeding  that of Nobel Prize winners, and the ability to  

46:38

interface with the physical world". You gave an interview two months ago  

46:42

with DealBook where you were emphasizing  your company's more responsible compute  

46:48

scaling as compared to your competitors. I'm trying to square these two views. 

46:52

If you really believe that we're going to  have a country of geniuses, you want as  

46:57

big a data center as you can get. There's no reason to slow down. 

47:00

The TAM of a Nobel Prize winner, that  can actually do everything a Nobel Prize  

47:04

winner can do, is trillions of dollars. So I'm trying to square this conservatism,  

47:10

which seems rational if you have more moderate  timelines, with your stated views about progress. 

47:16

It actually all fits together. We go back to  this fast, but not infinitely fast, diffusion. 

47:23

Let's say that we're making progress at this rate. The technology is making progress this fast. 

47:29

I have very high conviction that we're  going to get there within a few years. 

47:39

I have a hunch that we're going  to get there within a year or two. 

47:41

So there’s a little uncertainty on  the technical side, but pretty strong  

47:46

confidence that it won't be off by much. What I'm less certain about is, again,  

47:51

the economic diffusion side. I really do believe that we could  

47:56

have models that are a country of geniuses  in the data center in one to two years. 

48:03

One question is: How many years after that  do the trillions in revenue start rolling in? 

48:14

I don't think it's guaranteed  that it's going to be immediate. 

48:19

It could be one year, it could be two  years, I could even stretch it to five  

48:27

years although I'm skeptical of that. So we have  this uncertainty. Even if the technology goes as  

48:35

fast as I suspect that it will, we don't know  exactly how fast it's going to drive revenue. 

48:41

We know it's coming, but with the way you buy  these data centers, if you're off by a couple  

48:47

years, that can be ruinous. It is just like how I  

48:50

wrote in "Machines of Loving Grace". I said I think we might get this powerful AI,  

48:55

this "country of genius in the data center". That description you gave comes  

48:57

from "Machines of Loving Grace". I said we'll get that in 2026, maybe 2027. Again,  

49:02

that is my hunch. I wouldn't be surprised if  I'm off by a year or two, but that is my hunch.  

49:08

Let's say that happens. That's the starting gun.  How long does it take to cure all the diseases? 

49:13

That's one of the ways that drives a huge amount  of economic value. You cure every disease. There's  

49:21

a question of how much of that goes to the  pharmaceutical company or the AI company,  

49:24

but there's an enormous consumer surplus because  —assuming we can get access for everyone,  

49:29

which I care about greatly—we cure all of  these diseases. How long does it take? You  

49:34

have to do the biological discovery,  you have to manufacture the new drug,  

49:40

you have to go through the regulatory process. We saw this with vaccines and COVID. 

49:47

We got the vaccine out to everyone,  but it took a year and a half. 

49:52

My question is: How long does it take to get  the cure for everything—which AI is the genius  

49:58

that can in theory invent—out to everyone? How long from when that AI first exists  

50:03

in the lab to when diseases have  actually been cured for everyone? 

50:09

We've had a polio vaccine for 50 years. We're still trying to eradicate it in the  

50:14

most remote corners of Africa. The Gates Foundation is trying  

50:18

as hard as they can. Others are trying as hard  

50:20

as they can. But that's difficult. Again, I  don't expect most of the economic diffusion  

50:25

to be as difficult as that. That's the most  difficult case. But there's a real dilemma here. 

50:32

Where I've settled on it is that it will  be faster than anything we've seen in the  

50:39

world, but it still has its limits. So when we go to buying data centers,  

50:47

again, the curve I'm looking at is: we've  had a 10x a year increase every year. 

50:54

At the beginning of this year, we're looking  at $10 billion in annualized revenue. 

51:02

We have to decide how much compute to buy. It takes a year or two to actually build out  

51:10

the data centers, to reserve the data center. Basically I'm saying, "In 2027,  

51:16

how much compute do I get?" I could assume that the  

51:24

revenue will continue growing 10x a year,  so it'll be $100 billion at the end of  

51:31

2026 and $1 trillion at the end of 2027. Actually it would be $5 trillion dollars  

51:39

of compute because it would be $1  trillion a year for five years. 

51:43

I could buy $1 trillion of compute  that starts at the end of 2027. 

51:49

If my revenue is not $1 trillion dollars, if it's  even $800 billion, there's no force on earth,  

51:56

there's no hedge on earth that could stop me  from going bankrupt if I buy that much compute. 

52:03

Even though a part of my brain wonders  if it's going to keep growing 10x,  

52:07

I can't buy $1 trillion a year of compute in 2027. If I'm just off by a year in that rate of growth,  

52:17

or if the growth rate is 5x a year instead  of 10x a year, then you go bankrupt. 

52:25

So you end up in a world where you're  supporting hundreds of billions, not trillions. 

52:33

You accept some risk that there's so much  demand that you can't support the revenue,  

52:38

and you accept some risk that you  got it wrong and it's still slow. 

52:43

When I talked about behaving responsibly, what  I meant actually was not the absolute amount. 

52:51

I think it is true we're spending somewhat  less than some of the other players. 

52:55

It's actually the other things, like have we been  thoughtful about it or are we YOLOing and saying,  

53:01

"We're going to do $100 billion  here or $100 billion there"? 

53:05

I get the impression that some of the  other companies have not written down  

53:09

the spreadsheet, that they don't really  understand the risks they're taking. 

53:12

They're just doing stuff because it sounds  cool. We've thought carefully about it. We're  

53:19

an enterprise business. Therefore, we can rely  more on revenue. It's less fickle than consumer.  

53:26

We have better margins, which is the buffer  between buying too much and buying too little. 

53:31

I think we bought an amount that allows  us to capture pretty strong upside worlds. 

53:37

It won't capture the full 10x a year. Things would have to go pretty badly for  

53:42

us to be in financial trouble. So we've thought carefully and  

53:46

we've made that balance. That's what I mean when  

53:48

I say that we're being responsible. So it seems like it's possible that we  

53:54

actually just have different definitions of  the "country of a genius in a data center". 

53:56

Because when I think of actual human geniuses, an  actual country of human geniuses in a data center,  

54:02

I would happily buy $5 trillion worth  of compute to run an actual country of  

54:08

human geniuses in a data center. Let's say JPMorgan or Moderna or  

54:11

whatever doesn't want to use them. I've got a country of geniuses.  

54:14

They'll start their own company. If they can't  start their own company and they're bottlenecked  

54:18

by clinical trials… It is worth stating that with  clinical trials, most clinical trials fail because  

54:22

the drug doesn't work. There's not efficacy. I make exactly that point in "Machines of  

54:27

Loving Grace", I say the clinical  trials are going to go much faster  

54:30

than we're used to, but not infinitely fast. Okay, and then suppose it takes a year for  

54:35

the clinical trials to work out so that you're  getting revenue from that and can make more drugs. 

54:39

Okay, well, you've got a country  of geniuses and you're an AI lab. 

54:44

You could use many more AI researchers. You also think there are these self-reinforcing  

54:50

gains from smart people working on AI tech. You can have the data center  

54:56

working on AI progress. Are there substantially  

55:01

more gains from buying $1 trillion a year of  compute versus $300 billion a year of compute? 

55:07

If your competitor is buying  a trillion, yes there is. 

55:09

Well, no, there's some gain, but then again,  there's this chance that they go bankrupt before. 

55:17

Again, if you're off by only a year, you  destroy yourselves. That's the balance. We're  

55:23

buying a lot. We're buying a hell of a lot. We're buying an amount that's comparable to  

55:30

what the biggest players in the game are buying. But if you're asking me, "Why haven't we signed  

55:39

$10 trillion of compute starting in mid-2027?"... First of all, it can't be produced. 

55:44

There isn't that much in the world. But second, what if the country of  

55:50

geniuses comes, but it comes in mid-2028  instead of mid-2027? You go bankrupt. 

55:56

So if your projection is one to three  years, it seems like you should want  

56:00

$10 trillion of compute by 2029 at the latest? Even in the longest version of the timelines  

56:11

you state, the compute you are ramping  up to build doesn't seem in accordance. 

56:16

What makes you think that? Human wages, let's say,  

56:21

are on the order of $50 trillion a year— So I won't talk about Anthropic in particular,  

56:27

but if you talk about the industry, the amount  of compute the industry is building this year is  

56:38

probably, call it, 10-15 gigawatts. It goes up by roughly 3x a year. 

56:48

So next year's 30-40 gigawatts. 2028 might be  100 gigawatts. 2029 might be like 300 gigawatts. 

57:03

I'm doing the math in my head, but  each gigawatt costs maybe $10 billion,  

57:07

on the order of $10-15 billion a year. You put that all together and you're  

57:14

getting about what you described. You’re  getting exactly that. You're getting multiple  

57:16

trillions a year by 2028 or 2029. You're getting exactly what you predict. 

57:23

That's for the industry. That's for the industry, that’s right. 

57:26

Suppose Anthropic's compute keeps 3x-ing a year,  and then by 2027-28, you have 10 gigawatts. 

57:34

Multiply that by, as you say, $10 billion. So then it's like $100 billion a year. 

57:40

But then you're saying the  TAM by 2028 is $200 billion. 

57:43

Again, I don't want to give exact numbers for  Anthropic, but these numbers are too small. 

57:48

Okay, interesting. You've told investors  

58:49

that you plan to be profitable starting in 2028. This is the year when we're potentially getting  

58:55

the country of geniuses as a data center. This is now going to unlock all this progress  

59:02

in medicine and health and new technologies. Wouldn't this be exactly the time where you'd  

59:11

want to reinvest in the business and build bigger  "countries" so they can make more discoveries? 

59:16

Profitability is this kind  of weird thing in this field. 

59:21

I don't think in this field profitability  is actually a measure of spending down  

59:32

versus investing in the business. Let's just take a model of this. 

59:36

I actually think profitability happens when you  underestimated the amount of demand you were going  

59:41

to get and loss happens when you overestimated  the amount of demand you were going to get,  

59:46

because you're buying the data centers ahead  of time. Think about it this way. Again,  

59:52

these are stylized facts. These numbers are not  exact. I'm just trying to make a toy model here. 

59:56

Let's say half of your compute is for training  and half of your compute is for inference. 

60:02

The inference has some gross  margin that's more than 50%. 

60:07

So what that means is that if you were in  steady-state, you build a data center and if  

60:12

you knew exactly the demand you were getting,  you would get a certain amount of revenue. 

60:23

Let’s say you pay $100 billion a year for compute. On $50 billion a year you support  

60:28

$150 billion of revenue. The other $50 billion is used for training. 

60:36

Basically you’re profitable and  you make $50 billion of profit. 

60:40

Those are the economics of the industry  today, or not today but where we’re  

60:45

projecting forward in a year or two. The only thing that makes that not the  

60:49

case is if you get less demand than $50 billion. Then you have more than 50% of your data center  

60:57

for research and you're not profitable. So you train stronger models,  

61:01

but you're not profitable. If you get more demand than you thought, then  

61:07

research gets squeezed, but you're kind of able to  support more inference and you're more profitable. 

61:16

Maybe I'm not explaining it well, but  the thing I'm trying to say is that you  

61:19

decide the amount of compute first. Then you have some target desire of  

61:24

inference versus training, but  that gets determined by demand. 

61:28

It doesn't get determined by you. What I'm hearing is the reason  

61:30

you're predicting profit is that you are  systematically underinvesting in compute? 

61:37

No, no, no. I'm saying it's hard to predict. These things about 2028 and when it will happen,  

61:43

that's our attempt to do the  best we can with investors. 

61:46

All of this stuff is really uncertain  because of the cone of uncertainty. 

61:50

We could be profitable in 2026  if the revenue grows fast enough. 

61:58

If we overestimate or underestimate  the next year, that could swing wildly. 

62:04

What I'm trying to get at is that you have a  model in your head of a business that invests,  

62:09

invests, invests, gets scale  and then becomes profitable. 

62:14

There's a single point at  which things turn around. 

62:16

I don't think the economics of  this industry work that way. 

62:19

I see. So if I'm understanding correctly,  you're saying that because of the discrepancy  

62:24

between the amount of compute we should have  gotten and the amount of compute we got,  

62:27

we were sort of forced to make profit. But that doesn't mean we're going  

62:30

to continue making profit. We're going to reinvest the money  

62:33

because now AI has made so much progress  and we want a bigger country of geniuses. 

62:37

So back into revenue is high,  but losses are also high. 

62:44

If every year we predict exactly what the demand  is going to be, we'll be profitable every year. 

62:50

Because spending 50% of your compute on research,  roughly, plus a gross margin that's higher than  

63:00

50% and correct demand prediction leads to profit. That's the profitable business model that I think  

63:07

is kind of there, but obscured by these  building ahead and prediction errors. 

63:13

I guess you're treating the 50% as a  sort of given constant, whereas in fact,  

63:21

if AI progress is fast and you can increase the  progress by scaling up more, you should just have  

63:24

more than 50% and not make profit. But here's what I'll say. You  

63:26

might want to scale it up more. Remember the log returns to scale. 

63:34

If 70% would get you a very little bit of  a smaller model through a factor of 1.4x... 

63:42

That extra $20 billion, each dollar there is worth  much less to you because of the log-linear setup. 

63:51

So you might find that it's better  to invest that $20 billion in serving  

63:58

inference or in hiring engineers who are  kind of better at what they're doing. 

64:05

So the reason I said 50%... That's not exactly  our target. It's not exactly going to be 50%.  

64:10

It’ll probably vary over time. What I'm saying  is the log-linear return, what it leads to is you  

64:18

spend of order one fraction of the business. Like  not 5%, not 95%. Then you get diminishing returns. 

64:28

I feel strange that I'm convincing Dario  to believe in AI progress or something. 

64:34

Okay, you don't invest in research  because it has diminishing returns,  

64:37

but you invest in the other things you mentioned. I think profit at a sort of macro level— 

64:39

Again, I'm talking about diminishing returns,  but after you're spending $50 billion a year. 

64:46

This is a point I'm sure you would make,  but diminishing returns on a genius could  

64:51

be quite high. More generally,  

64:54

what is profit in a market economy? Profit is basically saying other  

64:58

companies in the market can do more  things with this money than I can. 

65:02

Put aside Anthropic. I don't want  to give information about Anthropic. 

65:06

That’s why I'm giving these stylized numbers. But let's just derive the  

65:10

equilibrium of the industry. Why doesn't everyone spend 100% of their  

65:21

compute on training and not serve any customers? It's because if they didn't get any revenue,  

65:25

they couldn't raise money,  they couldn't do compute deals,  

65:27

they couldn't buy more compute the next year. So there's going to be an equilibrium where every  

65:31

company spends less than 100% on training  and certainly less than 100% on inference. 

65:38

It should be clear why you don't just serve the  current models and never train another model,  

65:44

because then you don't have any demand because  you'll fall behind. So there's some equilibrium.  

65:49

It's not gonna be 10%, it's not gonna be 90%. Let's just say as a stylized fact, it's 50%.  

65:55

That's what I'm getting at. I think we're gonna be  in a position where that equilibrium of how much  

66:01

you spend on training is less than the gross  margins that you're able to get on compute. 

66:08

So the underlying economics are profitable. The problem is you have this hellish demand  

66:14

prediction problem when you're buying the next  year of compute and you might guess under and be  

66:21

very profitable but have no compute for research. Or you might guess over and you are not  

66:30

profitable and you have all the compute for  research in the world. Does that make sense?  

66:36

Just as a dynamic model of the industry? Maybe stepping back, I'm not saying I think  

66:42

the "country of geniuses" is going to come in two  years and therefore you should buy this compute. 

66:47

To me, the end conclusion you're  arriving at makes a lot of sense. 

66:51

But that's because it seems like "country of  geniuses" is hard and there's a long way to go. 

66:57

So stepping back, the thing I'm trying to get  at is more that it seems like your worldview  

67:03

is compatible with somebody who says, "We're  like 10 years away from a world in which we're  

67:07

generating trillions of dollars of value." That's just not my view. So I'll make  

67:14

another prediction. It is hard for me  to see that there won't be trillions  

67:20

of dollars in revenue before 2030. I can construct a plausible world.  

67:26

It takes maybe three years. That would be  the end of what I think it's plausible. 

67:31

Like in 2028, we get the real "country  of geniuses in the data center". 

67:36

The revenue's going into the low hundreds  of billions by 2028, and then the country  

67:46

of geniuses accelerates it to trillions. We’re basically on the slow end of diffusion. 

67:52

It takes two years to get to the trillions. That would be the world where it takes until 2030. 

67:59

I suspect even composing the technical  exponential and diffusion exponential,  

68:05

we’ll get there before 2030. So you laid out a model where Anthropic makes  

68:10

profit because it seems like fundamentally  we're in a compute-constrained world. 

68:14

So eventually we keep growing compute— I think the way the profit comes is… Again,  

68:21

let's just abstract the whole industry here. Let's just imagine we're in an economics textbook. 

68:27

We have a small number of firms. Each can invest a limited amount. 

68:33

Each can invest some fraction in R&D. They have some marginal cost to serve. 

68:38

The gross profit margins on that marginal cost  are very high because inference is efficient. 

68:47

There's some competition, but the  models are also differentiated. 

68:52

Companies will compete to push  their research budgets up. 

68:55

But because there's a small number of  players, we have the... What is it called?  

69:00

The Cournot equilibrium, I think, is what  the small number of firm equilibrium is. 

69:05

The point is it doesn't equilibrate to  perfect competition with zero margins. 

69:15

If there's three firms in the economy and all  are kind of independently behaving rationally,  

69:20

it doesn't equilibrate to zero. Help me understand that, because  

69:24

right now we do have three leading firms and  they're not making profit. So what is changing? 

69:33

Again, the gross margins  right now are very positive. 

69:38

What's happening is a combination of two things. One is that we're still in the exponential  

69:43

scale-up phase of compute. A model  gets trained. Let's say a model got  

69:53

trained that costs $1 billion last year. Then this year it produced $4 billion of  

70:02

revenue and cost $1 billion to inference from. Again, I'm using stylized numbers here, but that  

70:12

would be 75% gross margins and this 25% tax. So that model as a whole makes $2 billion. 

70:23

But at the same time, we're spending $10  billion to train the next model because  

70:27

there's an exponential scale-up. So  the company loses money. Each model  

70:31

makes money, but the company loses money. The equilibrium I'm talking about is an  

70:35

equilibrium where we have the "country  of geniuses in a data center", but that  

70:43

model training scale-up has equilibrated more.  Maybe it's still going up. We're still trying to  

70:49

predict the demand, but it's more leveled out. I'm confused about a couple of things there. 

70:56

Let's start with the current world. In the current world, you're right that,  

71:00

as you said before, if you treat each  individual model as a company, it's profitable. 

71:05

But of course, a big part of the production  function of being a frontier lab is training  

71:11

the next model, right? Yes, that's right. 

71:13

If you didn't do that, then you'd  make profit for two months and then  

71:16

you wouldn't have margins because  you wouldn't have the best model. 

71:19

But at some point that reaches the  biggest scale that it can reach. 

71:23

And then in equilibrium, we have algorithmic  improvements, but we're spending roughly the  

71:28

same amount to train the next model as  we spend to train the current model. 

71:37

At some point you run out of money in the economy. A fixed lump of labor fallacy… The economy is  

71:42

going to grow, right? That's one  of your predictions. We're going  

71:44

to have the data centers in space. Yes, but this is another example  

71:47

of the theme I was talking about. The economy will grow much faster  

71:53

with AI than I think it ever has before. Right now the compute is growing 3x a year. 

71:59

I don't believe the economy  is gonna grow 300% a year. 

72:03

I said this in "Machines of Loving  Grace", I think we may get 10-20%  

72:08

per year growth in the economy, but we're  not gonna get 300% growth in the economy. 

72:13

So I think in the end, if compute becomes  the majority of what the economy produces,  

72:18

it's gonna be capped by that. So let's assume a model  

72:22

where compute stays capped. The world where frontier labs are making money  

72:26

is one where they continue to make fast progress. Because fundamentally your margin is limited by  

72:34

how good the alternative is. So you are able to make money  

72:37

because you have a frontier model. If you didn't have a frontier model  

72:39

you wouldn't be making money. So this model requires there  

72:45

never to be a steady state. Forever and ever you keep  

72:48

making more algorithmic progress. I don't think that's true. I mean,  

72:51

I feel like we're in an economics class. Do you know the Tyler Cowen quote? 

72:59

We never stop talking about economics. We never stop talking about economics. 

73:03

So no, I don't think this  field's going to be a monopoly. 

73:12

All my lawyers never want me  to say the word "monopoly". 

73:15

But I don't think this field's  going to be a monopoly. 

73:17

You do get industries in which  there are a small number of players. 

73:21

Not one, but a small number of players. Ordinarily, the way you get monopolies  

73:27

like Facebook or Meta—I always call them  Facebook—is these kinds of network effects. 

73:37

The way you get industries in which  there are a small number of players,  

73:41

is very high costs of entry. Cloud is like  this. I think cloud is a good example of this. 

73:49

There are three, maybe four, players within cloud. I think that's the same for AI, three, maybe four. 

73:56

The reason is that it's so expensive. It requires so much expertise and so  

74:02

much capital to run a cloud company. You have to put up all this capital. 

74:08

In addition to putting up all this capital,  you have to get all of this other stuff  

74:11

that requires a lot of skill to make it happen. So if you go to someone and you're like, "I want  

74:17

to disrupt this industry, here's $100 billion." You're like, "okay, I'm putting in $100 billion  

74:22

and also betting that you can do all these  other things that these people have been doing." 

74:26

Only to decrease the profit. The effect of your entering  

74:29

is that profit margins go down. So, we have equilibria like this  

74:33

all the time in the economy where we have a few  players. Profits are not astronomical. Margins  

74:39

are not astronomical, but they're not zero. That's what we see on cloud. Cloud is very  

74:47

undifferentiated. Models are  more differentiated than cloud. 

74:51

Everyone knows Claude is good at different things  than GPT is good at, than Gemini is good at. 

74:58

It's not just that Claude's good at  coding, GPT is good at math and reasoning.  

75:05

It's more subtle than that. Models are good at  different types of coding. Models have different  

75:09

styles. I think these things are actually quite  different from each other, and so I would expect  

75:15

more differentiation than you see in cloud. Now, there actually is one counter-argument. 

75:26

That counter-argument is if the  process of producing models,  

75:32

if AI models can do that themselves, then  that could spread throughout the economy. 

75:37

But that is not an argument for  commoditizing AI models in general. 

75:41

That's kind of an argument for  commoditizing the whole economy at once. 

75:45

I don't know what quite happens in  that world where basically anyone  

75:48

can do anything, anyone can build anything,  and there's no moat around anything at all. 

75:53

I don't know, maybe we want that world. Maybe that's the end state here. 

75:58

Maybe when AI models can do everything, if we've  solved all the safety and security problems,  

76:09

that's one of the mechanisms for the  economy just flattening itself again. 

76:17

But that's kind of far post-"country  of geniuses in the data center." 

76:23

Maybe a finer way to put that potential point  is: 1) it seems like AI research is especially  

76:32

loaded on raw intellectual power, which will  be especially abundant in the world of AGI. 

76:37

And 2) if you just look at the world today,  there are very few technologies that seem to be  

76:41

diffusing as fast as AI algorithmic progress. So that does hint that this industry is  

76:50

sort of structurally diffusive. I think coding is going fast, but  

76:54

I think AI research is a superset of coding and  there are aspects of it that are not going fast. 

77:00

But I do think, again, once we get coding, once we  get AI models going fast, then that will speed up  

77:07

the ability of AI models to do everything else. So while coding is going fast now, I think once  

77:13

the AI models are building the next AI  models and building everything else,  

77:17

the whole economy will kind of go at the same  pace. I am worried geographically, though.  

77:24

I'm a little worried that just proximity to AI,  having heard about AI, may be one differentiator. 

77:34

So when I said the 10-20% growth rate, a worry  I have is that the growth rate could be like 50%  

77:42

in Silicon Valley and parts of the world that are  socially connected to Silicon Valley, and not that  

77:50

much faster than its current pace elsewhere. I think that'd be a pretty messed up world. 

77:54

So one of the things I think about  a lot is how to prevent that. 

77:57

Do you think that once we have this  country of geniuses in a data center, that  

78:01

robotics is sort of quickly solved afterwards? Because it seems like a big problem with robotics  

78:06

is that a human can learn how to teleoperate  current hardware, but current AI models can't,  

78:12

at least not in a way that's super productive. And so if we have this ability to learn like  

78:16

a human, shouldn't it solve  robotics immediately as well? 

78:19

I don't think it's dependent  on learning like a human. 

78:21

It could happen in different ways. Again, we could have trained the model on  

78:25

many different video games, which are like robotic  controls, or many different simulated robotics  

78:30

environments, or just train them to control  computer screens, and they learn to generalize. 

78:34

So it will happen... it's not necessarily  dependent on human-like learning. 

78:41

Human-like learning is one way it could happen. If the model's like, "Oh, I pick up a robot,  

78:44

I don't know how to use it, I learn," that could  happen because we discovered continual learning. 

78:50

That could also happen because we trained  the model on a bunch of environments and  

78:54

then generalized, or it could happen because  the model learns that in the context length. 

78:58

It doesn't actually matter which way. If we go back to the discussion we had  

79:03

an hour ago, that type of thing can  happen in several different ways. 

79:10

But I do think when for whatever reason the  models have those skills, then robotics will be  

79:16

revolutionized—both the design of robots, because  the models will be much better than humans at  

79:21

that, and also the ability to control robots. So we'll get better at building the physical  

79:28

hardware, building the physical robots, and  we'll also get better at controlling it. 

79:32

Now, does that mean the robotics  industry will also be generating  

79:36

trillions of dollars of revenue? My answer there is yes, but there will be  

79:40

the same extremely fast, but not infinitely fast  diffusion. So will robotics be revolutionized?  

79:46

Yeah, maybe tack on another year or two. That's the way I think about these things. 

79:52

Makes sense. There's a general skepticism about  extremely fast progress. Here's my view. It sounds  

79:58

like you are going to solve continual learning  one way or another within a matter of years. 

80:02

But just as people weren't talking about  continual learning a couple of years ago,  

80:06

and then we realized, "Oh, why aren't these  models as useful as they could be right now,  

80:09

even though they are clearly passing the Turing  test and are experts in so many different domains?  

80:14

Maybe it's this thing." Then we solve this thing  and we realize, actually, there's another thing  

80:19

that human intelligence can do that's a basis  of human labor that these models can't do. 

80:24

So why not think there will be  more things like this, where  

80:28

we've found more pieces of human intelligence? Well, to be clear, I think continual learning, as  

80:33

I've said before, might not be a barrier at all. I think we may just get there by pre-training  

80:40

generalization and RL generalization. I think there just  

80:48

might not be such a thing at all. In fact, I would point to the history  

80:51

in ML of people coming up with things  that are barriers that end up kind of  

80:56

dissolving within the big blob of compute. People talked about, "How do your models  

81:06

keep track of nouns and verbs?"  "They can understand syntactically,  

81:11

but they can't understand semantically? It's only statistical correlations."  

81:16

"You can understand a paragraph,  you can’t understand a word. 

81:19

There's reasoning, you can't do reasoning." But then suddenly it turns out you can  

81:23

do code and math very well. So I think there's actually a  

81:27

stronger history of some of these things seeming  like a big deal and then kind of dissolving. Some  

81:35

of them are real. The need for data is real,  maybe continual learning is a real thing. 

81:42

But again, I would ground  us in something like code. 

81:46

I think we may get to the point in  a year or two where the models can  

81:50

just do SWE end-to-end. That's a whole task.  That's a whole sphere of human activity that  

81:56

we're just saying models can do now. When you say end-to-end, do you mean  

82:02

setting technical direction, understanding  the context of the problem, et cetera? 

82:06

Yes. I mean all of that. Interesting. I feel like that is AGI-complete,  

82:13

which maybe is internally consistent. But it's not like saying 90%  

82:17

of code or 100% of code. No, I gave this spectrum:  

82:22

90% of code, 100% of code, 90% of  end-to-end SWE, 100% of end-to-end SWE. 

82:28

New tasks are created for SWEs. Eventually those get done as well. 

82:31

It's a long spectrum there, but we're  traversing the spectrum very quickly. 

82:35

I do think it's funny that I've seen  a couple of podcasts you've done where  

82:40

the hosts will be like, "But Dwarkesh wrote  the essay about the continuous learning thing." 

82:43

It always makes me crack up because  you've been an AI researcher for 10 years. 

82:48

I'm sure there's some feeling of,  "Okay, so a podcaster wrote an essay,  

82:53

and every interview I get asked about it." The truth of the matter is that we're all  

82:59

trying to figure this out together. There are some ways in which I'm  

83:04

able to see things that others aren't. These days that probably has more to do  

83:08

with seeing a bunch of stuff within Anthropic and  having to make a bunch of decisions than I have  

83:13

any great research insight that others don't. I'm running a 2,500 person company. 

83:20

It's actually pretty hard for me to have concrete  research insight, much harder than it would have  

83:27

been 10 years ago or even two or three years ago. As we go towards a world of a full drop-in  

83:36

remote worker replacement, does an API  pricing model still make the most sense? 

83:42

If not, what is the correct  way to price AGI, or serve AGI? 

83:45

I think there's going to be a bunch of  different business models here, all at once,  

83:49

that are going to be experimented with. I actually do think that the API  

83:59

model is more durable than many people think. One way I think about it is if the technology  

84:06

is advancing quickly, if it's advancing  exponentially, what that means is there's  

84:12

always a surface area of new use cases that  have been developed in the last three months. 

84:20

Any kind of product surface you put in place is  always at risk of sort of becoming irrelevant. 

84:27

Any given product surface probably makes sense  for a range of capabilities of the model. 

84:32

The chatbot is already running into limitations  where making it smarter doesn't really help the  

84:39

average consumer that much. But I don't think that's  

84:41

a limitation of AI models. I don't think that's evidence  

84:45

that the models are good enough and them  getting better doesn't matter to the economy. 

84:51

It doesn't matter to that particular product. So I think the value of the API is that the API  

84:58

always offers an opportunity, very close to the  bare metal, to build on what the latest thing is. 

85:06

There's always going to be this front  of new startups and new ideas that  

85:14

weren't possible a few months ago and are  possible because the model is advancing. 

85:19

I actually predict that it's going to exist  alongside other models, but we're always going  

85:28

to have the API business model because there's  always going to be a need for a thousand different  

85:34

people to try experimenting with the model in a  different way. 100 of them become startups and  

85:40

ten of them become big successful startups. Two or three really end up being the way  

85:45

that people use the model of a given generation. So I basically think it's always going to exist. 

85:50

At the same time, I'm sure there's  going to be other models as well. 

85:55

Not every token that's output by  the model is worth the same amount. 

86:00

Think about what is the value of the tokens  that the model outputs when someone calls  

86:10

them up and says, "My Mac isn't working," or  something, the model's like, "restart it." 

86:16

Someone hasn't heard that before, but  the model said that 10 million times. 

86:23

Maybe that's worth like a dollar  or a few cents or something. 

86:26

Whereas if the model goes to one of the  pharmaceutical companies and it says, "Oh,  

86:34

you know, this molecule you're developing, you  should take the aromatic ring from that end of the  

86:39

molecule and put it on that end of the molecule. If you do that, wonderful things will happen." 

86:46

Those tokens could be worth  tens of millions of dollars. 

86:52

So I think we're definitely going to  see business models that recognize that. 

86:56

At some point we're going to see "pay for results"  in some form, or we may see forms of compensation  

87:06

that are like labor, that kind of work by the  hour. I don't know. I think because it's a new  

87:16

industry, a lot of things are going to be tried. I don't know what will turn out to  

87:19

be the right thing. I take your point that  

87:24

people will have to try things to figure out what  is the best way to use this blob of intelligence. 

87:28

But what I find striking is Claude Code. I don't think in the history of startups  

87:34

there has been a single application that has  been as hotly competed in as coding agents. 

87:42

Claude Code is a category leader here. That  seems surprising to me. It doesn't seem  

87:49

intrinsically that Anthropic had to build this. I wonder if you have an accounting of why it had  

87:54

to be Anthropic or how Anthropic ended  up building an application in addition  

87:58

to the model underlying it that was successful. So it actually happened in a pretty simple way,  

88:02

which is that we had our own coding  models, which were good at coding. 

88:09

Around the beginning of 2025, I said, "I  think the time has come where you can have  

88:14

nontrivial acceleration of your own research  if you're an AI company by using these models." 

88:21

Of course, you need an interface,  you need a harness to use them.  

88:25

So I encouraged people internally. I didn't  say this is one thing that you have to use. 

88:31

I just said people should experiment with this. I think it might have been originally  

88:37

called Claude CLI, and then the name  eventually got changed to Claude Code. 

88:42

Internally, it was the thing that everyone was  using and it was seeing fast internal adoption. 

88:48

I looked at it and I said, "Probably we  should launch this externally, right?" 

88:53

It's seen such fast adoption within Anthropic. Coding is a lot of what we do. 

88:59

We have an audience of many, many hundreds  of people that's in some ways at least  

89:04

representative of the external audience. So it looks like we already have product  

89:08

market fit. Let's launch this thing. And then  we launched it. I think just the fact that we  

89:15

ourselves are kind of developing the model and we  ourselves know what we most need to use the model,  

89:21

I think it's kind of creating this feedback loop. I see. In the sense that you, let's say a  

89:26

developer at Anthropic is like, "Ah, it would  be better if it was better at this X thing." 

89:31

Then you bake that into the  next model that you build. 

89:35

That's one version of it, but then there's  just the ordinary product iteration. 

89:41

We have a bunch of coders within  Anthropic, they use Claude Code  

89:47

every day and so we get fast feedback. That was more important in the early days. 

89:50

Now, of course, there are millions  of people using it, and so we get  

89:53

a bunch of external feedback as well. But it's just great to be able to get  

89:58

kind of fast internal feedback. I think this is the reason why we  

90:03

launched a coding model and didn't  launch a pharmaceutical company. 

90:10

My background's in biology, but we  don't have any of the resources that  

90:14

are needed to launch a pharmaceutical company. Let me now ask you about making AI go well. 

91:24

It seems like whatever vision we have about how  AI goes well has to be compatible with two things:  

91:30

1) the ability to build and run AIs is  diffusing extremely rapidly and 2) the  

91:37

population of AIs, the amount we have and their  intelligence, will also increase very rapidly. 

91:44

That means that lots of people will be able  to build huge populations of misaligned AIs,  

91:49

or AIs which are just companies  which are trying to increase their  

91:53

footprint or have weird psyches like  Sydney Bing, but now they're superhuman. 

91:57

What is a vision for a world in which we  have an equilibrium that is compatible  

92:02

with lots of different AIs, some of  which are misaligned, running around? 

92:06

I think in "The Adolescence of Technology",  I was skeptical of the balance of power. 

92:13

But the thing I was specifically skeptical of  is you have three or four of these companies  

92:23

all building models that are derived from the  same thing, that they would check each other. 

92:36

Or even that any number of  them would check each other. 

92:40

We might live in an offense-dominant world where  one person or one AI model is smart enough to do  

92:47

something that causes damage for everything else. In the short run, we have a limited number  

92:54

of players now. So we can start  

92:56

within the limited number of players. We need to put in place the safeguards. 

93:03

We need to make sure everyone  does the right alignment work. 

93:05

We need to make sure everyone has bioclassifiers. Those are the immediate things we need to do. 

93:11

I agree that that doesn't solve the problem in  the long run, particularly if the ability of  

93:16

AI models to make other AI models proliferates,  then the whole thing can become harder to solve. 

93:26

I think in the long run we need  some architecture of governance. 

93:30

We need some architecture of governance  that preserves human freedom,  

93:35

but also allows us to govern a very large  number of human systems, AI systems, hybrid  

93:52

human-AI companies or economic units. So we're gonna need to think about:  

94:01

how do we protect the world against bioterrorism? How do we protect the world against mirror life? 

94:11

Probably we're gonna need some  kind of AI monitoring system  

94:15

that monitors for all of these things. But then we need to build this in a way  

94:20

that preserves civil liberties  and our constitutional rights. 

94:24

So I think just as anything else, it's a  new security landscape with a new set of  

94:34

tools and a new set of vulnerabilities. My worry is, if we had 100 years for this  

94:40

to happen all very slowly, we'd get used to it. We've gotten used to the presence of explosives  

94:49

in society or the presence of various new  weapons or the presence of video cameras. 

94:58

We would get used to it over 100 years and  we’d develop governance mechanisms. We'd  

95:03

make our mistakes. My worry is just  that this is happening all so fast. 

95:07

So maybe we need to do our thinking faster about  how to make these governance mechanisms work. 

95:13

It seems like in an offense-dominant world, over  the course of the next century—the idea is that AI  

95:19

is making the progress that would happen over the  next century happen in some period of five to ten  

95:22

years—we would still need the same mechanisms, or  balance of power would be similarly intractable,  

95:29

even if humans were the only game in town. I guess we have the advice of AI. 

95:36

But it fundamentally doesn't seem like  a totally different ball game here. 

95:41

If checks and balances were going to  work, they would work with humans as well. 

95:44

If they aren't going to work, they  wouldn't work with AIs as well. 

95:47

So maybe this just dooms human  checks and balances as well. 

95:51

Again, I think there's some  way to make this happen. 

95:58

The governments of the world may have  to work together to make it happen. 

96:02

We may have to talk to AIs about building  societal structures in such a way that these  

96:10

defenses are possible. I don't know. I don’t  want to say this is so far ahead in time,  

96:15

but it’s so far ahead in technological ability  that may happen over a short period of time,  

96:21

that it's hard for us to anticipate it in advance. Speaking of governments getting involved,  

96:25

on December 26, the Tennessee legislature  introduced a bill which said, "It would  

96:31

be an offense for a person to knowingly  train artificial intelligence to provide  

96:34

emotional support, including through  open-ended conversations with a user." 

96:39

Of course, one of the things that Claude attempts  to do is be a thoughtful, knowledgeable friend. 

96:48

In general, it seems like we're going  to have this patchwork of state laws. 

96:52

A lot of the benefits that normal people could  experience as a result of AI are going to be  

96:56

curtailed, especially when we get into the  kinds of things you discuss in "Machines  

96:59

of Loving Grace": biological freedom,  mental health improvements, et cetera. 

97:02

It seems easy to imagine worlds in which these  get Whac-A-Moled away by different laws, whereas  

97:10

bills like this don't seem to address the actual  existential threats that you're concerned about. 

97:15

I'm curious to understand, in the context  of things like this, Anthropic's position  

97:20

against the federal moratorium on state AI laws. There are many different things going on at once. 

97:28

I think that particular law is dumb. It was clearly made by legislators  

97:34

who just probably had little idea  what AI models could do and not do. 

97:38

They're like, "AI models serving  us, that just sounds scary. 

97:41

I don't want that to happen." So we're not in favor of that. 

97:47

But that wasn't the thing that was being voted on. The thing that was being voted on is:  

97:52

we're going to ban all state regulation of AI  for 10 years with no apparent plan to do any  

98:00

federal regulation of AI, which would take  Congress to pass, which is a very high bar. 

98:05

So the idea that we'd ban states from doing  anything for 10 years… People said they had  

98:11

a plan for the federal government, but there  was no actual proposal on the table. There was  

98:15

no actual attempt. Given the serious dangers  that I lay out in "Adolescence of Technology"  

98:22

around things like biological weapons  and bioterrorism autonomy risk, and the  

98:29

timelines we've been talking about—10 years is  an eternity—I think that's a crazy thing to do. 

98:36

So if that's the choice, if that's what  you force us to choose, then we're going  

98:42

to choose not to have that moratorium. I think the benefits of that position  

98:47

exceed the costs, but it's not a  perfect position if that's the choice. 

98:51

Now, I think the thing that we should do, the  thing that I would support, is the federal  

98:56

government should step in, not saying "states you  can't regulate", but "Here's what we're going to  

99:02

do, and states you can't differ from this." I think preemption is fine in the sense of  

99:08

saying that the federal government says, "Here  is our standard. This applies to everyone.  

99:12

States can't do something different."  That would be something I would support  

99:16

if it would be done in the right way. But this idea of states, "You can't do  

99:22

anything and we're not doing anything either,"  that struck us as very much not making sense. 

99:29

I think it will not age well, it is  already starting to not age well with  

99:33

all the backlash that you've seen. Now, in terms of what we would want,  

99:39

the things we've talked about are starting with  transparency standards in order to monitor some  

99:46

of these autonomy risks and bioterrorism risks. As the risks become more serious, as we get more  

99:53

evidence for them, then I think we could be more  aggressive in some targeted ways and say, "Hey,  

99:58

AI bioterrorism is really a threat. Let's pass a law that forces  

100:04

people to have classifiers." I could even imagine… It depends. 

100:07

It depends how serious the threat it ends up  being. We don't know for sure. We need to pursue  

100:12

this in an intellectually honest way where we say  that ahead of time, the risk has not emerged yet. 

100:16

But I could certainly imagine, with  the pace that things are going at,  

100:21

a world where later this year we say, "Hey,  this AI bioterrorism stuff is really serious. 

100:27

We should do something about it. We should put it in a federal standard. 

100:31

If the federal government won't act, we should put  it in a state standard." I could totally see that. 

100:36

I'm concerned about a world where if you just  consider the pace of progress you're expecting,  

100:42

the life cycle of legislation... The benefits are, as you say because  

100:48

of diffusion lag, slow enough that I  really do think this patchwork of state  

100:55

laws, on the current trajectory, would prohibit. I mean if having an emotional chatbot friend is  

100:59

something that freaks people out, then just  imagine the kinds of actual benefits from AI  

101:03

we want normal people to be able to experience. From improvements in health and healthspan and  

101:08

improvements in mental health and so forth. Whereas at the same time, it seems like you  

101:13

think the dangers are already on the horizon and  I just don't see that much… It seems like it would  

101:19

be especially injurious to the benefits  of AI as compared to the dangers of AI. 

101:24

So that's maybe where the cost  benefit makes less sense to me. 

101:27

So there's a few things here. People talk about there being  

101:31

thousands of these state laws. First of all, the vast,  

101:34

vast majority of them do not pass. The world works a certain way in theory,  

101:41

but just because a law has been passed  doesn't mean it's really enforced. 

101:44

The people implementing it may be  like, "Oh my God, this is stupid. 

101:48

It would mean shutting off everything  that's ever been built in Tennessee." 

101:55

Very often, laws are interpreted in a way  that makes them not as dangerous or harmful. 

102:02

On the same side, of course, you have to worry  if you're passing a law to stop a bad thing;  

102:06

you have this problem as well. My basic view is that if we could  

102:16

decide what laws were passed and how things  were done—and we’re only one small input  

102:21

into that—I would deregulate a lot of the  stuff around the health benefits of AI. 

102:29

I don't worry as much about the chatbot laws. I actually worry more about the drug approval  

102:37

process, where I think AI models are going to  greatly accelerate the rate at which we discover  

102:45

drugs, and the pipeline will get jammed up. The pipeline will not be prepared to process  

102:50

all the stuff that's going through it. I think reform of the regulatory process  

102:58

should bias more towards the fact that we have  a lot of things coming where the safety and  

103:02

efficacy is actually going to be really crisp and  clear, a beautiful thing, and really effective. 

103:12

Maybe we don't need all this superstructure around  it that was designed around an era of drugs that  

103:21

barely work and often have serious side effects. At the same time, I think we should be  

103:26

ramping up quite significantly the  safety and security legislation. 

103:35

Like I've said, starting with transparency is  my view of trying not to hamper the industry,  

103:43

trying to find the right balance. I'm  worried about it. Some people criticize  

103:46

my essay for saying, "That's too slow. The dangers of AI will come too soon  

103:50

if we do that." Well, basically,  

103:52

I think the last six months and maybe the next  few months are going to be about transparency. 

103:58

Then, if these risks emerge when  we're more certain of them—which  

104:02

I think we might be as soon as later this  year—then I think we need to act very fast  

104:07

in the areas where we've actually seen the risk. I think the only way to do this is to be nimble. 

104:13

Now, the legislative process is normally  not nimble, but we need to emphasize the  

104:21

urgency of this to everyone involved. That's why I'm sending this message of urgency. 

104:24

That's why I wrote Adolescence of Technology. I wanted policymakers, economists, national  

104:30

security professionals, and decision-makers to  read it so that they have some hope of acting  

104:36

faster than they would have otherwise. Is there anything you can do or advocate  

104:42

that would make it more certain that the  benefits of AI are better instantiated? 

104:51

I feel like you have worked  with legislatures to say, "Okay,  

104:54

we're going to prevent bioterrorism here. We're going to increase transparency, we're  

104:57

going to increase whistleblower protection." But I think by default, the actual benefits  

105:01

we're looking forward to seem very fragile  to different kinds of moral panics or  

105:08

political economy problems. I don't actually agree that  

105:12

much regarding the developed world. I feel like in the developed world,  

105:17

markets function pretty well. When there's a lot of money to  

105:23

be made on something and it's clearly the best  available alternative, it's actually hard for  

105:27

the regulatory system to stop it. We're seeing that in AI itself. 

105:33

A thing I've been trying to fight for  is export controls on chips to China. 

105:38

That's in the national  security interest of the US. 

105:42

That's squarely within the policy beliefs of  almost everyone in Congress of both parties.  

105:52

The case is very clear. The counterarguments  against it, I'll politely call them fishy. 

105:59

Yet it doesn't happen and we sell the chips  because there's so much money riding on it. 

106:08

That money wants to be made. In that case, in my opinion, that's a bad thing. 

106:13

But it also applies when it's a good thing. So if we're talking about drugs and benefits of  

106:23

the technology, I am not as worried about those  benefits being hampered in the developed world. 

106:30

I am a little worried about them going too slow. As I said, I do think we should work to speed  

106:37

the approval process in the FDA. I do think we should fight against  

106:41

these chatbot bills that you're describing.  Described individually, I'm against them. I  

106:46

think they're stupid. But I actually think the  bigger worry is the developing world, where we  

106:51

don't have functioning markets and where we often  can't build on the technology that we've had. 

106:58

I worry more that those  folks will get left behind. 

107:01

And I worry that even if the cures are  developed, maybe there's someone in rural  

107:04

Mississippi who doesn't get it as well. That's a smaller version of the concern  

107:10

we have in the developing world. So the things we've been doing  

107:14

are working with philanthropists. We work with folks who deliver medicine and  

107:26

health interventions to the developing world,  to sub-Saharan Africa, India, Latin America,  

107:34

and other developing parts of the world. That's the thing I think that  

107:39

won't happen on its own. You mentioned export controls.  

107:42

Why shouldn't the US and China both have  a "country of geniuses in a data center"? 

107:48

Why won’t it happen or why shouldn't it happen? Why shouldn't it happen. 

107:54

If this does happen, we  could have a few situations. 

108:02

If we have an offense-dominant  situation, we could have a situation  

108:05

like nuclear weapons, but more dangerous. Either side could easily destroy everything. 

108:14

We could also have a world where it's unstable. The nuclear equilibrium is  

108:19

stable because it's deterrence. But let's say there was uncertainty about,  

108:24

if the two AIs fought, which AI would win?  That could create instability. You often have  

108:30

conflict when the two sides have a different  assessment of their likelihood of winning. 

108:34

If one side is like, "Oh yeah, there's a 90%  chance I'll win," and the other side thinks  

108:40

the same, then a fight is much more likely. They can't both be right,  

108:43

but they can both think that. But this seems like a fully general argument  

108:46

against the diffusion of AI technology. That's the implication of this world. 

108:52

Let me just go on, because I think  we will get diffusion eventually. 

108:55

The other concern I have is that governments  will oppress their own people with AI. 

109:04

I'm worried about a world where you have a country  in which there’s already a government that's  

109:16

building a high-tech authoritarian state. To be clear, this is about the government. 

109:21

This is not about the people. We need to find a way for  

109:24

people everywhere to benefit. My worry here is about governments. 

109:30

My worry is if the world gets carved up  into two pieces, one of those two pieces  

109:33

could be authoritarian or totalitarian in  a way that's very difficult to displace. 

109:39

Now, will governments eventually get powerful  AI, and is there a risk of authoritarianism?  

109:45

Yes. Will governments eventually get  powerful AI, and is there a risk of  

109:52

bad equilibria? Yes, I think both things. But the  initial conditions matter. At some point, we're  

110:00

going to need to set up the rules of the road. I'm not saying that one country, either the United  

110:05

States or a coalition of democracies—which  I think would be a better setup, although it  

110:09

requires more international cooperation than we  currently seem to want to make—should just say,  

110:19

"These are the rules of the road." There's going to be some negotiation. 

110:22

The world is going to have to grapple with this. What I would like is for the democratic nations of  

110:31

the world—those whose governments represent  closer to pro-human values—are holding the  

110:39

stronger hand and have more leverage  when the rules of the road are set. 

110:44

So I'm very concerned about  that initial condition. 

110:47

I was re-listening to the interview from  three years ago, and one of the ways it  

110:51

aged poorly is that I kept asking questions  assuming there was going to be some key  

110:55

fulcrum moment two to three years from now. In fact, being that far out, it just seems  

111:00

like progress continues, AI improves, AI is more  diffused, and people will use it for more things. 

111:05

It seems like you're imagining a world in the  future where the countries get together, and  

111:09

"Here's the rules of the road, here's the leverage  we have, and here's the leverage you have." 

111:13

But on the current trajectory,  everybody will have more AI. 

111:18

Some of that AI will be used  by authoritarian countries. 

111:20

Some of that within the authoritarian  countries will be used by private  

111:22

actors versus state actors. It's not clear who will benefit more. 

111:26

It's always unpredictable to tell in advance. It seems like the internet privileged  

111:30

authoritarian countries more  than you would've expected. 

111:33

Maybe AI will be the opposite way around. I want to better understand what  

111:38

you're imagining here. Just to be precise about it,  

111:42

I think the exponential of the underlying  technology will continue as it has before. 

111:47

The models get smarter and smarter, even when they  get to a "country of geniuses in a data center." 

111:53

I think you can continue  to make the model smarter. 

111:56

There's a question of getting diminishing  returns on their value in the world. 

112:01

How much does it matter after  you've already solved human biology? 

112:07

At some point you can do harder, more abstruse  math problems, but nothing after that matters. 

112:12

Putting that aside, I do think the exponential  will continue, but there will be certain  

112:18

distinguished points on the exponential. Companies, individuals, and countries  

112:24

will reach those points at different times. In "The Adolescence of Technology" I talk about:  

112:31

Is a nuclear deterrent still  stable in the world of AI? 

112:38

I don't know, but that's an example  of one thing we've taken for granted. 

112:42

The technology could reach such a level  that we can no longer be certain of it.  

112:50

Think of others. There are points where if you  reach a certain level, maybe you have offensive  

112:57

cyber dominance, and every computer system  is transparent to you after that unless the  

113:04

other side has an equivalent defense. I don't know what the critical moment  

113:09

is or if there's a single critical moment. But I think there will be either a critical  

113:14

moment, a small number of critical moments,  or some critical window where AI confers  

113:22

some large advantage from the perspective  of national security, and one country or  

113:30

coalition has reached it before others. I'm not advocating that they just say,  

113:36

"Okay, we're in charge now." That's not how I think about it. 

113:42

The other side is always catching up. There are extreme actions you're not  

113:45

willing to take, and it's not right  to take complete control anyway. 

113:52

But at the point that happens, people are  going to understand that the world has changed. 

113:58

There's going to be some negotiation,  implicit or explicit, about what the  

114:05

post-AI world order looks like. My interest is in making that  

114:14

negotiation be one in which classical  liberal democracy has a strong hand. 

114:24

I want to understand what that better  means, because you say in the essay,  

114:27

"Autocracy is simply not a form of government that  people can accept in the post-powerful AI age." 

114:33

That sounds like you're saying the CCP as an  institution cannot exist after we get AGI. 

114:41

That seems like a very strong demand, and it  seems to imply a world where the leading lab  

114:47

or the leading country will be able to—and  by that language, should get to—determine  

114:54

how the world is governed or what kinds  of governments are, and are not, allowed. 

115:02

I believe that paragraph said something like,  "You could take it even further and say X." 

115:13

I wasn't necessarily endorsing that view. I was saying,  

115:18

"Here's a weaker thing that I believe. We have to worry a lot about authoritarians and  

115:24

we should try to check them and limit their power. You could take this much further and have a more  

115:30

interventionist view that says authoritarian  countries with AI are these self-fulfilling  

115:38

cycles that are very hard to displace, so you  just need to get rid of them from the beginning." 

115:43

That has exactly all the problems you say. If you were to make a commitment to  

115:49

overthrowing every authoritarian country,  they would take a bunch of actions now  

115:53

that could lead to instability. That just may not be possible. 

116:02

But the point I was making that I do  endorse is that it is quite possible that... 

116:09

Today, the view, my view, in most of the Western  world is that democracy is a better form of  

116:16

government than authoritarianism. But if a country’s authoritarian,  

116:21

we don’t react the way we’d react if  they committed a genocide or something. 

116:27

I guess what I'm saying is I'm a little worried  that in the age of AGI, authoritarianism will  

116:32

have a different meaning. It will be a graver thing. 

116:35

We have to decide one way or  another how to deal with that. 

116:39

The interventionist view is one possible view. I  was exploring such views. It may end up being the  

116:47

right view, or it may end up being too extreme.  But I do have hope. One piece of hope I have is  

116:55

that we have seen that as new technologies are  invented, forms of government become obsolete. 

117:04

I mentioned this in "Adolescence of  Technology", where I said feudalism  

117:10

was basically a form of government, and when  we invented industrialization, feudalism was no  

117:18

longer sustainable. It no longer made sense. Why is that hope? Couldn't that imply that  

117:23

democracy is no longer going  to be a competitive system? 

117:26

Right, it could go either way. But these problems with  

117:38

authoritarianism get deeper. I wonder if that's an indicator of  

117:44

other problems that authoritarianism will have. In other words, because authoritarianism becomes  

117:52

worse, people are more afraid of it. They work harder to stop it. 

117:59

You have to think in terms of total equilibrium. I just wonder if it will motivate new ways  

118:07

of thinking about how to preserve and  protect freedom with the new technology. 

118:13

Even more optimistically, will it lead to  a collective reckoning and a more emphatic  

118:22

realization of how important some of the  things we take as individual rights are? 

118:27

A more emphatic realization that  we really can't give these away. 

118:32

We've seen there's no other way  to live that actually works. 

118:39

I am actually hopeful that—it sounds too  idealistic, but I believe it could be the  

118:46

case—dictatorships become morally obsolete. They become morally unworkable forms of  

118:52

government and the crisis that that creates  is sufficient to force us to find another way. 

119:03

I think there is genuinely a tough question  here which I'm not sure how you resolve. 

119:07

We've had to come out one way or  another on it through history. 

119:11

With China in the '70s and '80s,  we decided that even though it's an  

119:15

authoritarian system, we will engage with it. I think in retrospect that was the right call,  

119:18

because it’s a state authoritarian system but  a billion-plus people are much wealthier and  

119:23

better off than they would've otherwise been. It's not clear that it would've stopped being  

119:27

an authoritarian country otherwise. You can just look at North Korea  

119:30

as an example of that. I don't know if it takes  

119:34

that much intelligence to remain an authoritarian  country that continues to coalesce its own power. 

119:40

You can imagine a North Korea with an AI  that's much worse than everybody else's,  

119:44

but still enough to keep power. In general, it seems like we should just  

119:50

have this attitude that the benefits of  AI—in the form of all these empowerments  

119:54

of humanity and health—will be big. Historically, we have decided it's good  

120:00

to spread the benefits of technology widely, even  to people whose governments are authoritarian. 

120:06

It is a tough question, how to think about it  with AI, but historically we have said, "yes,  

120:10

this is a positive-sum world, and it's  still worth diffusing the technology." 

120:15

There are a number of choices we have. Framing this as a government-to-government  

120:20

decision in national security terms is one  lens, but there are a lot of other lenses. 

120:27

You could imagine a world where we  produce all these cures to diseases. 

120:32

The cures are fine to sell to authoritarian  countries, but the data centers just aren't. 

120:38

The chips and the data centers aren't,  and the AI industry itself isn't. 

120:44

Another possibility I think  folks should think about is this. 

120:49

Could there be developments we can make—either  that naturally happen as a result of AI,  

120:55

or that we could make happen by  building technology on AI—that  

120:59

create an equilibrium where it becomes  infeasible for authoritarian countries  

121:05

to deny their people private use  of the benefits of the technology? 

121:12

Are there equilibria where we can give everyone in  an authoritarian country their own AI model that  

121:19

defends them from surveillance and there isn't  a way for the authoritarian country to crack  

121:24

down on this while retaining power? I don't know.  That sounds to me like if that went far enough,  

121:29

it would be a reason why authoritarian  countries would disintegrate from the inside. 

121:35

But maybe there's a middle world where there's  an equilibrium where, if they want to hold on  

121:39

to power, the authoritarians can't deny  individualized access to the technology. 

121:45

But I actually do have a hope  for the more radical version. 

121:50

Is it possible that the technology  might inherently have properties—or  

121:54

that by building on it in certain ways  we could create properties—that have this  

122:01

dissolving effect on authoritarian structures? Now, we hoped originally—think back to the  

122:07

beginning of the Obama administration—that  social media and the internet would have  

122:13

that property, and it turns out not to. But what if we could try again with the  

122:20

knowledge of how many things could go wrong,  and that this is a different technology? 

122:23

I don't know if it would  work, but it's worth a try. 

122:26

It's just very unpredictable. There  are first principles reasons why  

122:30

authoritarianism might be privileged. It's all very unpredictable. We just  

122:35

have to recognize the problem and come  up with 10 things we can try, try those,  

122:40

and then assess which ones are working, if any. Then try new ones if the old ones aren't working. 

122:46

But I guess that nets out to today, as you  say, that we will not sell data centers,  

122:51

or chips, and the ability to make chips to China. So in some sense, you are denying… There would be  

122:58

some benefits to the Chinese economy, Chinese  people, et cetera, because we're doing that. 

123:02

Then there'd also be benefits to the American  economy because it's a positive-sum world.  

123:06

We could trade. They could have their  country's data centers doing one thing. 

123:08

We could have ours doing another. Already, you're saying it's not worth that  

123:14

positive-sum stipend to empower those countries? What I would say is that we are about to be  

123:22

in a world where growth and economic  value will come very easily if we're  

123:27

able to build these powerful AI models. What will not come easily is distribution  

123:35

of benefits, distribution of  wealth, political freedom. 

123:40

These are the things that are  going to be hard to achieve. 

123:43

So when I think about policy, I think that the  technology and the market will deliver all the  

123:50

fundamental benefits, this is my fundamental  belief, almost faster than we can take them. 

123:55

These questions about distribution and political  freedom and rights are the ones that will actually  

124:02

matter and that policy should focus on. Speaking of distribution, as you were  

124:06

mentioning, we have developing countries. In many cases, catch-up growth has been  

124:12

weaker than we would have hoped for. But when catch-up growth does happen,  

124:15

it's fundamentally because  they have underutilized labor. 

124:18

We can bring the capital and know-how from  developed countries to these countries,  

124:21

and then they can grow quite rapidly. Obviously, in a world where labor is no  

124:26

longer the constraining factor,  this mechanism no longer works. 

124:30

So is the hope basically to  rely on philanthropy from  

124:33

the people or countries who immediately  get wealthy from AI? What is the hope? 

124:38

Philanthropy should obviously play  some role, as it has in the past. 

124:44

But I think growth is always better and  stronger if we can make it endogenous. 

124:50

What are the relevant industries  in an AI-driven world? 

124:58

I said we shouldn't build data centers in  China, but there's no reason we shouldn't  

125:00

build data centers in Africa. In fact, I think it'd be  

125:04

great to build data centers in Africa. As long as they're not owned by China,  

125:08

we should build data centers in Africa. I think that's a great thing to do. 

125:16

There's no reason we can't build a  pharmaceutical industry that's AI-driven. 

125:22

If AI is accelerating drug discovery, then  there will be a bunch of biotech startups. 

125:28

Let's make sure some of those  happen in the developing world. 

125:31

Certainly, during the transition—we can  talk about the point where humans have no  

125:34

role—humans will still have some role in starting  up these companies and supervising the AI models. 

125:41

So let's make sure some of those  humans are in the developing world  

125:44

so that fast growth can happen there as well. You guys recently announced that Claude is going  

125:48

to have a constitution that's aligned to a set of  values, and not necessarily just to the end user. 

125:53

There's a world I can imagine where  if it is aligned to the end user,  

125:56

it preserves the balance of power we have in the  world today because everybody gets to have their  

125:59

own AI that's advocating for them. The ratio of bad actors to  

126:03

good actors stays constant. It seems to work out for our world today. 

126:07

Why is it better not to do that, but to  have a specific set of values that the  

126:12

AI should carry forward? I'm not sure I'd quite  

126:16

draw the distinction in that way. There may be two relevant distinctions here. 

126:22

I think you're talking about a mix of the two. One is, should we give the model a set of  

126:27

instructions about "do this"  versus "don't do this"? 

126:31

The other is, should we give the model  a set of principles for how to act? 

126:44

It's kind of purely a practical and  empirical thing that we've observed. 

126:48

By teaching the model principles,  getting it to learn from principles,  

126:52

its behavior is more consistent, it's easier  to cover edge cases, and the model is more  

126:58

likely to do what people want it to do. In other words, if you give it a list of  

127:09

rules—"don't tell people how to hot-wire  a car, don't speak in Korean"—it doesn't  

127:10

really understand the rules, and  it's hard to generalize from them. 

127:15

It’s just a list of do’s and don’t’s. Whereas if you give it principles—it  

127:21

has some hard guardrails like "Don't make  biological weapons" but—overall you're  

127:25

trying to understand what it should be aiming  to do, how it should be aiming to operate. 

127:31

So just from a practical perspective, that turns  out to be a more effective way to train the model. 

127:35

That's the rules versus principles trade-off. Then there's another thing you're talking about,  

127:42

which is the corrigibility versus  intrinsic motivation trade-off. 

127:51

How much should the model be a kind  of "skin suit" where it just directly  

128:02

follows the instructions given to it by  whoever is giving those instructions,  

128:06

versus how much should the model have an inherent  set of values and go off and do things on its own? 

128:14

There I would actually say everything about  the model is closer to the direction that  

128:21

it should mostly do what people want.  It should mostly follow instructions.  

128:24

We're not trying to build something that  goes off and runs the world on its own. 

128:29

We're actually pretty far on the corrigible side. Now, what we do say is there are certain  

128:34

things that the model won't do. I think we say it in various ways in the  

128:40

constitution, that under normal circumstances, if  someone asks the model to do a task, it should do  

128:45

that task. That should be the default. But if  you've asked it to do something dangerous, or  

128:54

to harm someone else, then the  model is unwilling to do that. 

129:01

So I actually think of it as a mostly  corrigible model that has some limits,  

129:07

but those limits are based on principles. Then the fundamental question is,  

129:12

how are those principles determined? This is not a special question for Anthropic. 

129:15

This would be a question for any AI company. But because you have been the ones to actually  

129:22

write down the principles, I  get to ask you this question. 

129:25

Normally, a constitution is written down,  set in stone, and there's a process of  

129:29

updating it and changing it and so forth. In this case, it seems like a document  

129:34

that people at Anthropic write,  that can be changed at any time,  

129:37

that guides the behavior of systems that are going  to be the basis of a lot of economic activity. 

129:45

How do you think about how  those principles should be set? 

129:50

I think there are maybe three sizes  of loop here, three ways to iterate. 

129:58

One is we iterate within Anthropic. We train the model, we're not happy with it,  

130:01

and we change the constitution. I think that's good to do. 

130:06

Putting out public updates to the  constitution every once in a while  

130:10

is good because people can comment on it. The second level of loop is different companies  

130:16

having different constitutions. I think it’s  useful. Anthropic puts out a constitution,  

130:21

Gemini puts out a constitution, and  other companies put out a constitution. 

130:28

People can look at them and compare. Outside observers can critique and say,  

130:34

"I like this thing from this constitution  and this thing from that constitution." 

130:40

That creates a soft incentive and  feedback for all the companies to  

130:45

take the best of each element and improve. Then I think there's a third loop, which is  

130:50

society beyond the AI companies and beyond  just those who comment without hard power.  

130:59

There we've done some experiments. A couple years  ago, we did an experiment with the Collective  

131:04

Intelligence Project to basically poll people and  ask them what should be in our AI constitution. 

131:15

At the time, we incorporated  some of those changes. 

131:17

So you could imagine doing something  like that with the new approach we've  

131:19

taken to the constitution. It's a little harder because  

131:23

it was an easier approach to take when the  constitution was a list of dos and don'ts. 

131:29

At the level of principles, it has to  have a certain amount of coherence. 

131:32

But you could still imagine getting  views from a wide variety of people. 

131:37

You could also imagine—and this  is a crazy idea, but this whole  

131:42

interview is about crazy ideas—systems of  representative government having input. 

131:52

I wouldn't do this today because  the legislative process is so slow. 

131:55

This is exactly why I think we should be careful  about the legislative process and AI regulation. 

132:00

But there's no reason you couldn't, in principle,  say, "All AI models have to have a constitution  

132:06

that starts with these things, and then you can  append other things after it, but there has to  

132:13

be this special section that takes precedence."  I wouldn't do that. That's too rigid and sounds  

132:22

overly prescriptive in a way that I  think overly aggressive legislation is. 

132:26

But that is a thing you could try to do. Is there some much less heavy-handed  

132:32

version of that? Maybe. I really like control loop two. 

132:37

Obviously, this is not how constitutions  of actual governments do or should work. 

132:42

There's not this vague sense in which the  Supreme Court will feel out how people  

132:46

are feeling—what are the vibes—and  update the constitution accordingly. 

132:50

With actual governments, there's  a more formal, procedural process. 

132:55

But you have a vision of competition between  constitutions, which is actually very reminiscent  

133:01

of how some libertarian charter cities people used  to talk, about what an archipelago of different  

133:07

kinds of governments would look like. There would be selection among them of  

133:10

who could operate the most effectively  and where people would be the happiest. 

133:15

In a sense, you're recreating that  vision of a utopia of archipelagos. 

133:23

I think that vision has things to recommend  it and things that will go wrong with it. 

133:31

It's an interesting, in some ways  compelling, vision, but things will  

133:34

go wrong that you hadn't imagined. So I like loop two as well,  

133:40

but I feel like the whole thing has got to  be some mix of loops one, two, and three,  

133:46

and it's a matter of the proportions. I think that's gotta be the answer. 

133:53

When somebody eventually writes the equivalent  of The Making of the Atomic Bomb for this era,  

133:58

what is the thing that will be hardest  to glean from the historical record that  

134:02

they're most likely to miss? I think a few things. One is,  

134:06

at every moment of this exponential, the extent to  which the world outside it didn't understand it. 

134:12

This is a bias that's often present in history. Anything that actually happened looks  

134:17

inevitable in retrospect. When people look back, it will  

134:24

be hard for them to put themselves in the place  of people who were actually making a bet on this  

134:32

thing to happen that wasn't inevitable, that we  had these arguments like the arguments I make for  

134:38

scaling or that continual learning will be solved. Some of us internally put a high probability  

134:48

on this happening, but there's a world  outside us that's not acting on that at all. 

134:58

I think the weirdness of it,  unfortunately the insularity of it... 

135:07

If we're one year or two  years away from it happening,  

135:10

the average person on the street has no idea. That's one of the things I'm trying to change with  

135:14

the memos, with talking to policymakers. I don’t know but I think  

135:19

that's just a crazy thing. Finally, I would say—and this  

135:27

probably applies to almost all historical moments  of crisis—how absolutely fast it was happening,  

135:33

how everything was happening all at once. Decisions that you might think were  

135:39

carefully calculated, well actually  you have to make that decision,  

135:42

and then you have to make 30 other decisions on  the same day because it's all happening so fast. 

135:47

You don't even know which decisions are  going to turn out to be consequential. 

135:52

One of my worries—although it's also an  insight into what's happening—is that some  

136:00

very critical decision will be some decision  where someone just comes into my office and  

136:05

is like, "Dario, you have two minutes. Should we do thing A or thing B on this?" 

136:14

Someone gives me this random half-page memo  and asks, "Should we do A or B?" I'm like, "I  

136:20

don't know. I have to eat lunch. Let's do B." That  ends up being the most consequential thing ever. 

136:26

So final question. There aren't tech CEOs who are  usually writing 50-page memos every few months. 

136:35

It seems like you have managed to build  a role for yourself and a company around  

136:40

you which is compatible with this  more intellectual-type role of CEO. 

136:47

I want to understand how you construct that.  How does that work? Do you just go away for  

136:53

a couple of weeks and then you tell your  company, "This is the memo. Here's what  

136:56

we're doing"? It's also reported that  you write a bunch of these internally. 

136:59

For this particular one, I  wrote it over winter break. 

137:04

I was having a hard time finding  the time to actually write it. 

137:08

But I think about this in a broader way. I think it relates to the culture of the company. 

137:13

I probably spend a third, maybe 40%, of my time  making sure the culture of Anthropic is good. 

137:19

As Anthropic has gotten larger, it's gotten  harder to get directly involved in the training  

137:26

of the models, the launch of the models,  the building of the products. It's 2,500  

137:30

people. I have certain instincts, but it's very  difficult to get involved in every single detail. 

137:41

I try as much as possible, but one thing that's  very leveraged is making sure Anthropic is a good  

137:46

place to work, people like working there, everyone  thinks of themselves as team members, and everyone  

137:51

works together instead of against each other. We've seen as some of the other AI companies  

137:57

have grown—without naming any names—we're starting  to see decoherence and people fighting each other. 

138:03

I would argue there was even a lot of that  from the beginning, but it's gotten worse. 

138:08

I think we've done an extraordinarily good  job, even if not perfect, of holding the  

138:14

company together, making everyone feel the  mission, that we're sincere about the mission,  

138:19

and that everyone has faith that everyone  else there is working for the right reason. 

138:23

That we're a team, that people aren't trying  to get ahead at each other's expense or  

138:28

backstab each other, which again, I think  happens a lot at some of the other places. 

138:33

How do you make that the case?  It's a lot of things. It's me,  

138:36

it's Daniela, who runs the company  day to day, it's the co-founders,  

138:41

it's the other people we hire, it's  the environment we try to create. 

138:44

But I think an important thing in the culture is  that the other leaders as well, but especially me,  

138:53

have to articulate what the company is  about, why it's doing what it's doing,  

138:58

what its strategy is, what its values are,  what its mission is, and what it stands for. 

139:06

When you get to 2,500 people, you  can't do that person by person. 

139:09

You have to write, or you have  to speak to the whole company. 

139:12

This is why I get up in front of the whole  company every two weeks and speak for an hour. 

139:18

I wouldn't say I write essays internally.  I do two things. One, I write this thing  

139:22

called a DVQ, Dario Vision Quest. I wasn't the one who named it that. 

139:27

That's the name it received, and it's one of these  names that I tried to fight because it made it  

139:32

sound like I was going off and smoking peyote or  something. But the name just stuck. So I get up  

139:38

in front of the company every two weeks. I have a three or four-page document,  

139:43

and I just talk through three or four different  topics about what's going on internally,  

139:49

the models we're producing, the products,  the outside industry, the world as a whole  

139:54

as it relates to AI and geopolitically  in general. Just some mix of that. I go  

139:59

through very honestly and I say, "This is what I'm  thinking, and this is what Anthropic leadership  

140:06

is thinking," and then I answer questions. That direct connection has a lot of value that  

140:13

is hard to achieve when you're passing  things down the chain six levels deep. 

140:19

A large fraction of the company comes to  attend, either in person or virtually. 

140:27

It really means that you can communicate a lot. The other thing I do is I have a channel in  

140:32

Slack where I just write a bunch  of things and comment a lot. 

140:36

Often that's in response to things I'm seeing  at the company or questions people ask. 

140:44

We do internal surveys and there are things people  are concerned about, and so I'll write them up. 

140:50

I'm just very honest about these things. I just say them very directly. 

140:56

The point is to get a reputation of telling the  company the truth about what's happening, to call  

141:01

things what they are, to acknowledge problems,  to avoid the sort of corpo speak, the kind of  

141:07

defensive communication that often is necessary in  public because the world is very large and full of  

141:14

people who are interpreting things in bad faith. But if you have a company of people who you trust,  

141:21

and we try to hire people that we trust, then  you can really just be entirely unfiltered. 

141:31

I think that's an enormous  strength of the company. 

141:33

It makes it a better place to work, it makes  people more than the sum of their parts,  

141:38

and increases the likelihood that we accomplish  the mission because everyone is on the same page  

141:41

about the mission, and everyone is debating and  discussing how best to accomplish the mission. 

141:46

Well, in lieu of an external Dario  Vision Quest, we have this interview. 

141:50

This interview is a little like that. This has been fun, Dario. Thanks for doing it. 

141:54

Thank you, Dwarkesh.

Interactive Summary

The discussion revolves around the rapid advancement of AI, particularly over the last three years, and the surprising lack of public recognition of its proximity to AGI. The speaker emphasizes the 'Big Blob of Compute Hypothesis,' which prioritizes raw compute, data, training time, and scalable objective functions over novel techniques. AI's learning process is likened to human evolution rather than on-the-spot learning, explaining its high sample inefficiency. Predictions for achieving 'country of geniuses in a data center' are within 1-3 years, with high confidence within 10 years. The economic diffusion of AI, though extremely fast (Anthropic's 10x annual revenue growth), is not instantaneous due to integration complexities. The conversation also covers AI's potential impact on society, including risks of bioterrorism and authoritarianism, and the need for robust governance frameworks and a 'constitution' for AI models. The speaker reflects on the unique intellectual culture at Anthropic, emphasizing transparency and direct communication.

Suggested questions

8 ready-made prompts