HomeVideos

David Reich – Bronze Age shock, the Neanderthal puzzle, & the sudden spread of farming

Now Playing

David Reich – Bronze Age shock, the Neanderthal puzzle, & the sudden spread of farming

Transcript

1513 segments

0:00

Humans, at least in this part of the  world, were wrenched into a way of  

0:04

living that was so different from how  their hunter-gatherer ancestors lived  

0:08

that the organism had to adapt very strongly. Maybe the degree of that wrenching process moving  

0:15

into the Bronze Age was qualitatively greater  than the degree of the wrenching process that  

0:20

happened from the initial transition to growing  plants, which is surprising, because our cartoon  

0:26

picture is that the big transition is farming. But the genetic data, the biological readout, is  

0:32

saying our genome is reacting much more strongly  to these events that happened 5,000 years ago. 

0:38

I am back with David Reich, who is a  professor of ancient DNA at Harvard. 

0:44

How do you describe what it is that you study? I'm a geneticist, and I work on human history  

0:51

and how ancient people relate to  each other and people living today. 

0:57

We did an interview two years ago,  which ended up being one of the  

1:03

most popular interviews I've ever done. I think people found it really compelling  

1:06

that there's so much about human history we don't  know and are just learning about now as a result  

1:10

of the kinds of techniques your lab is using. You have a new preprint that's very exciting,  

1:17

and I wanted to talk to you about it. Can you give me a little bit of context  

1:22

on what we're talking about today? The dream was that when this ancient  

1:27

DNA field started, more than 16 or 17 years  ago, we were going to learn a lot about  

1:33

biology —about how people's biology changed  over time— by getting DNA out of ancient  

1:38

human remains and tracking changes over time. And that dream has really not been realized  

1:45

since the beginning of this field. The field has been a big success  

1:49

with regard to learning about human history. It's resulted in surprising findings about  

1:55

human migrations —people not being descended  from the people who lived in the same place  

2:00

hundreds or thousands or tens of thousands of  years before— and mixture being common in human  

2:04

history, and sex-biased processes being common. And there have been things that were  

2:09

not expected from archaeology. The field's been a big success from  

2:12

that perspective, but what's not been successful  is learning about biology and biological change. 

2:19

One big reason has been that the  sample sizes have been too small. 

2:24

When you have a single person's DNA, it provides  a tremendous amount of information about history. 

2:30

That's because when you look at one person's  DNA, it's not a single person. It's many  

2:34

people. It's your two parents, your four  grandparents, your eight great-grandparents,  

2:39

16 great-great-grandparents, and so on. Going back in time, thousands,  

2:44

tens of thousands, even hundreds of thousands  of ancestors are contributing to people today. 

2:50

When you look at the DNA of a single  person's genome or a Neanderthal genome,  

2:54

you have effectively tens of thousands of  ancestors all represented in your data. 

2:59

And you can position that  individual exquisitely with  

3:02

respect to other people from whom you have data. But when you are interested in how a particular  

3:08

genetic variant—that affects something like  your skin pigmentation, or your ability to  

3:12

digest cow's milk into adulthood, or a behavioral  trait—changes over time, a single person gives  

3:19

you only one sample, or maybe two samples: the  one in their mother and the one in their father. 

3:25

To get a high-resolution picture of how the  frequency changes over time, you need very big  

3:31

sample sizes, truly very large numbers of people. We just didn't have that until the last few years. 

3:37

What motivates the study we're talking about  today, and the work that hopefully a number  

3:43

of groups will be doing in the coming years, is  the fact that we now finally have those numbers. 

3:48

We can do something with the data to  see how frequency changes over time. 

3:52

Can I ask a question? I'll be asking a lot of  naive questions through the next few hours, but  

3:56

why are frequency changes especially interesting? What we're interested in is using the experiment  

4:03

of nature that's occurred in our history, over  the last tens of thousands of years, to understand  

4:09

what's biologically significant in our DNA. If there has been a change in environment  

4:17

that a population has experienced—for example,  people shifted to agriculture, began living close  

4:23

to domesticated animals, or moved from a cold  place to a warm place, or a low place to a high  

4:29

place—then there's pressure on the population  to adapt to these new stresses and new needs. 

4:36

The way you're going to detect that  is by seeing that the frequency of a  

4:40

genetic variant—that for example might  allow you to live at higher altitude,  

4:46

or that might nudge you to have a different  behavioral pattern advantageous in the new  

4:51

situation—pushes systematically in some direction  in a way that is enough for you to detect. 

4:58

It's very hard to detect slight shifts in  frequency by a few percent or ten percent  

5:03

unless you have a very big sample size. What we're looking for are those changes  

5:08

in frequency that are too  extreme to be due to chance. 

5:11

That will tell us there have been pushes against  the biology as a result of the changes in  

5:16

environment that people have experienced. Interesting. What did you guys find? 

5:21

Seven years ago, Ali Akbari, who at the time was a  postdoctoral scientist in my laboratory and a few  

5:29

years later became a permanent staff scientist,  set out to use the data we were producing to  

5:36

learn about biological change over time. I think the reason he was interested in  

5:40

our laboratory rather than other places was  that a focus of our lab has been generating  

5:45

truly large amounts of data from ancient humans. We've been trying to industrialize the process,  

5:50

make it very inexpensive, make it high quality,  and generate large numbers of samples with lots  

5:56

of good data for this purpose. There's been this large amount  

6:00

of data we've generated. And it made it possible to  

6:03

conceive again of asking whether there  have been frequency changes over time. 

6:08

The mainstream view in human evolution in the  last several decades has been that natural  

6:15

selection has been pretty quiescent over the last  several hundred thousand years of human history. 

6:20

There are several lines of evidence that  have been deployed to document this. 

6:25

One is that if you compare diverse populations  from different continents around the world,  

6:30

for example Europeans and East Asians, and  you look at mutations that differ in frequency  

6:35

between these groups—all mutations differ a little  bit in frequency, sometimes a lot—you can say,  

6:40

"What are the most different mutations in terms  of frequency between Europeans and East Asians?" 

6:44

And there are almost no genetic  changes that are 100% different  

6:47

in frequency between Europeans and East Asians. Europeans and East Asians descend from a common  

6:52

ancestral population 40,000 or 50,000 years ago  that came out of Africa and the Middle East. 

6:58

This population had a set of gene frequencies,  and these variants bopped around randomly—a  

7:04

process known as genetic drift—or perhaps  under selection in one direction or another. 

7:09

The time that's passed since 40,000 or  50,000 years ago is sufficiently small on  

7:14

an evolutionary timescale that there's  just not much genetic differentiation  

7:18

on average between these two groups. However, if there's been natural selection, for  

7:24

example to help people in one place digest alcohol  better, or digest milk better, what you might  

7:33

expect is that there would be some mutation that  would have rocketed up to very high frequency. 

7:38

Forty or fifty thousand years is a lot of  time, it's maybe 1,500-2,000 generations. 

7:43

That might easily be enough time to  see a 100% difference in frequency. 

7:47

Yet you don't see any more than  what you would expect by chance. 

7:51

This combination of things made it seem  that selection has just been quiescent. 

7:54

Maybe a few hundred thousand years ago, the  ancestral human population got to some kind  

7:58

of optimum, and after that there hasn't been  much genetic change in one way or the other. 

8:04

There have been small amounts of natural  selection, or selection to remove bad mutations  

8:09

that are constantly raining down on the genome,  but not what we call directional selection. 

8:13

That would be newly arising mutations, or  mutations being pushed in a systematic direction,  

8:19

to help the population get to a different  adaptive set point more favorable for the  

8:26

conditions that population is living in. We were able to partition how much of the  

8:31

changes in frequencies of all the mutations that  we're seeing in the DNA—we're looking at about 10  

8:37

million positions that vary—is due to directional  selection (adaptation) versus other factors,  

8:43

especially genetic drift. And 98% of it is other  

8:47

factors, especially genetic drift. It's overwhelmingly migrations and population  

8:52

structure causing fluctuations in frequency. As a result, it's super hard to detect the signals  

8:58

of adaptive natural selection because they're  a tiny fraction of the total frequency change. 

9:04

The vast majority of it are  these migrations and mixtures. 

9:07

Nevertheless, there's so much natural  selection, as our study has shown,  

9:12

that it's actually been rampant in the genome. Can I ask a clarifying question here? 

9:16

Why are we discounting population  admixture or replacement as selection? 

9:22

If you think about it at a group  level, if one population replaces  

9:26

another population, isn't that selection? I remember from the last episode you were  

9:30

explaining how there have been huge changes in  what kinds of people are in a specific area. 

9:36

One population came in and replaced  the previous one, and then a new  

9:38

population came in and replaced that one. To the extent that the genetics are relevant  

9:42

to why that population replaced the other one, why  should that not count towards what we understand  

9:49

to be selection over the last 10,000 years? It could count, and may count, and probably  

9:55

should count in some respects. But it could also be that this  

9:59

population replacement is due to some  cultural phenomenon —technology held by  

10:04

one of these groups and not others. And maybe there are some genetic  

10:07

mutations that are contributing to  this. Who knows? It's possible. But  

10:11

what you're seeing is a whole-genome shift. What we're looking to see is whether there's one  

10:16

place in the DNA that is driving the change in a  way that's different from the rest of the genome. 

10:22

From a statistical point of view, what happens  at these times of migration is there are just  

10:26

huge fluctuations in frequencies. These are extremely uninformative  

10:30

times for detecting natural selection. The best moments to detect natural selection  

10:36

are when migrations and population admixtures  are not happening for a few hundred years. 

10:41

During these times, you can  actually see the mutation  

10:44

slowly blowing in one direction as a result. The way we think about the history of Europe and  

10:51

the Middle East for the purpose of this study is  as an archipelago of little populations in space  

10:57

and time, each pretty isolated from each other. You have a little population in Britain isolated  

11:02

for a few hundred years, or a little population  in Hungary isolated for a few hundred years,  

11:06

between big events of migration and mixture. In each of those little experiments of nature,  

11:10

we can ask: does this mutation  slightly increase in frequency? 

11:13

Does that same mutation  slightly increase in frequency? 

11:16

If all the arrows point in  the same direction, we win. 

11:20

They're telling us that  natural selection is occurring. 

11:23

For example, 4,500 years ago in Europe, almost  all mutations went through huge frequency changes. 

11:29

That's not because of natural selection. It's because of the steppe migration from  

11:33

north of the Black and Caspian Sea. 40-80% of  the DNA becomes Yamnaya from steppe pastoralists. 

11:40

Their frequencies of mutations were different  not because of selection necessarily, but just  

11:45

because they had evolved in different places  for thousands and tens of thousands of years. 

11:48

When you look at the descendant populations,  there are huge changes in frequency. 

11:53

What you need to do is see if natural  selection is explaining a shift more  

11:57

than you would expect by chance. Ok, in this next section David  

12:01

explained the nitty-gritty of the methodology  of this paper. It's honestly a bit technical,  

12:05

and I wanted you to get a sense of the results  first, so I've moved that section to the end.  

12:09

If you want to understand the methodology  just stick around for the full episode. 

12:13

So you found these locations that seem to be  under selection. I have another clarifying  

12:18

question. You say you found 3,800 locations  which you're 50% confident have been under  

12:28

selection in the last 10,000 years. It's 7,200 where we're 50% confident. 

12:33

We're getting about 7,200 positions in the  DNA that have 50% confidence of being real. 

12:40

Only half of those are real—we don't know  which ones—so 3,600 of them are real. 

12:48

Does that also mean that outside of  those 7,200, you're confident the other  

12:55

locations in the genome are not under selection? No. If you look at the 25% probability cutoff,  

13:00

there will be tens of thousands, and  there will be many real ones there too. 

13:04

In fact, multiple analyses we do suggest that  the genome is vibrating with natural selection. 

13:11

There are all sorts of weaker  effects that would be picked  

13:15

up in even larger studies than we've done. In fact, almost every position in the DNA is  

13:23

correlated to another position that  is being dragged in one way or the  

13:26

other by natural selection. Instead of being quiescent,  

13:30

natural selection is everywhere. Even though it's only 2% of the  

13:33

frequency change, it's tugging the positions  in one direction or the other everywhere. 

13:38

So we analyzed these positions that we  had identified, the hundreds of positions  

13:43

we were super confident about. We looked to see whether they  

13:46

were randomly distributed in the  DNA or whether they had patterns. 

13:51

We looked at maybe 100 or so traits where  there had been genome-wide association  

13:55

studies for all sorts of different traits,  associated with immunity or autoimmunity or  

14:01

behavior or metabolism, and other things. For each of these we could ask: do the  

14:08

genetic variations that are known to affect these  traits from genome-wide association studies have  

14:14

an unusual number of genetic selection signals? What we found is that there was a vast enrichment,  

14:20

by about four or five-fold, for immune traits. There was a super concentration of selected  

14:27

signals in immune traits. We also saw a strong enrichment  

14:32

for metabolic traits—things that might impact  obesity or fat traits or Type 2 diabetes—and  

14:38

almost no detectable enrichment, as far as we  could tell, for behavioral or psychiatric traits. 

14:47

Just to make sure I understand. This is not to say that behavioral or psychiatric  

14:51

or cognitive traits are not under selection. It's just that the individual sites where such  

14:57

traits are controlled are not especially  likely to be among the locations you've  

15:04

identified as under selection. That's exactly right. It might  

15:08

seem from the results of that analysis that  immune traits are highly selected and that  

15:15

there's been no selection for behavior in the  last 18,000 years in this part of the world. 

15:19

But that's a wrong conclusion, and we have  evidence that it's a wrong conclusion. 

15:25

There's clear evidence of selection  also on behavioral traits. 

15:29

The reason we think we see much weaker signals  for behavioral traits is that behavioral traits,  

15:38

we know from medical studies, are underpinned by  much larger numbers of genes than immune traits,  

15:45

which are underpinned by relatively  small numbers of genes of strong effect. 

15:48

Behavioral traits are shaped genetically by  a very large number of genes of weak effect,  

15:54

and we just don't have the statistical  power to detect these very weak signals. 

15:59

When we do an analysis looking at our very  strong signals of selection, that collection  

16:05

of very strong results is very effectively  querying the immune traits, but is not very  

16:10

effectively querying the behavioral traits. It may still be the case, and I guess it is,  

16:15

that immune traits are the most selected category. But it is not at all the case—and we  

16:20

can prove it's not the case—that  behavioral traits are not selected. 

16:27

Interesting. We've been able to  

16:30

prove that there are two ways to reconcile the  previous observations with our new observations. 

16:36

Remember, the previous observation is that  natural selection seems to have been quiescent  

16:42

over a timescale of hundreds of thousands or  many tens of thousands of years. Reason? That  

16:45

you don't see 100% difference in frequency  variance across Europeans and East Asians. 

16:50

Now we're seeing hundreds of positions  that are rocketing up in frequency with  

16:54

selection rates of 1% or more in a lot of cases. A 1% or more selection rate will mean a rapid  

17:01

doubling over periods of dozens of generations. Over the 1,500 or 2,000 generations separating  

17:08

Europeans and East Asians, shouldn't you see  many genetic variants that are 100% different  

17:13

in frequency across populations? We were able to show that this  

17:17

is explained by at least two factors. One is that in this part of the world—Europe  

17:22

and the Middle East—we are actually in a  period of accelerated natural selection. 

17:27

One way to see this is to look at the enrichment  pattern we're observing, where immune traits are  

17:33

unusually associated with these selection signals. We could compare the last 5,000 years of our  

17:39

time period, what's called the Bronze Age and  further onward, to the previous 5,000 years. 

17:44

What we see is that this intensification of  selection around immune traits, and similarly  

17:49

the intensification around metabolic traits,  has accelerated over this time period. 

17:54

It's not like natural selection has been  at the same rate over all places and times. 

17:59

It's increasing over the  time period we're analyzing. 

18:02

Plausibly the whole time period has  increased compared to previous periods. 

18:06

We're in a period of intensified selection. That's not implausible, because this is a  

18:12

population that went through a huge shock in  terms of the way people live and the culture. 

18:18

Almost everyone we're analyzing are farmers  or food producers in one way or another. 

18:22

Farming was invented for the first time  anywhere in the world in the Middle  

18:25

East 11,000 or 12,000 years ago. The people who invented farming  

18:30

exploded into Europe after 8,500 years ago,  spread across the continent, and expanded rapidly. 

18:35

In the Bronze Age, there was an  intensification of how people lived,  

18:38

with much higher population densities. People were living more and more next to  

18:42

their animals and getting their  diseases, and exchanging their  

18:45

diseases with the animals and with each other. This is a period of rapid change in how people  

18:50

are living, resulting in different  biological needs of this population. 

18:58

It's not surprising, perhaps, that in the context  of these dramatic changes, the biology of the  

19:04

population might not be ideally adapted. There might be what some people call an  

19:12

evolutionary mismatch, where you take  a genetic variation that evolved in  

19:17

hunter-gatherers and put it into farmers or  pastoralists, and it's not exactly right. 

19:23

What you're seeing is the DNA of this population,  which descended from hunter-gatherers only 10,000  

19:27

years ago, reacting to the shock of having  been moved into an agricultural, Bronze Age,  

19:33

high-population-density, urban environment. A hypothesis is that what we're seeing is  

19:38

the adaptation that occurs as a result. In the paper you have many examples of  

19:47

this intensification of  selection around the Bronze Age. 

19:52

It might be helpful to go through some of these. One of the things we do in this work is look  

19:59

carefully at many of these positions in the DNA. We actually have an internet browser called the  

20:06

AGES browser, which Ali and a colleague of  his—who's a co-author of our paper—built. 

20:11

It allows you to query each of these 10 million  positions and see the trajectories at each  

20:15

position and the evidence for selection. One of the things we see is that,  

20:20

while for the most part the signals of natural  selection we detect are consistent with constant  

20:26

natural selection over time, in a handful of  them we're able to see that there's been a  

20:30

reversal or a radical change in natural selection. Very often that occurs in the period between 5,000  

20:37

to 2,000 years ago, which is the Bronze Age  and the Iron Age, a period of rapid population  

20:43

growth and rapid movement to intensive use of many  technologies that were not used that way before. 

20:52

An example of this is the TYK2 genetic  variant that is a major risk factor for  

20:58

severe tuberculosis, which is the most important  infectious disease killer in the world today. 

21:05

If you look at this major risk factor for  tuberculosis, this variant rockets up in  

21:11

frequency from 8,000 or 6,000 years ago to  maybe 9% or 10% in this part of the world. 

21:16

Then it rockets down in frequency  in the last 3,000 years. 

21:20

In both cases, there's very clear  evidence of natural selection,  

21:23

in the first case to increase in frequency,  and in the next case to decrease in frequency. 

21:28

A possible reason is the spread of tuberculosis. It maybe becomes endemic in the  

21:33

population 2,000 or 3,000 years ago. That's potentially consistent with pathogen  

21:38

sequence data and other lines of evidence. And maybe this variant was protecting against  

21:43

something before then, but then tuberculosis  became significant after that point,  

21:49

and it was so bad that it pushed in the  opposite direction. That's speculative. 

21:53

The thing it was protecting against  was probably another disease? 

21:56

Maybe. Prepping for  

21:58

this episode required a full lit review. I needed to understand why other methods  

22:02

had failed to find evidence of natural  selection over the last 10,000 years. 

22:06

What exactly did Reich and Akbari do differently? Honestly, this was quite subtle because the most  

22:12

important points were distributed  across a bunch of different papers. 

22:15

And it was frustrating to talk to LLMs  about it because they kept getting confused. 

22:19

One of them would fail to  understand an important crux. 

22:22

And so I’d switch over to a different  model, and that one would get tripped  

22:25

up on the very next point. I ended up using Cursor to  

22:27

kick off a handful of models at the same  time and compare their results after. 

22:31

I could have one model critique  the response of another. 

22:34

This was super useful because  while I'm not a geneticist,  

22:38

I do have enough taste to be able to say, “hey,  this answer makes sense, these ones don't.” 

22:43

I also had Cursor turn this work into  flashcards so I could retain what I learned. 

22:47

Cursor started as a programming tool, but I  found it really great for this kind of research. 

22:51

There's no other interface where I can get answers  from a bunch of independent LLMs all while reading  

22:55

the relevant paper on the same screen. Go to cursor.com/dwarkesh to try it out. 

23:02

One of the big takeaways for me from  the paper was just that something  

23:06

weird happened in the Bronze Age. As you said, across trait after trait,  

23:15

the selection intensifies during the Bronze Age. This makes sense for some things. 

23:20

For example, why do we see lactase  persistence, where adults can process  

23:24

milk, intensified during this period? This is the time when we start using  

23:30

cattle not just for the meat, but also for  milk and wool and other secondary products. 

23:37

So it makes sense why lactase  persistence would matter more. 

23:42

But then there are other things  that seem like they should have  

23:43

been relevant since the dawn of agriculture. I forget the exact name of the allele, but was  

23:48

it FADS1, which helps convert plant fatty acids  into long-chain fatty acids that your body needs? 

23:57

That's obviously relevant when  you move from a diet of meat as  

24:00

a hunter-gatherer to a diet of cereals. That is also one I think you found was  

24:05

under especially high selection 5,000  to 3,000 years ago. So what's going on?  

24:14

Why is the Bronze Age so special across all  these different traits that you're observing? 

24:20

So this FADS1/2 variant is a  vegetarian/meat-eating adaptation. 

24:27

Already in work prior to this, Ian Mathieson,  who worked with me in 2015, identified this as  

24:36

a very strongly selected variant. It’s actually  ancient. You see copies in archaic humans too. 

24:43

One of the findings of our  paper is the ABO blood system. 

24:46

You get your blood typed as A, B, and O. The B variant has increased up to 10% at  

24:51

the expense of A, but previous work has shown  that A and B were both already present in the  

24:56

ancestor of humans and gibbons and other apes. Some of these mutations have been going back and  

25:03

forth and fluctuating over different time periods. But we're talking about changes in the Bronze Age. 

25:10

The TYK2 variant for tuberculosis risk, a multiple  sclerosis risk variant, inflected and increased in  

25:18

frequency before the Bronze Age, and then 2,000  or 3,000 years ago reversed in that period. 

25:23

There are differences in Northern Europe  where this process is super strong,  

25:27

very strong positive selection,  very strong negative selection. 

25:30

And then in Southern Europe, only a little bit,  and not even very strong negative selection. 

25:34

For hemochromatosis, which is pathogenic  iron buildup that causes problems in Europe,  

25:41

that too has reversed around this period. In some of the complex traits that maybe  

25:46

we'll talk about later, these traits too have  periods of intensification of natural selection. 

25:52

For example, depigmentation: Europeans have  gotten lighter skin over the last 10,000 years. 

25:59

You can see it in our data. The period of strongest  

26:01

depigmentation is between about 4,000 to 2,000  years ago, and then after that it's much less. 

26:07

This seems to be a very impactful, eventful,  important period where a lot of the processes  

26:13

we are seeing become very powerful. It's  surprising on first principles. You might think,  

26:19

before you walked into this genetic data, that  the big change is going to be starting to grow  

26:24

plants and maybe farm animals. That happens in the Neolithic,  

26:27

beginning 10,000-12,000 years ago, and  spreads into Europe after 8,500 years ago. 

26:32

But actually, the intensification happens  5,000 years ago, 4,000 years ago. It’s  

26:38

really interesting. This observation of that  being an inflection point tells us something  

26:45

about when humans, at least in this part  of the world, were wrenched into a way of  

26:50

living that was so different from how  their hunter-gatherer ancestors lived  

26:55

that the organism had to adapt very strongly. It may be that the degree of that wrenching  

27:00

process moving into the Bronze Age was  qualitatively greater than the degree of  

27:06

the wrenching process that happened from  the initial transition to growing plants. 

27:11

That's surprising, because our cartoon  picture is that the big transition is farming. 

27:16

But the biological readout is saying our  genome is reacting much more strongly to  

27:22

these events that happened 5,000 years ago. You did some work with Bhatia and many other  

27:28

colleagues in 2014 where you were looking at  20,000 or 30,000 African American genomes today. 

27:34

You were saying, "Look, there's 80% West  African DNA and then 20% European DNA. 

27:40

Can we look at their genomes today and see that  their allele frequencies are much different than  

27:49

we'd expect from this admixture?" Correct me if I’m wrong,  

27:54

but you found that they weren't. That is to say, over 200 or 300 years of extremely  

27:58

intense environmental change—going from chattel  slavery to a completely new environment—there's  

28:08

no effect of natural selection. So we see episodes like this where  

28:13

we don't see natural selection, but then  the Bronze Age apparently must have had an  

28:19

even stronger effect, where the change in  environment is even stronger than what we  

28:23

see from Africans in Africa being migrated  to the New World and living under slavery. 

28:31

That may be the case. It also may be the case that  that period is just too short to see much effect. 

28:37

In the Bhatia et al. paper, where we looked at  about 30,000 African Americans, we looked to  

28:42

see whether—instead of the average percentage of  around 80% West African ancestry—there were some  

28:48

places in the DNA with significantly more  than 80%, or significantly less than 80%. 

28:53

That’s what you would expect if there  were natural selection for some genetic  

28:57

variant from Europeans or from Africans. We didn't see any place in the DNA that  

29:02

was significantly different from  what you would expect by chance. 

29:07

One possible explanation is just that there's  only a handful of generations, maybe five,  

29:12

over which natural selection would operate. So if the selection was 2% a generation,  

29:17

you would still only see a 10% compounded effect,  and there's just not enough time to detect it. 

29:23

But the Bronze Age is not  300 years, it's 3,000 years. 

29:26

It's the power of compound interest, and you  have enough time to begin to see a strong effect. 

29:31

This really, really does seem to be a very  impactful time in terms of human history,  

29:37

and you can see it in our complex traits. Look at pigmentation, for example,  

29:43

which is the strongest signal of selection  for a complex trait in our data set. 

29:50

You look at genetic mutations that  are known to affect pigmentation. 

29:55

You add up their effect across all of the  DNA, there's dozens or hundreds of them. 

29:59

You look to see when natural  selection is strongest, and the  

30:04

time period is really 2,000 to 4,000 years ago. For some of these other traits as well, you see  

30:11

again that the time period over which selection  is strongest is 2,000 to 4,000 years ago. 

30:17

For example, if you look at genetic variants  that affect measures of cognitive performance,  

30:25

such as performance on intelligence  tests in white British people today. 

30:33

This is of course a very strange trait to  measure in the past because there were no  

30:36

intelligence tests and there was no school. But it is a predictor today, and you can  

30:40

look at how it's changed in the past. We see very strong natural selection  

30:44

for this combination of genetic variants  that predicts people's performance on IQ  

30:48

tests and is also highly correlated to the  predictor of the number of years of school  

30:52

or the household wealth of people. All crazy traits in the past because  

30:56

there was no wealth in the past,  there was no school in the past. 

30:59

But if you look at the predictors today, there  is a strong movement in a systematic direction,  

31:07

a large effect, about a standard deviation  on the scale of modern variation. 

31:11

We can do this trick of looking to see  whether there are periods of time when  

31:15

this natural selection has occurred  more intensely or less intensely. 

31:19

We drag a 2,000-year window through our data, and  we repeat our whole analysis, not on 18,000 years,  

31:25

but just on a short 2,000-year window. We can measure the strength of selection  

31:29

in each of these 2,000-year windows. What you see when you look at intelligence  

31:33

is that this maxes out in the Bronze  Age, between 5,000 and 2,000 years ago. 

31:40

The impact in the last 2,000  years is almost nothing. 

31:43

There's no evidence of natural selection at all. Your bias coming into this, my bias  

31:48

perhaps, might be that if there's any signal of  natural selection on this trait at all, that it  

31:52

would be unusually strong in the last 2,000 years. Maybe this is a time of industrialization. 

31:57

Maybe this is a time of greater  need for this particular trait. 

32:00

But in fact, there's no evidence of natural  selection at all in the last 2,000 years. 

32:04

There's very strong evidence between 2,000  and 4,000 years ago, where instead of a one  

32:10

standard deviation strength of selection,  it's a two standard deviation strength,  

32:14

averaged over this time period. The standard deviation here is how much  

32:18

the polygenic score for the trait itself moves? How much the polygenic score for the trait moves  

32:27

over a 10,000-year period within a population  that is held constant in terms of its ancestry. 

32:34

What we're actually doing is looking in our  data set at a heterogeneous group of people. 

32:39

There's Southern Europeans and Northern  Europeans and hunter-gatherers and farmers. 

32:44

At different times in the past, those  groups are more or less represented. 

32:48

The whole strength of the methodology Ali  Akbari developed is that it corrects for that  

32:55

changing ancestry over time. Really what's being asked here  

33:00

is that we've divided up our whole data set  into an archipelago of little populations  

33:05

in different places in space and time. We're asking in each place in space and time:  

33:10

a little pocket of people in Britain from 4,000  years ago to 3,500 years ago, a little pocket of  

33:16

people in Hungary, a little pocket of people in  Italy from 2,000 years ago to 1,500 years ago. 

33:21

In each of these places, where the ancestry is  relatively similar without being too disrupted in  

33:27

that short period by migrations, we watch to see  if the genetic changes blow in the same direction. 

33:36

We're measuring the strength of selection at  each point in time after correcting for the big  

33:41

population changes that have occurred. The effect here is huge then. 

33:48

One standard deviation above the median  would be somebody in the 85th percentile. 

33:54

You're saying the effect of selection has been  so strong that comparing 10,000 years ago to now,  

34:01

the median has gone to the 85th percentile. That's just a huge effect over the last  

34:06

10,000 years on something like intelligence  or the thing that predicts household income. 

34:17

Especially given that this is only 2% of the  change in allele frequencies, and the 98% is  

34:23

coming from migration… It's stupendous to think  about what the impact of migration is, if this  

34:29

alone is driving a standard deviation change in  these kinds of qualities, at least among the kind  

34:35

of variation we see in the world today. One thing you can see in the data  

34:38

is that the migration impact is huge. For example, if you look at the trajectory  

34:42

for measures of cognitive performance—scores  on intelligence tests in white British people  

34:48

today—but you look at the predictor of that  in people in ancient times, the estimate for  

34:53

the hunter-gatherers of Europe is three standard  deviations below the modern mean. So that's hugely  

34:59

different. Then you see a huge jump from them to  the farmers, who are at the mean, at zero. That's  

35:08

migration. What you're seeing is that those two  groups had different set points for those traits. 

35:14

And then the steppe pastoralists  have a lower set value. 

35:18

You see huge fluctuations in the predictor  of this trait over time. That doesn't prove  

35:23

selection. That's just migration. But what our  test is telling you is: in addition to those  

35:29

fluctuations due to migration, is there  a consistent effect of natural selection  

35:35

blowing the trait in the same direction over all  places and times? That's what we're detecting. 

35:41

There's this theory called the  collective intelligence hypothesis,  

35:44

which is the idea that selection for intelligence  has actually been in the opposite direction. 

35:54

As society has developed,  there's been more specialization,  

35:56

and if there's more specialization,  each person only needs to understand  

36:00

a smaller and smaller part of the world. Therefore, the ancients were actually much smarter  

36:07

than us, and we've evolved down in intelligence. Your results seem to point  

36:11

in the opposite direction. Although there hasn't been selection  

36:15

in the last 2,000 years as society has gotten  more complicated, at least when society began,  

36:21

there was more need for the kind of  thing that predicts intelligence today. 

36:25

The reason that's surprising is, if you  think about hunter-gatherers—reading  

36:30

your colleague Joseph Henrich’s book—the amount of  information they needed to hold onto and assess,  

36:38

everything from how to process food, to how  to build shelters, fire, et cetera, compared  

36:45

to my world, where I just need to know how  to set up mics and ask questions… It seems  

36:50

like the demands on intelligence should have  been way higher in the ancestral environment. 

36:53

So it's very surprising that the  beginnings of civilization increased  

37:00

the selection on intelligence. This is the power of data. 

37:06

I think if you asked Joe prior to this work what  the hunter-gatherer selection would be and where  

37:13

their set point for this particular trait would  have been… I think he probably wouldn't have  

37:18

made a very strong prediction, but he would  have said, "Maybe you would have expected  

37:21

it to have a high predicted value of this trait  because these people were really having to do a  

37:26

lot of things and figure a lot of stuff out. Maybe once you have more complex societies,  

37:31

there would be more of a collective brain, and  maybe there'd be selection against this trait." 

37:36

In fact, it's the opposite in some ways.  It's the power of data. It's not what you  

37:40

expect. It's actually the value of data  to try to make sense of all these things.  

37:47

It's very interesting. The genetic predictor of  intelligence, there are lots of things that are  

37:53

confusing about it, so it's worth talking about. Or the genetic predictor of years of schooling,  

37:58

which is highly correlated to  it and is measured even better. 

38:02

If you look at the genetic predictor of years  of schooling, there's another amazing study from  

38:06

2017 from a group in Iceland that looked at this  measure over the last hundred years in Iceland. 

38:12

It looked at older people and  younger people born more recently. 

38:17

There's an estimated 0.1 standard deviation  decrease in the genetic predictor of intelligence  

38:23

in Iceland just within one century. It's an absolutely huge  

38:27

effect over a short period. This is selection against years of schooling. 

38:32

If I said intelligence, I didn't mean to. It's selection against the genetic  

38:36

predictors of the number of years of school. One possible interpretation of this—hand-wavy—is  

38:44

that what's being measured here is not selection  for years of schooling or for real intelligence,  

38:50

but for another trait altogether  that's correlated to both of them. 

38:53

For example, the predictor of the number of years  of schooling is very strongly correlated to the  

38:59

age at which women have their first kid. If you control for that, all of the  

39:05

signal of years of schooling goes away. So maybe what you're measuring is women's  

39:10

decision about when to have children. If you have children earlier,  

39:14

you don't go to school as much. If you have children later, you go to school more. 

39:18

Maybe it's some kind of measurement of delaying  gratification or putting things off or planning. 

39:23

The same trait is correlated to body mass  index, to obesity, and to walking pace. 

39:29

So is this really intelligence as we think about  it, or is it something else that manifests itself  

39:36

differently at different times in the past? Obviously, a trait like years of schooling was  

39:43

not itself a meaningful thing in the past. The underlying things for it seem  

39:48

to have been under strong selection. Whatever in the genome predicts years of schooling  

39:52

seems to have been under strong selection. How should we think about this? 

39:57

What's the actual thing  that's changing in the genome? 

40:02

There are two things going on  that you need to think about. 

40:08

Years of schooling is connected to  so many other things genetically. 

40:12

If you look at the genetic predictor of  years of schooling—this trait has been  

40:16

measured in millions of people now—it's  correlated to really surprising things. 

40:20

It's correlated to the age at which women have  their first kid. It's correlated to people's  

40:25

obesity. It's correlated to people's walking pace. It's correlated to people's household wealth. 

40:31

It's correlated to a variety of other  traits that seem quite different from it. 

40:37

If you think you're actually measuring the  genetic prediction of intelligence, or actual  

40:45

studiousness, you should think again because  there are many things that it's correlated to. 

40:49

There seems to be some kind of general trait that  you could maybe think of as executive function  

40:55

or a propensity to defer gratification—I’m  just waving my hands—that is under selection. 

41:03

It pushes all these traits in the same direction  one way or the other, and at different times in  

41:08

the past, it's advantageous or disadvantageous. When we found this signal of the genetic  

41:19

propensity to go to school for more years as it  manifests itself in white British people today,  

41:26

we were incredulous. How could this be? Maybe  this is a problem. So we did a few tests to  

41:31

try to figure out whether this was real. One of the tests we did was that we looked  

41:36

for a study where this measurement of the  number of years of school was done not in  

41:39

Europeans, but in Chinese people in China. We looked at the effect size of many variants  

41:47

as they affected the number of years of school in  China, and we saw whether they had a correlation  

41:53

to the trajectory of those same genetic variants  in Europeans over the last 10,000 years. 

41:59

These are two parts of the world  where the populations have been  

42:01

essentially completely disconnected. There's no way by chance that the  

42:06

trajectory in Europeans over the last 10,000  years would have anything to do with the effect  

42:13

on years of schooling in China today. But there's actually a huge statistical  

42:17

correlation, a five or six standard deviation  correlation between the effect size of variants  

42:22

on the number of years of school in  China today and the trajectory in Europe. 

42:26

It’s just as strong, actually, as the effect size  of variants in Europeans on years of school to  

42:32

the trajectory in Europeans. We just could not see a way  

42:36

this could happen by chance. Once we saw that, we felt quite  

42:39

convinced that this was a real signal and that  somehow there has been natural selection to  

42:45

increase the genetic changes that today manifest  themselves as predicting more years of schooling. 

42:52

Just to make sure I understood, you're  looking at this ancient DNA in Europe. 

43:00

You're saying it seems to predict years  of schooling for modern people in Europe,  

43:06

or at least selection on that ancient DNA seems to  predict more years of schooling in modern Europe. 

43:13

You also find that the same variants predict more  years of schooling for Chinese people in China. 

43:23

So this is not just some weird artifact  from the way these GWAS were done in Europe. 

43:31

These parts of the genome seem to robustly predict  the kind of thing that actually leads to more  

43:35

years of schooling, at least in people today. Correct. 

43:38

Jane Street is pretty secretive, but I did learn  about one internal mechanism which illustrates  

43:42

how high trust and weird their culture is. Researchers aren't given compute allocations. 

43:48

Instead, Jane Streeters use an internal  currency called “hive bucks” to bid for  

43:52

compute in real-time auctions. Everybody can spend as many  

43:55

hive bucks as they want. But your hive buck bid is  

43:58

meant to represent the real dollar value  of the experiment that you want to run. 

44:02

Now notably during the auction,  anybody can change anybody else's bid. 

44:05

And after the auction, people  can even kill each other's jobs. 

44:08

People just trust each other to do this  in a way that benefits the whole firm. 

44:12

As a result, Jane Suite's allocations  reflect a near real-time consensus  

44:15

on the highest priority uses of compute. As Axel, one of their ML engineers, put it: 

44:19

“I think Jane Street is like pretty  bottom-up in terms of we have lots  

44:21

of different researchers who are all  training their own models, sequence models,  

44:26

all sorts of other weird and wonderful things.” By the way, with their new compute deal,  

44:29

they've just added a six billion dollar hive  buck stimulus to their internal economy. 

44:33

Jane Street is hiring researchers,  engineers, and interns. 

44:36

Go to janestreet.com/dwarkesh to learn more. Stepping back, I want to understand what  

44:47

this tells us about what actually changed in  our environments over the last 18,000 years. 

44:53

We talked a little about what  happened after the Bronze Age. 

44:59

We were talking about this during the collective  intelligence part of the conversation. 

45:02

It's surprising to me that things  like intelligence, or lack of  

45:10

schizophrenia—things that just seem robustly  good—were not maxed out before the Bronze Age. 

45:22

The diversity among different populations was so  big that you have the European hunter-gatherers  

45:29

having three standard deviations less  predicted value for what they would score  

45:35

on an intelligence test if it existed. But they were existing in the real  

45:40

world in a place where intelligence matters. How can it be that this was not a trait… You  

45:48

just look at the human body or any animal, and  evolution has been acting on it so strongly to  

45:53

make it functional for the things it needs to do. And this one thing, which seems so  

45:57

relevant—especially to what human  hunter-gatherers needed to do—doesn't seem  

46:04

to have been under that strong selection  in the Mesolithic or Paleolithic eras? 

46:09

I think that's a great question. As we talked about before,  

46:17

selection is very effective. It can move the mean value of  

46:20

traits within hundreds or thousands of  years in one direction or the other if  

46:24

that's adaptive in a particular environment. So you might wonder, isn't intelligence  

46:29

good in all contexts and places in time? There are a number of ways to think about that. 

46:35

First of all, we are speaking from the  point of view of a society which intensely  

46:40

values this particular trait, the ability to  score well on IQ tests or things like them,  

46:46

or to go to school for a long time. I think it's unprecedented in human  

46:51

history that we live in a time like this. If you look at the Hebrew and Christian Bible,  

46:56

and you look at how much intelligence  is valued, it's basically not at all. 

47:00

But when the Bible was being written,  especially the Old Testament, that’s  

47:04

exactly when selection for intelligence is at  the highest point it's apparently ever been. 

47:08

Exactly. But there it's about strength or courage  or religiosity. Those are the values. If you read  

47:16

Homer or the texts of other religions,  it's not intelligence. It's beauty and  

47:23

other things. This value system which has  a hyper-focus on smarts is not obviously a  

47:34

trait value that's been common in the past. You might think that in certain communities  

47:39

there might be valuation of things that  are more proximate to years of schooling. 

47:43

But really broadly, it's not been  a high value in the population. 

47:47

Obviously, the thing we care about is not direct  performance on an IQ test, especially in the past. 

47:55

The thing I'm trying to understand  better is intelligence more broadly. 

47:59

Maybe IQ-test intelligence is just not  that correlated with, "Here is a new-world  

48:06

environment, go figure out how to process food  there and make shelter and everything else." 

48:12

Your colleagues like Joseph Henrich  have talked about how modern people  

48:15

underestimate the difficulty of doing this  kind of thing with a small band of people. 

48:23

Maybe that's not IQ-test intelligence,  and that's why we don't see that strong  

48:26

a selection effect on this thing. But intuitively, regardless of  

48:30

the value system, it just seems very  valuable to have this trait maxed out. 

48:36

I'm being very speculative. Let me give you two  examples of how I'm thinking about this, not that  

48:44

I’m a particularly good authority on these things. As I mentioned, a lot of these traits,  

48:48

which are quite disparate, are  highly correlated to each other. 

48:51

Obesity, years of schooling, walking pace,  performance on IQ tests, household wealth,  

48:56

all these crazy traits seem to be  governed to a substantial extent by  

49:00

a shared combination of genetic variants. Let's think about what this might mean. 

49:05

In Iceland in the last hundred years, there's been  selection against this combination of variants. 

49:11

One possible interpretation is that  it's basically selection for two  

49:15

ways of investing in your children: having  many kids and not investing a lot in them,  

49:20

or having few kids and investing more in them. If you invest in deferring having kids,  

49:28

having more wealth, having more resources,  and putting more into each kid, you're going  

49:31

to have lower fertility and fewer kids. That’s going to result in lower fertility,  

49:35

but those kids might survive  more and do better in society. 

49:38

Alternatively, you can just have as many  kids as you can and invest less in them. 

49:42

They might individually have less good outcomes,  but in a time of plenty—which is potentially  

49:47

Iceland in the 20th century—it might make sense  to have more kids and invest less in them. 

49:53

There's a toggle between having more kids and  investing less in them, and having fewer kids  

50:02

and investing more in excelling in various ways. You can imagine that at different times and in  

50:09

different places… In ecology,  there are different ways. 

50:13

Mammals often invest a lot with a pregnancy and a  small number of children, whereas fish will spawn  

50:19

huge numbers of offspring into the river,  the great majority of whom will be eaten. 

50:26

But that is an effective way to produce  offspring in certain conditions. 

50:30

So there will be a toggle depending on  the environmental conditions back and  

50:33

forth between investing in large numbers of  offspring with less investment, or smaller  

50:38

numbers of offspring with more investment. Maybe we're just seeing that move back and  

50:42

forth over different places and times. Similarly, for schizophrenia and bipolar  

50:47

disease, how could this ever be advantageous? Maybe what we're seeing with these diseases is  

50:51

a readout of some spectrum of traits that  in some contexts might be advantageous. 

50:59

Maybe being anxious, imaginative, or neurotic  might be helpful in a shamanistic tradition or  

51:07

a religious tradition which values people  who can have visions or be creative. 

51:13

Maybe these are subclinical versions of  schizophrenia or bipolar disease that in  

51:18

certain times may be advantageous and  in other times may be disadvantageous. 

51:22

You might just be seeing selection for different  types of creativity or other thinking that can be  

51:28

valuable in different contexts. I'm waving my hands here,  

51:31

but my sense is that these complex traits  have not pushed in one direction because  

51:36

there are advantages to both ends of the  spectrum, and there are multidimensional  

51:46

impacts of these different traits. Julian Jaynes has this famous theory  

51:50

in The Origin of Consciousness in  the Breakdown of the Bicameral Mind. 

51:54

I'm butchering this, but fundamentally, the  way I understand it is that up until Homer,  

51:59

basically everybody was schizophrenic. People genuinely thought that gods were real  

52:08

people that you were communicating with. His claim is that ancient texts seem  

52:12

to show people behaving in this way. You're being asked to believe in visions. 

52:17

Even today, there's valuation in some religious  communities in communicating with God,  

52:22

having visions, and having supernatural  communions. So I just don't know. But I  

52:30

think it's super interesting to ask the question  of why certain traits are not always advantageous. 

52:36

For schizophrenia and bipolar  disease, there is a sense in  

52:38

which most of the mutations are disadvantageous. We can see that from the patterns of variation,  

52:43

where the variants that are  risk factors tend to be low  

52:46

frequency and they tend to be small effects. So another trait you find under selection is  

52:51

the trend away from body fat since the  agricultural revolution. Why is that? 

52:57

What you see is a reduction in the  combination of genetic mutations  

53:01

that make you at risk for obesity, body mass  index, and similarly very correlated to it,  

53:06

higher fat mass, higher waist-to-hip  ratio, and higher type 2 diabetes risk. 

53:12

There is clear selection, by about a standard  deviation on the scale of modern variation  

53:16

for these traits, reducing over the last  10,000 years in this part of the world. 

53:22

What can be going on there? Why wasn't there selection for  

53:25

this combination of traits before? There's a longstanding idea  

53:28

known as the thrifty gene hypothesis. The idea is that once you have hunter-gatherer  

53:35

populations that move into a farming environment  where there's plentiful food, there is no longer  

53:40

a need to the same extent to be able to build  up body fat to survive in times of stress,  

53:46

because there are more constant stores of food. As a result, there will be natural selection  

53:52

against body fat once you  move into an agricultural  

53:56

environment and into periods of food plenty. Maybe what you're seeing is that this group  

54:03

of people in Europe and the Middle East over  the last 10,000 years has moved into a period  

54:08

of relatively more stable food, where building up  stores of fat is not as advantageous, and there's  

54:13

been selection against this combination of traits. Europeans are actually relatively better protected  

54:19

genetically against type 2 diabetes than some  other populations around the world, like African  

54:23

Americans and Native Americans, that have perhaps  not been exposed to agriculture for as much time. 

54:29

So you may be seeing the effect of more  exposure to more stable food accessibility. 

54:35

This is also another way in which  the data go against a common story. 

54:41

The common story is that hunter-gatherers  actually had much more stable diets because  

54:45

they were more varied, and they weren't reliant  on a single cereal or crop for their calories. 

54:53

If one game went away, they had  other things they could scout for. 

54:57

They could move locations more easily because  they weren't tied down to the land. So they were  

55:01

more food-stable. But if there's been selection  against storage of body fat, that suggests that  

55:09

as unstable and as common as famines might have  been in agricultural societies, it's at least  

55:15

more stable than what the hunter-gatherers had. There's a timescale issue. You're absolutely  

55:19

right. As I understand it, I'm no anthropologist,  when there's a hunt in traditional societies  

55:30

or communities that hunt, people will often  gorge themselves, eat a huge amount, build up  

55:35

a temporary store of fat, and then go multiple  days without eating meat until the next hunt. 

55:41

There is this boom-and-bust access to  high-value nutrition that is not true  

55:47

to the same extent in farming communities. On the flip side, famines are something that  

55:55

occurs more commonly in agricultural societies,  but the timescale and the tempo of them is  

56:00

very different from the hunting tempo. Maybe there's a famine every three years. 

56:04

Indeed, if you look at the bones of  farmers, at least in some communities,  

56:08

there's more stress in them, maybe due to a  famine every three years or every five years. 

56:14

But selection might not be acting  on that three-year time period. 

56:17

Your fat store from the latest  hunt is not going to carry you  

56:21

through to the famine three years later. Survival of famines is a different thing  

56:27

than building up body fat to be  able to survive two weeks later. 

56:33

A random question I have. You were  mentioning that compared to these  

56:39

other things which matter much more for fitness  in the ancestral environment—the immune system,  

56:44

especially after the Bronze Age—all these other  things have mattered more than intelligence. 

56:49

They've been under much more  selective pressure than intelligence. 

56:51

That makes you wonder whether there's much  more room at the top for intelligence. 

56:56

If humans had been selected especially for  intelligence, they could have been much smarter. 

57:00

The reason that's relevant is that  we're currently building AI systems,  

57:04

which we're trying to make as smart as possible. 

57:05

In fact, the only goal of the  training process is intelligence. 

57:08

We don't have to worry about at the same  time making their immune systems powerful— 

57:12

We have lots of energy to spend on it. And at the same time making  

57:15

sure they're not schizophrenic. I guess we kind of do worry about that. 

57:17

But if intelligence has not been the dominant  trait under selection for humans over the last 10,  

57:25

20, or 100,000 years, does that mean  there's more room at the top for this trait? 

57:29

I think there's more room at the  top for a lot of these traits. 

57:33

You can move height extremely in one  direction, much more than it is today. 

57:37

You can move any of these traits much  more extreme in the other direction. 

57:41

There are probably very strong  negatives to doing that. 

57:43

You're probably sacrificing other  things, and there are trade-offs. 

57:49

But it's highly likely that if natural  selection pushed any of these traits more  

57:57

in one direction than it is, the mean would move. So all of this evolution since "Out of Africa" is  

58:04

acting on alleles that already existed in the  pool of human variation from that first group  

58:09

we were talking about last time, on the order  of 10,000 people, that exploded out of Africa. 

58:18

Is it surprising that across all these different  traits, from cognitive profiles to disease  

58:26

resistance to height, that one pool of people  contained so much latent variation that they  

58:39

could supply enough stretchiness to accommodate  all of these different traits you're studying now? 

58:50

That's a rich question, and I think the human  population has within it a tremendous amount  

58:57

of variation for complex traits. There's a huge amount of variation  

59:02

that affects height. There's a huge amount of  

59:05

variation that affects body mass index. If you take all these mutations and set  

59:09

them to the high-height variant, a person will be  extremely tall, like as tall as a tall building. 

59:15

Of course, that will never happen. But if you take all these variants  

59:18

that affect schizophrenia risk and you point them  all in the same direction, there will be extreme  

59:24

risk or extreme protection for schizophrenia. For complex traits, ones underpinned by many  

59:30

mutations, all the variation already exists to  move the population to a different adaptive set  

59:36

point that's optimal in the environment it's in. If you push the population into a new environment,  

59:42

within hundreds or thousands  of years, the population can  

59:45

rapidly move to a new adaptive set point. There are some unusual traits, like the ability  

59:50

to digest cow's milk or protection against sickle  cell anemia, that require a single very important  

59:56

mutation that may not yet exist in the population. You have to wait for the mutation to  

60:01

occur in some people. When the populations are  

60:04

relatively small, only 10,000 people, you  might have to wait dozens or hundreds of  

60:08

generations for that mutation to arise. But when the populations are large,  

60:12

there's no mutation limit anymore. Every mutation that can occur does occur. 

60:16

There are eight billion people in the world. There are maybe 30 new mutations every generation,  

60:23

so that's 240 billion new point  mutations every generation. 

60:28

There are only three billion DNA bases in the  genome, so every mutation that can occur does  

60:33

occur about 100 times every generation. We're  not mutation-limited anymore. The mutations  

60:40

can arise again. They do arise again.  But when the population is only 10,000,  

60:45

you sometimes have to wait dozens or hundreds  of generations for the new mutation to occur. 

60:49

How likely is it that the thing that changed  with the Bronze Age is just that the human  

60:52

population was big enough? By 3000 BC, you go to  

60:56

a population of 50 million-ish people. The population is big enough, and the gene flow  

61:02

between different areas is high enough, such that  things which don't have an overwhelming selection  

61:08

coefficient, which aren't overwhelmingly favored  by evolution, are finally visible to selection. 

61:13

I think that's not likely to be true, but it's  an extremely interesting thing to think about. 

61:18

Already when population sizes are on the order  of a million or so, every mutation that can occur  

61:24

does occur within a few generations. That's well before the Bronze Age if  

61:28

you take the population even of a place  like Europe, but also of other places. 

61:34

Or maybe it's at the dawn of the  Bronze Age or the farming period. 

61:38

The question you're asking is whether,  when the population is small, maybe natural  

61:42

selection doesn't work effectively. A common thing people think about with  

61:46

natural selection, which is true, is that in small  populations selection doesn't work effectively. 

61:52

That's because mutations bop around in  frequency from generation to generation a  

61:57

lot in a small population, just randomly. If you have a population size of 1,000,  

62:03

mutations will bop around by a frequency  of one over 1,000 every generation. 

62:08

If the selection coefficient is less than  that, it will be drowned in the random bopping  

62:12

around of frequencies due to genetic drift. But that is already for a population of 1,000. 

62:17

A 0.1% selection coefficient is very weak. We're talking about 1% effects,  

62:22

and that's very strong. It will work very well even  

62:25

in a population of size 1,000 or 10,000. If you are talking about mutations of the  

62:30

type that will start rising only in large  populations but not small populations,  

62:34

those are selection coefficients on the  scale of one over 10,000 or one over 100,000. 

62:40

Those will take 10,000 or 100,000 generations  to rise in frequency, which is hundreds  

62:45

of thousands or millions of years. That's not going to do anything over  

62:49

the timescale we're talking about. There's just  a timescale issue. We're talking about strong,  

62:54

measurable selection coefficients on the  order of half a percent or more in this study. 

62:59

All of those are going to work in  small populations or large populations. 

63:03

It's not going to be affected  by the population size. 

63:05

Interesting. You're saying that more generally,  once you hit a given threshold of population, the  

63:09

dominant factor is time span, not population size. Correct. It's very interesting,  

63:14

and it's actually not widely understood. Speaking of data contradicting what you  

63:20

might have otherwise assumed, one of the  papers you sent me beforehand, Mallick 2016,  

63:26

found that there are no fixed differences between  modern and archaic humans 50,000 years ago. 

63:36

We know this is the period in which the  so-called cognitive revolution happened,  

63:40

and modernity started, and people are making art. Does this suggest that nothing biological  

63:48

changed to make modern humans modern? The thing that happened was some cultural change? 

63:55

How do we understand what this data tells us? Right. 100,000 to 50,000 years ago, there's  

64:03

a quickening of the pace of change in culture. You see the first extensive representational art,  

64:14

bead necklaces, drawings on the wall,  and a rapidly increasing pace of  

64:20

innovation in the types of tools that people use. The thought might be that there would have been  

64:25

some important genetic switch, a kind of important  genetic change that occurred in the population and  

64:32

swept to high frequency that everybody soon had. That made it possible to do these things. 

64:40

Maybe some genes allowed people to have  complex, representational language, for example. 

64:47

One thing we did in 2016 in this paper  by Swapan Mallick and colleagues was look  

64:53

across the DNA for places that might  be expected to look like this, where  

64:59

nearly all people living today share a common  ancestor maybe 100,000 or 200,000 years ago. 

65:06

We looked really hard, and right  across all the DNA we could look at,  

65:09

we couldn't find anything more recent than  four or five hundred thousand years ago. 

65:14

This is a crazy result because it looks  like there are no key selective sweeps  

65:19

that have occurred in this period that  are ancestral to everyone living today. 

65:23

We talked before about no selective  sweeps between Europeans and East Asians,  

65:27

but there don't even seem to be any selective  sweeps shared between all humans in this really  

65:33

important period when a lot of evidence  in the material culture record appears. 

65:39

It could be that there's biological  adaptation in this period, but it's polygenic. 

65:44

There are lots of mutations that all shift in  the same direction to help the population move  

65:50

to a new set point, but there's no key biological  change that rises to high frequency in this time. 

65:55

This group 50,000 years ago, are  they the ancestors of everybody  

66:00

out of Africa or also some Africans? This is 100,000 to 50,000 years ago. 

66:07

This is the population that's  ancestral to West Africans,  

66:10

to most East Africans, to all non-Africans. There are a couple of populations in Africa  

66:16

that have substantial ancestry  coming from more divergent groups. 

66:21

For example, Khoisan from Southern  Africa or Central African rainforest  

66:25

hunter-gatherers have substantial fractions  of their ancestry from groups that diverged  

66:30

maybe 200,000 years ago from the other lineages. But all of these groups today are able to go to  

66:36

college and do everything everybody else does. There is no evidence that there is any key  

66:42

mutation lacking in some groups  that is not present in the others. 

66:48

The differences we see between different  groups of people, especially if this group  

66:53

50,000 to 100,000 years ago had a very small  population size… I think last time we were  

66:58

discussing on the order of 10,000 people. So almost everybody in the world,  

67:05

or the variance we see between different  humans today, was latent in this group. 

67:12

I get your point that if you just stack  up different things across the genome,  

67:22

stacking them up really has a big effect. But it's interesting that we have so many  

67:27

different groups in the world today, and all that  diversity comes from a very small population size. 

67:33

A lot of us in human genetics think that  our population contains within it the  

67:39

clay that's needed to make almost any trait. And that depending on environmental conditions  

67:45

or selection conditions, the mean value of  these traits will move in different directions. 

67:50

There's an empirical question about  how much selection there's been  

67:55

in different human populations over time. One of the things this new work we're  

67:58

involved in is showing is that at least in the  last 18,000 years in this part of the world,  

68:06

there has been significant movement, at  least for a handful of important traits. 

68:10

We looked at more than 500 traits. About 100 complex traits showed  

68:15

significant movement in a systematic  direction over this time period. 

68:21

It really does seem that there is a response  to the environments people are living in that  

68:26

has occurred over this period, and that is  potentially stronger than in previous periods. 

68:30

Crusoe has an amazing ML infra team that  keeps finding clever ways to squeeze more  

68:34

performance out of their hardware. For example, tokenization has become  

68:37

a real bottleneck for agentic workloads. Agentic prompts are often extremely long. 

68:41

They tend to have high KV cache hit rates,  which shrink the GPU's pre-fill work. 

68:45

This means that the tokenization step,  which is traditionally sequential,  

68:48

is a much larger fraction of time to first token. To solve this, Crusoe built fastokens,  

68:54

an open source Rust-based tokenizer which  parallelizes things in order to take  

68:58

advantage of all the cores on modern CPUs. Crusoe had to get creative here because  

69:02

the naive approach doesn't work. For example, for pre-tokenization,  

69:06

you can't just split your text into chunks  and run regex because you'd end up with  

69:09

issues whenever a word straddled the split. Crusoe solved this by giving each thread  

69:14

an authority zone plus the ability to  read one kilobyte past its own edges. 

69:18

This one kilobyte buffer guarantees that you  won't misprocess a token, and the authority zone  

69:23

guarantees that you won't end up with duplicates. No cross-thread coordination required. 

69:27

Crusoe combined this optimization with a  handful of other smart tweaks in order to  

69:31

get up to 40% faster time-to-first  token on real production workloads. 

69:35

To learn more, go to crusoe.ai/dwarkesh We were talking earlier about how there  

69:41

are no fixed differences between humans  30,000 years ago and humans today. 

69:47

So if there's no genetic basis for the kind of  thing that allowed humans to have more symbolic  

69:55

representation, have farming, et cetera—I think  I asked you this question last time we talked,  

70:00

but especially with this context—why no farming  before the Ice Age? Genetically we were there. 

70:05

That is such an interesting question.  Genetically we're there. The common  

70:08

ancestral population has all of the  ingredients for farming 50,000 years ago. 

70:14

These people are distributed into different  parts of the world: the Americas 15,000 years  

70:19

ago or whatever it is, New Guinea 40,000  years ago, East Asia, Europe, West Africa. 

70:27

No farming developed before  11,000 or 12,000 years ago. 

70:32

It only developed in the last 12,000  years, the period known as the Holocene,  

70:37

which is the end of the Ice Age. If you talk to climate scientists  

70:42

and archaeologists—I keep asking people  this question every time I meet someone  

70:46

who's an expert in this—how can it be  that farming develops in all these places? 

70:50

Are we really living in such an unusual time? People tell me, indeed, we're living in a very  

70:56

unusual time on a scale of two million years. That is, 12,000 years ago we switched into this  

71:01

period of not just warmth, but climate stability. It's hard to believe that we're  

71:09

living in such a special time. But if you look at data from the bottoms  

71:14

of ponds where you can measure the fluctuations of  temperatures using isotopic signatures, apparently  

71:19

we're in a period where it's fluctuating a  lot less year to year, 10 years to 10 years,  

71:25

and 100 years to 100 years. It's a period of relative  

71:27

stability that we are miraculously living in. When this period of relative stability happens,  

71:35

it follows that multiple groups independently  turn to agriculture, even though they  

71:42

all have the same genetic complement that arose  50,000, 100,000, 200,000, 300,000 years ago. 

71:47

It's a crazy observation that people  just accept, but it's unbelievable. 

71:53

Oh, so you increased the range there. You said 100,000, 200,000, 300,000 years ago. 

72:00

Based on the genetic differences between modern  people and people from 300,000 years ago. 

72:06

Do you basically think they're  modern 300,000 years ago? 

72:10

I don't know. This is actively what I'm  thinking about all the time right now. 

72:17

There's a big transformation in terms of the  culture of humans 300,000 or 400,000 years ago:  

72:24

this invention of Levallois technology, the  ability to make stone tools out of cores. 

72:28

The Middle Stone Age Revolution, or the  Middle Paleolithic Revolution depending  

72:32

on what you call it in Africa or Eurasia,  is a new way of making stone tools that's  

72:39

shared by Neanderthals and by modern humans,  but is not shared in East or South Asia. 

72:45

It's a big change, and it presumably  involves a cognitive change in order  

72:49

to make this sort of technology. Then there's a further change to  

72:52

the Upper Paleolithic Later Stone Age, maybe  100,000 to 50,000 years ago, when there's a  

72:58

second transition with a new type of tool making,  but it’s not as revolutionary as the earlier one. 

73:04

So when the cognitive leap happens is unclear. The diversification of the lineages leading to  

73:09

people living today, like Khoisan Southern  Africans and rainforest hunter-gatherers,  

73:15

all occurs more on the timescale  of 300,000 or 200,000 years. 

73:20

All of these people are capable of  going to college and doing everything. 

73:24

So it's not obvious that the cognitive toolkit,  the behavioral toolkit, and the genetic abilities  

73:32

were not all in place 200,000 or 300,000 years  ago, and that even Neanderthals had them. 

73:36

It’s not obvious that this was not the case.  I just don't know. You distribute these people  

73:44

descended from this diversification that  happened 200,000 or 300,000 years ago to  

73:48

different parts of the world, and then  after 12,000 years ago, you start having  

73:53

agriculture popping up in different places. It's an outstanding mystery of human history. 

74:00

I find it unbelievable that we live in a  time period that climatologically is so  

74:06

unique on a scale of two million years,  but my colleagues tell me it's true. 

74:10

The climate thing seems surprising given there  are so many different environments in which  

74:16

agriculture was independently developed. I understand that across environments  

74:20

the variance could have gone down. If it had only happened in one place  

74:25

at one time, I could have bought that explanation. But the fact that they're making maize in the New  

74:30

World and they've got cereals in the Old World in  very different environments makes it surprising. 

74:39

It's very, very surprising. We accept  it, but it's a crazy observation that  

74:45

most normal people don't realize. The thing that basically everybody  

74:50

accepts is that the common ancestral  population of almost everybody in the world,  

74:54

except for rainforest hunter-gatherers  and Khoisan, is around 70,000 years ago. 

74:59

Everybody accepts that these people all have in  place the cognitive, behavioral, and intellectual  

75:04

ingredients that are necessary for the farming  revolution and building state societies. 

75:08

Because when these descendants get distributed  to West Africa, East Africa, the Americas,  

75:14

Europe, South Asia, East Asia, New Guinea,  and so on, their descendants all do this. 

75:19

They do it independently, semi-independently,  or demonstrably completely independently in  

75:25

all these different parts of the world. The cognitive resources for doing this must  

75:29

have all been in place, but it's a very long fuse. It delays for 40,000 or 60,000 years in all these  

75:37

different places after the common ancestral  population splits up, and then ignites into  

75:43

agriculture and all these other things after  that point. It's a crazy claim. Then you could  

75:48

argue about whether the actual fuse is 300,000  years, from when Neanderthals separated and  

75:54

from when different lineages of extant modern  humans separate, and that's also plausible. 

75:59

It's a crazy set of things that  we're being asked to believe. 

76:04

Is it possible that agriculture existed, but  you didn't have modern metallurgy or whatever  

76:10

it was that allowed populations to explode  starting in 5000 BC with the Bronze Age? 

76:16

Population-wise, it doesn't seem  like much is happening from 10,000  

76:18

BC to 5000 BC in the early Neolithic. Is it possible that they had farming  

76:24

but they didn't have copper or tin, which  you needed to go to the Middle East for,  

76:31

to develop a civilization that could make use  of bronze at a large scale, and so they just  

76:36

disappeared from the historical record? I think we would see their archaeology. 

76:41

There are extraordinary developments in  the Americas which are entirely Stone Age. 

76:47

You would see them today if  they had completely vanished? 

76:50

Oh, yeah. We should go for a trip  to Teotihuacán in Mexico. It's so  

76:58

impressive. When I went there when I was 20,  it was totally as impressive as ancient Egypt.  

77:06

It's huge. It's massive. It's without metal. It's even more impressive because it's not  

77:11

only without metal, but without animals  and without wheels, which is crazy. 

77:16

The marble is just hauled without wheels. Right. Take any person who has an old  

77:21

world superiority and take them to these  places, and they will not have it anymore. 

77:26

It's just extraordinary what's in these places. These are people who separated 20,000 years ago at  

77:31

least from the ancestors of East Asians and 40,000  years ago from the ancestors of West Eurasians. 

77:38

They just had the same biological and cultural  shared toolkit from then, but there's a long  

77:47

fuse delay until all this stuff happens. It's an amazing thing, and we don't question it. 

77:52

What are other questions you are  either investigating right now  

77:59

or want to investigate, these kinds of  big picture questions of human history? 

78:04

I'm perplexed. I don't know if we talked about  it before, but I remain very confused about the  

78:13

relationships between archaic and modern humans. We have genome sequences now from archaic humans  

78:18

who lived in Europe, West Eurasia, and  Central Eurasia, and the Neanderthals. 

78:23

We have archaic sequences from these  enigmatic Denisovans, who we now have  

78:27

a skeleton for since we last talked. There's now a skull that's  

78:32

been shown to be a Denisovan. We have data from lots of modern humans,  

78:37

and there are really big mysteries about  the relationships amongst these groups. 

78:43

Genetically, the Denisovans and  the Neanderthals are sisters. 

78:49

They descend from a common ancestral  population 500,000 or 600,000 years ago. 

78:53

That group descends 700,000 or 800,000 years  ago from the common ancestors of modern humans. 

79:01

Genetically, the whole genome data says that  Neanderthals and Denisovans are archaic humans  

79:05

from a common ancestral archaic population. But there are so many things shared between  

79:10

Neanderthals and modern humans that  don't seem to be shared with East Asians. 

79:15

They both share Middle Stone Age  stone tools, Levallois technology,  

79:20

this cognitively unique way of making  stone tools that wasn't used in East Asia. 

79:25

They both have the same mitochondrial  DNA and Y chromosome sequence. 

79:30

The Y chromosome sequence of Neanderthals  and the mitochondrial DNA of Neanderthals,  

79:34

is actually modern human that came through  interbreeding 200,000 or 300,000 years  

79:38

ago and then shot up to 100% frequency. Neanderthals and modern humans are both the  

79:43

product of mixture events that happened between  archaic and modern humans 300,000 or 200,000  

79:49

years ago, demonstrably through patterns  of variation in ancient and modern DNA. 

79:54

It feels that there's something shared  between Neanderthals and modern humans  

79:58

that's not shared with Denisovans, even  though the vote of the whole genome says  

80:01

that Denisovans and Neanderthals are related. One wonders whether there's something connecting  

80:08

Neanderthals and modern humans that's different  from Denisovans, even though genome-wide,  

80:13

Denisovans and Neanderthals cluster. I'm thinking about that all the time now. 

80:18

Connecting them would be interbreeding  events or being in the same place  

80:22

at the same time that we missed? There's a known interbreeding event  

80:25

from the lineage leading to modern humans into  Neanderthals, but it's supposed to be only 5%. 

80:32

I'm interested in the possibility that that 5% is  actually a sign of something much more impactful,  

80:37

that somehow Neanderthals are in some  sense deeply modern in some ways,  

80:42

and even though they get swamped by  archaic genes, they actually have  

80:47

more of a modern impact than one would think. The Middle Stone Age and Middle Paleolithic  

80:53

Revolution that they share with modern humans  is more fundamentally a part of who they are,  

80:57

in some sense, than we think. Interesting. Sorry, when was  

81:01

this interbreeding event? 300,000 to 200,000 years ago. 

81:04

So the common ancestor between Neanderthals  and most humans alive today is potentially  

81:10

more recent than the common ancestor  between all humans alive today. 

81:14

Oh, for sure. Which is crazy. 

81:16

Well, the divergence to all the archaic humans,  including Denisovans, is within human variation. 

81:24

Wait, what? Yes. The average time  

81:26

to the common ancestor of any two human  genes is one or two million years ago. 

81:33

If you look at the copy of chromosome 3 you get  from your mother and the copy of chromosome 3 you  

81:43

get from your father, the typical time they share  a common ancestor is one or two million years ago. 

81:46

That's before the split from  Neanderthals and Denisovans. 

81:49

So there are many places in your DNA where  you're more closely related to a Neanderthal  

81:53

on your mother's side than you are to your father. I'm sure there's a simple explanation, but how? 

82:00

It's the same reason that if you  have a sister, in some places in  

82:04

your DNA you're more closely related to her  than you are to me because you share a parent. 

82:08

But in other places you're more closely related to  me than you are to your sister because you happen  

82:12

not to share the same DNA from your parents. It's just that the DNA we get from our common  

82:19

ancestral population was already  quite variable 500,000 years ago,  

82:25

700,000 years ago, a million years ago, and some  of us descend from some of those ancestors and  

82:29

others descend from other of those ancestors. Neanderthals split from our lineage really close  

82:35

in time on human evolutionary timescales, such  that in some places in our DNA we're more closely  

82:40

related to Neanderthals than to each other. Interesting. What are the other big questions? 

82:44

That's the main thing I'm  thinking about a lot these days. 

82:48

I continue to be very obsessed with  questions about the spread of human  

82:56

populations around the world and trying  to reconstruct that with ancient DNA. 

82:59

After the recording ended, David started  spontaneously explaining a new theory he's working  

83:03

on about Neanderthal genetics on a whiteboard in  the room, which I ended up capturing on my iPhone. 

83:08

The thing I've been thinking about a lot  recently is the possibility that maybe  

83:12

we're not thinking in the right way about the  relationship between archaic and modern humans. 

83:19

The standard model is one where Denisovans—these  archaic humans that were found from ancient  

83:31

DNA—and Neanderthals descend from a common  ancestral population 500,000 or 600,000 years ago,  

83:40

and these two separate earlier, maybe 700,000 to  800,000 years ago, from the ancestors of modern  

83:48

humans, people like us. That's the big result  

83:52

of a lot of studies since 2010. But there's also evidence of an  

83:59

interbreeding event that happened maybe 200,000 to  300,000 years ago that resulted in modern humans  

84:15

contributing DNA to the ancestors of Neanderthals. So maybe 5% of the DNA of Neanderthals comes  

84:24

from this interbreeding event, and  a lot of studies have shown this. 

84:29

I'm very interested in this because, from the  archaeological record, Neanderthals and modern  

84:36

humans look quite similar to each other, much more  similar to each other than a lot of them do to  

84:41

Denisovans, these archaic humans in East Asia. For a lot of history, people have thought that  

84:48

Neanderthals are our sisters. But in 2010, the sequencing of  

84:53

the Denisovan genome made it very clear  that on average, Denisovans are closer to  

84:57

Neanderthals than to modern humans. This was a very confusing result. 

85:03

Most people now think that Neanderthals and  Denisovans descend from a common ancestral  

85:07

population that separated earlier  from the ancestors of modern humans. 

85:13

I'm interested in the possibility that  the right way to think about Neanderthals  

85:17

is actually as somehow culturally modern humans,  even though genetically they're mostly Denisovans. 

85:25

The model I'm thinking about is motivated  by this archaeological phenomenon known  

85:31

as the Middle Stone Age Revolution. If this is Africa and this is Europe, we know  

85:44

that the new way of making stone tools—with cores  that were very carefully mined far away from the  

85:52

locations they were used, made out of high-quality  stone like flint—starts being used 300,000 or  

85:57

400,000 years ago, first in the Caucasus,  places like Georgia today, or East Africa. 

86:02

This way of making stone  tools is quite revolutionary. 

86:06

It is known in Europe as the Middle  Paleolithic and in Africa as the  

86:09

Middle Stone Age, and is associated with much  more widespread use of fire and moving stone  

86:16

around at much further distances than before. I'm interested in the idea that this is something  

86:21

shared between modern humans and Neanderthals. There's somehow some shared cultural feature  

86:26

that's absent in East Asia, and that  might have a relationship in the genetic  

86:30

data and is somehow related to this 5% DNA. The idea I’m interested in is the possibility  

86:35

that there is a population here that invents  the Middle Stone Age and the Middle Paleolithic,  

86:40

sometimes called Levallois technology, and that  people from this population expand into Europe and  

86:48

mix with the local archaic humans who are there. That is what this 5% interbreeding event is. 

86:53

It happens 200,000 to 300,000 years ago. It produces a group that, as it expands across  

86:59

this landscape in Europe, mostly picks up the  local DNA and becomes mostly archaic genetically,  

87:06

but retains its modern human culture, the way of  making stone tools and some of its traditions. 

87:12

One of the things that's super interesting  about this is that if you actually look at  

87:16

the genetics, across the whole genome,  Neanderthals and Denisovans cluster. 

87:20

But if you look at the mitochondrial DNA—which  humans and Neanderthals get from their  

87:24

moms—Neanderthals and modern humans cluster. If you look at the mitochondrial DNA,  

87:30

Denisovans and modern humans share an  ancestor well more than 700,000 or 800,000  

87:34

years ago, as you'd expect from the history. If you look at the Y chromosome that you get  

87:37

from your dad, Denisovans and modern humans  share an ancestor more than 700,000 or 800,000  

87:42

years ago, which is consistent with this history. But if you look at the Neanderthal mitochondrial  

87:48

DNA, it's only 300,000 to 450,000 years. If you look at the Y chromosome,  

87:52

it's only 300,000 to 450,000 years. What the current genetic work is asking us to  

87:57

believe is that even though this is only 5% of the  whole genome, it introduces mitochondrial DNA and  

88:03

Y chromosomes, and they jump up to 100% frequency. It's kind of a crazy claim because the probability  

88:09

of this occurring by chance is low,  maybe 5% times 5%, a very small number. 

88:15

It's what we actually all believe,  but it's a very surprising event. 

88:20

Somehow it's accreted to all the findings in the  literature so that we make ourselves believe this,  

88:24

but it seems unlikely on first principles  that somehow only 5% would introduce both  

88:29

the Y chromosome and mitochondrial DNA. And it really does look like this. 

88:33

There's amazing data from a site in Spain  that's 300,000 to 400,000 years old,  

88:40

called Sima de los Huesos. They have a nuclear genome  

88:43

that looks Neanderthal-like for most of  the genome, but their mitochondrial DNA  

88:47

and Y chromosome are Denisovan-like. So it really looks like there was a  

88:51

population related to modern humans that pushed  into this Sima de los Huesos-like population,  

88:55

displaced its mitochondrial DNA and Y  chromosome, but kept the rest of its genome. 

89:00

It really looks like something like this happened. The idea I'm playing with—and probably it's wrong,  

89:06

who knows—is that there's a landscape…  This is Europe and you can break up  

89:13

into a hundred or so demes, little areas. Modern humans get introduced at the bottom  

89:19

right corner, in the Middle East or  somewhere, and they spread into Europe. 

89:23

As this population spreads, there's a  wave front of expansion, and they're  

89:27

interacting with the local archaic humans. The theory from simulations and studies of  

89:29

different species like mammals and birds  shows that even if there's a small amount  

89:30

of interbreeding—when there's an invasion or  a movement of expansion of one group into the  

89:48

territory occupied by another—there's  massive introgression of local genes. 

89:52

The pioneers at the wave front will sometimes  interbreed with the local population. 

89:59

There are so many of them around that their  DNA will get swamped by the local group,  

90:03

so by the time they make it to the  other side, they're largely local. 

90:08

Maybe this is what we're seeing. You have a modern human population  

90:13

that's matrilineal, for example, where  the transmission of making stone tools  

90:18

this way is happening from mother to child. That's why they're retaining their mitochondrial  

90:24

DNA, but by the time they get to the other  end of Europe, they're mostly local archaic. 

90:29

You end up with a 95% population replacement. This would explain why the mitochondrial DNA  

90:35

is shared between Neanderthals and  modern humans, and it would also  

90:39

explain why the mixture proportion is only 5%. The really interesting thing is that there's  

90:44

other evidence from studies of modern humans  showing that modern humans are also admixed. 

90:51

The right way to think about this is  that modern humans are a mixture of  

90:58

two groups that diverged maybe 1.5 million  years ago, and they come together 200,000  

91:08

to 300,000 years ago with maybe 20%  ancestry from this archaic African group  

91:18

and 80% ancestry from this early modern lineage. And that same group then mixes with Neanderthals,  

91:31

and it's 5% modern and 95% local. So you actually have this key  

91:38

population that makes the Middle  Stone Age or Levallois technology. 

91:47

It appears and expands in all directions—into  Europe and into Africa 200,000 to 300,000 years  

91:53

ago—bringing this technology, new ideas,  and perhaps some genetic adaptations. 

91:59

It expands into archaic humans in Europe,  mixes with the local population, and gets  

92:05

95% replaced but still retains its cultural  features and maybe some genetic features. 

92:11

It expands in Africa too, but here it's  not 95% replaced, it's only 20% replaced. 

92:16

Probably the reason is that this  group is much more diverged. 

92:22

It’s 1.5 million years diverged  rather than 700,000 to 800,000 years. 

92:27

As a result, there are many more genetic  incompatibilities and barriers to gene flow. 

92:32

But there's still a lot of mixing, maybe 20%, and  we have evidence that this is a big mixture event. 

92:37

So what you're actually seeing is a modern human  expansion both into Europe and into Africa. 

92:43

In one place, it forms Neanderthals. In another, it forms the  

92:45

ancestors of everybody living today. But all of these groups are descended from  

92:49

this key revolutionary event that happens here. We often talk about the revolutionary events 50  

92:56

to 100,000 years ago, the more symbolic behavior  and so on, that first appears in Africa and  

93:04

the Middle East and spreads beyond. But there's also this earlier event,  

93:08

and it is contemporaneous with the breakup  of all the different groups in Africa today,  

93:14

the Khoisan Southern Africans and the  Central African rainforest hunter-gatherers. 

93:18

One wonders whether this is an  equally important formative event. 

93:23

If that's true, it makes you think of  Neanderthals as somehow our cousins. 

93:27

Their shared Y chromosome,  their shared mitochondrial DNA,  

93:31

they share the formation of this 200,000 or  300,000-year-old event, and their shared toolkit. 

93:38

Even though the genome is telling us they're  cousins of Denisovans, the correct way to think  

93:47

about them may, in an important sense,  be as close cousins of modern humans. 

93:55

I have so many questions.  Do you have 15 more minutes? 

93:58

First of all, what is going on with this group  of archaic Africans 1.5 million years ago? 

94:05

Where in Africa are they, and what happens  to the portion of them that don't form  

94:12

modern humans? Do they survive? This is not from ancient DNA,  

94:22

but from analysis of modern DNA from different  people, mostly in Africa, but also non-Africans. 

94:27

In multiple studies—there's at least  three, maybe four or five studies that  

94:30

I know about—they have looked at the patterns  of variation in people today and say the data  

94:35

in modern people today, including in Africans,  is not consistent with a homogeneous population. 

94:40

It looks like a population that split well  more than a million years ago into multiple  

94:45

groups—at least two, but maybe many—and then  came together a few hundred thousand years ago. 

94:51

The papers have different models that they fit,  but they all have this feature of a split-up more  

94:55

than a million years ago, and then on the  order of a few hundred thousand years ago,  

95:00

a coming together and a remixture event forming  the ancestors of anatomically modern humans. 

95:05

This includes the Khoisan  and whatever other groups? 

95:08

Yes. All of these groups have this,  maybe in slightly different proportions. 

95:11

So you ask, where are these people  living? Who knows. In this scenario,  

95:16

the 80% is coming from the Caucasus or Northeast  Africa, where the Middle Stone Age forms. 

95:22

It's from this population that  the Middle Stone Age comes. 

95:24

They mix with local groups, and who knows where  they are: Southern Africa, Western Africa,  

95:29

Central Africa, Eastern Africa. We don't have any ancient DNA,  

95:32

but this is a very rich environment. People have been living there for seven  

95:36

million years at least, and there would have  been different groups of people everywhere. 

95:40

Probably it's not just two  groups, it's probably more. 

95:43

The important theme here is  there's evidence of substructure  

95:46

that's well more than a million years old. This place would have been a landscape full  

95:51

of archaic humans that would have been differently  related to these expanding people and would have  

95:58

admixed with them when they came through. So with the Neanderthals, the first  

96:01

time around 300,000 years ago, our  ancestors share culture with them. 

96:06

They share the Middle Stone Age technology,  but they don't replace the population. 

96:10

The technology spreads through culture, basically. It spreads through genes too. If you look at  

96:16

Yamnaya in India, there's almost no Yamnaya  ancestry in India. It's just diluted down.  

96:25

As Yamnaya expanded into Central Asia and into  Europe, it makes the Corded Ware. There's a 25%  

96:32

dilution. It expands back across Central Asia. It goes through the Hindu Kush and  

96:37

gets into northern South Asia. It admixes more with local people. 

96:41

Today, the most Yamnaya ancestry  you see in India is 20% or 10%. 

96:46

Most people have less than 10% or 5%. There's been a lot of mixture on the way,  

96:51

but it is the tracer dye. It tracks Indo-European languages,  

96:56

and important aspects of Indo-European  culture are coming through Yamnaya. 

96:59

So if you know where to look, that tracer  dye is only 10%, 5%, or 2% in some groups. 

97:05

But it's the languages people speak,  and important shared cultural elements,  

97:08

that connect them to people on the other  side of the Indo-European-speaking world. 

97:12

So this 5%, you shouldn't sneeze at it. That's tracing something important in this model. 

97:20

I understand that if things are transmitted  more through women… Sorry, let me back up. 

97:25

I don't understand why the maternal  mitochondrial DNA and the Y chromosome  

97:31

would be especially privileged as the  spreading is happening. Can you explain that? 

97:37

The reason I'm talking about these matrilineal  or patrilineal expansions is that I'm really  

97:41

troubled, and have been troubled for many  years—especially in the last three or four  

97:47

years—by the fact that the mitochondrial  DNA and Y chromosome cluster Neanderthals  

97:52

and modern humans, but the rest of the  genome clusters Neanderthals and Denisovans. 

97:55

This is a crazy result that is  not seen in any other species. 

98:00

I'm very interested in  patterns that would explain it. 

98:03

If you assume that there was a matrilineal  or a patrilineal expansion—it could be  

98:10

either—then modern humans, when they were  expanding across the landscape of Europe,  

98:16

retained their identity along one of the lines. 

98:21

If it's matrilineal, when they incorporate  a male from the local community,  

98:24

he's brought into the community, and the kids  are raised based on the culture of the mothers. 

98:30

If it's a patrilineal expansion and they  incorporate a female from the community,  

98:33

she's raised with the culture of the fathers. If that happens, it guarantees that one of  

98:39

these two parts of the genome looks like it  does, because it's a modern human expansion. 

98:44

If it's patrilineal, it will  retain the Y chromosome. 

98:47

If it's matrilineal, it will  retain the mitochondrial DNA. 

98:50

So it will solve one of your two problems. But not both. 

98:53

It won't solve the other one, so  you need to solve the other one. 

98:56

You can solve it either by natural  selection or by social selection. 

99:03

By the way, patrilineality and matrilineality are  the rule, not the exception, in human communities. 

99:08

Usually communities have continuity  along the male or the female line. 

99:14

Usually it’s patrilineality,  sometimes it’s matrilineality. 

99:19

You can also have phenomena like social selection. It could be that once you have kids of someone  

99:28

whose father, for example, is from the outside  community… Usually in most communities,  

99:36

females all reproduce. That's typical  today. Usually women have kids if they can. 

99:41

But men in traditional societies are actually  very variable in their reproductive success. 

99:45

A large fraction of men never have kids. Then there's a subset of men who have  

99:51

many kids with many women. There's competition among men for kids. 

99:56

In this context, where males are competing for  access to females, female mate choice begins to  

100:03

be an important process. You have a phenomenon  

100:08

where if your dad is an archaic male, it  could be the case that you're not going to  

100:15

be as successful in the competition for local  females as if your dad is a non-archaic male. 

100:20

Some simple social phenomenon  like that could explain the data,  

100:23

and we actually see this in human society. For example, if I remember right, in Central  

100:27

African rainforest hunter-gatherers,  there's different treatment of boys  

100:31

and girls depending on whether their  dad or mom is one group or the other. 

100:35

I guess I don't understand how  the maternal… The group spreads,  

100:43

and it gets to the next front. They have kids.  From the humans that have just entered, the kids  

100:56

will have the mitochondrial DNA from the humans. But from the existing people, they will have the  

101:02

mitochondrial DNA of the archaic humans. Why are the people with the archaic  

101:09

mitochondrial DNA not surviving? It's a question. There are multiple  

101:16

possible explanations, but it's much easier  to explain that than both the mitochondrial  

101:20

DNA and the Y chromosome. One possibility is that the  

101:22

mitochondrial DNA was less biologically fit. Another possibility is that there's social  

101:27

discrimination against people based on whether  their parents are archaic or not, which is not  

101:33

at all surprising in a human context. It's the weakest link in this argument. 

101:38

This argument's probably wrong, but I'm  just telling you what I'm thinking about. 

101:42

Okay, the Neanderthals. So 300,000 years  ago our lineage interacts with them,  

101:49

but mostly their lineage survives, and  there's cultural and genetic diffusion. 

101:56

And then is it 70,000 years  ago that we interact again? 

102:00

Yes. And they don't survive. 

102:03

The genetic ancestry doesn't survive. The genetic ancestry doesn't survive.  

102:08

Presumably there was also other contact  between 300,000 years ago and 70,000 years ago. 

102:11

Probably. But these are the  ones we are detecting currently. 

102:15

Is it just contingent that one time  there's this kind of diffusion where  

102:23

most of the archaic genome survives, and  the other time it's total replacement? 

102:28

This is not at all surprising given the context. If you think about this model, this is 700,000 or  

102:34

800,000 years ago. This is 300,000 years  ago. So this is 400,000 years separated. 

102:40

You talked about the Bhatia paper with me earlier. That's two populations 70,000 years separated. 

102:45

There are no biological incompatibilities  between West Africans and Europeans. 

102:48

There's no natural selection against  biological incompatibilities. 

102:52

We know when Neanderthals and modern  humans met and mixed, there were biological  

102:57

incompatibilities. That was 700,000 years  ago. As populations become further apart,  

103:04

biological incompatibilities rapidly develop,  probably as the square of the separation distance,  

103:09

because you need pairs of interacting genes. Here, it would have been maybe only 400,000 years  

103:17

separated between this lineage and that lineage. But here, it's 1.2 million years. That's a lot.  

103:23

These are at the edge of not being able to produce  children. These are quite different humans. These  

103:30

are actually three times closer than these. If you look at mixtures of humans today,  

103:35

there are mixtures in Southern Africa  of people who are half this distance. 

103:40

If you look at Khoisan and Bantu people  mixing in Southern Africa, like the Xhosa,  

103:44

which is the population of Nelson Mandela,  these are groups that are separated by  

103:48

almost 200,000 years, which is half of this.  Totally compatible. What you're seeing is a  

103:56

group that's actually completely permeable  genetically, or nearly completely permeable. 

104:00

This other one almost certainly has  substantial biological incompatibilities. 

104:04

Because 200,000 or 300,000 years later, we  see interbreeding between Neanderthals and  

104:09

modern humans, or between Denisovans and  modern humans, and there's clear evidence  

104:12

of incompatibility at that point. But this would be even bigger. 

104:16

What you would expect to see is that as this  group spread, they would be moving into a  

104:21

territory full of archaic humans. There would be some interbreeding,  

104:23

but the kids would not be very fit. They would  die off. There would be a lot of infertility. 

104:29

The barriers to gene flow and to  interbreeding would be greater. 

104:34

To me, it's not at all surprising that as  this group moves into Eurasia, you have  

104:39

Eurasian archaics—the ancestors of  Denisovans—who are only 400,000 years  

104:46

diverged from these people over here. And then you have African archaics,  

104:51

and these are 1.2 million years diverged. They just don't interbreed as much,  

104:55

and you don't get as much gene flow. But the key thing is the timing. It’s the  

105:01

same time. It really feels like the signature  of an explosion of people from one place,  

105:07

interacting with people here and  interacting with people there. 

105:10

It's the same cultural or technological  revolution impacting this place and that place,  

105:19

and creating populations that are impacted by  this cultural revolution, which we know is the  

105:24

case because they share the same toolkit. Some people argue that Levallois  

105:28

technology is independently invented. But it's very similar, and this model would  

105:35

be a way that it could have the same origin. So there's a culturally shared thread,  

105:40

this shared toolkit. There's a mitochondrial  

105:43

DNA and Y chromosome thread. And then there is a shared timing thread,  

105:49

which is they both form by mixture. Because otherwise you'd have to believe  

105:53

that Neanderthals independently  developed Stone Age tools. 

105:56

Yes, which is not inconceivable. But it's a little  bit like believing that farming independently  

106:00

developed in multiple parts of the world. Right. But it did. 

106:03

It did. So as I said, this is probably wrong. I'm trying to tell you that we don't really know  

106:09

the world we live in. This is not obviously  wrong. In fact, to me, this is much more  

106:15

plausible than the model we currently write down. It's probably wrong, but it's much more plausible. 

106:24

It explains many more things,  and it's no more complicated. 

106:26

Interesting. Do you want to recapitulate  the thing you were saying about the  

106:30

analogy to Ptolemy and the epicycles? I thought that was quite interesting. 

106:34

I think the model that we've put together  collectively about the relationships between  

106:41

archaic and modern humans has accreted over time. There was this idea that modern humans are  

106:48

distinct and that Neanderthals and  Denisovans are sisters of each other. 

106:54

Over time, we detected additional mixture  events, like this modern human into Neanderthal,  

106:59

and then these other ones I didn't even talk  about, like a super-divergent lineage going  

107:03

into Denisovans and all this other stuff. We still say, "Oh, the whole genome says  

107:09

Neanderthals and Denisovans are  sisters, so that's the truth." 

107:13

We've patched it all together  and gotten it all to work. 

107:16

You look at the mitochondrial DNA and the Y  chromosome, and they have this odd pattern,  

107:19

and it's improbable, but we can get that  to work if we invoke natural selection,  

107:24

things like this. You patch it all together. It  reminds one of what happened in the ancient world,  

107:34

where there was this idea that the sun revolves  around the Earth, but it doesn't quite explain  

107:40

the movements of the planets properly. In order to get the movements of the planets  

107:46

to work right, Ptolemy and the astronomers made up  these epicycles, these special extra rotations and  

107:56

movements to make everything work about right. It was such a convoluted model. 

108:01

When Copernicus and colleagues suggested instead  that everything is revolving around the sun,  

108:09

it simplified things ever so much. What was happening is that as astronomical  

108:18

information accumulated, it kept being  contradictory to the standard model, but it could  

108:22

be made to work by proposing another complication  and another complication and another complication. 

108:28

This is not as fantastic as proposing that  everything revolves around the sun rather  

108:34

than the Earth, but it is much simpler. And it actually explains many things. 

108:39

What is counterintuitive or unexpected or  hard to accept about this alternative model? 

108:48

What is the hesitation that  people have for adopting this? 

108:51

I don't know. Nobody's thinking  about this model right now. 

108:55

It just seems obviously a  very natural model to me. 

109:02

The reason I ask is that Aristarchus, the  ancient Greek, had the heliocentric theory  

109:08

because he had deduced how far the Earth  is from the sun and noticed other things. 

109:16

But it was not adopted, because his fellow  Athenians were like, "Look, if we believe  

109:24

that the Earth revolves around the sun, for it to  be the case that we don't see relative movement  

109:30

of the stars to the Earth, the only possible  explanation is that the stars are so far away that  

109:36

it is just incomprehensible and implausible." So the heliocentric theory was dismissed. 

109:43

What I'm trying to ask is, what is the equivalent  here of "for this to work, the stars have to  

109:47

be so far away that it's inconceivable,"  where actually the stars are so far away? 

109:51

Maybe we should adopt the implausible  implication that this theory gives us. 

109:57

That's a great question. I think we have to  assume that there's a linkage between the  

110:03

cultural transformations in Africa and Eurasia at  this time, and that's not something the community  

110:09

has really put together with the genetic data. There's this thread in the genetics about  

110:17

substructure in Africans, and then there's  this whole world based on ancient DNA,  

110:21

and they've never been put together. Nobody's put together the now extensive  

110:28

work on modern human substructure with  the extensive work based on ancient DNA  

110:33

of archaic human relationships to modern humans. If you put them together, you realize they line up  

110:38

in terms of their time of substructuring. I don't know if that's improbable. It  

110:44

seems parsimonious to me. It also seems significant  

110:48

that different groups of humans at this time  were capable of adopting Stone Age technology. 

110:56

Once one group had figured it out, the  genetic difference between different human  

111:00

lineages was not so big that you could  not show people how to use stone tools. 

111:05

Who knows? It could be that  this was genetically driven. 

111:09

We talked before about the time to  the common ancestor of human genes. 

111:13

There's nothing at 100,000 years or 150,000 years,  but there's a lot at 400,000 or 500,000 years. 

111:20

If that's what happens, and you have a mutation  that occurs in the Caucasus, somewhere in the  

111:26

Middle East, or Northeast Africa, there could  be key genetic mutations that make people able  

111:30

to do this. Then this population expands. When it  moves into Europe, it's swamped by local genes,  

111:35

but there could be retention of those  genes through selection as it expands. 

111:40

Maybe what you're seeing is that  there are genetic developments. 

111:43

Most of the discussion has been focused  on the 50,000 to 100,000-year event,  

111:49

which is anatomically modern human behavior. But a lot of archaeologists think this is an  

111:55

equally—if not more—profoundly  significant event in many ways. 

112:00

Why is that not the event  we should be talking about? 

112:05

You're talking about how there are no  fixed differences between modern humans  

112:08

and the humans 50,000 years ago. Do we know if there are any fixed  

112:13

differences between the people 50,000  years ago and the people 300,000 years ago? 

112:17

I think there are. Other than obviously these interbreedings. 

112:23

If you look at the genetic variation  going back 300,000 or 400,000 years,  

112:28

there do begin to be places where all  modern humans share common ancestry. 

112:33

That's another way of saying there begin  to be fixed differences at that time depth. 

112:38

That is where you start seeing evidence  for possible fixed differences. 

112:42

What's happening, if everybody shares a  common ancestor 400,000 or 500,000 years ago,  

112:46

is that there's a single ancestor at that time. If you compared it to another population, they  

112:52

would descend from a different lineage, so any  mutation that occurred ancestral to that single  

112:56

ancestor would be a fixed difference. This is the time at which you can  

113:01

begin to see fixed differences. But anatomically modern, cognitively  

113:04

modern humans exist by the beginning of the Middle  Stone Age, before we're breeding with this ancient  

113:15

group of Africans or breeding with Neanderthals. Anatomically modern humans occur exactly here.  

113:19

It's the same moment. This is when they occur.  The people who have skeletal features like ours,  

113:24

and Neanderthals, appear exactly then. This is when it all happens. 

113:29

There is this disconnect between  anatomically modern humans in  

113:32

the skeletal record and behaviorally modern  humans, which is 50,000 to 100,000 years ago. 

113:38

Anatomically modern humans appear at  this time, and recognizable Neanderthals  

113:43

appear roughly around this time, too. Interesting. But we don't know what exactly  

113:48

happens, if anything, between 200,000 years  ago and 50,000 years ago that goes from just  

113:52

anatomical modernity to behavioral modernity. My understanding is no. They're busy making  

113:59

Levallois stone tools like Neanderthals  for 200,000 years, and they are not  

114:04

more impressive than Neanderthals in  any obvious way, as I understand it. 

114:09

Then there begins to be in the archaeological  record a quickening of behavioral traits, which  

114:16

could be not genetic at all, or could be genetic. There are lots of arguments about this. 

114:24

We were obsessed with intelligence  earlier in our conversation. 

114:28

People are obsessed with art and  these things that seem important  

114:31

to us but who knows what's important? Interesting. Cool, thanks for the digression. 

114:40

The work that I've been involved in has  consistently shown that I was wrong in  

114:45

my biases coming into the work, and I've  really been almost traumatized by this. 

114:49

Again and again, I've come into a project with  some kind of guess about what the data was  

114:54

showing, and then the data doesn't show that. For example, when I got involved in the  

115:00

Neanderthal genome project helping to analyze  data looking at how archaic Neanderthals were  

115:05

related to modern humans, I was part of  a group of scientists who had established  

115:10

that non-Africans were a simple subset of African  variation and that there was no evidence at all  

115:16

of Neanderthal interbreeding into the ancestors  of modern humans or other archaic interbreeding. 

115:23

Different analyses that I and many  other people had done made it look  

115:28

like non-African variation was just a  subset, a small sample of that in Africa,  

115:33

and that could have fully explained the data. So when I was involved in analyzing the  

115:38

Neanderthal DNA sequences, what happened  was I found this very strong evidence of  

115:44

Neanderthals being more closely related  to non-Africans than to Africans. 

115:48

It was very surprising, and I thought it  must be a mistake. I was quite incredulous.  

115:53

I thought it was unlikely to be true because  other evidence that had been found before  

116:00

seemed to point in the other direction. So I spent several years trying to make  

116:04

these results go away, as did my colleagues, and  we just couldn't make the results go away. They  

116:09

just kept getting stronger. And this experience  working on natural selection was the same. 

116:17

What we were convinced of was that natural  selection had been pretty quiescent in our species  

116:22

over the last several hundred thousand years. Therefore, if we look at patterns of variation in  

116:29

non-African people today, or in any people today,  we should see not a lot of selection going on. 

116:35

Indeed, the first ancient DNA studies, beginning  in 2015 with this paper that we were involved in  

116:41

with Ian Mathieson and colleagues, seemed  to show relatively small numbers of genetic  

116:48

positions associated with natural selection. In 2015, we analyzed data from about 200  

116:54

Europeans and Middle Easterners to try to  understand frequency changes over time. 

117:00

We compared those ancient people who were the  sources of modern Europeans to people in Europe  

117:05

today, and we looked at frequency differences  that were too extreme to be due to chance. 

117:09

We were very excited to find 12 positions  that we were convinced were highly different  

117:13

in frequency between Europeans today and what  we would expect, based on the history that we  

117:20

and others had identified as the history  relating modern to ancient Europeans. 

117:24

Some of these were known and some of these  were not known, and this was very exciting. 

117:28

We hoped that as the numbers of  samples would increase and we got  

117:32

higher resolution to be able to appreciate  differences in frequencies over time,  

117:38

it would make it possible to detect far more. What was quite disappointing over the subsequent  

117:44

decade is that that didn't happen. For example, the largest study of  

117:48

that type in 2024 by a group in Copenhagen  analyzed much better data than we had in  

117:54

2015 and found only 21 positions that were  highly different in frequency across time. 

118:00

While that was exciting—it was almost twice  as many as we had found in 2015—in a lot of  

118:05

ways it was disappointing because the sample  size and data quality had gone up so much,  

118:09

and yet this is all that was found. That suggested we might be hitting  

118:13

an asymptote and might not be able to  get beyond where we currently were. 

118:17

This approach to learning about biology,  which was very promising in theory,  

118:22

might not produce a high yield. Maybe natural selection was quiescent,  

118:27

and the reason we're seeing so  few changes is that there has  

118:31

not been a lot of adaptive directional selection. That was the situation we found ourselves in until  

118:36

just a few years ago when we carried out this  study in our research group led by Ali Akbari. 

118:44

What we did is we deployed a few  innovations to try to improve our  

118:51

power to detect natural selection. One of them was that we just pumped a  

118:55

lot of data into the system, increasing  the amount of data by about 14-fold. 

119:01

The main thing that we do in this study is  report new data from about 10,000 individuals. 

119:08

This is a very big increase in the  amount of data in the literature. 

119:13

The total dataset size of ancient individuals  distributed over the last 18,000 years is  

119:18

about 16,000 people. This is a large dataset.  It's much larger than was previously possible,  

119:23

and when you have more data, you can estimate  frequency changes with much more subtlety. 

119:28

The data comes from only one part of the  world, which is Europe and the Middle East. 

119:32

It's not a more important part  of the world than other places,  

119:35

but it's the place where maybe 70-80% of  the data in the ancient DNA literature so  

119:40

far comes from due to historical reasons. It provides us with a natural laboratory  

119:45

where we can see what happens to the genome  in one place over time as environments change. 

119:50

It's really interesting to imagine doing this  type of analysis in other parts of the world,  

119:54

and the comparative analyses are super  important and interesting, but this study  

119:58

right now is about this one place in the world  where we have particularly fantastic data. 

120:02

The other thing we did is that we  developed an entirely new methodology  

120:06

that hadn't been used in this area before. The methodology is based on a technique  

120:11

that had been developed for finding risk  factors for a disease in medical studies. 

120:19

A simple way to explain it is that we ask how  to predict the genetic type a person has based  

120:25

on their pattern of relatedness to other people. We have a dataset of about 16,000 ancient people,  

120:31

and 22,000 people if we include the modern people. Then we look at how closely related each of these  

120:37

22,000 people are to each other, and we predict  the genetic type at each position in the DNA—at  

120:43

10 million positions—based on the pattern of  relatedness to all of the other 22,000 people. 

120:48

Then we ask if natural selection blowing  the frequency of the mutation in the same  

120:53

direction in all geographic places and at  all times predicts the data a little bit  

120:58

better than just knowing the relatedness  to all the other samples in the database. 

121:02

We're simply asking if the alternative  hypothesis—that selection has been  

121:06

blowing in the same direction at  all times—explains the data better. 

121:11

That's a dumb assumption, because of  course, the truth is that natural selection  

121:16

will have changed in frequency over time. But we're just asking the simplest of questions:  

121:20

whether assuming a constant rate of selection  explains the data more than not doing so. 

121:26

To summarize to make sure I've understood,  you're trying to make a model that predicts  

121:29

allele frequency changes over time. You  have two different parts. One part is this  

121:34

genetic relatedness matrix, which captures how  similar different genomes are to each other. 

121:41

That should capture the impact of  different bottlenecks, of drift,  

121:46

of population admixtures, and all those  things which affect the entire genome. 

121:52

Then you have the separate thing, which is, if we  look at specific locations, can we just say that,  

121:58

"Oh, this location has been selected  at whatever coefficient over time"? 

122:03

And if we add some coefficient, does it become  easier to predict the allele frequency changes  

122:10

than you would have just seen from this other  artifact, which is just looking at, "Oh, if you  

122:16

look at the whole genome, are these guys in the  same, have they gone through the same bottlenecks? 

122:20

Have they gone through the same drift," etc.? That's precisely right. 

122:22

Okay, what did we learn? When we analyzed the data this way, we  

122:29

looked at 10 million positions in the DNA across  these 22,000 people, 16,000 of whom were ancient. 

122:37

We looked to see if there was more  change in this consistent direction  

122:41

over time than you would expect by chance. When we analyzed the data, we found many  

122:46

hundreds of places in the DNA that were  changing too much over time and in too  

122:51

consistent a way to be explained by chance. There's a bit of a statistical problem  

122:55

in figuring out how many there are because  they're so densely packed that they're close  

122:59

to each other and interfering with each other. But when you try to piece them out and say,  

123:05

"Let's count only one in each place in  the DNA and blank out the others," we  

123:09

find at least 479 positions that are all  independently pushing in the same way. 

123:15

We are 99% confident that  those positions are real. 

123:19

By another criteria of being more  than 50% confident that they're real,  

123:22

we think that about 3,800 positions  are all pushing in the same direction. 

123:26

This is a crazy number of results given that  in our previous work and other people's work,  

123:31

there were at most a couple of dozen  discoveries coming from a single scan. 

123:35

So when we got this result,  we were very surprised. 

123:37

We thought it must be wrong, and we spent the  next couple of years trying to make the results  

123:42

go away, but they just kept getting stronger. We were trying to look for some independent  

123:49

type of evidence to tell us  whether these positions were real. 

123:53

We stumbled on something really powerful for this  purpose that had not been used in this way before. 

123:59

It relied on the fact that we had  very large numbers of discoveries,  

124:02

many hundreds of discoveries or even thousands. We took a completely independent dataset, which  

124:08

was the corpus of genome-wide association studies. These are studies that people have carried out in  

124:14

hundreds of thousands of people, looking for  whether particular genetic mutations are more  

124:18

common in people with high blood pressure than  with low blood pressure, or something like this. 

124:24

We took the UK Biobank, which is about 500,000  people from Great Britain who have been measured  

124:30

for hundreds of traits. The whole genomes of all  

124:32

these people have been sequenced. For each of these traits, we could  

124:36

look at whether each of these 10 million positions  are connected to this trait in a convincing way. 

124:40

Out of 10 million positions, about 15%—about 1.5  million positions in the DNA—are predictive of  

124:47

at least one of these several hundred traits. Then we could ask a question: is our natural  

124:52

selection signal, our statistic, related  to whether a mutation causes high blood  

124:58

pressure or some other trait? We slid our statistic for natural  

125:02

selection upward, to a value of  one, two, three, four, or five. 

125:10

As we did that, the enrichment  for genetic mutations that  

125:14

affect traits got higher and higher. Whereas it was only 15% when we didn't  

125:18

use our selection statistic, when we required  the selection statistic to be above about five,  

125:25

there was about a five-fold enrichment  for mutations that cause traits. 

125:29

Sorry, what is a selection statistic? This is the statistic we use to measure  

125:34

whether a mutation is changing over  time significantly in a non-zero way. 

125:42

It can be approximately thought of as a normally  distributed statistic, a Gaussian statistic,  

125:48

which is the number of standard deviations  the statistical value is away from zero,  

125:53

where zero is no natural selection. It's not exactly that, but it's close to that. 

126:00

If the statistic is above five, we  see about a five-fold enrichment  

126:05

in mutations that affect a trait. Instead of 15% of the mutations that are  

126:12

at random affecting the trait, it's  60% or 70% that are affecting the trait  

126:18

when we slide our statistic upward. This provides completely independent  

126:22

evidence that these sites are real, and as you  slide above five, there's no more enrichment. 

126:27

Our interpretation of these results—which we  were able to validate and show made sense using  

126:35

computer simulations of our process—is that once  you slide the statistic above five, essentially  

126:42

all the signals of natural selection are real. Okay, just to make sure I understand. 

126:48

You're saying, in order to figure out what  alleles have been under selection, your model  

126:54

assigns a statistic saying, "In order to explain  why this allele has a specific frequency,  

126:58

we're going to give it a selection statistic." Independently, we run these studies on modern  

127:03

populations where we say, "If you look at height,  eye color, intelligence, or whatever trait,  

127:09

what are the parts of the genome  that are correlated with that trait?" 

127:14

The higher the statistic you give it in  your study to explain allele frequency  

127:19

changes over time as a result of selection,  the more probable it is that that region in  

127:24

the genome is associated with traits that  have some functional thing we can measure. 

127:30

That's exactly right. This is  a brilliant idea that Ali had. 

127:34

It abandons the traditional approach of assigning  statistical significance to mutations that cause  

127:43

a trait because we're just using an external  piece of information—the correlation to traits,  

127:51

measured in a completely different way—to  read off the probability mutations are real. 

127:55

We can ask how much enrichment for real signal  is there given a particular selection statistic. 

128:02

If it's halfway enriched to the plateau, we're  able to show the correct interpretation is that  

128:07

50% of the mutations are really selected. If it's three-quarters of the way toward  

128:12

the plateau, there's a three-quarters  probability that the mutation is real. 

128:16

If it's 99% of the way to the plateau,  there's a 99% probability that it's real. 

128:20

That gives us a calibrated estimate of the  probability that a particular position is  

128:26

really under natural selection. A major concern here is that what  

128:32

we're actually seeing is not that these mutations  are really under selection, but rather that both  

128:39

association to a disease and our selection signal  are due to some third thing that's causing both  

128:45

of them, which is a type of selection which  is not what we're after, not selection to  

128:50

adapt to new environments, but what's called  background selection: selection against newly  

128:55

arising bad mutations that are removed from the  population that tend to be concentrated in genes. 

129:00

Genes are also the parts of the genome  that tend to be associated to traits. 

129:05

This common process is causing both the  enrichment for trait signals and is also  

129:10

causing the enrichment for selection signals  that we're observing. That's the concern. We  

129:15

were super concerned about this. So what we did is we repeated this  

129:19

enrichment analysis in slices of the DNA  that all were affected to the same extent by  

129:26

background selection, by this rain of slightly bad  mutations, and we get exactly the same pattern. 

129:31

We also repeated this experiment just  using mutations of the same frequencies  

129:37

because there's different statistical power to  detect these signals at different frequencies. 

129:42

We see the same pattern where above  a value of the selection statistic  

129:46

of around five, we get this plateau. The thing that changed that allowed  

129:50

you to increase the amount of sequences  you're generating by two orders of magnitude  

129:55

is just the statistical method you're  using to identify which part is human? 

129:59

Or what exactly changed in 2014 and since then? There's been a whole series of improvements. 

130:06

The big ones have been the huge drop in  sequencing cost, which made it possible to  

130:10

generate ancient DNA in the first place. The drop in cost has been a millionfold  

130:15

since the late 2000s, and another maybe  one to two orders of magnitude from 2010  

130:20

to today. That's one big change. Another  change has been in-solution enrichment. 

130:26

It's been this way of taking a sample that  has very small percentages of human DNA,  

130:31

but then suddenly creating a process that will  mean that the great majority of the sequences  

130:36

that one's analyzing will be useful for analyses. The approach that we used was we took the DNA  

130:42

samples that we had, most of which were very  low percentages of human DNA—less than 10%,  

130:47

often less than 1%—which is such a low proportion  that it's prohibitively expensive to sequence  

130:53

them and to just brute-force sequencing them given  the technology that we had available at the time. 

130:58

We took these samples and washed them over  an artificially synthesized set of short DNA  

131:05

fragments that targeted positions of the  DNA that we were interested in analyzing. 

131:11

This is more than a million positions that are  highly variable in people, and we picked many  

131:15

of these to be biologically interesting. We had a whole set of known biological  

131:19

targets that affected traits in genome-wide  association studies, which is the way that  

131:24

people look to see if there are particular  genetic variants in modern people that have  

131:30

particular impacts in phenotypes and traits. And so, what we did is we had this artificially  

131:36

synthesized set of DNA fragments that we  washed our ancient sample over, and it  

131:42

bound the parts of the DNA that we targeted. The resulting sequence that we generated was  

131:48

very enriched for the parts of the genome  that were informative about history. 

131:52

Even though only 10% or 1% of the DNA was human,  it ended up that a very large fraction was from  

131:58

the parts of the genome that we were interested  in, and it became economically efficient to do it. 

132:03

What was the other 99% of the DNA? It's mostly microbial. It's from  

132:07

bacteria and fungi that colonize  a person's body after they die. 

132:14

Depending on how they die, there'll be  more or less of these bacteria and fungi. 

132:18

When you typically sequence DNA from a person,  it'll just be full of microbial sequence. 

132:24

Sometimes the microbial  sequence is very interesting;  

132:27

it might be pathogens that a person died of. There's amazing work, for example, about different  

132:35

plagues of malaria and Black Death and hepatitis  B and so on that have been obtained from the  

132:42

sequences of these pathogens in people's teeth  and other parts of their body when they died. 

132:47

But we're focusing here on the human DNA. This changed the amount of data that was possible  

132:54

to produce from tens per year to hundreds  per year, and then we further roboticized  

133:00

and industrialized the process so that there  were many hundreds or even thousands per year. 

133:05

Just in our laboratory, we've been  generating genome-scale data from  

133:10

more than 5,000 individuals per year. I know this is true also of several  

133:14

other laboratories in the world now. This huge jump in data, this semi-exponential  

133:21

or even super-exponential jump in some cases,  has made it possible to ask and answer questions. 

133:26

While we were only on the order of 10 genome  sequences from humans in 2010, this year it's  

133:34

passed more than 20,000 reported sequences. There are several orders of magnitude increase,  

133:39

and the questions we were able to ask in 2014 are  just not the same as the ones we can ask today. 

133:45

Awesome. Excellent. David, thanks for your time. Thank you, Dwarkesh.

Interactive Summary

The discussion revolves around David Reich's groundbreaking work in ancient DNA, particularly a new study that challenges the long-held belief that natural selection has been relatively quiescent in recent human history. By utilizing massively increased sample sizes (16,000 ancient individuals from Europe and the Middle East) and an innovative statistical methodology, the study reveals that natural selection has been rampant across the genome, leading to significant biological changes over the last 18,000 years. The Bronze Age (5,000-2,000 years ago) is identified as a period of particularly intensified selection, driven by profound cultural and environmental shifts such as increased population density and interaction with domesticated animals. The research shows a strong enrichment of selection signals in immune and metabolic traits, and surprisingly, also on genetic predictors of cognitive performance or schooling during the Bronze Age, a finding that contradicts common assumptions about hunter-gatherer intelligence. The conversation also delves into a new, speculative model proposed by Reich regarding Neanderthal and modern human relationships, suggesting early interbreeding and shared cultural traits (like Levallois technology) could make Neanderthals closer cousins to modern humans than previously thought, addressing inconsistencies in genetic data. Finally, the role of climate stability in the Holocene in enabling the widespread development of agriculture is discussed, highlighting how genetic potential was present long before its cultural manifestation.

Suggested questions

7 ready-made prompts