HomeVideos

3D NAND: The Most Scalable Semiconductor

Now Playing

3D NAND: The Most Scalable Semiconductor

Transcript

241 segments

0:02

In the beginning, 2D NAND had only one  way to scale. Making the cells smaller.

0:08

That eventually hit a technical wall. Now what?  It is all over, right? WRONG! We go UP, baby!

0:14

The results are astonishing.  3D NAND is perhaps the most  

0:18

scalable semiconductor product ever mass produced.

0:22

In today's video, we talk 3D NAND.

0:26

## Beginnings

0:38

I previously did a history  about flash memory and NAND.

0:41

And I recommend you watch  it. But if you haven't ...

0:44

The fundamental building blocks  of NAND are memory cells.

0:48

For the first two decades after  its commercialization, NAND was  

0:52

2D. 2D NAND memory cells are transistors with  a source, drain, gate, channel, all that jazz.

0:59

But NAND cells also have another  special polysilicon gate called  

1:02

the "floating gate". It is called that  because it is surrounded by insulating  

1:06

oxide with no connection to  the rest of the transistor.

1:10

For all intents and purposes isolated. Like  

1:13

a brain in a vat. Screaming  into the void. Just like me.

1:19

During programming, electrons quantum tunnel  from the channel through the tunnel oxide  

1:24

into the floating gate. They stay there until  erasure, when they quantum tunnel back out.

1:30

While the floating gate holds a charge,  

1:33

the transistor's threshold voltage  rises and we can map that to a bit.

1:38

NAND cells are networked together in strings  of 16 to 128 with each cell's source connected  

1:44

to its neighbor's drain. This lets  us pack cells very closely together,  

1:48

but also means no random access. We can  only manipulate data in blocks or pages.

1:56

## NAND's Big Business

1:56

When NAND first hit the market around 1995, a  different architecture called NOR dominated.

2:03

NOR has similar memory cells but wires  them together more like DRAM does,  

2:08

giving faster data access at the  cost of lower density. Today,  

2:12

vendors use them to store crucial  hardware boot code like BIOS.

2:16

But NAND's more compressed architecture  made it possible to create a solid-state  

2:21

mass storage product competitive with  a hard disk drive. After Apple adopted  

2:26

NAND storage for its new iPods in 2005, the  market took off - eventually eclipsing NOR.

2:34

The dynamics of the NAND industry are like DRAM’s.  It is a commodity with a short product life cycle.  

2:40

So companies must continually invest in R&D to  achieve leadership in metrics like cost per bit.

2:46

They then must spend billions more  on semiconductor fab capacity to  

2:51

harvest profits from their product  leadership while they still have it.

2:55

This dynamic creates booms and  busts. So when times are hard,  

2:59

NAND companies know that the  best way to survive is to be  

3:02

the lowest cost provider. Or have other  profitable businesses to subsidize them.

3:08

## 2D Scaling

3:08

Reaching the cheapest per-bit cost in the  2D NAND era was achieved with two levers.

3:14

The first lever was to follow  Moore's Law - shrinking the  

3:17

transistor's physical dimensions so  that we can stuff more cells together.

3:21

The second lever - commercialized in the early  

3:24

2000s - are multi-level cells  capable of storing two, three,  

3:28

or even four bits inside a single memory cell by  segmenting the threshold voltages into buckets.

3:35

In 1995, the cells were produced using a 470  

3:39

nanometer process to produce  a 32-megabit NAND flash chip.

3:45

Almost twenty years later, leading  edge NAND produced using the 16  

3:49

nanometer process node are reaching  128-gigabit capacity. Such advancements  

3:55

were made possible by increasingly  sophisticated lithography machines.

4:00

## The End of 2D Scaling

4:00

This is incredible scaling. But after twenty  years, these two major levers were petering out.

4:07

The cost of patterning these smaller NAND cells  were getting out of hand. With EUV delayed,  

4:12

fabs turned to the 193-nanometer immersion and  then double patterning to produce the 2D layers.

4:19

Moreover, the cells were getting too small.  Smaller cells hold fewer electrons inside  

4:25

their floating gates. A 43-nanometer memory cell  can store a few hundred. A 25-nanometer memory  

4:31

cell does store just a hundred. Which  is kind of wild if you think about it.

4:36

The multi-level cells have this even worse.  Their 100 electrons have to be divided amongst  

4:42

the cells' multiple bit levels. Each division  is now only a third or half the previous size.

4:49

This not only makes it more likely that  degradation via leakage occurs sooner,  

4:54

but also lowers the signal-to-noise  ratio. Now losing just ten electrons  

5:00

materially affects the cell's contents.

5:03

At the same time, these small cells are now so  densely packed together - and their insulating  

5:09

oxide layers so thin - that the cells begin  electrically interfering with each other.

5:15

This increasing technical difficulty along with  several brutal industry downturns triggered a  

5:21

cycle of consolidation that reduced the number of  NAND firms from nine in 2008 to just five in 2014.

5:30

Finally, there emerged the recognition that the  

5:32

16/15 nanometer nodes would be the end  of planar NAND scaling as we know it.

5:39

This impending deadline spurred research into new  forms of non-volatile memory like MRAM or ReRAM.

5:46

But to quote a famous historical text, NAND  wasn't dead yet. There was still another  

5:53

dimension to go. Vertical. If we can't  grow bit density by shrinking the cells,  

5:58

then let's grow it by stacking  more cells on top of each other.

6:02

## 3D Stacked Architectures

6:03

The first attempts at 3D NAND simply fabricated  

6:06

layers of 2D memory cells on top  of each other and connected them.

6:10

Each NAND layer is its own self-contained array.  

6:14

So imagine like as if you were  building a multi-story Asian mall,  

6:18

you completely finished each floor of  the mall before moving on up to the next.

6:23

One advantage of this 3D stacked architecture  is that it carries over the older 2D NAND  

6:28

processes. So fabs are familiar with it and  you don't have to buy as much new equipment.  

6:34

Moreover you can test for bad memory  cells before putting them into the stack.

6:40

But there are also serious issues.  Just ask yourself why they don't  

6:44

actually build buildings like this.  Construction affects the completed  

6:48

layers lower down. The most  concerning being heat damage.

6:53

Moreover, stacking layers of 2D NAND  means repeating expensive lithography  

6:58

steps and inserting duplicative  support transistors for each  

7:03

layer. So you are just multiplying complexity  in a way that raises costs faster than bits.

7:10

The per-bit costs do not scale down as more  

7:12

layers are added to the stack. And beyond three  levels, the per-bit costs even start to rise.

7:19

Memory-makers are not going to spend billions  retooling their lines for this if the benefits  

7:24

peter out so soon. We needed something  that grows the layers while also holding  

7:29

constant the number of expensive steps i.e.  lithography. What out there can do that?

7:36

## BiCS

7:36

In 2007, Toshiba came out with a  different approach that literally  

7:40

turns the existing paradigm on its side.

7:43

They called it Bit Cost Scalable, or BiCS,  

7:47

though they later retconned that  abbreviation to "bit column stacked".

7:51

To produce it, Toshiba presented  what they called a "stack, punch,  

7:56

and plug" process. The name  is surprisingly descriptive.

8:00

It starts with "stacking" layers of  conducting polysilicon plates with  

8:05

layers of dielectric in between  to insulate them. The polysilicon  

8:09

plates will become the cells'  wordlines and control gates.

8:14

After that, we pattern and then "punch"  deep vertical shafts through this inedible  

8:19

polysilicon lasagna. This is usually  done with reactive ion etching or RIE.

8:25

Where an RF field accelerates reactive ions at  the wafer and pound it like a sand blaster. The  

8:32

ions react with the material  at the bottom, removing it.

8:37

Once that is done, BiCS builds the memory cell  by "plugging" the shaft with deposited material  

8:43

layers. Here, Toshiba ditched  the venerable floating gate  

8:47

and brought something new to the  world of NAND: The charge trap.

8:52

## Charge Traps

8:52

Charge traps are kind of like  floating gates, but different.

8:56

So the floating gate contains  electrons inside a conductive  

8:59

polysilicon layer between two insulating oxides.

9:03

Charge traps are different in that  they use a layer of trapping oxide  

9:07

like silicon nitride between  two insulating oxide layers.

9:12

This seemingly subtle difference  in materials changes how these  

9:17

two devices store electrons. It has been  described that the floating gate carries  

9:22

electrons as like a tub of water,  with the electrons sloshing around.

9:27

A charge trap on the other hand  stores electrons like how a sponge  

9:31

stores water - trapped in discrete  areas inside the material's "pores".

9:37

The charge trap has been around for decades,  

9:39

but it wasn't used in a commercial product  until 2002 when AMD and Fujitsu - now called  

9:45

Spansion - introduced a NOR flash  memory product called MirrorBit.

9:50

Samsung made some 2D NAND stuff using charge  traps in the mid-2000s but until Toshiba used  

9:57

it for BiCS, it wasn't popular. But they  did this because it is more manufacturable.

10:03

Making a floating gate there requires fabs to  pattern and deposit alternating layers of oxide,  

10:10

polysilicon, and then oxide again at a very  specific location in the shaft. Moreover,  

10:17

those gates have to be electrically isolated  from other floating gates in the same shaft.

10:22

It is like trying to stack donuts on top  of donuts at a specific spot inside a metal  

10:27

pipe. And to do that consistently  at each layer for some 100 layers.

10:32

Because the charge trap holds  electrons at discrete locations,  

10:36

the fabs can just focus on the task of depositing  continuous layers of oxide, silicon nitride which  

10:42

is the charge trapping layer, and then  oxide along the inside of the shaft.

10:47

Which is quite formidable, by the way.  These atomically thin layers must be  

10:53

precisely consistent all through this deep  shaft. Keeping that consistency is not easy.

11:00

Anyway. The shaft is finally  finished off with a pillar  

11:03

of amorphous polysilicon. This serves as the  channel through which Electrons will travel.

11:09

Since the BiCS memory cells' layers  are structured as a sandwich of  

11:15

silicon-oxide-nitride-oxide-silicon,  they called it SONOS.

11:21

Has nothing to do with the  wireless speaker brand name.

11:24

What makes this architecture so brilliant is  that no matter how many more layers are added  

11:29

to the stack, only one expensive litho-and-etch  step is needed to pattern and make the shafts.

11:35

In fact, since bit capacity no longer depends  on the cells' sizes, we no longer need the  

11:41

most leading-edge lithography to pattern the  shafts. We can take a step back on resolution.

11:47

## Other Architectures

11:48

At the time of Toshiba's 2007 proposal, Samsung  was the leading NAND vendor with 50% market share.

11:54

Toshiba was second at 30%, and SK Hynix and Micron  both at about 15% market share. As the end of  

12:02

scaling for 2D NAND became more obvious, Toshiba  and Samsung rushed to bring 3D NAND to the market.

12:09

In 2008 and 2009, Samsung proposed a  multitude of architectures - VRAT, TCAT, VSAT,  

12:15

and VG-NAND. Toshiba announced in 2009 an evolved  version of their BiCS, called pipe BiCS or P-BiCS.

12:25

P-BiCS is called that because it takes  the existing string of 2D NAND cells,  

12:30

stretches it out at the  middle, and then folds it over.

12:33

Like a 刈包 (guabao, Taiwanese street food) or pita  

12:35

bread. The result is a U-shaped pipe  that is repeated throughout the die.

12:41

And Intel, SK Hynix and Micron tried a  few approaches with the floating gate,  

12:46

perhaps due to the difficulties of  getting the charge traps to work. I  

12:50

think everyone was just trying to explore  the space and find something that worked.

12:55

## The Polysilicon Problem

12:55

BiCS and P-BiCS were not perfect.  Several major issues cropped up.

13:01

BiCS used polysilicon plates for  its control gates. Polysilicon,  

13:06

even highly doped, has resistance that causes  signals to travel more slowly through it.  

13:11

This degraded the cells' read and write speeds.

13:14

Another issue introduced by the  polysilicon gate is a smaller  

13:18

threshold window. When the cell gets  programmed - meaning the charge traps  

13:22

receive electrons - the threshold voltage  is supposed to rise an appreciable amount.

13:28

But polysilicon gates exert rather weak control  over the channel. As the charge trap is being  

13:34

erased, the weak control causes electrons  to tunnel back into the charge traps.

13:40

This means the difference between the  threshold voltages of the programmed and  

13:46

erased states is not that high.  And that smaller margin makes  

13:50

it harder to distinguish between  the two states, leading to errors.

13:54

Metal would be a far better gate  material than polysilicon. But  

13:59

Toshiba chose polysilicon for a reason. They  needed a material through which they can  

14:04

etch both deeply and cleanly, and metal is  definitely not good for that. So what to do?

14:11

## TCAT In the end, NAND leader Samsung won the race.

14:14

Toshiba's approach begins with the  polysilicon control gate and works  

14:19

its way from the outside in. So we  can call it a "gate first" method.

14:24

Samsung figured out how to produce a metal  control gate using a "gate last" method,  

14:30

something akin to the High-K metal  gate industry transition that the  

14:35

guys over in logic semiconductors  went through in the end of the 2000s.

14:40

Like that gate last method, this  one employs the use of a sacrificial  

14:45

layer, meaning that we deposit it with the  intent of removing it later down the line.

14:51

It starts with depositing alternating layers  of silicon nitride and an oxide dielectric.  

14:57

Then we pattern and punch  holes through this thick stack.

15:02

So far, so good. This feels like  the aforementioned BiCS except  

15:07

that silicon nitride is swapped  in for polysilicon. This silicon  

15:11

nitride is purely "sacrificial"  and will be removed later on.

15:16

But here the process flow changes. Instead  of depositing the various charge trap layers,  

15:23

we fill the shaft with polysilicon to produce  the channel. If you recall, BiCS did this last.

15:30

The next step will be to create the metal  gate surrounding the channel. But if the  

15:35

shaft is already filled, then how are  we going to reach the lower levels?

15:40

This is done by digging trenches on the sides  of the layer stack. We then selectively etch  

15:46

away only the silicon nitride, leaving just the  oxides and the polysilicon channel standing.

15:53

There are basically now empty air gaps between  the stacks. Into those gaps, we can then deposit  

16:00

the nitride charge trap layers, insulating  dielectric layers, and finally the metal gates.

16:07

Samsung called this the Terabit  Cell Array Transistor, or TCAT.

16:11

TCAT is also what they call the Japanese  delivery service company over in Taiwan,  

16:16

but I reckon the Koreans didn't think about that.

16:19

TCAT requires a far more complicated process than  BiCS, but the inclusion of metal gates produced  

16:26

a far better product. So far as I know, they  use a variant called Scalable TCAT to this day.

16:33

## The 3D NAND Era

16:34

Samsung's introduction of 3D NAND, a 24-layer  product in 2013, remade the industry landscape.

16:40

The decelerating pace of 2D scaling helped  the fast-followers catch up. Samsung's lead,  

16:46

once as large as 14 months, declined to just six.

16:51

But 3D NAND reset the table with a  disruptively better product. The first  

16:56

generation was a bit too early. Typical  Samsung. But their second generation  

17:00

VNAND - released in 2014 and made with a  ~20 nanometer node - delivered the goods.

17:06

TechInsights measured it at 2.6 gigabits  per square millimeter. To compare,  

17:11

Samsung had a 2D NAND made with  a 16 nanometer node and it did  

17:15

740 megabits per square millimeter. So 3D  NAND took a step back in process node from  

17:22

16 to 20ish nanometers but still raised  density by 3.5 times. That's incredible.

17:30

This substantially grew Samsung's lead in  both market share and profit, and triggered  

17:35

a race amongst the fast-followers  to commercialize their own 3D NAND.

17:40

Micron and Intel were second in 2015,  bringing a variant that uses the floating  

17:45

gate. The two companies later split their  alliance. Micron switched to charge traps,  

17:50

and Intel sold their NAND business to SK Hynix.

17:54

Toshiba/Western Digital and SK Hynix also joined  the club in the 2015 and 2016 time period.

18:01

Technical pioneer Toshiba later hit financing  problems - Westinghouse, nuclear energy,  

18:06

long story - which I reckon affected the company's  ability to invest like the Korean giants.

18:12

## Limitations Now the path forward is simple. More layers.

18:16

Right now the leader Samsung is at 400, SK  Hynix in the 300s, and everyone else in the  

18:22

high 200s. Predictions of future scaling  have gone up to 500, 600, and even 1,000.

18:29

But more layers bring more challenges,  particularly with High aspect ratio  

18:33

contact or HARC. "Aspect ratio" refers  to the shafts' height to its width.

18:40

We want to keep that high. A 128-layer NAND might  

18:44

have a shaft 100 nanometers wide and  6 micrometers deep, so around 60:1.

18:50

But keeping such a high aspect ratio gets  challenging as the shaft gets longer. It is  

18:56

harder to get enough of the reactive particles  doing the etch down to the bottom of the shaft,  

19:02

resulting in weird shapes  like a cone or even a spiral.

19:06

A new technology developed to address  this is an etch technique called cryogenic  

19:11

etch. As the name implies,  it brings temperatures down  

19:15

to -70 to -196 degrees Celsius for higher  etch speed at lower environmental impact.

19:24

## Conclusion As I always should, I need to thank Tanj for his

19:26

invaluable advice and commentary on  NAND as well as memory in general.

19:31

NAND can never replace DRAM. DRAM is just faster  and does not wear out. But the hard disk drive?  

19:39

As of this writing, HDDs still have the best  cost-per-bit thanks to new innovations like HAMR.

19:45

But the SSD is raising density at a  far faster rate. In the last ten years,  

19:50

we have gone from 2.6 gigabits per square  millimeter to 14 to as high as 28 gigabits.

19:58

This scaling is remarkable, and  continues despite challenges.  

20:03

The rest of the semiconductor industry  - struggling with their own scaling  

20:06

challenges - must be overflowing  with envy. They must be thinking:  

20:10

How can we get that scaling trend for ourselves?  What can we do to get as vertical as NAND?

Interactive Summary

Ask follow-up questions or revisit key timestamps.

This video explains the evolution of NAND flash memory, focusing on the transition from 2D to 3D architectures. Initially, 2D NAND scaled by shrinking cell size, but this hit a technical wall due to reduced electron counts and interference. The industry then moved to 3D NAND, stacking memory cells vertically. Early 3D NAND approaches involved stacking 2D layers, which proved inefficient. Toshiba's Bit Cost Scalable (BiCS) architecture, introduced in 2007, revolutionized 3D NAND by using a "stack, punch, and plug" process with charge traps instead of floating gates. Samsung countered with its Terabit Cell Array Transistor (TCAT) technology, which utilized metal gates for better performance. The adoption of 3D NAND, particularly Samsung's VNAND in 2014, significantly increased storage density and reshaped the market. Current challenges in 3D NAND involve increasing the number of layers while managing high aspect ratio etching, with technologies like cryogenic etch emerging to address these issues. Despite ongoing challenges, 3D NAND continues to scale rapidly, offering a stark contrast to the scaling struggles in other semiconductor industries.

Suggested questions

7 ready-made prompts