192层3D-NAND技术下的高密度内存与高速接口研讨

需积分: 14 190 浏览量更新于2024-06-26 收藏 20.78MB PDF 举报

Session 28专注于"高密度内存与高速接口"这一主题，聚焦于当今半导体行业的最新进展。会议在上午8:30首先由Ali Khaki菲罗兹，来自英特尔圣克拉拉分公司，分享了一篇关于1.67太比特（Tb）的高密度存储解决方案。这款5位/单元（5b/cell）的Flash内存是基于192层的浮栅三维NAND（3D NAND）技术制造的，实现了惊人的23.3千兆位每平方毫米（Gb/mm²）的位密度。单个die的容量达到了1.67太比特，而其随机读取（R-Read）速度和编程（P-Program）时间分别为354微秒（μs）和5500μs，展现了技术在存储密度和性能上的突破。接下来，在上午9:00的演讲中，SK海力士的Byungryul Kim展示了他们研发的一款高性能1太比特（Tb）3位/单元（3b/cell）3D NAND闪存。这款产品具有超过300层的高度，显著提高了写入速度，达到了每秒194兆位（MB/s）。通过引入五种创新设计技术，SK海力士成功地实现了高性能的存储，其特点是随机写入速度为34μs，编程吞吐量高达194MB/s，而且具备超过20Gb/mm²的位密度。这种内存采用外围电路下阵列架构，进一步提升了存储密度和效率。这两项研究成果均体现了3D NAND闪存技术的革新，以及在高密度和高速度方面的持续进步。它们不仅推动了计算机存储领域的技术边界，也对整个信息技术产业产生了深远影响，为未来数据存储和处理应用提供了强大的支撑。这些成果预示着更快、更高效的电子设备将成为可能，满足不断增长的数据需求，从而加速了数字化世界的步伐。

• 2023 IEEE International Solid-State Circuits Conference

ISSCC 2023 PAPER CONTINUATIONS

Figure 28.1.7: Die photograph and key metrics of the proposed work.

402

• 2023 IEEE International Solid-State Circuits Conference

ISSCC 2023 / SESSION 28 / HIGH-DENSITY MEMORIES AND HIGH-SPEED INTERFACE / 28.2

28.2 A High-Performance 1Tb 3b/Cell 3D-NAND Flash with a

194MB/s Write Throughput on over 300 Layers

Byungryul Kim, Seungpil Lee, Beomseok Hah, Kangwoo Park, Yongsoon Park,

Kangwook Jo, Yujong Noh, Hyeoncheon Seol, Hyunsoo Lee, Jaehyeon Shin,

Seongjin Choi, Youngdon Jung, Sungho Ahn, Yonghun Park, Sujeong Oh,

Myungsu Kim, Seonguk Kim, Hyunwook Park, Taeho Lee, Haeun Won,

Minsung Kim, Cheulhee Koo, Yeonjoo Choi, Suyoung Choi, Sechun Park,

Dongkyu Youn, Junyoun Lim, Wonsun Park, Hwang Hur, Kichang Kwean,

Hongsok Choi, Woopyo Jeong, Sungyong Chung, Jungdal Choi, Seonyong Cha

SK hynix Semiconductor, Icheon, Korea

As data produced by multimedia explodes and demand for data storage increases, the

most important topics for the NAND-Flash memory ﬁeld are continuous performance

improvements and cost/bit reduction. To improve performance, features to improve the

quality of service (QoS) as well as the read/write performance [1] are required. To reduce

the cost/bit, the number of stacked layers needs to increase, while the pitch between

stacked layers decreases. It is necessary to manage the increasing WL resistance

produced by a decreased stack pitch. To overcome these challenges, this paper presents

techniques applied to a >300-layer 1Tb 3b/cell (TLC) 3D-NAND Flash memory: 1) A triple-

verify program (TPGM) technique is used to improve program performance. 2) An

adaptive unselected string pre-charge (AUSP) technique is used to reduce disturb and

program time (t

PROG

). 3) A programmed dummy string (PDS) technique is used to reduce

WL settling time. 4) An all-pass rising (APR) technique is used to reduce the read time

), 5) A plane-level read retry (PLRR) technique is used during erase to improve the

QoS.

The TPGM scheme reduces t

PROG

by narrowing the cell threshold voltage (V

)

distribution. Increasing the step voltage (V

STEP

) is one way to reduce program time,

whereby an incremental step pulse programming method increases the step voltage

STEP

) but makes the V

distribution wider. However, improving the V

distribution is

essential to increasing the step voltage and reducing the program time. In a program

operation, the threshold voltage difference (ΔV

) is determined by difference between

the step voltage applied to WL and the channel voltage (V

). Figure 28.2.1 (a) and Fig.

28.2.1 (b) present the difference between the double-verify program (DPGM) and the

TPGM scheme. The DPGM scheme [2] divides cells into three groups, according to the

program verify (PV) levels and then controls the channel voltage of each group by

applying three different BL voltages (V

). Appling V

to the group 1 (GR1) BLs to isolate

the channels; the cells of GR1 are not programmed. V

is applied to group 2 (GR2) BLs,

and ΔV

= V

STEP

– V

. 0V is applied to group 3 (GR3) BL and ΔV

= V

STEP

. In DPGM, the

distribution can be improved by two kinds of ΔV

. Adding one more group (ΔV

STEP

– V

, V

> V

) to existing three groups in DPGM. TPGM categorizes cells into four

groups according to their PV levels and drives the channel voltage of each group by

applying four different BL voltages. Figure 28.2.1(c) illustrates the counter driving

scheme that prevents BL coupling effect. BL1 is driven by the series connection of NMOS

and is set to V

REF1

– V

THN

, while BL2 is initially set to V

and is discharged to V

REF2

+ V

THP

by the series connection of PMOS and NMOS. V

THN

and V

THP

represents the threshold

voltages of the NMOS and the PMOS. BL1 rising is affected by BL2 falling, however the

BL1 level does not exceed the target level due to inverse coupling. The counter driving

scheme enhances BL settling and TPGM efﬁciency. By converting the V

distribution

improvements into program time reduction results in approximately a 10% of program

time reduction.

The AUSP scheme reduces t

PROG

by tightening the cell’s V

distribution. A program pulse

is preceded by an unselected-string precharge (USP) [3] period to initialize all channels.

USP prevents lack of channel boosting in a program pulse by precharging channels with

, but a hot-carrier injection (HCI) disturbance occurs, as shown in Fig. 28.2.2(a). A

voltage below V

PASS

LOW

) is applied to all WLs, and the selected cell with a V

higher

than V

LOW

is turned off. The source-selection line (SSL) side channel is pre-charged to

and the Drain Selection Lines (DSL) side channel is undriven. Due to the voltage

difference between the SSL- and DSL-side channel, the HCI disturbance is produced by

the high electric ﬁeld. In the AUSP scheme, the SSL-side dummy WL is controlled by

DWL

, and V

DWL

– V

TH(DummyCell)

is applied to the channel. HCI disturbances are reduced due

to a lower electric ﬁeld. Figure 28.2.2(b) illustrates the incremental channel initialization

voltage that is proportional to the number of program loops. The channel initialization

voltage corresponds to the SSL-side channel voltage; a higher channel initialization

voltage is required for higher program loops. The channel initialization voltage can be

lowered for lower program loops, thereby reducing HCI disturb further. As shown in Fig.

28.2.2(c), the cell’s V

distribution becomes widen after programming, while

programming with AUSP results in a narrower V

distribution, compared to a

conventional USP. This reduced V

distribution contributes to around 2% t

PROG

reduction.

The PDS scheme reduces t

and t

PROG

by programming dummy cells of the dummy

strings. DSLs are divided by the DSL cut, as shown in Fig. 28.2.3(a), which separates

each DSL; meanwhile, the dummy WLs, main WLs, and SSLs are connected to several

strings in the 3D-NAND cell array. A dummy string produced by the DSL cut acts as

capacitive load for the case of a rising/falling WL; hence, delaying WL settling time. Figure

28.2.3(b) and 28.2.3(c) present different channel conditions between an unprogrammed

dummy string and a programmed dummy string. In an unprogrammed dummy string,

all the cells are turned on, and the channel voltage becomes 0V via the source-line voltage

) when V

PASS

is applied to all WLs. The non-ﬂoating channel acts as a capacitive load

and affects the WL settling time. The PDS scheme programs the V

of dummy string’s

SSL-side dummy cell above V

PASS

to turn off the dummy cell. As the SSL-side dummy

cell is turned off, the ﬂoating channel no longer acts as capacitive load and the WL settling

time is reduced.

The APR scheme reduces t

by reducing the WL rise time. The different resistance and

capacitance characteristics of each WL require different V

PASS

sources to be connected

to each WL group, and one source is selected by the switch circuits. As depicted in Fig.

28.2.4(a), in a conventional scheme one target V

PASS

source is selected and applied to

the dedicated WL during V

PASS

rise time. As in shown in Fig 28.2.4(b), the APR scheme

divides the V

PASS

rise time into two parts, A and B. In part A, all V

PASS

sources are

connected to all WL to reduce the WL rise time. In part B, one target V

PASS

source is

applied to the dedicated WL so that it is same as the conventional V

PASS

rising scheme.

The APR scheme reduces t

by around 2%.

As program/erase (P/E) cycles increase, the number of erroneous bits also increase;

adjusting the read voltage bias can reduce the number erroneous bits. The read retry

(RR) scheme with read level change is one effective method to overcome these

situations. However, in a conventional RR the read level can only be changed when the

read operation for all planes in the NAND device are completed. As a result, the read

performance is determined by the last plane terminated. In this work, a PLRR scheme is

used to alleviate read performance deterioration in the NAND controller. Figure 28.2.5

shows an example PLRR sequence: the read level is changed regardless of the operations

occurring in other planes. Therefore, the read performance can be improved compared

to the previous one since subsequent read commands can be issued immediately. In

addition, the PLRR effect becomes greater when the number of planes increases.

In this work, ﬁve new techniques are introduced to achieve a high-performance 1-Tb

3bit/cell 3D-NAND Flash memory using a peripheral circuit under cell array architecture.

The key comparison table, shown in Fig 28.2.6, reports a 20Gb/mm

bit density, which

is achieved by using over 300-stacked WLs with an improved program throughput, t

and bit density compared to prior work [4]. A die microphotograph of the fabricated TLC

NAND chip is shown in Figure 28.2.7.

References:

[1] A. Grossi et al., “Quality-of-service implications of enhanced program algorithms for

charge-trapping NAND in future solid-state drives,” IEEE Trans. Device Mater. Rel., vol.

15, no. 3, pp. 363-369, Sept. 2015.

[2] C. Miccoli et al., “Investigation of the programming accuracy of a double-verify ISPP

algorithm for nanoscale NAND Flash memories,” IEEE IRPS, pp. 5.1-5.6, 2011.

[3] R. Yamashita et al., “A 512Gb 3b/cell ﬂash memory on 64-word-line-layer BiCS

technology”,ISSCC, pp. 196-197, 2017.

[4] M. Kim et al., “A 1Tb 3b/Cell 8th-Generation 3D-NAND Flash Memory with 164MB/s

Write Throughput and a 2.4Gb/s Interface,” ISSCC, pp. 136-137, 2022.

剩余25页未读，继续阅读

LittleBrightness

粉丝: 0
资源: 144

192层3D-NAND技术下的高密度内存与高速接口研讨

Session_23_Analog_Sensor_Interfaces.pdf

Class SignalItem implements the interface QGraphicsItem but does not list it in Q_INTERFACES. qobject_cast to QGraphicsItem will not work!

@Scope(value = WebApplicationContext.SCOPE_SESSION, proxyMode = ScopedProxyMode.INTERFACES)是什么意思

ubuntu系统，ros2利用python3-colcon-ros编译过程中，Cmake error at /opt/ros/humble/share/rosidl_adapt_interfaces.cmake:59

android aidl depends on //hardware/interfaces:hardware_interfaces_license wh

sage.interfaces.r.R

ros2 AttributeError: 'MoveCircleActionClient' object has no attribute 'action_client'

最新资源