LazyFTL：优化NAND闪存的页级FTL设计

需积分: 10 41 浏览量更新于2024-08-05 收藏 2.23MB PDF 举报

"LazyFTL是一种针对NAND闪存优化的页级Flash Translation Layer（FTL）设计，旨在解决传统FTL方案在特定访问模式下性能下降和合并操作开销大的问题。" SSD（Solid State Drive）是现代存储技术中的重要组成部分，它基于NAND闪存芯片，提供比传统机械硬盘更快的读写速度、更低的功耗以及更好的抗震性。然而，NAND闪存的特性如非易失性、有限的擦写次数和块级别的编程与擦除操作，使其无法直接替代传统磁盘。为了解决这些问题，引入了FTL层。 FTL的主要职责是执行垃圾收集（Garbage Collection）和磨损均衡（Wear Leveling）策略，以隐藏NAND闪存的特殊性质，并模拟出类似磁盘的块设备接口供上层文件系统使用。FTL的设计和实现对SSD的性能至关重要，因为它们直接影响着读写速度、寿命和稳定性。现有的大多数FTL方案针对某些特定的访问模式进行优化，但在某些情况下，例如频繁的小块写入操作，会导致合并操作的开销显著增加，从而降低了SSD的性能。这种问题在高并发或随机写入密集的工作负载中尤为突出。 "LazyFTL"的创新之处在于其采用了更智能的页级管理策略。该设计通过延迟更新和合并操作，减少了不必要的写入放大（Write Amplification），同时尽可能地保持数据一致性。在不影响正常读取操作的前提下，LazyFTL会尽可能推迟对闪存的写入，直到达到一定的阈值或者需要释放空间时才进行实际的物理写入。这种方式可以减少由于频繁的逻辑到物理地址映射更新导致的额外开销。此外，LazyFTL还可能包含一种优化的磨损均衡算法，它能够在减少性能影响的同时，确保闪存各区域的擦写次数均匀分布，从而延长SSD的整体寿命。通过精细化的管理，LazyFTL能够更好地适应各种工作负载，提供更加稳定且高效的存储性能。 "LazyFTL"是一种针对NAND闪存优化的新型FTL设计，旨在通过延迟策略减少合并操作的开销，提高SSD在复杂访问模式下的性能表现，同时延长闪存的使用寿命。这一研究为SSD的FTL设计提供了新的思路，对于提升SSD的效率和可靠性具有重要意义。

Figure 1: Three Types of Merge Operations

DBA which occupies most of the ﬂash memory while each

valid page in the LBA is traced by another page-level map-

ping. The LBA is very small and generally takes less than 5

percent of th e entire ﬂash memory. In hybrid FTLs (except

HFTL [17]), the LBA is used to store overwriting data and

diﬀerent schemes adopt diﬀerent strategies to merge data in

the LBA to the DBA to generate new space for the LBA.

There are three types of merge operations as illustrated in

Figure 1. A full merge is a general but expensive operation

in w h ich a ll up-to-date pages need to be copied to a new

allocated block and then old blocks are erased and put back

into the free block pool. The partial and sw itch merges are

eﬃcient but can only be done in special cases s in c e they can

only be done when pages in th e log block or the repla c ement

block are all free or valid and each valid page is written in

their own place. Although many hybrid FTL schemes t r y to

do partial or switch merges whenever possible, full merges

are diﬃcult to avoid with diﬀerent a c c es s patterns. This

makes an insuperable bottleneck for all hybrid FTL schemes.

It is also possible to map variab le-len g th continuous logical

pages to continuou s physical pages in ﬂas h memory. In this

case, granular ity can be adjusted dynamically when access

pattern changes. However, since sizes of diﬀerent mapping

units are not identical and are changing, mapping entries can

only be stored in some type of search tree, and as a result,

the table look-up overhead of variable-length mappings is

higher than other schemes of which the mapping table is

nothing more than a simple address array.

2.3 Page-level FTL Schemes

The ﬁr s t FTL scheme was patented by Ban in 1995 [3]

and was adopted by the PCMCIA as a standard for NOR-

based ﬂash memories several years later [12]. There is one

issue that NOR-based FTLs should handle in the ﬁrst place.

When a page is overwr itten , the relevant entr y in ﬂash mem-

ory needs to be updated to keep the operation atomic and

reliable. (Remember that p a g e-level FTL schemes keep an

entire mir r o r of the mapping table in ﬂash memory to re-

duce the SRAM overhead.) This presents no diﬃculty to

the NOR-based FTL since NOR-type ﬂash memories can be

programmed in bytes. By assigning a replacement page list

for the relevant mapping page when necessary, this mapping

page can be updated (written in the ﬁrst free entry of the

same oﬀset in the replacement page list) several times as

long as the length of the list without rewritin g the entire

mapping page [12, 9].

DFTL (Demand-based FTL) [10], ano th er page-level FTL

scheme, makes the ﬁrst attempt to transfer the former NOR-

based FTL to NAND-type ﬂas h memories, omitting the re-

placement page part. This scheme, though eﬃcient, faces a

serious reliability problem since a ll modiﬁed information in

the SRAM will be lost if a system failure occurs. In this case,

spare areas of all data pages need to be scanned until the sys-

tem r ec overs t o a consistent state. Therefore, DFTL is not

suitable, we believe, for circumstances where ﬂash memory

is regarded as a permanent and reliable storage device.

2.4 Block-level FTL Schemes

Ban patented two other FTL schemes in 1999 [4, 8, 9].

These schemes are designed for NAND-type ﬂash memories

and also known as the NF T Ls . In this paper, they will be

cited as NFTL-1 and NFTL-N. NFTL-1 is design ed for ﬂash

memories that have a spare area for each page and NFTL-N

is for devices without such storage.

When a page is overwritten, NFTL-1 ﬁrst allocates a re-

placement block for th e relevant logical b lock if there is none

and writes overwriting pages one after a n o th er from the be-

ginning of the replacement block. Since pag es are wr itten

in an out-of-place manner in replacement blocks, NFTL-1

needs to scan all the spare areas in the replacement block

in reversed order to ﬁnd the most up-to-date version of a

requested page. Fortunately, the spare areas in NAND-type

ﬂash memory are usin g a diﬀerent addressing algorithm th a t

is optimized for fast reference and the overhead of this search

process is relatively low.

On the other hand, since some models of NAND ﬂash

memories have no spare areas to support fast search, NFTL-N

keeps a replacement block list for some of the logical blocks

when necessary and write requests for each log ic a l p a g e are

ﬁrst han d led by the ﬁrst block in the list and th en the next

one, keeping the in-block oﬀset identical with that of the

logical address. If all pages in the list with the request oﬀ-

set have been programmed , a n ew block is allocated and

appended to the back of the list.

2.5 Hybrid FTL Schemes

BAST (Block-Associative Sector Translat io n ) is the ﬁrst

hybrid FTL scheme proposed in 2002 [15], which is essen-

tially an altered version of NFTL-1. As mentioned ear-

lier, hybrid FTL schemes build a page-level mapping for the

LBA. To keep this table small enough to reside in the SRAM,

BAST limits the tota l number of replacement blocks (also

known as log blocks). Obviously, the read performance of

BAST is better than N F T L-1 bec a u s e the SRAM is several

orders of magnitude faster tha n ﬂash memories. However,

BAST does not work well with random overwrite patterns

which may result in a block thrashing p r o b lem [20]. Since

each replacement block can accommodate p a g es from only

one logical block, BAST can eas ily run out of free replace-

ment blocks and be forced to reclaim replacement blocks

that have not been ﬁlled. Therefore, the utilization ratio of

replacement blocks in BAST is low both theoretically and

experimentally.

To solve the block thrashing p r o b lem, another hybrid FTL

scheme named FAST (Fully Associative Sector Translation)

was put forward [20]. FAST goes to the other extreme by

allowing a log block to hold updates from any data block.

Although FAST successfully delays garbage collections as

much as possible, the system-wide latency for r ec la imin g

a single log block may turn out to be longer than BAST,

since the assoc ia tivity of log blocks is only limited by the

number o f pages in a block. The associativity of a log block

is deﬁned as the number of diﬀerent data blo cks whose most

up-to-date pages are located in the log block. The higher

the associativity of a log block is, the more expensive it is to

剩余11页未读，继续阅读

wolong426

粉丝: 3
资源: 15

LazyFTL：优化NAND闪存的页级FTL设计

sigmod2011全部论文(1)

在NAND闪存中，LazyFTL如何实现页级管理以优化SSD的性能和寿命？

LazyFTL技术是如何通过页级管理提高NAND闪存SSD的性能并延长其使用寿命的？

LazyFTL在NAND闪存的SSD设计中是如何应用页级优化来提升性能和延长寿命的？

玄武岩纤维行业研究报告 新材料技术 玄武岩纤维 性能应用 市场分析

基于 Vue 3、Vite、Ant Design Vue 4.0、TypeScript、Vben Vue Admin，最先进的技术栈，让初学者能够更快的入门并投入到团队开发中去

请参阅 readme 了解更新的 repo 详细信息！一个示例商店，展示了如何使用 Vue、Stripe 和无服务器函数管理付款.zip

【java毕业设计】学生宿舍管理系统的设计与开发源码（springboot+vue+mysql+说明文档+LW）.zip

Python期末大作业基于LSTM的天气数据时间序列预测项目源码+论文+数据集（高分项目）

C++期末大作业基于C++和QT的天气预报系统源码（高分项目）

最新资源

玄武岩纤维行业研究报告新材料技术玄武岩纤维性能应用市场分析