应对复杂故障：IRON文件系统与局部失效模型

38 浏览量更新于2024-07-14 收藏 271KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

Electrical: A power spike or surge can damage in-drive circuits

and hence lead to drive failure [68]. Thus, electrical problems can

lead to entire disk failure.

Drive ﬁrmware: Interesting errors arise in the drive controller,

which consists of many thousands of lines of real-time, concurrent

ﬁrmware. For example, disks have been known to return correct

data but circularly shifted by a byte [37] or have memory leaks

that lead to intermittent failures [68]. Other ﬁrmware problems

can lead to poor drive performance [54]. Some ﬁrmware bugs are

well-enough known in the ﬁeld that they have speciﬁc names; for

example, “misdirected” writes are writes that place the correct data

on the disk but in the wrong location, and “phantom” writes are

writes that the drive reports as completed but that never reach the

media [73]. Phantom writes can be caused by a buggy or even mis-

conﬁgured cache (i.e., write-back caching is enabled). In summary,

drive ﬁrmware errors often lead to sticky or transient block corrup-

tion but can also lead to performance problems.

Transport: The transport connecting the drive and host can also be

problematic. For example, a study of a large disk farm [67] reveals

that most of the systems tested had interconnect problems, such

as bus timeouts. Parity errors also occurred with some frequency,

either causing requests to succeed (slowly) or fail altogether. Thus,

the transport often causes transient errors for the entire drive.

Bus controller: The main bus controller can also be problematic.

For example, the EIDE controller on a particular series of moth-

erboards incorrectly indicates completion of a disk request before

the data has reached the main memory of the host, leading to data

corruption [72]. A similar problem causes some other controllers to

return status bits as data if the ﬂoppy drive is in use at the same time

as the hard drive [26]. Others have also observed IDE protocol ver-

sion problems that yield corrupt data [23]. In summary, controller

problems can lead to transient block failure and data corruption.

Low-level drivers: Recent research has shown that device driver

code is more likely to contain bugs than the rest of the operating

system [15, 22, 66]. While some of these bugs will likely crash the

operating system, others can issue disk requests with bad parame-

ters, data, or both, resulting in data corruption.

2.3 The Fail-Partial Failure Model

From our discussion of the many root causes for failure, we are

now ready to put forth a more realistic model of disk failure. In our

model, failures manifest themselves in three ways:

• Entire disk failure: The entire disk is no longer accessible. If

permanent, this is the classic “fail-stop” failure.

• Block failure: One or more blocks are not accessible; often re-

ferred to as “latent sector errors” [33, 34].

• Block corruption: The data within individual blocks is altered.

Corruption is particularly insidious because it is silent – the storage

subsystem simply returns “bad” data upon a read.

We term this model the Fail-Partial Failure Model, to empha-

size that pieces of the storage subsystem can fail. We now discuss

some other key elements of the fail-partial model, including the

transience, locality, and frequency of failures, and then discuss how

technology and market trends will impact disk failures over time.

2.3.1 Transience of Failures

In our model, failures can be “sticky” (permanent) or “transient”

(temporary). Which behavior manifests itself depends upon the

root cause of the problem. For example, a low-level media problem

portends the failure of subsequent requests. In contrast, a transport

or higher-level software issue might at ﬁrst cause block failure or

corruption; however, the operation could succeed if retried.

2.3.2 Locality of Failures

Because multiple blocks of a disk can fail, one must consider

whether such block failures are dependent. The root causes of

block failure suggest that some forms of block failure do indeed

exhibit spatial locality [34]. For example, a scratched surface can

render a number of contiguous blocks inaccessible. However, all

failures do not exhibit locality; for example, a corruption due to a

misdirected write may impact only a single block.

2.3.3 Frequency of Failures

Block failures and corruptions do occur – as one commercial

storage system developer succinctly stated, “Disks break a lot – all

guarantees are ﬁction” [29]. However, one must also consider how

frequently such errors occur, particularly when modeling overall re-

liability and deciding which failures are most important to handle.

Unfortunately, as Talagala and Patterson point out [67], disk drive

manufacturers are loathe to provide information on disk failures;

indeed, people within the industry refer to an implicit industry-wide

agreement to not publicize such details [4]. Not surprisingly, the

actual frequency of drive errors, especially errors that do not cause

the whole disk to fail, is not well-known in the literature. Previous

work on latent sector errors indicates that such errors occur more

commonly than absolute disk failure [34], and more recent research

estimates that such errors may occur ﬁve times more often than ab-

solute disk failures [57].

In terms of relative frequency, block failures are more likely to

occur on reads than writes, due to internal error handling common

in most disk drives. For example, failed writes to a given sector

are often remapped to another (distant) sector, allowing the drive

to transparently handle such problems [31]. However, remapping

does not imply that writes cannot fail. A failure in a component

above the media (e.g., a stuttering transport), can lead to an unsuc-

cessful write attempt; the move to network-attached storage [24]

serves to increase the frequency of this class of failures. Also, for

remapping to succeed, free blocks must be available; a large scratch

could render many blocks unwritable and quickly use up reserved

space. Reads are more problematic: if the media is unreadable, the

drive has no choice but to return an error.

2.3.4 Trends

In many other areas (e.g., processor performance), technology

and market trends combine to improve different aspects of com-

puter systems. In contrast, we believe that technology trends and

market forces may combine to make storage system failures occur

more frequently over time, for the following three reasons.

First, reliability is a greater challenge when drives are made in-

creasingly more dense; as more bits are packed into smaller spaces,

drive logic (and hence complexity) increases [5].

Second, at the low-end of the drive market, cost-per-byte domi-

nates, and hence many corners are cut to save pennies in IDE/ATA

drives [5]. Low-cost “PC class” drives tend to be tested less and

have less internal machinery toprevent failures from occurring [31].

The result, in the ﬁeld, is that ATA drives are observably less reli-

able [67]; however, cost pressures serve to increase their usage,

even in server environments [23].

Finally, the amount of software is increasing in storage systems

and, as others have noted, software is often the root cause of er-

rors [25]. In the storage system, hundreds of thousands of lines of

software are present in the lower-level drivers and ﬁrmware. This

low-level code is generally the type of code that is difﬁcult to write

and debug [22, 66] – hence a likely source of increased errors in

the storage stack.

剩余14页未读，继续阅读

weixin_38585666

粉丝: 6
资源: 966

应对复杂故障：IRON文件系统与局部失效模型

amazon-dynamo-sosp2007.pdf

gfs-sosp2003.pdf

实时计算：Apache Flink：Flink与Kafka集成实现事件驱动架构.docx

移动软件开发实验五：高校新闻

分布式存储系统：Google Cloud Storage：GoogleCloudStorage架构解析.docx

基于现有explorer文件资源管理，实现跨平台告诉多标签页文件资源管理器，类似360文件管理器等，还处于起步阶段

关于进程之间的通信IPC，管道，信号的学习

我的第二个练手项目：（django1.9+python2.7）慕学网在线教育平台（后端部分）.zip

Android开发实战经典-041105-传感器视频教程.zip

try except finally的用法.doc

消息队列：ActiveMQ：ActiveMQ消息过滤与选择.docx

小程序-电梯品牌商城（源码）.zip

大数据处理框架：Storm：大数据处理框架概论.docx

大数据管理与监控：Cloudera Manager：YARN资源管理与调度.docx

实时计算：Apache Flink：Flink性能调优与最佳实践.docx

主要用来收集-学习爬虫相关技术如：js逆向、app逆向、抓包、验证码、加密技术、自动化技术、机器学习。.zip

AutoYOLObile-0.0.9-py3-none-any.whl.zip

飞腾FT-2000_4安装Debian10.8-arm64.html

推荐项目预约献血数据库文件

C++资料，C++资料，C++资料，C++资料，C++资料

最新资源