Facebook的分布式Memcache扩展技术

memcached

facebook

需积分: 10 155 浏览量更新于2024-09-09 收藏 369KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Facebook在大规模扩展Memcache以支持其全球社交网络的分布式键值存储系统方面的实践和经验分享。" 在本文中，Facebook团队详细介绍了他们如何利用Memcached作为构建块来构建并扩展一个能够处理数十亿请求每秒、存储万亿条数据的分布式键值存储系统，从而为全球超过十亿用户提供丰富的互动体验。这项技术对于处理大规模社交网络所带来的计算、网络和I/O需求至关重要。 1. 引言流行的社交网络平台面临着巨大的基础设施挑战。每天有数以亿计的用户访问这些网络，对计算能力、网络带宽和I/O性能提出了前所未有的要求。传统的Web架构难以满足这样的负载，因此Facebook需要创新解决方案来应对这些挑战。 2. Memcached的扩展性问题 Memcached是一个广泛使用的、简单的内存缓存解决方案，但原始设计并未考虑大规模部署的需求。Facebook面临的挑战是如何将Memcached扩展到处理海量请求和数据的水平。 3. Facebook的解决方案 Facebook采取了多种策略来优化和扩展Memcached： - **分布式策略**：通过分布式算法（如一致性哈希）将数据分散到多台服务器上，避免单点故障，并提高系统的可用性和扩展性。 - **硬件优化**：使用高性能的服务器硬件，包括高速处理器、大容量内存和低延迟网络接口，以提高处理速度和吞吐量。 - **缓存策略**：采用智能的缓存策略，如最近最少使用（LRU）和时间过期策略，以减少无效的缓存访问和内存浪费。 - **故障恢复**：建立自动故障检测和恢复机制，确保服务的高可用性。 - **运维工具**：开发监控和管理工具，实时跟踪系统状态，以便快速定位和解决问题。 4. 系统设计与实现 Facebook的扩展版Memcache系统包含多个组件，包括客户端库、集群管理器、负载均衡器以及用于数据迁移和故障切换的组件。这些组件协同工作，确保高效的数据存储和检索，同时保持系统的弹性。 5. 性能评估与优化文中详细讨论了系统在实际运行中的性能指标，如请求响应时间、内存利用率和故障恢复速度等，并针对发现的问题进行优化。 6. 结论 Facebook通过深度定制和优化Memcached，成功构建了一个能够支撑其庞大用户基数的分布式键值存储系统。这一实践不仅解决了Facebook的挑战，也为其他大型互联网公司提供了参考和启示。 "Scaling Memcache at Facebook"这篇文章揭示了在应对大规模社交网络需求时，如何通过技术创新和系统优化来提升Memcached的性能和可扩展性，为构建高可用、高性能的分布式缓存系统提供了宝贵的经验。

资源详情

资源推荐

USENIX Association 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’13) 387

memcache get requests. For example, loading one of our

popular pages results in an average of 521 distinct items

fetched from memcache.

We provision hundreds of memcached servers in a

cluster to reduce load on databases and other services.

Items are distributed across the memcached servers

through consistent hashing [22]. Thus web servers have

to routinely communicate with many memcached servers

to satisfy a user request. As a result, all web servers

communicate with every memcached server in a short

period of time. This all-to-all communication pattern

can cause incast congestion [30] or allow a single server

to become the bottleneck for many web servers. Data

replication often alleviates the single-server bottleneck

but leads to signiﬁcant memory inefﬁciencies in the

common case.

We reduce latency mainly by focusing on the

memcache client, which runs on each web server. This

client serves a range of functions, including serializa-

tion, compression, request routing, error handling, and

request batching. Clients maintain a map of all available

servers, which is updated through an auxiliary conﬁgu-

ration system.

Parallel requests and batching: We structure our web-

application code to minimize the number of network

round trips necessary to respond to page requests. We

construct a directed acyclic graph (DAG) representing

the dependencies between data. A web server uses this

DAG to maximize the number of items that can be

fetched concurrently. On average these batches consist

of 24 keys per request

Client-server communication: Memcached servers do

not communicate with each other. When appropriate,

we embed the complexity of the system into a stateless

client rather than in the memcached servers. This greatly

simpliﬁes memcached and allows us to focus on making

it highly performant for a more limited use case. Keep-

ing the clients stateless enables rapid iteration in the

software and simpliﬁes our deployment process. Client

logic is provided as two components: a library that can

be embedded into applications or as a standalone proxy

named mcrouter. This proxy presents a memcached

server interface and routes the requests/replies to/from

other servers.

Clients use UDP and TCP to communicate with

memcached servers. We rely on UDP for get requests to

reduce latency and overhead. Since UDP is connection-

less, each thread in the web server is allowed to directly

communicate with memcached servers directly, bypass-

ing mcrouter, without establishing and maintaining a

The 95

percentile of fetches for that page is 1,740 items.

The 95

percentile is 95 keys per request.

Average of Medians Average of 95th Percentiles

microseconds

0 200 600 1000 1400

UDP direct

by mcrouter (TCP)

Figure 3: Get latency for UDP, TCP via mcrouter

connection thereby reducing the overhead. The UDP

implementation detects packets that are dropped or re-

ceived out of order (using sequence numbers) and treats

them as errors on the client side. It does not provide

any mechanism to try to recover from them. In our in-

frastructure, we ﬁnd this decision to be practical. Un-

der peak load, memcache clients observe that 0.25% of

get requests are discarded. About 80% of these drops

are due to late or dropped packets, while the remainder

are due to out of order delivery. Clients treat get er-

rors as cache misses, but web servers will skip insert-

ing entries into memcached after querying for data to

avoid putting additional load on a possibly overloaded

network or server.

For reliability, clients perform set and delete opera-

tions over TCP through an instance of mcrouter run-

ning on the same machine as the web server. For opera-

tions where we need to conﬁrm a state change (updates

and deletes) TCP alleviates the need to add a retry mech-

anism to our UDP implementation.

Web servers rely on a high degree of parallelism and

over-subscription to achieve high throughput. The high

memory demands of open TCP connections makes it

prohibitively expensive to have an open connection be-

tween every web thread and memcached server without

some form of connection coalescing via mcrouter. Co-

alescing these connections improves the efﬁciency of

the server by reducing the network, CPU and memory

resources needed by high throughput TCP connections.

Figure 3 shows the average, median, and 95

percentile

latencies of web servers in production getting keys over

UDP and through mcrouter via TCP. In all cases, the

standard deviation from these averages was less than

1%. As the data show, relying on UDP can lead to a

20% reduction in latency to serve requests.

Incast congestion: Memcache clients implement ﬂow-

control mechanisms to limit incast congestion. When a

剩余13页未读，继续阅读

jw598527338

粉丝: 6
资源: 2

Facebook的分布式Memcache扩展技术

Scaling-Memcache-At-Facebook

scaling_out_dict['scaling_wsjmix_{}_{}'.format(sr_dir, datalen_dir)] = scaling_wsjmix scaling_out_dict['scaling_wham_speech_{}_{}'.format(sr_dir, datalen_dir)] = scaling_speech_wham scaling_out_dict['scaling_wham_noise_{}_{}'.format(sr_dir, datalen_dir)] = scaling_noise_wham

python scaling

scaling_speed: 0.5

ufs_clk_scaling

modbus poll scaling功能

max_scaling_factor: 0.2

python中如何使用Amazon Auto Scaling

window.scaling

help='scaling and cropping of images at load time [resize_and_crop|crop|scale_width|scale_width_and_crop]'

scaling list

python中autoscaling=1shenn

conv_2d一定要用'variance_scaling_initializer'怎么办

r语言中scaling

terraform配置application auto scaling

Chirp Scaling算法

z-scaling用R语言的计算方式

autoscaling

power scaling

最新资源