Apache Kafka生产系统运维指南

需积分: 6 183 浏览量更新于2024-09-08 收藏 232KB PDF 举报

"Apache Kafka 的运维指南" Apache Kafka 是一个分布式流处理平台，常用于构建实时数据管道和流应用。这份运维指南提供了在生产环境中运行 Kafka 的关键信息，特别是基于 LinkedIn 的使用经验和最佳实践。 6.1 数据中心在多数据中心部署时，Kafka 的策略是每个数据中心内部署一个本地 Kafka 集群。每个位置的机器只与其本地集群交互，以减少跨数据中心的网络延迟和依赖。这样可以确保高可用性和低延迟的性能。对于需要全局数据视图的应用，Kafka 提供了镜像制作工具（MirrorMaker），它可以将所有数据中心的数据聚合在一个或多个镜像集群中。这些聚合的镜像集群被需要全局数据视图的应用用来读取数据。同时，为了支持将数据加载到位于不同设施的 Hadoop，Kafka 会提供本地只读集群，这些集群镜像生产数据中心的数据。这种方式使得每个设施都能独立运作，即使跨数据中心的链接不可用，也能保证数据的正常处理。当链接恢复后，镜像过程会迅速追赶，确保数据的完整性。此外，文档还可能涵盖其他运维主题，如： 1. **配置管理**：包括如何设置 Kafka 集群的配置参数，以优化性能、容错性和可扩展性。 2. **监控与日志**：如何实施有效的监控系统，收集和分析 Kafka 雖群的日志，以便及时发现和解决问题。 3. **故障恢复**：详细说明如何设计容错机制，例如副本策略，以及在节点故障时如何恢复服务。 4. **扩展性**：如何添加或删除服务器来适应数据量的变化，以及水平扩展 Kafka 集群的方法。 5. **安全性**：讨论如何使用 SSL/TLS 加密通信，设置权限控制和认证机制，确保数据的安全传输。 6. **备份与恢复**：如何定期备份数据，并在必要时进行数据恢复。 7. **性能调优**：提供关于如何提高 Kafka 生产者和消费者性能的建议，如批处理大小、压缩策略等。 8. **客户端管理**：如何管理和配置与 Kafka 集群交互的生产者和消费者应用。这些内容将帮助管理员更好地理解和操作 Kafka 集群，确保在生产环境中的稳定和高效运行。

4/16/2014 https://kafka.apache.org/08/ops.html

https://kafka.apache.org/08/ops.html 1/8

Here is some information on actually running Kafka as a production system based on usage and experience at

LinkedIn. Please send us any additional tips you know of.

6.1 Datacenters

Some deployments will need to manage a data pipeline that spans multiple datacenters. Our approach to this is to

deploy a local Kafka cluster in each datacenter and machines in each location interact only with their local cluster.

For applications that need a global view of all data we use the mirror maker tool to provide clusters which have

aggregate data mirrored from all datacenters. These aggregator clusters are used for reads by applications that

require this.

Likewise in order to support data load into Hadoop which resides in separate facilities we provide local read-only

clusters that mirror the production data centers in the facilities where this data load occurs.

This allows each facility to stand alone and operate even if the inter-datacenter links are unavailable: when this

occurs the mirroring falls behind until the link is restored at which time it catches up.

This deployment pattern allows datacenters to act as independent entities and allows us to manage and tune inter-

datacenter replication centrally.

This is not the only possible deployment pattern. It is possible to read from or write to a remote Kafka cluster over

the WAN though TCP tuning will be necessary for high-latency links.

It is generally not advisable to run a single Kafka cluster that spans multiple datacenters as this will incur very high

replication latency both for Kafka writes and Zookeeper writes and neither Kafka nor Zookeeper will remain

available if the network partitions.

6.2 Kafka Configuration

Kafka 0.8 is the version we currently run. We are currently running with replication but with producers acks = 1.

Important Server Configurations

The most important server configurations for performance are those that control the disk flush rate. The more often

data is flushed to disk, the more "seek-bound" Kafka will be and the lower the throughput. However very low

application flush rates can lead to high latency when the flush finally does occur (because of the volume of data that

must be flushed). See the section below on application versus OS flush.

Important Client Configurations

The most important producer configurations control

compression

sync vs async production>

batch size (for async producers)

The most important consumer configuration is the fetch size.

All configurations are documented in the configuration section.

A Production Server Config

Here is our server production server configuration:

# Replication configurations

下载后可阅读完整内容，剩余6页未读，立即下载

曹宇

粉丝: 14
资源: 7

Apache Kafka生产系统运维指南

Python Kafka库：mypy_boto3_kafka-*.**.**.*下载指南

Python库aws_custom_ews_kafka_resources-0.0.4.tar.gz深度解析

Python官方库mypy_boto3_kafka 1.19.12版本使用指南

GetOffsetShell_kafka_SSL:具有SSL的kafka.tools.GetOffsetShell

org.apache.flink.kafka.shaded.org.apache.kafka.common.KafkaException: Failed to construct kafka consumer

PyPI 官网下载 | mypy_boto3_kafka-1.17.88.post1-py3-none-any.whl

Python库 | aws_custom_ews_kafka_resources-0.0.4.tar.gz

debj_SpringBoot_ELK+Kafka_resources.zip

Python库 | mypy_boto3_kafka-1.19.12-py3-none-any.whl

Python库 | mypy_boto3_kafka-1.17.97-py3-none-any.whl

最新资源

Python Kafka库：mypy_boto3_kafka-...下载指南