netmap：突破界限的高速包IO框架

netmap

需积分: 10 53 浏览量更新于2024-09-15 收藏 455KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

ware architectures is that most systems barely reach

0.5..1 Mpps per core from userspace, and even remain-

ing in the kernel yields only modest speed improvements,

usually within a factor of 2.

3 Related (and unrelated) work

It is useful at this point to present some techniques pro-

posed in the literature, or used in commercial systems, to

improve packet processing speeds. This will be instru-

mental in understanding their advantages and limitations,

and how our netmap framework can make use of them.

Socket APIs: The Berkeley Packet Filter, or BPF [12],

is one of the most popular systems for direct access to

raw packet data. BPF taps into the data path of a net-

work device driver, and dispatches a copy of each sent or

received packet to a ﬁle descriptor, from which userspace

processes can read or write. Linux has a similar mech-

anism through the AF

PACKET socket family. BPF can

coexist with regular trafﬁc from/to the system, but usu-

ally BPF clients need to put the card in promiscuous

mode, causing large amounts of trafﬁc to be delivered

to the host stack (and immediately dropped).

Packet ﬁlter hooks: Netgraph (FreeBSD), Netﬁl-

ter (Linux), Ndis Miniport drivers (Windows) are in-

kernel mechanisms used when packet duplication (as in

BPF) is not necessary, or the application (e.g. a ﬁre-

wall, or an IDS) manipulates trafﬁc as part of a packet

processing chain. These hooks permit to intercept traf-

ﬁc from/to the driver and pass it to processing modules

without additional data copies. Note however that even

the packet ﬁlter hooks rely on the standard mbuf/skbuf

based packet representation, so the cost of metadata man-

agement (Section 2.2) still remains. Netslice [11] is an

example of a system that uses the netﬁlter hooks to ex-

port trafﬁc to userspace processes through a suitable ker-

nel module.

Direct buffer access: One easy way to remove the

data copies involved in the kernel-userland transition is

to run the application code directly within the kernel.

Systems based on kernel-mode Click [8, 4] follow this

approach. Click permits an easy construction of packet

processing chains through the composition of modules,

some of which support fast access to the NIC (even

though they retain an skbuf-based packet representation).

The kernel environment is much more constrained

than the one available in user space, so a number of re-

cent proposals try a different approach: instead of run-

ning the application in the kernel, they removethe system

call overhead by exposing NIC registers and data struc-

tures to user space. This approach generally requires

modiﬁed device drivers, and poses some risks at runtime,

because the NIC’s DMA engine can write to arbitrary

memory addresses (unless limited by hardware mecha-

nisms such as IOMMUs), and a misbehaving client can

potentially trash data anywhere in the system.

UIO-IXGBE [9] implements exactly what we have de-

scribed above: buffers, hardware rings and NIC registers

(see Figure 1) are directly accessible to user programs,

with obvious risks for the stability of the system.

RING [2] exports to userspace clients a shared

memory region containing a ring of pre-allocated packet

buffers. The kernel is in charge of copying data between

skbufs and the shared buffers, so the system is safe and

no driver modiﬁcations are needed. This approach amor-

tizes the system call costs over batches of packets, but re-

tains the data copy and skbuf management overhead. An

evolution of PF

RING called DNA [3] avoids the copy

because the memory mapped ring buffers are directly ac-

cessed by the NIC. Same as UIO-IXGBE, DNA clients

have direct access to the NIC’s registers and rings.

The PacketShader [5] I/O engine (PSIOE) is one of

the closest relatives to our proposals. PSIOE uses a

custom device driver that replaces the skbuf-based API

with a simpler one, using preallocated buffers. Cus-

tom ioctl()s are used to synchronize the kernel with

userspace applications, and multiple packets are passed

up and down through a memory area shared between

the kernel and the application. The kernel is in charge

of copying packet data between the shared memory

and packet buffers. Unlike netmap, PSIOE only sup-

ports one speciﬁc network card, and does not support

select()/poll(), requiring modiﬁcations to applica-

tions in order to let them use the new API.

Hardware solutions: Some hardware has been de-

signed speciﬁcally to support high speed packet cap-

ture, or possibly generation, together with special fea-

tures such as timestamping, ﬁltering, forwarding. Usu-

ally these cards come with custom device drivers and

user libraries to access the hardware. As an example,

DAG [1, 7] cards are FPGA-based devices for wire-rate

packet capture and precise timestamping, using fast on-

board memory for the capture buffers (at the time, the

I/O bus was unable to sustain line rate). NetFPGA [10]

is another example of an FPGA-based card where the

ﬁrmware of the card can be programmed to implement

speciﬁc functions directly in the NIC, ofﬂoading the

main CPU from some work.

3.1 Unrelated work

A lot of commercial interest, in high speed network-

ing, goes to TCP acceleration and hardware virtualiza-

tion, so it is important to clarify where netmap stands

in this respect. netmap is a framework to reduce the

cost of moving trafﬁc between the hardware and the

host stack. Popular hardware features related to TCP

acceleration, such as hardware checksumming or even

剩余11页未读，继续阅读

ShawnWithSmallEyes

粉丝: 5
资源: 1

netmap：突破界限的高速包IO框架

netmap ： a novel framework for fast i/o

MetMAP:用于代谢建模、分析和优化的 MATLAB 包。-matlab开发

iptables 源地址段10.107.2.151/24，经过10.107.20155或100.10.0.1，转给地址段100.10.0.2/16

netmap数据结构解析

如何降低linux网络数据的接收开销

用户可以把自己的线程可以使用stop_sched_class 吗

Linux5.4.61 安装docker要打开哪些内核配置

java-ssm+vue旅游资源网站实现源码(项目源码-说明文档)

【高创新】基于粒子群优化算法PSO-Transformer-BiLSTM实现故障识别Matlab实现.rar

这里收集那些神奇的产品经理为我们带来的意想不到的产品功能和改版，又称_MDZZ_PM_awesome-pm.zip

AI City track 5数据集-voc-xml格式

4-3_Business_BLUE_2017_16-CL-20180524MTAX.potx

VB075期刊信息管理系统(SQL).7z

西门子SMART200程序 PID的控制写法，突破8路，PID直接做成子程序，无密码，直接调用

VBATM校园自动银行系统设计(源代码+论文).zip

基于深度学习的移动物体检测分类源码

PMSM滑模控制仿真无位置 永磁电机 可提供文档if启动 如果没有收敛，将1e-4搞小一点 e-6或者e-5试下 本次滑模模型

60-AspNet8-jQery-Datatables-5-Code.zip

皮层微创脑机接口传感器的发展现状与未来趋势

基于python实现的社会力模型仿真+源码+文档（毕业设计&课程设计&项目开发）

最新资源

PMSM滑模控制仿真无位置永磁电机可提供文档if启动如果没有收敛，将1e-4搞小一点 e-6或者e-5试下本次滑模模型