点云Mamba：状态空间模型驱动的高效学习

需积分: 5 117 浏览量更新于2024-08-03 2 收藏 2.88MB PDF 举报

"Point Could Mamba 是一种利用状态空间模型进行点云学习的新方法，旨在超越基于点的方法，如 PointNet、PointNet++ 和 PointNeXt。该方法结合了局部和全局建模，具有线性的计算复杂性，使得处理三维点云数据更加高效。通过提出一致遍历序列化技术，点云被转换为一维点序列表，保持空间相邻性，同时通过六种坐标置换变体增强观察角度。此外，引入点提示以适应不同阶次的点序列，位置编码方法则优化了位置信息的注入。Point Cloud Mamba 在 ScanObjectNN、ModelNet40 和 ShapeNetPart 数据集上实现了最先进的性能。" 点云学习是一种处理三维数据的关键技术，尤其在计算机视觉、机器人和自动驾驶等领域。传统的点云处理方法，如基于点的方法，通常面临着全局理解的挑战和高计算复杂度。PointCouldMamba 引入了基于曼巴（Mamba）的状态空间模型，这是一种创新的处理方式，它克服了这些局限。曼巴方法的核心在于其强大的全局建模能力，这得益于其线性计算复杂性。在图1中，(a)部分展示了基于点的方法，如PointNet，它们主要依赖于局部感知；(b)部分是基于变换器的方法，如PointTransformer，它们在全局感知方面有所提升，但计算复杂度较高；而(c)部分的Mamba-based方法则兼顾了全局感知和较低的计算复杂性。为了解决点云数据的处理问题，论文中提出了一致遍历序列化，它将点云数据转化为一维序列，确保相邻点在空间上的连续性。这个过程通过改变x、y和z坐标的顺序创建了六个不同的序列变体，这些变体的组合使用使得Mamba可以从多个角度全面理解点云数据。此外，为了处理不同阶次的点序列，作者引入了“点提示”机制。点提示可以告知网络序列的排列规则，从而帮助Mamba更灵活地适应不同的点云结构。最后，位置编码方法的提出，旨在通过映射空间坐标，更精确地将点的位置信息整合到序列中，进一步增强了模型对点云细节的理解和表示能力。通过这些改进，Point Cloud Mamba 构建了一个综合了局部和全局建模的网络架构。实验证明，它在一系列基准数据集上，包括ScanObjectNN、ModelNet40和ShapeNetPart，表现出了超越现有最新技术（SOTA）的性能。这表明，Point Cloud Mamba 提供了一种有效且高效的点云学习新途径，对于未来点云处理的研究和应用具有重要意义。

𝑁

Stage 1

Geometry Affine

Copy

⊖

Mamba Block

Serialization

Forward

Conv1d

Backward

Conv1d

Forward

SSM

Backward

SSM

𝑁

Stage 2

𝑁

Stage 3

𝑁

Stage 4

Figure 2: The architecture of our proposed Point Cloud Mamba. PCM consists of four stages,

each comprising a geometric afﬁne module and several mamba layers. Point downsampling is

performed between stages.

that Mamba architecture can achieve comparable or even better results than transformer-based models

in 3D point clouds.

3D Visual Transformers. With the rise of the transformer in 2D version [

], several works [

] also explore transformer architectures in the point cloud. Earlier works [

]

have focused on the point cloud process. PCT [

] performs global attention directly to each

point, following the ViT [

]. However, it has memory consumption and computational complexity

issues. Point Transformer [

] solves this issue by introducing local attention. Then, the updated

versions [

] explore the different architectures to improve performance and efﬁciency. Inspired

by these studies, our works combine local point processing and a new traverse serialization strategy,

which leads to better results than direct SSM traverse.

State Space Models. Inspired by continuous state space models in control systems, recently, state

space models [

] have been proven to model long-range dependency. In particular, S4 [

]

proposes to normalize the parameter into the diagonal structure, which results in less computation

cost and memory usage. After that, Mamba [

] presents a selection mechanism that leads to better

results than transformers. Recently, several works have explored such architecture in different tasks,

including image classiﬁcation [

], graph modeling [

], medical segmentation [

and low-level version tasks [

]. As a concurrent work, we further prove the potential of SSMs in the

3D point clouds, where we can achieve even better results than previous architectures.

3 Method

The SSM-based architecture, Mamba [

], is attractive for point cloud representation learning due to

its global modeling capability and linear computational complexity. However, Mamba is designed

for the causal modeling of 1-D sequences, making it difﬁcult to directly apply it to the modeling of

non-causal 3-D point cloud data, posing many challenges to be addressed.

This section explores how to effectively integrate Mamba into architectures based on local modeling to

capture global features. We ﬁrst review PointMLP [

], a straightforward local modeling architecture,

in Sec. 3.1, and then review Mamba, a global modeling architecture with linear complexity, in Sec. 3.2.

Next, in Sec. 3.3, we introduce how to combine Mamba with PointMLP to obtain a point cloud

network based on Mamba architecture. Finally, we introduce improving this naive Mamba-based

network to Point Cloud Mamba in Sec. 3.4. We propose consistent traverse serialization, order

prompt, new positional embedding, and more reasonable architectures to assist Mamba in modelling

point cloud data better.

剩余12页未读，继续阅读

人工智能_SYBH

粉丝: 4w+
资源: 222

点云Mamba：状态空间模型驱动的高效学习

Mamba: Linear-Time Modeling With Selective State Space.pdf

LLM+Mamba具有选择性状态空间的线性时间序列建模

mamba：:snake:Mamba编程语言，因为我们关心安全性

mamba:曼巴模糊器重构

SCALE-MAMBA:SCALE-MAMBA MPC系统的存储库

BlackMamba：C2开发后框架

setup-mamba:用于设置Mamba软件包管理器的GitHub操作

mamba:快速跨平台软件包管理器

black-mamba:“足够好”的蛇玩家

mamba:Python的权威测试工具。 生于行为驱动开发（BDD）的旗帜下

最新资源

mamba:Python的权威测试工具。生于行为驱动开发（BDD）的旗帜下