2018 ICLR最佳论文：Spherical CNNs：解决球面图像分析的革命性方法

需积分: 50 127 浏览量更新于2024-07-16 收藏 2.48MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

Published as a conference paper at ICLR 2018

As mentioned before, the output of the spherical correlation is a function on

SO(3)

. This is perhaps

somewhat counterintuitive, and indeed the conventional deﬁnition of spherical convolution gives

as output a function on the sphere. However, as shown in Appendix B, the conventional deﬁnition

effectively restricts the ﬁlter to be circularly symmetric about the Z axis, which would greatly limit

the expressive capacity of the network.

Rotation of SO(3) Signals

We deﬁned the rotation operator

for spherical signals (eq. 1), and

used it to deﬁne spherical cross-correlation (eq. 4). To deﬁne the

SO(3)

correlation, we need to

generalize the rotation operator so that it can act on signals deﬁned on

SO(3)

. As we will show,

naively reusing eq. 1 is the way to go. That is, for f : SO(3) → R

, and R, Q ∈ SO(3):

f](Q) = f (R

−1

Q). (5)

Note that while the argument

−1

in Eq. 1 denotes the rotation of

x ∈ S

−1

∈ SO(3)

, the

analogous term R

−1

Q in Eq. 5 denotes to the composition of rotations (i.e. matrix multiplication).

Rotation Group Correlation

Using the same analogy as before, we can deﬁne the correlation of

two signals on the rotation group, f, ψ : SO(3) → R

, as follows:

[ψ ? f](R) = hL

ψ, fi =

SO(3)

k=1

−1

Q)f

(Q)dQ. (6)

The integration measure

is the invariant measure on

SO(3)

, which may be expressed in ZYZ-Euler

angles as dα sin(β)dβdγ/(8π

) (see Appendix A).

Equivariance

As we have seen, correlation is deﬁned in terms of the rotation operator

. This

operator acts naturally on the input space of the network, but what justiﬁcation do we have for using

it in the second layer and beyond?

The justiﬁcation is provided by an important property, shared by all kinds of convolution and

correlation, called equivariance. A layer

is equivariant if

Φ ◦ L

= T

◦ Φ

, for some operator

Using the deﬁnition of correlation and the unitarity of L

, showing equivariance is a one liner:

[ψ ? [L

f]](R) = hL

ψ, L

fi = hL

−1

ψ, fi = [ψ ? f](Q

−1

R) = [L

[ψ ? f]](R).

(7)

The derivation is valid for spherical correlation as well as rotation group correlation.

4 FAST SPHERICAL CORRELATION WITH G-FFT

It is well known that correlations and convolutions can be computed efﬁciently using the Fast Fourier

Transform (FFT). This is a result of the Fourier theorem, which states that

[

f ∗ ψ =

f ·

. Since the

FFT can be computed in O(n log n) time and the product · has linear complexity, implementing the

correlation using FFTs is asymptotically faster than the naive O(n

) spatial implementation.

For functions on the sphere and rotation group, there is an analogous transform, which we will refer

to as the generalized Fourier transform (GFT) and a corresponding fast algorithm (GFFT). This

transform ﬁnds it roots in the representation theory of groups, but due to space constraints we will

not go into details here and instead refer the interested reader to Sugiura (1990) and Folland (1995).

Conceptually, the GFT is nothing more than the linear projection of a function onto a set of orthogonal

basis functions called “matrix element of irreducible unitary representations”. For the circle (

) or

line (

), these are the familiar complex exponentials

exp(inθ)

. For

SO(3)

, we have the Wigner D-

functions

(R)

indexed by

l ≥ 0

and

−l ≤ m, n ≤ l

. For

, these are the spherical harmonics

(x) indexed by l ≥ 0 and −l ≤ m ≤ l.

Denoting the manifold (

SO(3)

) by

and the corresponding basis functions by

(which is

either vector-valued (

) or matrix-valued (

)), we can write the GFT of a function

f : X → R

f(x)U

(x)dx. (8)

Technically,

is not a group and therefore does not have irreducible representations, but it is a quotient of

groups SO(3)/ SO(2) and we have the relation Y

= D

旋转算子

剩余15页未读，继续阅读

Hoppipolla0816

粉丝: 59
资源: 7

2018 ICLR最佳论文：Spherical CNNs：解决球面图像分析的革命性方法

Spherical CNNs：球面卷积网络的一个PyTorch实现-python

大旋角空间直角坐标转换c#

var object = new THREE.Object3D(); object.position.setFromSpherical( spherical ); vector.copy( object.position ).multiplyScalar( 2 );每行什么意思

解释一下这段代码 Side data: stereo3d: 2D spherical: equirectangular (0.000000/0.000000/0.000000)

全景图展开代码

Cesium fromSpherical如何使用

Awesome_mixins-0.4-py2-none-any.whl.zip

小契约（交友互动小程序源码）.zip

服装图像检索-基于深度特征+基于内容的服装图像检索算法-附项目源码-优质项目实战.zip

2024-2030中国大肠杆菌在线分析仪市场现状研究分析与发展前景预测报告 Sample zxk.pdf

avatar_utils-1.0.1-py3-none-any.whl.zip

毕业设计基于Spring Cloud微服务架构的AI生成式网站的设计与实现

Axelrod-2.2.0-py2.py3-none-any.whl.zip

智能优化算法-海洋捕食者算法（MPA）（附源码）

和鲸社区Kesci 水下目标检测算法赛（光学图像赛项）三等奖 单模方案.zip

半导体集成电路 模拟集成电路设计与仿真 何乐年

libqt5sql5-psql-5.15.13+dfsg-1ubuntu1-arm64.deb

Avatar_Utils-1.8.8-py3-none-any.whl.zip

tiny—yolov3（keras）检测自己的图像，三类目标.zip

资源数据 (2qssxcx

最新资源

和鲸社区Kesci 水下目标检测算法赛（光学图像赛项）三等奖单模方案.zip

半导体集成电路模拟集成电路设计与仿真何乐年