球面卷积神经网络 (2018)
Spherical CNNs (2018)

原始链接: https://arxiv.org/abs/1801.10130

本文介绍了球面卷积神经网络(Spherical CNNs),这是一种分析球面图像的新方法,它解决了将标准卷积神经网络应用于平面投影时因失真导致效率低下的问题。其动机源于全向视觉、分子回归和全球气候建模等应用。核心创新在于定义了一种既具有表达能力又具有旋转等变性的球面互相关运算,这对于在球面上有效地共享权重至关重要。这种球面相关性受益于广义傅里叶定理,可以通过非交换快速傅里叶变换 (FFT) 算法进行高效计算。作者通过三维模型识别和原子化能回归的实验,证明了球面卷积神经网络的效率、准确性和有效性,突出了其在各种球面数据分析任务中的潜力。

Here's a short summary of the Hacker News thread about "Spherical CNNs (2018)": The article "Spherical CNNs (2018)" sparked a discussion on Hacker News. User voxleone shared SpinStep, a tool for visualizing SCNN computations layer-by-layer, hoping to demystify the "black box" nature of CNNs. Rkp8000 highlighted a key point: convolutions on a sphere are more complex than in Cartesian spaces. Unlike the straightforward "sliding" of filters in R^N, there isn't a single, clear way to shift a filter on a sphere due to varying orientations during rotations. This means spherical convolution returns a function over the 3D rotation group SO(3), not just a 2D sphere, complicating the convolution theorem. Smath noted the relevance of this research to 3D molecule modeling in drug discovery, referencing a related paper by some of the same authors.
相关文章

原文

View a PDF of the paper titled Spherical CNNs, by Taco S. Cohen and 3 other authors

View PDF
Abstract:Convolutional Neural Networks (CNNs) have become the method of choice for learning problems involving 2D planar images. However, a number of problems of recent interest have created a demand for models that can analyze spherical images. Examples include omnidirectional vision for drones, robots, and autonomous cars, molecular regression problems, and global weather and climate modelling. A naive application of convolutional networks to a planar projection of the spherical signal is destined to fail, because the space-varying distortions introduced by such a projection will make translational weight sharing ineffective.
In this paper we introduce the building blocks for constructing spherical CNNs. We propose a definition for the spherical cross-correlation that is both expressive and rotation-equivariant. The spherical correlation satisfies a generalized Fourier theorem, which allows us to compute it efficiently using a generalized (non-commutative) Fast Fourier Transform (FFT) algorithm. We demonstrate the computational efficiency, numerical accuracy, and effectiveness of spherical CNNs applied to 3D model recognition and atomization energy regression.
From: Taco Cohen [view email]
[v1] Tue, 30 Jan 2018 18:28:30 UTC (1,942 KB)
[v2] Thu, 8 Feb 2018 08:06:34 UTC (1,942 KB)
[v3] Sun, 25 Feb 2018 13:43:49 UTC (1,942 KB)
联系我们 contact @ memedata.com