基于深度估计的多视角分解与重建方法

需积分: 8 129 浏览量更新于2024-08-06 收藏 580KB PDF 举报

"该文提出了一种基于迭代深度估计的多视角分解方法，旨在通过匹配多个图像来重建相机运动和场景形状，特别是当相机捕捉到透视视图时。该方法从仿射投影相机模型出发，逐步估计投影深度，直至测量矩阵达到秩4。接着，通过分解得到的测量矩阵，恢复场景在投影空间中的三维信息。这种方法避免了传统透视投影图像分解过程中对噪声敏感的步骤，如计算基础矩阵，从而实现更稳定的重建。此外，还扩展了常规仿射模型中的度量约束，并推导出透视投影条件下的度量约束。证明了在满足内部参数条件下，可以在欧几里得空间中实现重建。" 在多视角几何领域，本文的核心贡献是一种新的分解技术，它特别关注于处理具有透视效果的图像序列。传统的多视图几何方法通常依赖于线性或非线性的优化算法来估计相机运动参数和场景结构。然而，这些方法在处理复杂场景和噪声数据时可能会遇到挑战，因为它们需要计算如本质矩阵或基础矩阵等高度敏感的几何量。本文提出的迭代深度估计方法，首先假设一个仿射投影相机模型，这个模型简化了相机的投影过程，但保留了透视效应的关键特征。通过迭代地估计每个视图中的深度值，该方法逐渐改进了场景的三维表示，同时减少了噪声对结果的影响。当测量矩阵的秩达到4时，意味着可以解析地解出四个自由度的相机运动和三个自由度的场景点。接下来，作者扩展了仿射模型的度量约束，将透视投影条件考虑进来。在仿射模型中，虽然可以捕获大部分图像的几何关系，但它不保持距离的比例，这可能导致重建的精度降低。通过引入透视约束，文章能够更准确地重建场景的欧几里得结构，这对于实际应用，如虚拟现实、增强现实或机器人导航等，至关重要。这项工作提供了一种稳健且高效的多视图几何方法，适用于包含透视效果的图像序列。通过迭代深度估计和改进的度量约束，它能够处理噪声数据，实现对相机运动和场景结构的精确重建。这种方法对于计算机视觉领域的研究和实践有着重要的理论与应用价值。

A Factorization Method for Multiple Perspective Views via

Iterative Depth Estimation

Toshio Ueshiba and Fumiaki Tomita

Electrotechnical Laboratory, Tsukuba, Japan 305-8568

SUMMARY

This paper proposes a factorization method that re-

constructs camera motion and scene shape based on the

matching of multiple images under the condition that the

camera captures a perspective view. Starting from the affine

projection camera model, the projection depth is iteratively

estimated until the measurement matrix has rank 4. Then,

the obtained measurement matrix is factorized to restore the

three-dimensional information of the scene in the projec-

tion space. This approach eliminates noise sensitive proc-

esses, such as the calculation of the fundamental matrix,

that are required in the factorization for the conventional

perspective projection image, and a stable reconstruction is

realized. Furthermore, the metric constraint in the conven-

tional affine model is extended, and the metric constraint in

the perspective projection condition is derived. It is shown

that the reconstruction in Euclidean space is realized if the

Technica, Syst Comp Jpn, 31(13): 8795, 2000

Key words:

Affine projection; perspective projec-

tion; factorization; reconstruction of three-dimensional in-

formation; metric constraint.

1. Instruction

The problem in which the relative locations of the

cameras and the three-dimensional information of the scene

are simultaneously reconstructed based on multiple images

obtained from various viewpoints is called the structure

from motion and is one of the essential problems in

computer vision. A large number of algorithms for this

problem have been proposed. Among them, the factoriza-

tion method proposed by Tomasi and Kanade [1] is an

excellent method that is simple and highly stable.

Their method is based on the property that the meas-

urement matrix composed of the two-dimensional coordi-

nates of the feature points observed in the image can be

decomposed into the product of two matrices representing

the camera motion and the three-dimensional positions of

the feature points, respectively, under the assumption that

the camera executes the affine projection. Tomasi and

Kanade used orthographic projection as the camera model.

Several extensions of the method were subsequently made

to the weak perspective camera model and the paraperspec-

tive camera model [2].

Recently, several methods were proposed to extend

the factorization method to the case of the perspective

projection camera model [38]. A difficulty in applying the

factorization method to the perspective projection images

is that the structural parameter of the scene called projective

depth is unknown. Thus, the measurement matrix is also

unknown, and the factorization cannot directly be applied.

The crucial point is therefore to determine projective depth

by some means.

Christy and Horaud presented the shape reconstruc-

tion method [3, 4], which starts from the paraperspective

camera model and makes the perspective projection model

approach the measurement matrix by iteratively applying

linear factorization. In this method, a constraint called

metric constraint is handled in each step of the factorization

Systems and Computers in Japan, Vol. 31, No. 13, 2000

Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J81-D-I, No. 8, August 1998, pp. 17181726

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38697557

粉丝: 8

基于深度估计的多视角分解与重建方法

laser-kinect-pointcloud-register-icp.zip_kinect点云提取_点云_点云提取_点云滤波

投影多视图结构与运动的元素明智分解新方法

增量多分辨矩阵分解：揭示对称矩阵的层次结构

预测模型深度对比：ARIMA vs 季节性分解，哪一种更胜一筹？

数值分析的迭代方法：收敛性与效率的实战探讨

动态张量分解：时间序列分析的未来视角

【频率域分析】：多通道信号处理的新视角与方法

【控制理论深度探索】：直流电机模型分解的必备知识与实践要点

【VMD变分模态分解深度解析】：算法数学原理与实现细节探究

深度学习与最小二乘法：系统辨识新视角

最新资源