无视点图像缝合网络：全局共变法的深度学习解决方案

需积分: 6 163 浏览量更新于2024-08-05 收藏 9.58MB PDF 举报

本文介绍了一种基于全局仿射变换的无视图图像缝合网络，旨在解决传统计算机视觉任务——图像拼接中的灵活性问题。近年来，深度学习在图像缝合领域取得了显著进展，但现有的学习方法通常假设图像拍摄时视角相对固定，这限制了它们在处理灵活视角场景时的泛化能力。作者针对这一问题，提出了一种分阶段的、无视图约束的图像缝合网络。首先，该网络的核心是利用全局仿射变换（global homography）来估计两个输入图像之间的关系。全局仿射变换是一种线性变换模型，它能够在保持物体形状基本不变的情况下，描述两个视角下图像之间的相对位置。通过这种方法，网络能够适应不同视角下的图像对，突破了传统方法对视角的限制。整个网络设计分为三个阶段：首先，通过深度学习模块对输入图像进行特征提取，提取出关键的图像内容和结构信息。这些特征对于后续的仿射变换估计至关重要，因为它们提供了足够的上下文来确定图像间的映射关系。然后，在特征空间中，网络计算两个图像之间的全局仿射变换矩阵，这一步骤涉及到优化过程，例如使用梯度下降或相似的方法求解仿射变换参数。接下来，网络利用估计的全局仿射变换来对两张图像进行融合，确保在缝合区域的边界处图像之间有平滑的过渡，减少视觉断点。这一步可能涉及到图像插值技术，如最近邻插值、双线性插值或者更高级的卷积神经网络（CNN）进行无缝融合。最后，网络通过一个后处理阶段来进一步优化结果，可能包括局部平滑、细节增强或者去除可能出现的残余错误。这个阶段可以使用循环神经网络（RNN）或者其他深度学习模型，以提高缝合图像的质量。这种无视图图像缝合网络通过引入全局仿射变换并结合深度学习，克服了传统方法对视角限制的问题，展现出更强的适应性和泛化能力。它为图像拼接任务提供了一个新的解决方案，特别是在需要处理多样视角场景时，能够生成更为自然、无缝的全景图像。该研究为计算机视觉领域的图像处理和深度学习相结合的应用开辟了新的方向。

J. Vis. Commun. Image R. 73 (2020) 102950

Available online 4 November 2020

A view-free image stitching network based on global homography

☆

Lang Nie, Chunyu Lin

, Kang Liao, Meiqin Liu, Yao Zhao

Institute of Information Science, Beijing Jiaotong University, Beijing Key Laboratory of Advanced Information Science and Network, Beijing 100044, China

ARTICLE INFO

Keywords:

41A05

41A10

65D05

65D17

ABSTRACT

Image stitching is a traditional but challenging computer vision task, aiming to obtain a seamless panoramic

image. Recently, researchers begin to study the image stitching task using deep learning. However, the existing

learning methods assume a relatively xed view during the image capturing, thus show a poor generalization

ability to exible view cases. To address the above problem, we present a cascaded view-free image stitching

network based on a global homography. This novel image stitching network does not have any restriction on the

view of images and it can be implemented in three stages. In particular, we rst estimate a global homography

between two input images from different views. And then we propose a structure stitching layer to obtain the

coarse stitching result using the global homography. In the last stage, we design a content revision network to

eliminate ghosting effects and rene the content of the stitching result. To enable efcient learning on various

views, we also present a method to generate synthetic datasets for network training. Experimental results

demonstrate that our method can achieve almost 100% elimination of artifacts in overlapping areas at the cost of

acceptable slight distortions in non-overlapping areas, compared with traditional methods. In addition, the

proposed method is view-free and more robust especially in a scene where feature points are difcult to detect.

1. Introduction

Image stitching is a technology that can create a seamless panorama

or high-resolution image by stitching images with overlapping parts.

The images may be obtained from different moments, different per-

spectives or different sensors. In recent years, it has received increasing

attention and has become a popular topic in photographic graphics,

surveillance videos [1], and VR [2], etc.

The classical image stitching follows these steps. First, a 3 × 3

homography matrix including translation, rotation, scaling and van-

ishing point transformation is estimated after the feature extraction and

feature matching between a pair of images. Then the homography is

utilized to warp the original image into alignment with the other one.

Finally the original image and the warped image are fused to get the

stitching result. However, this basic algorithm needs to satisfy a basic

assumption: the scene of the picture should be near planar [3]. In fact,

the depth of image contents always differs, which does not satisfy the

prior hypothesis. Therefore, it is easy to cause ghosting effects or mis-

alignments for overlapping parts in the stitching image. In order to

mitigate ghosting effects and improve stitching quality, some existing

image stitching algorithms calculate multiple content-aware local

warpings [4–11] to align the overlapping parts of images, and some

reduce the artifacts generated using projection transformation by

nding the optimal seams [12–15] around objects. As for deep stitching

methods, some methods [16–20] are qualied for stitching images from

arbitrary views, but there is only some steps of its frameworks, such as

feature extraction or feature matching, is achieved by deep learning,

which cannot be called a complete deep image stitching model. Some

other methods [21–23] are all implemented using deep learning, but

they are only specially designed for some specic conditions, such as

xed views.

Different from these deep stitching methods, we aim to establish a

complete deep learning model that can handle images captured from

arbitrary views. In this paper, we present a cascaded view-free image

stitching network based on the global homography, which can eliminate

the ghosting effects as much as possible.

The overview of our approach is illustrated in Fig. 1(e). Specically,

the rst stage is the homography estimation. Different from the existing

deep homography estimation [24–26], the proportion of overlapping

parts between two images in our image stitching is much lower, which

brings great challenges to the stitching performance. To address this

problem, we introduce a global correlation layer [27,28] into this stage

☆

This paper has been recommended for acceptance by Zicheng Liu.

* Corresponding author.

E-mail address: cylin@bjtu.edu.cn (C. Lin).

Contents lists available at ScienceDirect

Journal of Visual Communication and Image Representation

journal homepage: www.elsevier.com/locate/jvci

https://doi.org/10.1016/j.jvcir.2020.102950

Received 19 April 2020; Received in revised form 17 July 2020; Accepted 10 October 2020

下载后可阅读完整内容，剩余8页未读，立即下载

godbei233

粉丝: 0

无视点图像缝合网络：全局共变法的深度学习解决方案

homography

A view-free image stitching network based on global homography

parallax-tolerant image stitching based on robust elastic warping

matlab的欧拉方法代码-A-video-stitching-system-based-on-mirror-pyramids-and-non

Python-Multiple-Image-Stitching-master.zip_image stitching_pytho

As-Projective-As-Possible Image Stitching with Moving DLT

matlab影像镶嵌代码-SIFT-Based-Image-Stitching:基于SIFT的图像拼接

Learning edge-preserved image stitching from

Parallax-Tolerant Image Stitching with Epipolar Displacement

Content-Preserving Image Stitching with Piecewise Rectangular Boundary Constraints

最新资源