无监督图像拼接中的像素级对齐学习

论文

142 浏览量更新于2024-08-03 收藏 3.77MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Learning pixel-wise alignment for unsupervised image stitching" 这篇论文主要探讨的是无监督图像拼接中的像素级对齐技术。图像拼接是将两个视角相同的图像进行对齐合并，以便形成一个更大的视野。在实际非共面场景中，由于缺乏更广阔的视场作为参考，图像拼接在处理自然结构的精确对准时尤为具有挑战性，特别是存在大视差的情况下。作者们提出了一种无监督的图像拼接框架，突破了传统 Homography（单应性）估计中的共面约束，能够在有限的重叠区域实现像素级别的准确对齐。这一框架主要包含两个关键步骤：首先，他们通过迭代密集特征匹配生成全局变换。这个过程结合了误差控制策略，旨在减轻由大视差引起的差异。在图像特征匹配中，通常使用关键点检测器（如SIFT或ORB）来找出图像间的对应点，然后通过这些对应点估计出初始的变换模型。迭代过程可以逐步优化这个模型，确保更准确地对齐图像的相似部分。其次，论文提出了一种像素级扭曲网络，该网络内嵌在一个大规模特征提取框架内。这个网络能够对每个像素进行个体化的处理，以适应局部的变形。通常，这种网络会基于深度学习，如使用卷积神经网络（CNN），以端到端的方式学习从源图像到目标图像的映射函数。网络的训练无需标注数据，即为无监督学习，这使得模型能在没有配对的地面实况信息下自我优化。此外，由于在非共面场景中，简单的单应性假设不足以捕捉复杂的几何变换，该方法可能采用了多平面模型或其他高级几何模型来更准确地描述图像间的关系。同时，为了处理有限重叠区域的问题，网络可能会学习到如何在没有直接对应点的情况下估计合理的像素位置，这可能涉及到上下文信息的利用和边缘保持策略，以避免图像拼接时出现明显的失真或不连续。这篇论文贡献了一种新的无监督图像拼接方法，通过像素级的对齐策略提高了在复杂场景下的拼接效果，特别是在大视差和重叠区域有限的情况下。这种方法对于增强现实、全景图像生成和遥感图像处理等领域有着重要的应用价值。

资源详情

资源推荐

Learning Pixel-wise Alignment for Unsupervised Image Stitching

Qi Jia

Dalian University of Technology

jiaqi@dlut.edu.cn

Xiaomei Feng

Dalian University of Technology

xiaomeifeng19@gmail.com

Yu Liu

∗

Dalian University of Technology

liuyu8824@dlut.edu.cn

Xin Fan

Dalian University of Technology

xin.fan@dlut.edu.cn

Longin Jan Latecki

Temple University

latecki@temple.edu

ABSTRACT

Image stitching aims to align a pair of images in the same view.

Generating precise alignment with natural structures is challeng-

ing for image stitching, as there is no wider eld-of-view image

as a reference, especially in non-coplanar practical scenarios. In

this paper, we propose an unsupervised image stitching frame-

work, breaking through the coplanar constraints in homography

estimation, yielding accurate pixel-wise alignment under limited

overlapping regions. First, we generate a global transformation by

an iterative dense feature matching combined with an error control

strategy to alleviate the dierence introduced by large parallax. Sec-

ond, we propose a pixel-wise warping network embedded within a

large-scale feature extractor and a correlative feature enhancement

module to explicitly learn correspondences between the inputs,

and generate accurate pixel-level osets upon novel constraints

on both overlapping and non-overlapping regions. Notably, we

leverage the pixel-level osets in the overlapping area to guide the

adjustment in the non-overlapping area upon content and structure

consistency constraints, rendering a natural transition between two

regions and distortions suppression over the entire stitched image.

The proposed method achieves state-of-the-art performance that

surpasses both traditional and deep learning approaches by a large

margin. It also achieves the shortest execution time and has the

best generalization ability on the traditional dataset.

CCS CONCEPTS

• Computing methodologies → Computer vision;

KEYWORDS

image stitching, pixel-wise alignment, homography estimation

ACM Reference Format:

Qi Jia, Xiaomei Feng, Yu Liu, Xin Fan, and Longin Jan Latecki. 2023. Learning

Pixel-wise Alignment for Unsupervised Image Stitching. In Proceedings of

∗

Corresponding author: Yu Liu.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than the

author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or

republish, to post on servers or to redistribute to lists, requires prior specic permission

and/or a fee. Request permissions from permissions@acm.org.

MM ’23, October 29-November 3, 2023, Ottawa, ON, Canada

ACM ISBN 979-8-4007-0108-5/23/10.. . $15.00

https://doi.org/10.1145/3581783.3612298

(b) Stitched image of global alignment

Network

Global alignment

(a) Global alignment vs. Pixel-wise alignment

Pixel-wise alignment

...

x H x



DLT

Network

1 1 1

2 2 2

...

x x x

  

Warp

Inputs

Offsets of four points

Pixel-wise offset

Figure 1: Global alignment vs. Pixel-wise alignment. (a) Il-

lustration of the dierence between global and pixel-wise

alignment in principle. Global alignment applies a direct

linear transformation (DLT) to approximate the oset for

all the pixels, while our pixel-wise alignment executes un-

uniform transformations for each pixel. (b) and (c) compare

two kinds of stitching results. Pixel-wise alignment achieves

superior results in image and artifact suppression compared

to global alignment, as shown in the zoomed-in regions.

the 31st ACM International Conference on Multimedia (MM ’23), October 29-

November 3, 2023, Ottawa, ON, Canada. ACM, New York, NY, USA, 9 pages.

https://doi.org/10.1145/3581783.3612298

1 INTRODUCTION

Image stitching aims to estimate an accurate transformation be-

tween a pair of images and align them in the same view. It has

been a well-studied topic with widespread applications [

] such

as panorama on smartphones [

], robot navigation [

], and virtual

reality [

]. However, generating high-quality stitched images

in various practical scenarios is still challenging, especially when

there is no wider eld-of-view image as a reference.

Homography transformation [

] is the most widely used

image stitching model, that leverages the feature correlation in

overlapping regions as constraints to estimate a global homogra-

phy matrix [

], and transform the whole target image to the view

of the reference image (see the global warping part in Fig. 1 (a)).

Most existing methods estimate the global homography by assum-

ing the whole scene is coplanar, leading to severe misalignment

下载后可阅读完整内容，剩余8页未读，立即下载

Seung-YimYau

粉丝: 302
资源: 16

无监督图像拼接中的像素级对齐学习

Deep Visual-Semantic Alignments for Generating Image Descriptions

image--Alignment.rar_alignment_vc 88955.com

Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification

Image-Alignment-Algorithms.rar_ image alignment_Image Alignment_

(TMI) Unsupervised Bidirectional Cross-Modality Adaptation via Deeply Synergistic Image and Feature Alignment for Medical Image Segmentation 01.pdf

Rebuild_Strong-Weak-Distribution-Alignment-for-Adaptive-Object-Detection:这是纸的个人重建

Umeyama-Similiar-Transfrom-for-face-alignment:Shinji Umeyama Similiar Transfrom的opencv实现

keras-ACG-face-alignment:【ACG-face-alignment】ACG脸部对齐

LYRICS-TO-AUDIO ALIGNMENT AND PHRASE-LEVEL SEGMENTATION

gtk+-2.0之界面布局控件示例--alignment/fixed/table/box

jsantarc/Dynamic-Time-Alignment-K-Means-Kernel-Clustering-For-Time-Sequence-Clustering:用于时间序列聚类的动态时间对齐 (DTA) K-Means 内核聚类-matlab开发

提高磁盘性能-4k化工具-Paragon Alignment

CUSHAW3: Accurate Short-read Alignment-开源

精品--Alignment成为GPT类大模型微调的必须环节，深度强化学习是Alignment的核心。本项目是一个.zip

M-File Alignment:清理m-code对齐的函数-matlab开发

Focus conditioning effects on molecular field-free alignment observed with high-order harmonic generation

论文研究-利用Alignment空间理论分析蛋白质的结构.pdf

Image_Alignment_and_Stitching_A_Tutorial.pdf

最新资源