GPU加速下的大图图像拼接高效实现

128 浏览量更新于2024-08-13 1 收藏 1017KB PDF 举报

本文主要探讨了在GPU上加速图像拼接技术的研究与应用，针对计算机科学领域中广泛使用的图像镶嵌（image mosaicing）过程中的计算密集型任务，如特征匹配、图像变形（warping）和融合（blending）。这些步骤往往对实时性要求较高，但在传统的中央处理器（CPU）上执行时，由于计算量大，效率较低，无法满足某些实时应用场景的需求。随着图形处理器单元（GPU）的发展，越来越多的并行运算被开发出来以提升图像拼接的处理速度。本文作者利用CUDA（Compute Unified Device Architecture，统一计算设备架构）这一GPU编程模型，设计了一种高效的并行图像镶嵌算法。CUDA的优势在于其并行计算能力，能够同时处理大量数据，从而显著提高处理速度。通过实验对比，当使用集成的NVIDIA GeForce GTX 745 GPU进行图像拼接处理时，与在CPU上的实现相比，该GPU实现的执行时间提高了高达27.6倍，尤其是在处理大型输入图像时，性能提升尤为明显。这表明在GPU上实现图像拼接具有显著的优势，能够有效满足实时性和性能需求，对于图像处理、计算机视觉和虚拟现实等领域有着重要的实际应用价值。研究的关键点包括：1）CUDA编程技术的应用，使得算法能够在GPU的多核心架构上高效运行；2）特征匹配的并行化处理，通过GPU的SIMT（Single Instruction Multiple Threads）模型，加快了匹配过程；3）图像变形和融合的并行实现，通过GPU的大量线程并行处理能力，减少了单个任务的执行时间；4）实验结果验证，展示了在实际硬件环境下GPU加速图像拼接的具体性能提升。本研究提供了一个实践性的框架，展示了如何利用GPU的并行计算优势来优化图像拼接算法，为计算机视觉领域的实时处理任务提供了新的解决方案，具有很高的学术价值和工程实用性。

Fast Implementation of Image Mosaicing on GPU

Yixiang Lu

, Qingwei Gao

1,∗

, Shuai Chen

School of Electrical Engineering and Automation,

Anhui University, Hefei 230601, China

Dong Sun

, Yi Xia

, Xueming Peng

1,2

Shanghai Huawei Technology

Co., Ltd, Shanghai 200120, China

Abstract—Image mosaicing has been studied and widely used

in many ﬁelds of computer science, but there exists a huge

amount of computations involved in steps of feature matching,

warping and blending. And thus it could not meet the real-

time demands of some applications. Fortunately, some related

parallel operations which can speed up the process of mosaicing

have been developed and implemented on the Graphics Processor

Unit (GPU). In this paper, we present a parallel implementation

of image mosaicing based on GPU using the Compute Uniﬁed

Device Architecture (CUDA). We obtain better results in terms

of execution time than that of implementation on the central

processing unit (CPU). When an integrated GPU GTX745 was

used in the experiment, we achieved a speedup ratio up to 27.6

times for large input images.

Index Terms—Image mosaicing; Matching; Parallel; Graphics

Processor Unit (GPU).

I. INTRODUCTION

Image mosaicing is an active area of research in the ﬁelds of

photogrammetry, computer vision, image processing and com-

puter graphics. It can be deﬁned as a process of constructing

panoramic image mosaics from a sequence of partial images

obtained from different views [1]. The initial application of

image mosaicing mainly focuses on the construction of large

aerial and satellite photographs from collection of images

[2]. Nowadays, a variety of new applications of mosaicing

have been emerged, including scene stabilization and change

detection [3], increasing the ﬁeld of view and resolution

[4], video compression [5], wide-area video surveillance [6],

the construction of virtual environments [7] and image-based

rendering [8]. A typical mosaicing process mainly consists of

three different steps of image processing, that is, registration,

warping and interpolation, and blending. Image registration is

the key task of image mosaicing [9]. Registration refers to the

establishment of a geometric transformation between a pair

of images depicting the same scene, and the transformation is

determined by an 8 degrees of freedom planar homography.

If the homography have some errors, it will result in image

misalignment and make it difﬁcult to the subsequent blending.

To ensure the elements of the homography to be more accurate,

we must search for the best correct matching feature points

which are used to estimated the homography. However, the

searching process is computationally extremely expensive,

especially for the images with large sizes. Moreover, when

the mosaicing technique is used to video processing (e.g. video

indexing and wide-area video surveillance) which contains a

great large number of images, the mosaicing speed is very

important in such practical applications.

In recent years, the Graphics Processor Unit (GPU) has

attracted researches’ attention in many ﬁelds for its massive

parallel computational power. Using the GPU as a copro-

cessor to accelerate the algorithms with heavy computational

burden has become an important way in practice, and many

image processing algorithms have already been successfully

implemented on GPU. For example, Luo and Duraiswami [10]

implemented a version of the complete (including all stages

of the algorithms) Canny edge detector under CUDA, and

achieved a speedup of more than 3 times against its straight

CPU implementation. In their work, the author considered the

hysteresis labeling connected component stage which was not

included in previous GPU versions, this is the main reason that

they could not achieve a faster implementation performance.

For image matching and mosaicing, many related applications

are also available on GPU. In [11], Schatz and Trapnell

implemented a string-matching program that runs on the GPU

and achieved a speedup of as much as 35x over the equivalent

CPU-bound version. They presented string-matching kernel for

use in the CUDA, which executes parallelized searching of a

sufﬁx tree to ﬁnd exact matches for a set of query strings.

M. Adam et al. [12] presented a novel approach to local

alignment of images of real-time video stitching application on

GPU. To achieve a nearly double-sized panorama, they mainly

focused on stitching the margin regions of high deﬁnition

stereo images. To accelerate the assembling large mosaics of

electron microscope images, K. U. Venkataraju [13] proposed

to use texture memory lookups to speedup the access to

microscopy image tiles and data parallel computing which

leads to the root of complexity of the calculation. Due to the

usage of unsigned char as the image data type, this results in

slightly inaccurate calculation for pixel values in the mosaic.

Even though good results were achieved by these papers

mentioned above, they all avoided considering two extremely

time-consuming steps, that is, feature matching and random

sample consensus (RANSAC). As two key processes in image

registration, they should be considered in the proposed GPU-

accelerated parallel algorithms.

In this paper, a parallel image mosaicing method imple-

mented on GPU using Computed Uniﬁed Device Architecture

(CUDA) programming model is presented. To reduce compu-

tation time efﬁciently, this paper mainly focuses on the most

time-consuming part of mosaicing. In fact, for most precision

mosaicing, the execution time mainly depends on the number

of matched point pairs in the overlapping images, not on the

image size. Thus, our method starts with feature matching and

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI 2017)

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38687277

粉丝: 10
资源: 949

GPU加速下的大图图像拼接高效实现

实时专注度分析

基于特征点的显微图像自动拼接

基于CUDA的多分辨率图像融合算法

基于CUDA-GPU加速的全景图像拼接.pdf

基于CUDA-GPU加速的全景图像拼接 .docx

基于GPU的实时图像拼接.pdf

基于GPU的无人机遥感影像快速拼接.pdf

实现任意图像拼接，基于OPENCV

SiftGPU-在GPU上启用SIFT：在显卡上编译SIFT的实现-matlab开发

基于GPU的视频流拼接算法研究.pdf

最新资源