深度融合技术在显著目标检测中的应用

37 浏览量更新于2024-08-26 收藏 1.05MB PDF 举报

"深度信息融合显着目标检测是当前计算机视觉领域的一个研究热点，主要关注如何结合二维图像信息和深度信息来提升显著目标检测的精度。这篇研究论文由北京交通大学计算机与信息技术学院的Fangfang Chen、Congyan Lang、Songhe Feng和Zehai Song共同撰写，他们在文中提出了一个新的方法，利用深度信息来改进显著对象检测。现有的显著目标检测算法大多只关注二维图像信息，而忽略了深度信息的价值。深度图像提供了从观察者视角看每个物体的深度信息，其中每个像素的强度值代表了相应的深度数据。通过深度信息，可以更精确地描绘出图像中的对象特征，从而提高目标识别的准确性。在该论文中，作者首先创建了一个特定图像数据集，包含600张RGB-D图像，这些图像来自于不同的环境，不同的角度和光照条件。利用这个数据集，他们能够获取到丰富的深度信息，并基于此对显著目标检测模型进行训练和评估。接着，作者采用几种最先进的显著性检测模型生成二维的显著地图。这些二维地图随后与深度信息融合，形成一种深度信息融合的显著目标检测方法。这种方法旨在通过深度信息增强二维图像的特征，提升检测结果的显著性和准确性。此外，论文还探讨了如何有效地融合二维和三维信息，以克服仅依赖二维图像信息可能导致的局限性，如背景混淆、遮挡问题等。通过深度信息的融合，可以更好地理解场景的三维结构，从而有助于区分前景和背景，提高目标检测的精度。这篇研究论文致力于探索深度信息在显著目标检测中的作用，提出了一种新的深度信息融合策略，对于理解和改进计算机视觉系统在复杂环境下的目标检测能力具有重要意义。" 这篇论文的研究成果对未来的应用场景，如自动驾驶、监控系统、虚拟现实以及图像分析等领域，都将产生深远的影响，因为它提供了一种更精确、更全面的方式来识别和理解图像中的关键元素。

Depth Information Fused Salient Object Detection

Fangfang Chen, Congyan Lang, Songhe Feng, Zehai Song

School of Computer and Information Technology, Beijing Jiaotong University, China

{12120401, cylang, shfeng, zhsong}@bjtu.edu.cn

ABSTRACT

Saliency Detection has emerged as a hot topic due to its potential

application in image and video understanding. Most existing

saliency detection algorithms focus on two-dimensional

information while the depth information is often ignored. In this

paper, we first create the salient object ground truth of a specific

image dataset which contains 600 RGB-D (color and depth

information) images taken from different surroundings with

different angle and intensity of illumination. The depth image

describes the depth information of each object in the image from

the perspective of a viewer, and the intensity value of every pixel

in the depth image denotes the depth information. With the help of

depth information, a more precise object description can be

acquired. Furthermore, several state-of-the-art saliency detection

models can be utilized to generate 2D salient maps, which can be

fused with the depth map to detect the salient object in a given

image. Experimental results demonstrate the effectiveness of the

proposed method.

Categories and Subject Descriptors

I.4.8 [Image Processing And Computer Vision]: Scene Analysis

– color, depth cues.

General Terms

Theory

Keywords

salient object detection, depth information, RGB-D image, visual

attention.

1. INTRODUCTION

The rapid popularization of digital cameras and mobile phone

cameras has led to an explosive growth of social image sharing

web sites, such as Flickr. How to organize and analysis these

large-scale images has become a hot topic recently. Visual saliency

is deemed as a fundamental issue in the field of psychology,

neuroscience, neural systems and computer vision. It can be

regarded as the ability of a visual system (human or machine) to

select a certain subset of visual information for further processing

[1]. The goal of salient object detection is to detect and extract the

most salient and attention-grabbing object in a scene. The output is

usually called “saliency map” where the intensity of each pixel

represents the probability of the pixel belonging to the salient

object [1]. Visual saliency and salient detection can be applied in

many fields, including object detection and recognition[3], image

indexing[2], image compression[4], multimedia question

answering[5], movie2comics[6], tagging technology [7], and so on.

The study on human visual systems suggests that the saliency is

related to uniqueness, rarity and surprise of a scene, characterized

by primitive features like color, texture, shape, etc. [8]. Recently

various of efforts have been made to compute the salient object of

a given image.

In this paper, we introduce the depth information of an image to

assist the salient object detection. Depth information is derived

from the depth image, which is also known as distance image. It

includes the information of distance that between viewers and

object in the scene, the intensity of each pixel in the depth image

corresponds to the depth information. The larger the gray value is,

the further the object is, as shown in Figure 1. The proposed

approach consists of three main steps. Firstly, we try to create the

salient object ground truth of an image dataset which contains 600

RGB-D images introduced in [27]. Then, the image segmentation

technology is introduced to decompose the image into multiple

segmentations. Furthermore, an adaptive fusion strategy is

employed by incorporating the depth map with some existing 2D

saliency maps computed by the state-of-the-art saliency map

calculation algorithms. The novelty lies in that, unlike existing

algorithms that compute saliency maps only derived from

two-dimensional visual features, our method combines the depth

information with the salient maps, which can improve the object

edge detection performance, and generate more precise salient

object.

(a) original image (b) depth image

Figure 1. Example of depth image, the intensity of each pixel in

the image (b) corresponds to the depth information. The

greater the gray value is, the further the object is.

The paper is organized as follows. A related work about salient

object detection is introduced in Section 2. We present the

proposed algorithm in Section 3. Experimental results are shown in

Section 4 and a conclusion is given in Section 5.

2. RELATED WORKS

We introduce the related work on image salient object detection

briefly in this section. Recently, many efforts have been made to

propose various computational models to calculate the salient

objects or regions. According to whether the prior knowledge is

required or not, existing saliency detection algorithms can be

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for profit or commercial advantage and that

copies bear this notice and the full citation on the first page. To copy

otherwise, or republish, to post on servers or to redistribute to lists,

requires prior specific permission and/or a fee.

ICIMCS’14, July 10–12, 2014, Xiamen, Fujian, China.

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38693173

粉丝: 4
资源: 948

深度融合技术在显著目标检测中的应用

yolov5利用深度相机进行目标检测

面向自动驾驶目标检测的深度多模态融合技术.pdf

matlab2016b运行代码-DQSF:深度质量感知选择性显着性融合，用于RGB-D图像显着目标检测

基于显着性融合和传播的RGB-D图像中的显着目标检测

OpenCV与深度学习融合的目标检测系统部署教程

深度自编码多维特征融合慢动目标检测算法

Employing-Bilinear-Fusion-and-Saliency-Prior-Information-for-RGB-D-Salient-Object-Detection:利用双线性融合和显着性先验信息进行RGB-D显着目标检测

融合深度学习的机器人目标检测与定位.pdf

一种深度融合机制的遥感图像目标检测技术.docx

融合显着区域特征和深度数据以进行实时多目标搜索和人形导航

最新资源