RGB-D对象识别的融合色彩与深度信息的四元数型特征矩方法

149 浏览量更新于2024-08-31 收藏 324KB PDF 举报

本文主要探讨了一种结合颜色和深度信息的新型四元数类型特征（Quaternion-Type Moments, QTM）在RGB-D（即同时包含彩色图像和深度数据的）对象识别中的应用。现有的四元数表示法（Quaternion Representation, QR）通常用于处理彩色图像，然而，用四维四元数来表示只有三个颜色通道的数据会引入冗余。针对这个问题，研究者提出了一种改进的四元数表示方法，它将RGB图像的颜色信息与深度信息相结合。这种改进的四元数表示具有显著的优势。首先，通过融合颜色和深度数据，新方法能够更好地抵抗光照变化和色彩变异，提高了特征描述的鲁棒性。在物体识别任务中，光照条件和色彩变化常常会影响传统方法的性能，而结合深度信息能够提供更多的形状和空间线索，有助于更准确地识别对象。在实际操作中，作者可能采用了特定的融合策略，如利用深度信息调整或增强颜色信息，或者创建一个新的四元数结构，其中一部分表示颜色，另一部分表示深度。这样做的目的是为了创建一个更为丰富的特征向量，使得对象的几何和纹理特性都能被有效地捕捉和表达。文章可能会介绍一种新的四元数计算方法，例如，如何将RGB分量映射到四元数空间，以及如何通过深度信息对其进行扩展。此外，可能还包含了实验结果，展示了这种新型QTM在RGB-D物体识别任务中的优越性能，比如在标准数据集上的分类精度、识别速度以及对不同光照和颜色变化的适应性。为了验证新方法的有效性，研究者可能进行了严格的实验设计，包括与其他基于颜色和/或深度的特征方法进行了对比分析。通过实验结果，作者证明了他们的方法在保持高识别精度的同时，降低了冗余，提高了计算效率。这篇研究论文深入探讨了如何通过改进的四元数表示法有效地整合RGB-D图像的色彩和深度信息，以提升对象识别的准确性和鲁棒性，为计算机视觉领域特别是RGB-D应用场景提供了有价值的理论支持和技术手段。

Quaternion-type moments combining both color and

depth information for RGB-D object recognition

Beijing Chen

1,2

, Jianhao Yang

, Mengru Ding

, Tianliang Liu

, Xinpeng Zhang

Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, China

School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, China

College of Telecommunications & Information Engineering, Nanjing University of Posts & Telecommunications, Nanjing, China

School of Communication & Information Engineering, Shanghai University, Shanghai, China

nbutimage@126.com, {651419675, 906287196}@qq.com, liutl@njupt.edu.cn, xzhang@shu.edu.cn

Abstract—The existing quaternion-type moments (QTMs) are

based on the quaternion representation (QR) of color images.

However, this representation creates redundancy when using

four-dimensional quaternions to represent color images with

three components. In this paper, for RGB-D images, the QR is

improved by combining both color and depth information, which

is invariant to lighting and color variations. The improved QR

fully utilizes the four-dimensional quaternion domain. The new

QTMs (NQTMs) are defined using the improved QR. They are

combined with the quaternion back-propagation neural network

(QBPNN) for RGB-D object recognition. The experimental

results demonstrate that the NQTMs outperform our previous

QTMs considering only color information.

Keywords—RGB-D object recognition; quaternion moment;

color image; depth information

I. INTRODUCTION

Quaternion numbers are generalizations of complex

numbers. In the past two decades, they have been successfully

introduced to deal with color images by encoding three

components into the imaginary parts of quaternion numbers [1-

7]. The main advantage of quaternion-based color image

processing is that a color image can be treated holistically as a

vector field [1, 4-7].

However, because most color images have three

components, the extra fourth dimension in quaternions when

representing such images creates redundancy and the

corresponding computational cost involved is high. Aiming to

circumvent these disadvantages, Assefa et al. [8] introduced a

new representation scheme in three space using trinions, each

of which has one real and two imaginary components. After

that, they defined the trinion Fourier transform based on this

new representation. However, there are few color image

processing works using this representation, while more and

more published works still use the quaternion representation

(QR). The main reasons are that: (a) the theory of trinions

remains to be perfected while it is not the case for the theory of

quaternions, which is the theoretical basis of the quaternion-

based color image processing; (b) the QR has been successfully

used in many fields of color image processing [1-7]. So, this

paper also considers the QR. Certainly, the redundancy

problem will also be resolved.

Recently, due to the popularity of Kinect device, it becomes

easy to provide RGB-D images carrying both color and depth

information. It is well-known that the depth information has

many extra advantages: being invariant to lighting and color

variations, allowing better separation from the background and

providing pure geometry and shape cues [9]. So, combining

both color and depth information can dramatically improve the

performance of many vision problems, e.g., object recognition,

detection, tracking, and human activity analysis.

Moments are scalar quantities used to characterize a

function and to capture its significant features [5,6]. They have

been extensively considered for pattern recognition, scene

matching, and image registration, watermarking, and so on,

owing to their image description and invariance properties. Our

previous work [5] proposed the quaternion-type moments

(QTMs) using the QR and achieved good performance.

However, this work also suffers from the redundancy problem.

So, in this paper, we define the new QTMs (NQTMs) for

RGB-D images using an improved QR considering the

important depth information as well as the color information.

II. P

RELIMINARIES

A. Quaternion Number and Quaternion Representation of

Color Images

Quaternions are generalizations of complex numbers. A

quaternion has one real part and three imaginary parts given by

q = a + bi + cj + dk, (1)

where a, b, c, d  R, and i, j, k are three imaginary units

obeying the following rules

= j

= k

= -1, ij = -ji = k, jk = -kj = i, ki = -ik = j. (2)

If the real part a = 0, q is called a pure quaternion.

The conjugate a quaternion is defined as

*qabcd



ijk

. (3)

This work was supported by the Natural Science Foundation of China unde

Grants 61572258, 61232016, and 61572257, the Natural Science Foundation

of Jiangsu Province of China under Grants BK20151530, and BK20150925,

the PAPD fund, and the CICAEET fund.

2016 23rd International Conference on Pattern Recognition (ICPR)

Cancún Center, Cancún, México, December 4-8, 2016

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38606206

粉丝: 3
资源: 926

RGB-D对象识别的融合色彩与深度信息的四元数型特征矩方法

numpy_quaternion-2019.12.12-cp27-cp27m-win_amd64

Python库 | numpy_quaternion-2022.3.1-cp38-cp38-win_amd64.whl

Quaternion-kinematics

Arduino-PyTeapot-Quaternion-Euler-cube-rotation.zip

Quaternion-based robust attitude control for uncertain robotic quadrotors

Python库 | numba_quaternion-0.2.0-py3-none-any.whl

three-quaternion-from-normal:从法线向量构建ThreeJS四元数

Quaternion-valued echo state networks

PyPI 官网下载 | numba_quaternion-0.2.0-py3-none-any.whl

Quaternion-toolbox_matlab.zip_Quaternion toolbox_quad_quaternion

最新资源