鲁棒非侵入式眼睛定位：应对不同姿态、尺度与光照

5 浏览量更新于2024-08-30 收藏 1014KB PDF 举报

"文章探讨了在不同的眼部姿势、尺度和照明条件下的眼睛定位技术，提出了一种基于修正的Hausdorff距离的测量方法，该方法能够容忍眼部的各种变化。同时，为了消除光照变化的影响，文章提出了一种8邻域灰度变换，这种变换使图像对光照变化不那么敏感，但仍然保留了眼睛的外观信息。通过反向传播神经网络，可以识别所有眼睛的候选位置。实验表明，这种方法能够有效地在不同的大小、形状和姿势以及各种光照条件下定位眼睛。" 本文主要关注的是计算机视觉领域的一个重要问题——人眼定位，特别是在不同的眼部状态和环境光照条件下。作者Jinghe Yuan提出了一种新的、鲁棒的非侵入式眼睛定位策略，这对于基于视觉的人机交互具有重要意义。首先，文章介绍了针对眼部姿态、形状和尺寸变化的修正Hausdorff距离的测量方法。Hausdorff距离是一种衡量两个点集之间最大距离的度量，在这里被用来定位眼睛。通过修改这个距离度量，算法可以适应眼睛的各种变化，增强了定位的稳定性。其次，为了解决光照变化带来的影响，作者提出了一个8邻域的灰度图像变换方法。这种变换可以减少光照变化对图像的敏感性，同时保持眼睛的特征信息，使得在不同光照环境下，眼睛的特征仍能被有效识别。最后，论文采用反向传播神经网络来识别经过变换后的图像中的眼睛候选区域。反向传播神经网络是一种常用的机器学习模型，它可以从训练数据中学习并进行模式识别，尤其适合处理复杂的图像识别任务。通过实验验证，这种方法在不同的眼睛大小、形状、姿势以及多种光照条件下都能有效地进行眼睛定位，这表明了其在实际应用中的广泛潜力。文章的OCIS代码涉及的领域包括生物医学光学（150.1135）、图像处理（100.2000）和视觉系统（100.3008），反映了文章研究内容的跨学科性质。这项工作提供了一种新的、适应性强的眼睛定位技术，对于提升人机交互系统的性能，尤其是在复杂环境中的应用，具有重要的理论与实践价值。

January 10, 2010 / Vol. 8, No. 1 / CHINESE OPTICS LETTERS 59

Eye location under diﬀerent eye poses, scales, and

illuminations

Jinghe Yuan (µµµÚÚÚ)

Key Laboratory of Molecular Nanostructure and Nanotechnology, Institute of Chemistry,

Chinese Academy of Sciences, Beijing 100190, China

E-mail: jacobyuan@yahoo.com.cn

Received March 10, 2009

Robust non-intrusive eye location plays an important role in vision-based man-machine interaction. A

mo diﬁed Hausdorﬀ distance based measure to localize the eyes is proposed, which could tolerate various

changes in eye pose, shap e, and scale. To eliminate the eﬀects of the illumination variations, an 8-

neighb our-based transformation of the gray images is proposed. The transformed image is less sensitive

to illumination changes while preserves the appearance information of eyes. All the localized candidates

of eyes are identiﬁed by back-propagation neural networks. Exp eriments demonstrate that the robust

metho d for eye location is able to localize eyes with diﬀerent eye sizes, shapes, and poses under diﬀerent

illuminations.

OCIS co des: 150.1135, 100.2000, 100.3008.

doi: 10.3788/COL20100801.0059.

Robust non-intrusive eye location plays an imp ortant role

in vision-based man-machine interaction including auto-

motive applications, such as driver inspection, face recog-

nition, etc. In the past years, many works were addressed

on this area. There are two major approaches for auto-

matic eye detection. The ﬁrst approach, the holistic one,

conceptually relates to template matching, and attempts

to locate the eye using global representations. Character-

istic of this approach belongs to connectionist methods

such as principal component analysis (PCA) using eigen-

representations

[1]

. Although location by matching raw

images has been successful under limited circumstances,

it suﬀers from the usual shortcomings of straightforward

correlation-based approaches, such as sensitivity to eye

orientation, size, variable lighting conditions, noise, etc.

The second approach for eye detection extracts and mea-

sures local facial features, while standard pattern recog-

nition techniques are then employed for locating the eyes

using these measurements. Yuille et al. described a com-

plex but generic strategy

[2]

. The characteristic of this

approach is the concept of deformable templates. Lam et

al. extended Yuille’s method to extract eye features by

using corner locations inside the eye windows which are

obtained by means of average anthropometric measures

after the head boundary is located

[3]

. Deformable tem-

plate is an interesting concept, but it is diﬃcult in terms

of learning and implementation to use them.

Hausdorﬀ distance was originally deﬁned as a dissimi-

larity measure on data sets. It later got wide acceptance

in image comparison. Huttenlocher et al. proposed a

partial Hausdorﬀ distance (PHD) method for object de-

tection and recognition

[4]

. This method gets distance

measure between the most closely matching portions of

the images being compared which in turn reduces the

eﬀect of occlusion in object matching. Guo et al. pro-

posed spatially weighted Hausdorﬀ distance (SWHD) as

an improvement to conventional Hausdorﬀ distance be-

tween edge images

[5]

All the above-mentioned methods for feature recogni-

tion use edge images for ﬁnding Hausdorﬀ distance or

its variants. However, the appearance is more impor-

tant than the edge maps. The intensity distribution of

pixels captures this appearance information. However,

direct comparison of gray images is unsuitable because

the performance will be aﬀected by illumination vari-

ations. To overcome this shortcoming, infrared imag-

ing technique

[6]

, method of single training image per

person

[7]

, and local binary patterns (LPBs)

[8]

were pro-

posed.

In this letter, by using the average instead of the maxi-

mum in the directed Hausdorﬀ distance, a modiﬁed Haus-

dorﬀ distance (MHD) based measure

[9]

for comparing

the appearance of eyes is proposed, which is able to tol-

erate changes in eye shape, pose, and size. To elimi-

nate the eﬀects of the illumination variations, we pro-

pose an 8-neighbour-based transformation of the gray

images. The transformed eye image is less sensitive to

illumination changes while preserves the appearance in-

formation of eyes. All the located eyes are identiﬁed by

back-propagation neural network (BPNN) identiﬁer.

Primarily, the Hausdorﬀ distance is deﬁned as a dis-

tance measure between two data sets. This distance

gives a measure of dissimilarity between these two sets.

The conventional Hausdorﬀ distance between the two sets

A = {a

, a

,· · ·, a

} and B = {b

, b

,· · ·, b

} is given by

H(A, B) = max[h(A, B), h(B, A)], (1)

where

h (A, B) = max

a∈A

min

b∈B

ka − bk (2)

and

h (B, A) = max

b∈B

min

a∈A

kb − ak (3)

are the directed Hausdorﬀ distances from A to B and

from B to A, respectively, and k·k is the norm of a vector.

H (A, B) takes the maximum of the directed distances

from A to B and from B to A. When the Hausdorﬀ

distance is measured b etween two images, the data sets

1671-7694/2010/010059-04

° 2010 Chinese Optics Letters

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38526225

粉丝: 5
资源: 955

鲁棒非侵入式眼睛定位：应对不同姿态、尺度与光照

An Efficient Multi-Core SIMD Implementation for H.264AVC Encoder.

用IMU数据进行位置和姿态估计

Causality Analysis Between Soil of Diﬀerent Depth Moisture and Precipitation in the United States

EREnt数据恢复软件

btsl_draw_tree_diff​erent_angles_13:采用分支模式规范并生成 3D 渲染，然后分析端点分布-matlab开发

syncdiff:SyncDiff（erent）是一个有状态的，类似于rsync的文件同步器。 认为rsync + git + csync2 +一致

Design and Implementation: the Native Web Browser andServer for Content-Centric Networking

2013-Visualizing and Understanding Convolutional Networks

A novel DDS using nonlinear ROM addressing with improved compression ratio and quantization noise

MiniGui业务开发基础培训-htk

最新资源

btsl_draw_tree_different_angles_13:采用分支模式规范并生成 3D 渲染，然后分析端点分布-matlab开发

syncdiff:SyncDiff（erent）是一个有状态的，类似于rsync的文件同步器。认为rsync + git + csync2 +一致