网格结构光编码技术在三维测量中的应用

需积分: 10 101 浏览量更新于2024-09-13 收藏 1.05MB PDF 举报

"基于网格结构光的三维测量技术是利用主动测量方法来高效获取三维物体深度信息的一种方法。在机器人移动导航、室内室外环境感知等领域有广泛应用。本文详细探讨了结构光编码技术在实际操作中的关键考虑因素和图像处理工具，包括在自然光照环境下运行、深度纹理平滑、网格模式选择、反照率归一化、网格提取以及图像与投影网格的粗略配准等重要问题。通过获取环境的深度地图，可以进一步进行短距离路径规划。" 在计算机视觉领域，目标是赋予机器类似人类的感知能力，使其能够自主执行任务。其中，确定场景中物体的深度信息是一项极其有用的技术。深度信息可用于多种应用，如对象识别、机械臂对物体的抓取等。基于网格结构光的三维测量技术，其核心是使用结构光图案（如格状图案）投射到物体表面，通过分析被物体表面变形的光栅图案，可以推算出物体的三维几何形状。这种技术在机器人导航中尤其重要，因为获取环境的三维信息是实现自主移动和避障的关键。文章首先讨论了在自然光照环境下运行结构光传感器的挑战，这是实际应用中必须考虑的问题，因为环境光线可能干扰传感器的测量结果。通过特定的图像处理算法，可以降低环境光的影响，确保测量的准确性。深度纹理平滑是另一个关键步骤，它有助于去除噪声并提高测量数据的连续性和稳定性。这通常通过滤波器或图像处理技术来实现，例如中值滤波或高斯滤波，以平滑深度图并保留重要的边缘信息。网格模式选择关乎编码效率和解码复杂度。不同类型的网格模式（如条纹、二维码样式等）具有不同的优势和局限性，选择合适的模式能提高测量速度和精度。反照率归一化处理是为了消除物体表面反射性质对测量的影响。不同物体表面的反光能力差异可能导致测量误差，通过归一化处理，可以使得不同反照率的物体表面在测量中具有相对一致的响应。网格提取涉及从原始图像中准确地识别出投射的网格图案。这一步通常需要边缘检测和图像分割算法，以便从复杂的背景中分离出网格线，为后续的深度计算提供基础。最后，图像与投影网格的粗略配准是将相机捕获的图像与预先投射的网格进行对应，以建立空间坐标关系。这一过程是通过特征匹配和几何变换来实现的，可以为精确的三维重建奠定基础。一旦获得了环境的深度地图，机器人就能够对周围环境进行三维理解，进而进行短距离路径规划，避免障碍物，实现安全有效的移动。这样的技术对于自主驾驶车辆、服务机器人以及搜索救援任务等具有重要意义。基于网格结构光的三维测量技术涉及多方面的技术和算法，包括光学、图像处理、模式识别等多个领域，是现代计算机视觉和机器人学中的核心技术之一。通过不断的研究和发展，这些技术将进一步提升机器人在复杂环境下的自主导航和操作能力。

IEEE JOURNAL

ROBOTICS AND AUTOMATION,

VOL

OCTOBER

1988

Structured Light Patterns for Robot Mobility

JACQUELINE

LE MOIGNE

AND

ALLAN M. WAXMAN,

MEMBER,

IEEE

Abstract-In order to assess the feasibility of using a structured-light

range sensor for mobile outdoor and indoor robots,

discuss a number

of operational considerations and image processing tools relevant to this

task domain. In particular,

address the issues of operating in ambient

lighting, smoothing of range texture, grid pattern selection, albedo

normalization, grid extraction, and coarse registration

image to

projected grid. Once a range map of the immediate environment

obtained, short-range path planning can be attempted.

INTRODUCTION

GOAL OF computer vision is to endow machines with a

sensory capability

that they may perform their

assigned tasks with some degree of autonomy. Among the

many visual skills considered desirable, one of the most useful

is the ability to determine the ranges of objects in a scene.

Range information can be exploited

a number of different

applications such as object recognition, object acquisition by a

manipulator, and robot mobility. As these tasks are rather

different, one should expect that the type of range data

required will also differ with regard to sampling frequencies

(in

space and time) and resolution of range texture. Moreover,

the different task environments demand different tools in order

to acquire the data. A variety of such ranging techniques have

been described

a recent review article

[I].

Most methods are

targeted for implementation on industrial robot arms; for

example, the one

operation at the National Bureau of

Standards

[2].

Our work deals with the development of an inexpensive

ranging sensor which could be used

the “short-range

navigation” task of a mobile robot. The purpose of the sensor

is to enable the robot to construct a topographic map of its

immediate environment to be used for planning a path over the

terrain while avoiding obstacles. The desire for an “inexpen-

sive” system requires a minimal dependency on sophisticated

hardware: thus our approach is

‘

‘software-intensive.

”

One

method often invoked to accomplish this task is stereo vision,

which is a passive, bi-static triangulation mechanism. By

comparing features

two images of the same scene taken by

Manuscript received February 2, 1987; revised February 22, 1988. Part of

the material in this paper was presented at the Seventh International

Conference

Pattern Recognition, Montreal, Que., Canada, July 30-August

2, 1984. This work was supported by the Defense Advanced Research

Projects Agency and the

Army Night Vision Laboratory under Contract

DAAK70-83-K-0018 (DARPA Order 3206).

Le Moigne was with the Computer Vision Laboratory, Center

for

Automation Research, University

Maryland, College Park, MD 20742. She

now

with Martin Marietta Laboratories, Baltimore, MD 21227.

A. M. Waxman was with the Computer Vision Laboratory, Center

for

Automation Research, University of Maryland, College Park, MD 20742. He

is now with the Laboratory for Sensory Robotics, Department

Electrical,

Computer and Systems Engineering, Boston University, Boston, MA 022

15.

IEEE Log Number 8822827.

two cameras separated by a known baseline, one may

determine the ranges to these features from their measured

disparities in a simple fashion. The major difficulty encoun-

tered, however, is the so-called “correspondence problem”:

identifying features in the “left image” which correspond to

those

the “right image.” Sorting this out computationally

can be extremely time-consuming, rendering

unfit

for the

mobility task. One can alleviate this problem to a great extent

by going to an active, bi-static triangulation system based on

the concept of “structured light.” Here, one of the stereo

cameras is replaced by a light source which projects a known

pattern of light on the scene. The remaining camera then

images the illuminated scene from a different vantage point.

The range information manifests itself

the apparent distor-

tions of the projected pattern. The vision system must then

extract the pattern from the scene, compare

to the known

projected pattern

order to assign disparity measures, and

thus recover the range information. Depending on the choice

of projected pattern, one may still have to solve a correspon-

dence problem between the projected and perceived patterns.

If one projects a single spot or line of light onto the scene, then

no correspondence problem arises: however,

is then

necessary to scan the projected light over the scene to build up

a range map

[3].

Such scanning devices can make a system

expensive, particularly if they are to be sufficiently rugged and

reliable for the mobility task. The alternative is to project a

grid of points or lines on a scene

order to cover the entire

field of view of the camera. One is then faced with a much

simpler correspondence problem to solve, essentially to label

the grid points

the imaged pattern according to their

coordinates

the projected pattern. A particularly clever

method of doing this for a pattern composed of an array of

points is to encode the labels by modulating the individual

beams over time

[4].

This, however, assumes that the images

remain sufficiently registered over the required timescales. In

our case, we wish to use temporal modulation to enhance the

signal-to-noise ratio (as discussed

the following section).

We have chosen to project a grid of horizontal and vertical

lines, along with several dots, onto the scene which is then

imaged by a camera separated from the projector by a vertical

baseline (cf. Fig.

1).

The projection of a grid pattern has been

described previously

[4],

[7] but we added dots to this pattern

which are used as landmarks to initiate the labeling process. In

the following sections we shall describe the considerations

necessary for operating such a system

ambient lighting

(indoors and outdoors) and selecting the geometry of the

projected grid. We utilize the stereo ranging formulas derived

[7] but extend them to derive a formula relating the

smoothing of range texture to the thickness of the grid lines.

0882-4967/88/1000-0541$01

.OO

1988

IEEE

下载后可阅读完整内容，剩余7页未读，立即下载

songlovecandice

粉丝: 0
资源: 1

网格结构光编码技术在三维测量中的应用

基于结构光的空间深度检测和三维重建的研究

论文研究-基于结构光的三维点云重建方法研究.pdf

基于结构光的三维点云重建方法研究 (2016年)

基于圆结构光视觉三维点的孔洞缺陷识别及重构 (2012年)

基于激光网格标记的计算机视觉测量技术研究.pdf

一种基于空间编码结构光的稠密三维重建算法.docx

Grid.zip_三维测量代码_三维点云_图像 点云_激光 测量_点云图像

三维结构光学习资料

基于线结构光扫描点云数据的数据精简及三角网格剖分 (2007年)

Matlab实现结构光三维重建技术

最新资源

Grid.zip_三维测量代码_三维点云_图像点云_激光测量_点云图像