数字图像处理核心算法原理详解

Image

Processing

5星 · 超过95%的资源需积分: 9 20 浏览量更新于2024-07-21 收藏 17.65MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

1.2 Image Analysis 3

1.2 Image Analysis

Although image analysis is not the central theme of this book, most methods

described here exhibit a certain “analytical ﬂavor” that adds to the elemen-

tary “pixel crunching” techniques described in the preceding volume [14]. This

intersection becomes evident in tasks like segmenting image regions (Ch. 2),

detecting simple curves and corners (Chs. 3–4), or comparing images (Ch. 11)

at the pixel level. All these methods work directly on the pixel data in a bottom-

up way without recourse to any domain-speciﬁc or “semantic” knowledge. In

some sense, one could describe all these methods as “dumb and blind”, which

diﬀerentiates them from the approach pursued in pattern recognition and com-

puter vision. Although these two disciplines are ﬁrmly grounded in, and rely

heavily on, image processing, their ultimate goals are much loftier.

Pattern recognition is primarily a mathematical discipline and has been

responsible for techniques such as probabilistic modeling, clustering, decision

trees, or principal component analysis (PCA), which are used to discover pat-

terns in data and signals. Methods from pattern recognition have been ap-

plied extensively to problems arising in computer vision and image analysis.

A good example of their successful application is optical character recognition

(OCR), where robust, highly accurate turnkey solutions are available for recog-

nizing scanned text. Pattern recognition methods are truly universal and have

been successfully applied not only to images but also speech and audio sig-

nals, text documents, stock trades, and for ﬁnding trends in large databases,

where it is often called “data mining”. Dimensionality reduction, statistical,

and syntactical methods play important roles in pattern recognition (see, for

example, [21,55,72]).

Computer vision tackles the problem of engineering artiﬁcial visual sys-

tems capable of somehow comprehending and interpreting our real, three-

dimensional world. Popular topics in this ﬁeld include scene understanding,

object recognition, motion interpretation (tracking), autonomous navigation,

and the robotic manipulation of objects in a scene. Since computer vision has

its roots in artiﬁcial intelligence (AI), many AI methods were originally de-

veloped to either tackle or represent a problem in computer vision (see, for

example, [19, Ch. 13]). The ﬁelds still have much in common today, espe-

cially in terms of adaptive methods and machine learning. Further literature

on computer vision includes [2,24,35, 65,69,73].

Ultimately, you will ﬁnd image processing to be both intellectually challeng-

ing and professionally rewarding, as the ﬁeld is ripe with problems that were

originally thought to be relatively simple to solve but have, to this day, refused

to give up their secrets. With the background and techniques presented in this

text, you will not only be able to develop complete image processing solutions

6 2. Regions in Binary Images

to consider each pixel in isolation, we will not be able to determine how many

objects there are overall in the image, where they are located, and which pixels

belong to which objects. Therefore our ﬁrst step is to ﬁnd each object by

grouping together all the pixels that belong to it. In the simplest case, an

object is a group of touching foreground pixels; that is, a connected binary

region.

2.1 Finding Image Regions

In the search for binary regions, the most important tasks are to ﬁnd out which

pixels belong to which regions, how many regions are in the image, and where

these regions are located. These steps usually take place as part of a process

called region labeling or region coloring. During this process, neighboring pixels

are pieced together in a stepwise manner to build regions in which all pixels

within that region are assigned a unique number (“label”) for identiﬁcation.

In the following sections, we describe two variations on this idea. In the ﬁrst

method, region marking through ﬂood ﬁlling, a region is ﬁlled in all directions

starting from a single point or “seed” within the region. In the second method,

sequential region marking, the image is traversed from top to bottom, marking

regions as they are encountered. In Sec. 2.2.2, we describe a third method that

combines two useful processes, region labeling and contour ﬁnding, in a single

algorithm.

Independent of which of the methods above we use, we must ﬁrst settle on

either the 4- or 8-connected deﬁnition of neighboring (see Vol. 1 [14, Fig. 7.5])

for determining when two pixels are “connected” to each other, since under

each deﬁnition we can end up with diﬀerent results. In the following region-

marking algorithms, we use the following convention: the original binary image

I(u, v) contains the values 0 and 1 to mark the background and foreground,

respectively; any other value is used for numbering (labeling) the regions, i. e.,

the pixel values are

I(u, v)=

⎧

⎨

⎩

0 a background pixel

1 a foreground pixel

2, 3,... aregionlabel.

2.1.1 Region Labeling with Flood Filling

The underlying algorithm for region marking by ﬂood ﬁlling is simple: search

for an unmarked foreground pixel and then ﬁll (visit and mark) all the rest of the

neighboring pixels in its region (Alg. 2.1). This operation is called a “ﬂood ﬁll”

because it is as if a ﬂood of water erupts at the start pixel and ﬂows out across

a ﬂat region. There are various methods for carrying out the ﬁll operation that

2.1 Finding Image Regions 7

Algorithm 2.1 Region marking using ﬂood ﬁlling (Part 1). The binary input image I uses

the value 0 for background pixels and 1 for foreground pixels. Unmarked foreground pixels

are searched for, and then the region to which they belong is ﬁlled. The actual FloodFill()

procedure is described in Alg. 2.2.

1: RegionLabeling(I)

I: binary image; I(u, v)=0: background, I(u, v)=1: foreground

The image I is labeled (destructively modiﬁed) and returned.

2: Let m ← 2  value of the next label to be assigned

3: for all image coordinates (u, v) do

4: if I(u, v)=1then

5: FloodFill(I,u,v,m)  use any version from Alg. 2.2

6: m ← m +1.

7: return the labeled image I.

ultimately diﬀer in how to select the coordinates of the next pixel to be visited

during the ﬁll. We present three diﬀerent ways of performing the Fl oodFill()

procedure: a recursive version, an iterative depth-ﬁrst version,andaniterative

breadth-ﬁrst version (see Alg. 2.2):

(A) Recursive Flood Filling: The recursive version (Alg. 2.2, lines 1–8)

does not make use of explicit data structures to keep track of the image

coordinates but uses the local variables that are implicitly allocated by

recursive procedure calls.

Within each region, a tree structure, rooted at

the starting point, is deﬁned by the neighborhood relation between pixels.

The recursive step corresponds to a depth-ﬁrst traversal [20] of this tree

and results in very short and elegant program code. Unfortunately, since

the maximum depth of the recursion—and thus the size of the required

stack memory—is proportional to the size of the region, stack memory is

quickly exhausted. Therefore this method is risky and really only practical

for very small images.

(B) Iterative Flood Filling (depth-ﬁrst): Every recursive algorithm can

also be reformulated as an iterative algorithm (Alg. 2.2, lines 9–20) by

implementing and managing its own stacks. In this case, the stack records

the “open” (that is, the adjacent but not yet visited) elements. As in the

recursive version (A), the corresponding tree of pixels is traversed in depth-

ﬁrst order. By making use of its own dedicated stack (which is created in

the much larger heap memory), the depth of the tree is no longer limited

In Java, and similar imperative programming languages such as C and C++, local

variables are automatically stored on the cal l stack at each procedure call and

restored from the stack when the procedure returns.

剩余336页未读，继续阅读

rcasio

粉丝: 0
资源: 9

数字图像处理核心算法原理详解

Principles of Digital Image Processing Core Algorithms

digital signal processing principles, algorithms and application

principles of robot motion: theory algorithms and implementation. mit pres

matlab图像处理英文文献,matlab图像处理中英文翻译文献

digital signal processing——principles下载

推荐几本学习椭圆滤波器的书籍，该书籍有相关代码

有没有针对初学者的阵列信号处理入门教材？

fft 重叠加法_信号分析之：FFT计算中的“重叠”处理 （Overlap Processing）的文献有哪些

rudin principles of mathematical analysis 下载

principles of neural science

levinson h j.overlay//principles of lithography,spie,2005.

语音编码相关的参考书籍

principles of lithography pdf

principles of lasers 下载

关于dsp的外国书籍

principles of mathematical analysis 3rd

the principles of quantum mechanics pdf

principles of cmos vlsi design 2nd edition

principles of planar near-field antenna measurements

最新资源

fft 重叠加法_信号分析之：FFT计算中的“重叠”处理（Overlap Processing）的文献有哪些