没有合适的资源?快使用搜索试试~ 我知道了~
首页【2017年计算机视觉新作 】Foundations of Computer Vision
Book name: Foundations of Computer Vision. Computational Geometry, Visual Image Structures and Object Shape Detection Author: Peters, James F. Year: 2017 ISBN 978-3-319-52483-2 Discusses computer vision with a focus on extracting useful information from images and on the detection of the basic content of digital images
资源详情
资源评论
资源推荐


James F. Peters
Foundations of Computer
Vision
Computational Geometry, Visual Image
Structures and Object Shape Detection
123

James F. Peters
Electrical and Computer Engineering
University of Manitoba
Winnipeg, MB
Canada
ISSN 1868-4394 ISSN 1868-4408 (electronic)
Intelligent Systems Reference Library
ISBN 978-3-319-52481-8 ISBN 978-3-319-52483-2 (eBook)
DOI 10.1007/978-3-319-52483-2
Library of Congress Control Number: 2016963747
© Springer International Publishing AG 2017
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations,
recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission
or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar
methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are exempt from
the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, express or implied, with respect to the material contained herein or
for any errors or omissions that may have been made. The publisher remains neutral with regard to
jurisdictional claims in published maps and institutional affiliations.
Printed on acid-free paper
This Springer imprint is published by Springer Nature
The registered company is Springer International Publishing AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland

Preface
This book introduces the foundations of computer vision. The principal aim of
computer vision (also, called machine vision) is to reconstruct and interpret natural
scenes based on the content of images captured by various cameras (see, e.g.,
R. Szeliski [191]). Comput er vision systems include such things as survey satellites,
robotic navigation systems, smart scanners, and remote sensing systems. In this
study of computer vision, the focus is on extracting useful information from images
(see, e.g., S. Prince [162]). Computer vision systems typically emulate human
visual perception. The hardware of choice in computer vision systems is some form
of digital camera, programmed to approximate visual perception. Hence, there are
close ties between computer vision, digital image processing, optics, photometry
and photonics (see, e.g., E. Stijns and H. Thienpont [188]).
From a computer vision perspective, photonics is the science of light in the
capture of visual scenes. Image processing is the study of digital image formation
(e.g., conversion of analog ue optical sensor signals to digital signals), manipulation
(e.g., image filtering, denoising, cropping), feature extraction (e.g., pixel intensity,
gradient orientation, gradient magnitude, edge strength), description (e.g., image
edges and texture) and visualization (e.g., pixel intensity histograms). See, e.g., the
mathematical frameworks for image processing by B. Jähne [87] and S.G. Hoggar
[82], extending to a number of practitioner views of image processing provided,
for example, by M. Sonka and V. Hlavac and R. Boyle [186], W. Burger and
M.J. Burge [21], R.C. Gonzalez and R.E. Woods [58], R.C. Gonzalez and R.E.
Woods and S.L. Eddins [59], V. Hlavac [81], and C. Solomon and T. Breckon
[184]. This useful information provides the bedrock for the focal points of computer
visionists, namely, image object shapes and patterns that can be detected, analyzed
and classified (see, e.g., [142]). In effect, computer vision is the study of digital
image structures and patterns, which is a layer of image analysis above that of
image processing and photonics. Computer vision includes image processing and
photonics in its bag of tricks in its pursuit of image geometry and image region
patterns.
In addition, it is helpful to cultivate an intelligent systems view of digital images
with an eye to discovering hidden patterns such as repet itions of convex enclosures
vii

of image regions and embedded image structures such as clusters of points in image
regions of interest. The discovery of such structures is made possible by quantizers.
A quantizer restricts a set of values (usually continuous) to a discrete value. In its
simplest form in computer vision, a quantizer observes a particula r target pixel
intensity and selects the nearest approximating values in the neighbourhood of the
target. The output of a quantizer is called a codebook by A. Gersho and R.M. Gray
[55, §5.1, p. 133] (see, also, S. Ramakrishnan, K. Rose and A. Gersho [164]).
In the context of image mesh overlays, the Gersho–Gray quantizer is replaced by
geometry-based quantizers. A geometry-based quantizer restricts an image region
to its shape contour and observes in an image a particular target object shape
contour, which is compared with other shape contours that have approximately the
same shape as the target. In the foundations of computer vision, geometry-based
quantizers observe and compare image regions with approximately the same
regions such as mesh maximal nucleus clusters (MNCs) compared with other
nucleus clusters. A maximal nucleus cluster (MNCs) is a collection of image mesh
polygons surrounding a mesh polygon called the nucleus (see, e.g., J.F. Peters and
E. İnan on Edelsbrunner nerves in Voronoï tessellations of images [150]). An
image mesh nucleus is a mesh polygon that is the centre of a collection of adjacent
polygons. In effect, every mesh polygon is a nucleus of a cluster of polygons.
However, only one or more mesh nuclei are maximal.
A maximal image mesh nucleus is a mesh nucleus with the highest number of
adjacent polygons. MNCs are important in computer vision, since what we will call
a MNC contour approximates the shape of an underlying image object. A Voronoï
tessellation of an image is a tiling of the image with polygons. A Voronoï tessel-
lation of an image is also called a Voronoï mesh. A sample tiling of a musician
image in Fig. 0.1.1 is shown in Fig. 0.1.2. A sample nucleus of the musician image
tiling is shown in Fig. 0.2.1. The red
dots inside each of the tiling polygons are
examples of Voronoï region (polygon) generating points. For more about this, see
Sect. 1.22.1. This musician mesh nucleus is the centre of a maximal nucleus cluster
shown in Fig. 0.2 .2. This is the only MNC in the musician image mesh in Fig. 0.1.2.
This MNC is also an example of a Voronoï mesh nerve. The study of image MNCs
takes us to the threshold of image geometry and image object shape detection. For
more about this, see Sect. 1.22.2.
Each image tiling polygon is a convex hull of the interior and vertex pixels.
A convex hull of a set of image points is the smallest convex set of the set of points.
A set of image points A is a convex set, provided all of the points on every straight
line segment between any two points in the set A is contained in the set. In other
words, knowledge discovery is at the heart of computer vision. Both knowledge and
understanding of digital images can be used in the desig n of computer vision
systems. In vision system designs, there is a need to understand the composition
and stru cture of digital images as well as the methods used to analyze captured
images.
The focus of this volume is on the study of raster images. The sequel to this
volume will focus on vector images, which are compo sed of points (vectors), lines
and curves. The basic content of every raster image consists of pixels
viii Preface
剩余435页未读,继续阅读













安全验证
文档复制为VIP权益,开通VIP直接复制

评论2