8 1 Geometric Approaches to Three-dimensional Scene Reconstruction
points in the images of the N cameras are denoted by the sensor coordinates
S
i
x
k
,
where i = 1,...,N and k = 1,...,K. The image coordinates inferred from the extrin-
sic camera parameters
C
i
W
T, the intrinsic camera parameters {c
j
}
i
, and the K scene
point coordinates
W
x
k
are given by Eq. (1.6). Bundle adjustment corresponds to a
minimisation of the reprojection error
E
B
=
N
∑
i=1
K
∑
k=1
S
i
I
i
T
−1
P
C
i
W
T,{c
j
}
i
,
W
x
k
−
S
i
x
k
2
. (1.11)
The transformation by
S
i
I
i
T
−1
in Eq. (1.11) ensures that the backprojection error is
measured in Cartesian image coordinates. It can be omitted if a film is used for
image acquisition, on which Euclidean distances are measured in a Cartesian coor-
dinate system, or as long as the pixel raster of the digital camera sensor is orthogonal
(
θ
= 90
◦
) and the pixels are quadratic (
α
u
=
α
v
). This special case corresponds to
S
i
I
i
T in Eq. (1.5) describing a similarity transform.
The bundle adjustment approach can be used for calibration of the intrinsic and
extrinsic camera parameters, reconstruction of the three-dimensional scene struc-
ture, or estimation of object pose. Depending on the scenario, some or all of the
parameters
C
i
W
T, {c
j
}
i
, and
W
x
k
may be unknown and are obtained by a minimisa-
tion of the reprojection error E
B
with respect to the unknown parameters. As long as
the scene is static, utilising N simultaneously acquired images (stereo image analy-
sis, cf. Section 1.3) is equivalent to evaluating a sequence of N images acquired by
a single moving camera (structure from motion).
Minimisation of Eq. (1.11) involves nonlinear optimisation techniques such as
the Gauss-Newton or the Levenberg-Marquardt approach (Press et al., 1992). The
reprojection error of scene point
W
x
k
in image i influences the values of
C
i
W
T and
{c
j
}
i
only for images in which this scene point is also detected, leading to a sparse
set of nonlinear equations. The sparsity of the optimisation problem is exploited
in the algorithm by Lourakis and Argyros (2004). The error function defined by
Eq. (1.11) may have a large number of local minima, such that reasonable initial
guesses for the parameters to be estimated have to be provided. As long as no a-
priori knowledge about the camera positions is available, a general property of the
bundle adjustment method is that it only recovers the scene structure up to an un-
known constant scale factor, since an increase of the mutual distances between the
scene points by a constant factor can be compensated by accordingly increasing the
mutual distances between the cameras and their distances to the scene. However,
this scale factor can be obtained if additional information about the scene, such as
the distance between two scene points, is known.
Difficulties may occur in the presence of false correspondences or gross errors
of the determined point positions in the images, corresponding to strong deviations
of the distribution of reprojection errors from the assumed Gaussian distribution.
Lourakis and Argyros (2004) point out that in realistic scenarios the assumption of
a Gaussian distribution of the measurement errors systematically underestimates the
fraction of large errors. Searching for outliers in the established correspondences can
be performed e.g. using the random sample consensus (RANSAC) method (Fischler