MAES et al.: MULTIMODALITY IMAGE REGISTRATION BY MAXIMIZATION OF MUTUAL INFORMATION 189
(a) (b)
Fig. 1. Joint histogram of the overlapping volume of the CT and MR
brain images of dataset A in Tables II and III: (a) Initial position:
I
(
CT; MR
)=0
:
46
, (b) registered position:
I
(
CT;M R
)=0
:
89
.
Misregistration was about 20 mm and 10
(see the parameters in Table III).
If both marginal distributions and can be
considered to be independent of the registration parameters
, the MI criterion reduces to minimizing the joint entropy
[6]. If either or is independent of ,
which is the case if one of the images is always completely
contained in the other, the MI criterion reduces to minimizing
the conditional entropy
or . However, if
both images only partially overlap, which is very likely during
optimization, the volume of overlap will change when
is
varied and
and and also and will
generally depend on
. The MI criterion takes this into account
explicitly, as becomes clear in (2), which can be interpreted
as follows [27]: “maximizing MI will tend to find as much
as possible of the complexity that is in the separate datasets
(maximizing the first two terms) so that at the same time they
explain each other well (minimizing the last term).”
For
to be useful as a registration criterion and
well behaved with respect to optimization,
should
vary smoothly as a function of misregistration
. This
requires
and to change smoothly when
is varied, which will be the case if the image intensity
values are spatially correlated. This is illustrated by the graphs
in Fig. 2, showing the behavior of
as a function of
misregistration between an image and itself rotated around the
image center. The trace on the left is obtained from an original
MR image and shows a single sharp optimum with a rather
broad attraction basin. The trace on the right is obtained from
the same image after having reduced the spatial correlation of
the image intensity by repeatedly swapping pairs of randomly
selected pixels. This curve shows many local maxima and
the attraction basin of the global maximum is also much
smaller, which deteriorates the optimization robustness. Thus,
although the formulation of the MI criterion suggests that
spatial dependence of image intensity values is not taken into
account, such dependence is in fact essential for the criterion
to be well behaved around the registration solution.
III. A
LGORITHM
A. Transformation
Each of the images is associated an image coordinate frame
with its origin positioned in a corner of the image, with the
(a) (b)
Fig. 2. Spatial correlation of image intensity values increases MI registration
robustness. Top: (a) original 256
2
256 2-D MR image and (b) image of (a)
shuffled by swapping 30 000 randomly selected pixel pairs. Both images have
the same image content. Bottom: MI registration traces obtained using partial
volume distribution (PV) interpolation for in-plane rotation of each image
over itself. Local maxima are marked with “*”.
axis along the row direction, the axis along the column
direction, and the
axis along the plane direction.
One of the images is selected to be the floating image,
,
from which samples
are taken and transformed into
the reference image,
. can be the set of grid points of
or a sub- or superset thereof. Subsampling of the floating
image might be used to increase speed performance, while
supersampling aims at increasing accuracy. For each value
of the registration parameter
only those values
are retained for which falls inside the volume of
.
In this paper, we have restricted the transformation
to
rigid-body transformations only, although it is clear that the
MI criterion can be applied to more general transformations as
well. The rigid-body transformation is a superposition of a 3-
D rotation and a 3-D translation and the registration parameter
is a six-component vector consisting of three rotation
angles
(measured in degrees) and three translation
distances
(measured in millimeters). Transformation
of image coordinates
to from the image to image
is given by
(8)
with
and being 3 3 diagonal matrixes representing the
voxel sizes of images
and , respectively (in millimeters),
and the image coordinates of the centers of the images,
the 3 3 rotation matrix, with the matrixes
and representing rotations around the -, -, and
-axis, respectively, and the translation vector.