theory tells us that, if
f(t )
actually has very low band-
width, then a small number of (uniform) samples will suf-
fice for recovery. As we will see in the remainder of this
article, signal recovery can actually be made possible for a
much broader class of signal models.
INCOHERENCE AND THE SENSING OF SPARSE SIGNALS
This section presents the two fundamental premises underlying
CS: sparsity and incoherence.
SPARSITY
Many natural signals have concise representations when
expressed in a convenient basis. Consider, for example, the
image in Figure 1(a) and its wavelet transform in (b).
Although nearly all the image pixels have nonzero values, the
wavelet coefficients offer a concise summary: most coeffi-
cients are small, and the relatively few large coefficients cap-
ture most of the information.
Mathematically speaking, we have a vector
f ∈
R
n
(such as
the
n
-pixel image in Figure 1) which we expand in an orthonor-
mal basis (such as a wavelet basis)
= [ψ
1
ψ
2
···ψ
n
]
as follows:
f(t ) =
n
i=1
x
i
ψ
i
(t), (2)
where
x
is the coefficient sequence of
f
,
x
i
=f,ψ
i
. It will be
convenient to express
f
as
x
(where
is the
n × n
matrix
with
ψ
1
,... ,ψ
n
as columns). The implication of sparsity is
now clear: when a signal has a sparse expansion, one can dis-
card the small coefficients without much perceptual loss.
Formally, consider
f
S
(t)
obtained by keeping only the terms
corresponding to the
S
largest values of
(x
i
)
in the expansion
(2). By definition,
f
S
:= x
S
, where here and below,
x
S
is the
vector of coefficients
(x
i
)
with all but the largest
S
set to zero.
This vector is sparse in a strict sense since all but a few of its
entries are zero; we will call
S
-sparse
such objects with at most
S
nonzero
entries. Since
is an orthonormal
basis (or “orthobasis”), we have
f − f
S
2
=x − x
S
2
,
and if
x
is
sparse or compressible in the sense
that the sorted magnitudes of the
(x
i
)
decay quickly, then
x
is well approxi-
mated by
x
S
and, therefore, the error
f − f
S
2
is small. In plain terms,
one can “throw away” a large fraction
of the coefficients without much loss.
Figure 1(c) shows an example where
the perceptual loss is hardly noticeable
from a megapixel image to its approxi-
mation obtained by throwing away
97.5% of the coefficients.
This principle is, of course, what
underlies most modern lossy coders
such as JPEG-2000 [4] and many
others, since a simple method for data compression would be to
compute
x
from
f
and then (adaptively) encode the locations
and values of the
S
significant coefficients. Such a process
requires knowledge of all the
n
coefficients
x
, as the locations
of the significant pieces of information may not be known in
advance (they are signal dependent); in our example, they tend
to be clustered around edges in the image. More generally,
sparsity is a fundamental modeling tool which permits efficient
fundamental signal processing; e.g., accurate statistical estima-
tion and classification, efficient data compression, and so on.
This article is about a more surprising and far-reaching impli-
cation, however, which is that sparsity has significant bearings
on the acquisition process itself. Sparsity determines how effi-
ciently one can acquire signals nonadaptively.
INCOHERENT SAMPLING
Suppose we are given a pair
(, )
of orthobases of
R
n
. The first
basis
is used for sensing the object
f
as in (1) and the second is
used to represent
f
. The restriction to pairs of orthobases is not
essential and will merely simplify our treatment.
DEFINITION 1
The coherence between the sensing basis
and the representa-
tion basis
is
μ(, ) =
√
n · max
1≤k, j≤n
|ϕ
k
,ψ
j
|.(3)
In plain English, the coherence measures the largest correlation
between any two elements of
and
; see also [5]. If
and
contain correlated elements, the coherence is large. Otherwise,
it is small. As for how large and how small, it follows from linear
algebra that
μ(, ) ∈ [1,
√
n]
.
Compressive sampling is mainly concerned with low coher-
ence pairs, and we now give examples of such pairs. In our first
example,
is the canonical or spike basis
ϕ
k
(t) = δ(t − k )
and
[FIG1] (a) Original megapixel image with pixel values in the range [0,255] and (b) its
wavelet transform coefficients (arranged in random order for enhanced visibility).
Relatively few wavelet coefficients capture most of the signal energy; many such images
are highly compressible. (c) The reconstruction obtained by zeroing out all the coefficients
in the wavelet expansion but the 25,000 largest (pixel values are thresholded to the range
[0,255]). The difference with the original picture is hardly noticeable. As we describe in
“Undersampling and Sparse Signal Recovery,” this image can be perfectly recovered from
just 96,000 incoherent measurements.
visibility). Relatively few wavelet coefficients capture most of the signal energy; many