Holub et al. EURASIP Journal on Information Security 2014, 2014:1 Page 3 of 13
http://jis.eurasipjournals.com/content/2014/1/1
quantized coefficients in this structure with X and Y to
obtain the cover and stego images, respectively. This way,
we guarantee that both images were created using the
same JPEG compressor and that all that we will be detect-
ing are the embedding changes rather than compressor
artifacts.
3 Universal distortion function UNIWARD
In this section, we provide a general description of the
proposed universal distortion function UNIWARD and
explain how it can be used to embe d in the JPEG
and the side-informed JPEG domains. The distortion
depends on the choice of a directional filter bank and one
scalar parameter whose purpose is stabilizing the numer-
ical computations. The distortion design is finished in
Se ction 5, which investigates the effect of the filter bank
and the stabilizing constant on empirical security.
Since r ich models [18,20-22] currently used in ste-
ganalysis are capable of detecting changes along ‘clean
edges’ that can be well fitted using lo cally polynomial
models, whenever possible the embedding algorithm
should embed into textured/noisy areas that are not eas-
ily modellable in any direction. We quantify this u sing
outputs of a directional filter bank and construct the
distortion function in this manner.
3.1 Directional filter bank
By a directional filter bank, we underst and a set of three
linear shift-invariant filters represented with their ker-
nels
B ={K
(1)
, K
(2)
, K
(3)
}. They are used to evaluate the
smoothness of a given image X along the horizontal, ver-
tical, and diagonal directions by computing the so-called
directional residuals W
(k)
= K
(k)
X,where‘’isa
mirror-padded convolution so that W
(k)
has again n
1
×
n
2
elements. The mirror padding prevents introducing
embe dding artifact s at the image boundary.
While it is possible to use arbitrary filter banks, we
will exclusively use kernels built from one-dimensional
low-pass (and high-pass) wavelet decomposition filters h
(and g):
K
(1)
= h · g
T
, K
(2)
= g · h
T
, K
(3)
= g · g
T
.(2)
In this case, the filters correspond, respectively, to two-
dimensional LH, HL, and HH wavelet directional hig h-
pass filters, and the residuals coincide with the first-level
undecimated wavelet LH, HL, and HH directional decom-
position of X. We constrained ourselves to wavelet filter
banks b ecause wavelet representations are known to pro-
vide good decorrelation and energy compactification for
images of natural scenes (see, e.g., Chapter 7 in [23]).
3.2 Distortion function (non-side-informed embedding)
We are now ready to describe the universal distortion
function. We do so first for embedding that does not use
any precover. Given a pair of cover and stego images,
X and Y, represented in the spatial (pixel) domain, we
will denote with W
(k)
uv
(X) and W
(k)
uv
(Y), k = 1, 2, 3, u ∈
{1, ..., n
1
}, v ∈{1, ..., n
2
}, their corresponding uvth
wavelet coefficient in the kth subband of the first decom-
position level. The UNIWARD distortion function is the
sum of relative changes of all wavelet coefficients with
respect to the cover image:
D(X, Y)
3
k=1
n
1
u=1
n
2
v=1
|W
(k)
uv
(X) − W
(k)
uv
(Y)|
σ +|W
(k)
uv
(X)|
,(3)
where σ>0 is a constant stabilizing the numerical
calculations .
The ratio in (3) is smaller when a large cover wavelet
coefficient is changed (where texture and e dges appear).
Embe dding changes are discouraged in regions where
|W
(k)
uv
(X)| is small for at least one k, which corresponds to
a direction along which the content is modellable.
F or JPEG images, the distortion between the two arrays
of quantized DCT coefficients, X and Y,iscomputedby
first decompressing the JPEG files to the spatial domain,
and evaluating the distor tion betwe en the decompressed
images, J
−1
(X) and J
−1
(Y), in the same manner as in (3):
D(X, Y) D
J
−1
(X), J
−1
(Y)
.(4)
Note that the distortion (3) is non-additive because
changing pixel X
ij
will affect s × s wavelet coefficients,
where s × s is the size of the 2D wavelet support. Also,
changing a JPEG coefficient X
ij
will affect a block of
8 × 8 pixels and therefore a block of (8 + s − 1) × (8 +
s − 1) wavelet coefficients. It is thus apparent that when
changing neighboring pixels (or DCT coefficients), the
embedding changes ‘interact,’ hence the non-additivity of
D.
3.3 Distortion function (JPEG side-informed embedding)
By side-informed embedding in JPEG domain, we under-
stand the following general principle. Given the raw DCT
coefficient D
ij
obtained from the precover P, the embed-
der has the choice of rounding D
ij
up or down to modulate
its parity (usually the least significant bit of the rounded
value). We denote with e
ij
=|D
ij
− X
ij
|, e
ij
∈[0, 0.5], the
rounding error for the ijth co efficient when compressing
the precover P to the cover image X. Rounding ‘to the
other side’ leads to an embedding change, Y
ij
= X
ij
+
sign(D
ij
− X
ij
), which corresponds to a ‘rounding error ’
1−e
ij
. Thus, every embedding change increases the distor-
tion with respect to the precover by the difference between
both rounding errors: |D
ij
− Y
ij
|−|D
ij
− X
ij
|=1 − 2e
ij
.