To achieve reversible data hiding, Zhao et al. [41] proposed a novel three-dimensional
histogram shifting (3D HS) scheme that utilizes stereo H.264 video as covers media. The
proposed 3D HS algorithm is used to embed data in quantized DCT (QDCT) coefficients of
MVC video. Three coefficients chosen from each embeddable block are used for hiding two
bits of information, where just one coefficient may be changed by adding 1 (at most) in most
cases. Compared with the conventional 3D HS algorithm, the proposed scheme could achieve
superior payload-distortion performance. As information is embedded on the inter-MBs that
are based on inter and inter-view predictions, the algorithm could be applied to P and B frames
in the MVC sequence. For the selected inter-MBs, the intra-frame distortion drift caused by
data hiding can be avoided. However, the inter-MBs of P/B frames other than B4 frames will
still lead to inter-frame distortion drift.
To address the drawbacks of the above algorithms, we developed a novel fragile
watermarking algorithm for H.264/AVC multiview coding in the compressed domain. In the
proposed algorithm, the hierarchical B picture (HBP) structure of MVC videos is explored and
the algorithm is lossless. To ensure the integrity of the video content, the watermark is
extracted from the features of the I/P-frame and embedded in the B-frame, which is the
primary frame type in the HBP coding structure. The algorithm is characterized by low
complexity, high payload, and low video bitstream overhead. An experimental implementation
confirmed its compatibility with the H.264/AVC video compression standard, and applicability
to both motion and still video contents with acceptable embedding capacities.
The remainder of this paper is organized as follows. Section 2 reviews the inter-prediction
modes of H.264/AVC and MVC with the HBP coding structure, and describes the proposed
watermarking scheme. Section 3 describes the experiments that were conducted to demon-
strate the performance of the proposed scheme, and Section 4 concludes the paper.
2 Motivation
The high compression efficiency of the H.264/AVC video coding standard constitutes a major
obstacle to the application of video watermarking in the compressed domain. The present
study was motivated by some restrictions that were highlighted by previous works on
watermarking for the MVC video stream. Firstly, based on the specific H.264/AVC codec
architecture, many skipped macroblocks that are widely distributed in the inter-prediction
mode may not carry any watermark data when using the current watermarking technique.
Secondly, the coded video sequence of an MVC video with a hierarchical B picture structure
has a high proportion of B-frames that are exploited for temporal and inter-view predictions,
which is required for developing a novel watermarking scheme with a high embedding
capacity instead of applying a direct extension of a 2D field scheme. In this section, we briefly
introduce the inter-prediction mode of H.264/AVC and the HBP coding structure of MVC,
with a view toward laying a foundation for the introduction of the proposed watermarking
scheme.
2.1 B_SKIP and B_DIRECT modes
As an efficient coding standard, H.264/AVC affords more flexibility for the selection of motion
compensation block sizes and shapes compared to previous standards, with the minimum size
of the luma motion compensation blocks being as small as 4 × 4 [33]. Whereas a small block is
Multimedia Tools and Applications