ZHU et al.: BINARY AND MULTI-CLASS LEARNING BASED LOW COMPLEXITY OPTIMIZATION FOR HEVC ENCODING 549
Fig. 2. Example of the CUs and PUs with the minimum RD cost. (a) CUs and PUs with the minimum RD cost in “BasketballPass” sequence. (b) PUs with
the minimum RD cost. (c) CUs with the minimum RD cost. (d) Pruned quad-tree structure of (c).
TABLE I
U
PPER BOUND OF POTENTIAL TIME SAVI NG (TS) RATIO UNDER
DIFFERENT QUANTIZATION PARAMETERS (QPS)[UNIT:%](TRUE
CUS/PUS CHECKING VS ALL CANDIDATE CUS/PUS CHECKING)
complexity. The proposed learning based fast HEVC encod-
ing algorithm is presented in Section III. Section IV provides
the optimal parameters determination for flexible complexity
allocation. Experimental results and analysis are discussed in
Section V. Finally, conclusions are drawn in Section VI.
II. M
OTIVATION AND STATISTICAL ANALYSIS
In HEVC, the recursive CU decision and multiple PUs
selection are adopted, as shown in Fig. 1. The flexible
CUs can be recursively selected from 64×64 to 8×8, i.e.,
noted as Depth 0 to Depth 3. Fig. 1(a) shows the pro-
cess of recursive splitting, namely, the current CU can be
split into four sub-CUs, and then every sub-CU can also
be further split until reaches the smallest size of 8×8.
This recursive CU decision can be represented as a quad-
tree, as shown in Fig. 1(b). After checking all CU can-
didates, i.e., each node of the quad-tree, the CU or CU
combinations will be selected with the minimum RD cost.
Additionally, PU selection is implemented for each CU, in
which the PU with the minimum RD cost will be selected
from 11 mode candidates, i.e., SKIP/Merge, Inter_2N×2N,
Inter_N×2N, Inter_N×N, Asymmetric Motion Partitions
(AMP, including Inter_2N×nU, Inter_2N×nD, Inter_nL×2N,
and Inter_nR×2N), Intra_2N×2N and Intra_N×N, as shown
in Fig. 1(c). In the recursive CU decision and PU selection
process, HEVC checks dozens of mode candidates one by one
then selects the best one with the minimum RD cost, which
is extremely time-consuming [2].
Fig. 2(a) shows an example of the CUs and PUs with the
minimum RD cost in BasketballPass (416 × 240) sequence.
One CU of 64×64 is marked as red boundary, and its PU
and CU partitions with the minimum RD cost are shown in
Fig. 2(b) and Fig. 2(c). The pruned quad-tree structure of
Fig. 2(c) is also shown in Fig. 2(d). Compared with the full
candidates in Fig. 1, only several CUs and PUs are selected
ultimately. Therefore, we consider that if these CUs and PUs
with the minimum RD cost can be precisely predicted without
full RD cost calculation and comparison, the computational
complexity would be significantly reduced. Then, we conduct
statistical experiments to explore the complexity redundancy
in HEVC. The experimental procedures are as follows. Firstly,
the ground truths (the CUs and PUs with the minimum RD
cost) are recorded when encoding the video with the HEVC
test model. Secondly, the sequences are encoded again with
ground truths. Thirdly, the encoding complexity is compared
with that of the HEVC test model. Table I shows the upper
bound of potential Time Saving (TS). CU level indicates that
CUs are predicted directly instead of RD cost calculation and
comparison. PU level represents that the PUs will be predicted
directly without checking all the candidates. CU + PU means
that CU and PU levels are both activated. In Table I, it can be
found that if these CUs and PUs are precisely predicted with-
out RD cost calculation, the average upper bound of TS in each
level can reach 66.4%, 44.8% and 82.3%, respectively. There
is a great potential of complexity reduction by predicting the
CUs/PUs directly.
Thus, these CU decision and PU selection in video cod-
ing can be regarded as classification problems: (1) In the CU
level, the recursive CU decision is modeled as a three-level
hierarchical binary classification issue, i.e.,
whether the current
CU being further split or not will be determined by a binary
classifier directly. (2) In the PU level, selecting the best PU
from multiple candidates is modeled as a multi-class classifi-
cation issue, i.e., the best one can be predicted by a multi-class
classifier directly without full candidates checking.
III. P
ROPOSED LEARNING BASED FAST
HEVC ENCODING ALGORITHM
In this section, we propose a fast CU decision and PU selec-
tion algorithm. Firstly, the decision structure for CU and PU