UNCORRECT
ED
PROOF
68
which is proposed first by Crammer and Singer [
16]. In
69
their paper three learning problems have been arisen. They
70
focus on the second problem which is given a set of binary
71
classifier, find a good matrix. They prove that this problem
72
is NP-complete which underscores the difficulty of coding
73
matrix design. The difficulty is due to that there is no
74
function mapping a coding matrix to the empirical loss. In
75
other word, it is difficult to know what kind of coding
76
matrix can lead to have small empirical loss. For this
77
purpose, Masulli and Valentini [
17] experimentally analyze
78
some of the main factors affecting the effectiveness of
79
ECOC methods and the analysis shows that all these fac-
80
tors concur to the effectiveness of ECOC methods in a not
81
straightforward way, very likely dependent on the distri-
82
bution and complexity of the data. Garcia-Pedrajas and
83
Fyfe [
18] propose an evolutionary approach to the design
84
of output codes with a fitness function, which is made up of
85
five terms that are relevan t for a coding matrix to achieve a
86
good performance. Their paper obtains a better perfor-
87
mance, but more detailed works are also needed to
88
understand how these terms affect the performance of a
89
coding matrix clearly.
90
On the othe r hand, since the factors affecting the
91
effectiveness of ECOC methods are diverse and interact
92
between them, simpl y, we can focus on only one aspect to
93
check the impact of the performance of a coding matrix.
94
Ali-Bagheri et al. [
14, 19] propose an efficient way to
95
improve independency among binary classifiers. Angel-
96
Bautista et al. [
15] utilize a novel Genetic strategy to find
97
a set of dichotomizers with better performance, then,
98
ensemble them as an optimal coding matrix. Moreover,
99
Valentini [
20] proposed that the empirical loss depends on
100
the complexity of the dichotomies induced by the selected
101
decomposition method and on the accuracy of the
102
dichotomizers. Kong and Dietterich [
21] present that the
103
dichotomizers must learn more separating surfaces
104
between classes. From this point, Pujol et al. [
12] present a
105
heuristic method (discriminant ECOC, DECOC) to obtain
106
maximum class discrimination in the partitions. With
107
maximum class discrimination in each partition, DECOC
108
can reduce the complexity of the dichotomies efficiently
109
and have more separating surfaces, which will lead to a
110
smaller empirical loss. Furthermore, with the aid of a
111
binary tree, DECOC can also have compact codewords.
112
So, we can state that DECOC is a promising method for
113
ECOC design.
114
However, finding the binary partitions with maximum
115
class discrimination in DECOC is still a difficul t problem,
116
which affects the application of DECOC in practice. The
117
reason is that we need an exhaustive search among all
118
possible partitions to find the optimal binary partitions,
119
which will spend an impractical computation cost. It is
120
worth noting that in Escalera’s work [
28] they give a
121
simplified version for DECOC, which has a random per-
122
mutation among two given partitions and cannot guarantee
123
to obtain the optimal binary partitions. This has proven that
124
it is difficult to apply DECOC with the SFFS algorithm and
125
the fast quadratic mutual information in practice. This also
126
proves that it is necessary to find an alternative method to
127
improve the original DECOC. The motivation of this paper
128
is to make DECOC be applied in practice easily, by
129
proposing an alternative and efficient approach to obtain
130
the optimal binary partitions.
131
It is our purpose in this paper to make a certain con-
132
tribution to improve DECOC using spectral clustering to
133
obtain the optimal binary partitions. In our algorithm, each
134
class is seen as a vertex in an undirected graph and the
135
weight for each edge which joints two classes is measured
136
by the confusion matrix with a pre-classifier. Here, the
137
weight is related to the discrimination. The smaller weight
138
means the larger discrimination. In this case, finding a
139
binary partition with maximum class discrimination is
140
changed to solve a mincut problem in spectral clustering.
141
Fortunately, spectral clustering provides the way to solve
142
the relaxed version of this problem. Avoiding the exhaus-
143
tive search process, our algorithm can reduce the compu-
144
tational complexity of DECO C significantly. Furthermore,
145
normalized cut in spectral clustering also bring a new
146
property (balanced column [
22]), which is helpful to avoid
147
imbalanced problems to a certain degree. The final DECOC
148
design is obta ined by implementing a recursive process
149
with the aid of binary tree.
150
The paper is organized as follows: Sect.
2 provides a
151
simple description for ECOC framework. Section
3 pre-
152
sents the Spectral DECOC approach and the computational
153
complexity analysis. Section
4 shows the experimental
154
results on several datasets from different environments.
155
Finally, Sect.
5 concludes the paper.
156
2 ECOC
157
ECOC provides an efficient way to solve the complex
158
multi-class classification by decomposing the complex
159
multi-class classification into a series of binary classifi-
160
cation. There are two decomposing frameworks: Binary
161
ECOC and Ternary ECOC. In the Binary ECOC frame-
162
work, there are two symbols þ1; 1
fg
which stand for a
163
negative class and a positive class in a binary problem. In
164
the Ternary ECOC framework, symbol 0
fg
is added,
165
which stands for the ignored class in a binary problem. In
166
Fig.
1 four classical ECOCs are shown: (a) one-versus-all,
167
(b) one-versus-one, (c) dense random, (d) sparse random.
168
Each row of the ECOC matrix represents a codeword for
169
class C
i
i ¼ 1; 2; 3; 4ðÞand each column represents a par-
170
tition on the data set for a binary classifier. The code
Pattern Anal Applic
123
Journal : Large 10044 Dispatch : 23-10-2015 Pages : 19
Article No. : 523
h LE h TYPESET
MS Code : PAAA-D-15-00264 h CP h DISK
44
Author Proof