A:6 Li and Hoi
both buying and selling. At the beginning of the t
th
period, the portfolio manager intends to rebal-
ance the portfolio from closing price adjusted portfolio
ˆ
b
t−1
to a new portfolio b
t
. Here
ˆ
b
t−1
is
calculated as,
ˆ
b
t−1,i
=
b
t−1,i
x
t−1,i
b
t−1
·x
t−1
, i = 1, . . . , m. Assuming two transaction cost rates γ
b
∈ (0, 1 )
and γ
s
∈ (0, 1), where γ
b
denotes the transaction costs rate incurred during buying and γ
s
denotes
the transaction costs rate incurred during selling. After rebalancing, S
t−1
will be decomposed into
two parts, that is, the net wealth N
t−1
in the new portfolio b
t
and the transaction costs incurred
during the buying and selling. If the wealth on asset i before rebalancing is higher than that after
reblancing, that is,
b
t−1,i
x
t−1,i
b
t−1
·x
t−1
S
t−1
≥ b
t,i
N
t−1
, then there will be a selling rebalancing. Otherwise,
then a buying rebalancing is required. Formally,
S
t−1
= N
t−1
+γ
s
m
X
i=1
b
t−1,i
x
t−1,i
b
t−1
· x
t−1
S
t−1
− b
t,i
N
t−1
+
+γ
b
m
X
i=1
b
t,i
N
t−1
−
b
t−1,i
x
t−1,i
b
t−1
·x
t−1
S
t−1
+
.
Let use denote transaction costs factor [Gy¨orfi and Vajda 2008] as the ratio of net wealth after
rebalancing to wealth before rebalancing, that is, c
t−1
=
N
t−1
S
t−1
∈ (0, 1). Dividing above equation
by S
t−1
, we can get,
1 = c
t−1
+ γ
s
m
X
i=1
b
t−1,i
x
t−1,i
b
t−1
· x
t−1
− b
t,i
c
t−1
+
+ γ
b
m
X
i=1
b
t,i
c
t−1
−
b
t−1,i
x
t−1,i
b
t−1
·x
t−1
+
. (1)
Clearly, given b
t−1
, x
t−1
, and b
t
, there exists a unique transaction costs factor for each rebalancing.
Thus, we can denote c
t−1
as a function, c
t−1
= c (b
t
, b
t−1
, x
t−1
). Moreover, considering the
portfolio is in the simplex domain, then the factor ranges between
1−γ
s
1+γ
b
≤ c
t−1
≤ 1.
Finally, for each period t, the wealth grows by a factor as,
S
t
= S
t−1
× c
t−1
× (b
t
·x
t
) ,
and the final cumulative wealth after n periods equals,
S
n
= S
0
n
Y
t=1
c
t−1
× (b
t
·x
t
) ,
where c
t−1
is calculated as Eq. (1).
3. ONLINE PORTFOLIO SELECTION APPROACHES
In this section, we survey the area of online portfolio selection. Algorithms in this area formulate
the online portfolio selection task as in Section 2 and derive explicit portfolio update schemes for
each period. Basically, the routine is to implicitly assume various price relative predictions and learn
optimal portfolios.
In the subsequent sections, we mainly list the algorithms following Table I. In particular, we first
introduce several benchmark algorithms in Section 3.1. Then, we introduce the algorithms with ex-
plicit update schemes in the subsequent three sections. We classifies them based on the direction
of the weight transfer. The first approach, Follow-the-Winner approach, tries to increase the rela-
tive weights of more successful experts/stocks, often based on their historical performance. On the
contrary, the second approach, Follow-the-Loser approach, tries to increase the relative weights of
less successful experts/stocks, or transfer the weights from winners to losers. The third approach,
Pattern-Matching based approach, tries to build a portfolio based on some sampled similar his-
torical patterns with no explicit weights transfer directions. After that, we survey Meta-Learning
Algorithms, which can be applied to higher level experts equipped with any existing algorithm.
3.1. Benchmarks
ACM Computing Surveys, Vol. V, No. N, Article A, Publication date: December YEAR.