2 M. Lei, C. Baehr / Physica D 246 (2013) 1–14
Unscented Kalman Filter (EnUKF). The EnUKF is then able to treat
some high dimensional systems with a variable sample size, which
depend on the possible rank reduction and a pre-computed sam-
ple covariance. Although there exists a debate on the combination
method [20,21], the numerical results are convincing.
Our work takes place at this sensitive point. For all the KF-like
filters the estimate is assumed to be a linear regression with the
observation (see Section 2.2.1 for details). This is the core of KF
but this is hard to accept for general situations especially for a
dynamics with strong nonlinearities and non-Gaussian perturba-
tions. In a certain sense it becomes a bottleneck for some complex
and large dimension applications. Therefore we develop an assimi-
lation method with an update step that is based on variational min-
imization instead of the Kalman regression. Naturally the linear
regression is replaced by quadratic terms of observation. First a UT
is applied for the sample generation and the statistical mean es-
timation. Then an Ensemble Transform (ET) is used to empirically
compute the error covariance. To deal with the high dimensional-
ity, we incorporate a modified rank reduction into the UT scheme
in order to maintain a size-reduced ensemble generation. There-
fore the new scheme is called the Unscented/Ensemble transform-
based Variational Filter (UEVF). This filter is able to deal with the
nonlinear and chaotic systems with high dimensionality as the nu-
merical experiments show in Section 5.
It is clear that the suggested method, outlined in Section 3, is at
the junction of different technologies: The UT approach in order to
give an evolution to the error covariance matrices, the ensemble
transform to implement the covariance update, the variational
minimization to get a state update for a nonlinear dynamics, the
rank reduction to achieve a significant dimension reduction with
an adaptive fashion.
This paper is divided into 6 sections. In Section 2, we introduce
the background methodologies, including three update schemes.
In Section 3, we derive the UEVF and introduce the variational
minimization update, and the sigma-points generation with a size-
reduced dynamics. In Section 4, three implementation issues about
the UEVF are discussed. First we present the rank-reduced co-
variance approximation, then the conjugate-gradient method to
minimize the cost-function, and finally the computation of the
background error matrix. The Section 5 concerns the numerical ex-
periments. There are two examples, the chaotic high dimensional
Lorenz-95 model and a simulation based on the 2D shallow water
equation. Finally the conclusion and a discussion are provided in
Section 6.
2. Background methods
We assume that we have an n-dimensional discrete dynamical
system
x
k+1
= M
k,k+1
(x
k
) + u
k
(1)
and to observe the state x
k
we could (partly) obtain a measurement
sequence y
k
using the observer H
k
: R
n
× R
m
→ R
m
, i.e.,
y
k
= H
k
(x
k
) + v
k
, k = 1, 2, . . . (2)
where k ∈ [0, T] is the discrete-time index and T the total time
step, x
k
∈ R
n
and y
k
∈ R
m
are the state and the noisy observa-
tion at the time k, M
k,k+1
is the nonlinear transition operator with
M
k,k+1
: R
n
× R
n
→ R
n
. u
k
∈ R
n
is the additive dynamical noise
with a zero mean and non null covariance Q
k
, which includes both
the ordinary process noise and mismatch errors of the mathemat-
ical model. We assume that u
k
is independent of the additive ob-
servation noise v
k
∈ R
m
with a zero mean and known covariance
R
k
. u
k
is also supposed to be independent of the initial state x
0
.
The problem of data assimilation is equivalent to estimating the
Probability Density Function (PDF) η
k
= pdf(x
k
|y
1:k
). Two PDF’s
are used to describe it completely: the first is the predictor PDF
using the Markov transition given by the model, η
b
k
= pdf(x
b
k
|x
a
k−1
),
the second is the update PDF η
a
k
= pdf(x
a
k
|x
b
k
, y
1:k
). Finding these
two PDF’s solves the assimilation or the filtering problem.
Actually the update process does not have a unique representa-
tion as a Markov transition. The transformation η
b
k
may be repre-
sented by different transportation processes that are equivalent in
mean. This can be seen with the Feynman–Kac formulas [22].
2.1. Unscented transform
The scheme of unscented transform (UT) is designed to solve
the following estimation problem [19,23,24]: at time k − 1, we
have a Gaussian random variable x
k−1
∈ R
n
with mean x
a
k−1
and
covariance P
a
xx,k−1
, and Gaussian perturbations u
k−1
with zero-
mean and covariance Q
k−1
, which are assumed to be independent
of each other. Without loss of generality, we can apply a nonlin-
ear transition M on the x
k−1
to obtain a new random variable
x
k
= M(x
k−1
).
1
The interesting thing is to estimate the mean and
covariance of the transformed random variable x
k
.
The UT generates a set of 2n +1 state points {X
a
k−1,i
}
2n
i=0
, which
are called sigma-points and defined by
{X
a
k−1,i
}
2n
i=0
=
x
a
k−1
, x
a
k−1
±
(n + λ)P
a
xx,k−1
i
,
i = 1, . . . , n
, (3)
where (
(n + λ)P
a
xx,k−1
)
i
denotes the i-th column of the square
root matrix
(n + λ)P
a
xx,k−1
. Incidentally in the implementation,
by using P
a
xx,k−1
= S
a
x,k−1
(S
a
x,k−1
)
T
, a n ×n square root matrix S
a
x,k−1
can be obtained with a numerical decomposition algorithm. There-
fore (
(n + λ)P
a
xx,k−1
)
i
is rewritten as
√
n + λ
S
a
x,k−1
i
where λ
is a constant used for scaling adjusting and n is the dimension of
state x
k
.
Associated to the sigma-points, a set of weights {W
k−1,i
}
2n
i=0
is
allocated by
W
k−1,0
=
λ
n + λ
,
W
k−1,i
=
1
2(n + λ)
, i = 1, . . . , 2n.
(4)
It can be proved that the weighted sample mean
¯
x
k−1
and
sample covariance
¯
P
xx,k−1
of the finite set {X
i
}
2n
i=0
perfectly match
the mean x
a
k−1
and the covariance P
a
xx,k−1
of x
k−1
respectively,
¯
x
k−1
=
2n
i=0
W
k−1,i
X
a
k−1,i
= x
a
k−1
, (5a)
¯
P
xx,k−1
=
2n
i=0
W
k−1,i
(X
a
k−1,i
− x
a
k−1
)(X
a
k−1,i
− x
a
k−1
)
T
= P
a
xx,k−1
. (5b)
The above identities are independent of the choice of parameters
λ and n [23] and this feature will be employed to implement
a strategy of PDF re-approximation in construction of UEVF in
Section 3.
1
Being different from the dynamical equations (1)(2), we can assume that x
k−1
and u
k−1
are coupled together by x
k
= M(x
k−1
, u
k−1
). This expression can always
be converted into the simpler form x
k
= M(z
k−1
) introducing the augmented state
z
k−1
= [x
T
k−1
, u
T
k−1
]
T
, where T denotes the transpose operation. Consequently the
covariance of augmented state z
k−1
is denoted P
zz,k−1
= diag
P
xx,k−1
, Q
k−1
.