Markov Chain Monte Carlo Analysis of
Correlated Count Data
Siddhartha Chib
John M. Olin School of Business, Washington University, St. Louis, MO 63130 (chib@olin.wustl.edu)
Rainer Winkelmann
IZA Bonn, 53072 Bonn, Germany (winkelmann@iza.org)
This article is concerned with the analysis of correlated count data. A class of models is proposed in
which the correlation among the counts is represented by correlated latent effects. Special cases of the
model are discussed and a tuned and ef cient Markov chain Monte Carlo algorithm is developed to
estimate the model under both multivariate normal an d multivariate-
t
assumptions on the latent effects.
The methods are illustrated with two real data examples of six and sixteen variate correlated counts.
KEY WORDS: Latent effects; Metropolis–Hastings algorithm; Multivariate count data; Poisson–
lognormal distribution.
A large literature on the an alysis of count data is now
available (Cameron and Trivedi 1998, Winkelmann 2000),
but only a small portion of it deals with correlated counts.
Correlated counts typically arise in three varieties—as gen-
uine “multivariate” data on several related counted outcomes,
as longitudinal measurements o n a large number of su bjects
over a sh ort period of time, or as measurements on a small
set of subjects over a long period of time (the seemingly
unrelated case). Although the longitudinal situation has been
actively studied (e.g., see Hausman, Hall, and Griliches 1 984;
Blundell, Grif th, and Van Reenen 1995; Wooldridge 1997;
Chib, Greenberg, and Winkelmann 1998, henceforth CGW)
and a number of useful models and approaches are avail-
able, the other cases have been analyzed only under simpli-
fying assumptions (King 1989; Jung and Winkelmann 1993;
Gurmu and Elder 1998; Munkin and Trivedi 1999). The latter
approaches either do not allow a general correlation structure
or are dif cult to extend beyond the case of a few outcomes.
This article is an effort to deal with both problems. To
model the correlation among a large number of counts in a
exible fashion, we introduce a set of correlated latent effects,
one for each subject and outcome. Conditioned on the latent
effects, t he counts are assumed to be independent Poisson with
a conditional mean function that depends on the latent effects
and a set of covariates. To complete the model we assume
that the latent effects follow a multivariate Gaussian distribu-
tion with a zero mean vector and full unrestricted covariance
matrix. As an extension of this model, we also consider the
case in which the latent effects follow a multivariate-
t
distri-
bution. To estimate this model, we develop a Markov ch ain
Monte Carlo (MCMC) simulation method that is based on the
work of CGW. Under t his framework, we are able to sample
the posterior distribution of the parameters and latent effects
without computing the likelihood function of the model.
The methods that we develop in this article can be applied to
datasets with large numbers of correlated counts. We demon-
strate this feature by tting our model to a problem with 16
response variables. In our view this is an important illustration
that highlights what is possible from a Bayesian simulation-
based perspective.
The rest of the article is organized as follows. In Section 1
we present the basic model and some special cases and exten-
sions. The tting algo rithm is developed in Section 2, while
Section 3 gives two real data examples. Section 4 concludes.
1. MODEL
Following the usual notation for multivariate d ata, let
y
i
D
4y
i
1
1 : : : 1 y
iJ
5
denote the collection of
J
counts on the
i
th sub-
ject in the sample,
i µ n
. Let
b
i
D
4b
i
1
1 : : : 1 b
iJ
5
denote a set
of
J
subject and outcome-speci c latent effects, and suppose
that, conditioned on
b
i
and parameters
‚
j
2
R
k
j
1
the coun ts
y
ij
,
j µ J
, are independent Poisson:
y
ij
—
b
i
1 ‚
j
Poisson
4Œ
ij
5
Œ
ij
D
exp
4x
0
ij
‚
j
C
b
ij
5
for
j µ J
and
i µ n1
(1)
where
x
ij
are covariates. To model the correlation among the
counts, let
b
i
—
D
N
J
4
0
1 D51 i µ n1
(2)
where
D
is an unrestricted covariance ma trix.
To understand some of the features of this model, let
v
ij
D
exp
4b
ij
5
and
v
i
D
4v
i
1
1 : : : 1 v
iJ
5
. Then
v
i
LN
J
4Œ1 è5
, a multi-
variate lognormal distribution with mean
Œ
D
exp
4
0
0
5 diag
4D55
and dispersion matrix
è
D
4
diag
4Œ556
exp
4D5
ƒ
11
0
74
diag
4Œ55
.
Hence,
y
ij
—
‹
ij
1 v
ij
Poisson
4‹
ij
v
ij
5
, where
‹
ij
D
exp
4x
0
ij
‚
j
5
.
This is, therefore, in the form of a Poisson–lognormal distribu-
tion as discussed by Aitchison and Ho (1989).
In this setup, the expectation and va riance of t he marginal
joint distribution of
y
i
can be derived without integration. Let
Q
‹
ij
D
‹
ij
Œ
(i.e.,
Q
‹
ij
and
‹
ij
differ only b y a constant fac-
tor),
Q
‹
i
D
4
Q
‹
i
1
1 : : : 1
Q
‹
iJ
5
, and
e
å
i
D
diag
4
Q
‹
i
5
. Applying the
© 2001 American Statistical Association
Journal of Business & Economic Statistics
October 2001, Vol. 19, No. 4
428