every particle can belong to every jet with some probability.
3
This can be seen explicitly
in figure 1 where the densities of all three clusters are everywhere nonzero, so q
ij
> 0 for all
j. The idea of probabilistic membership was recently studied in the context of the Q-jets
algorithm [18] in which the same event is interpreted many times by injecting randomness
into the clustering procedure. Unlike Q-jets, fuzzy jets allocates the soft membership
functions deterministically throughout the clustering procedure. However, like Q-jets, there
is an ambiguity in how to assign kinematic properties to the clustered jets. Fuzzy jets are
defined by their shape (and location), not their constituents. This is in contrast to anti-k
t
jets, which are defined by their constituents without an explicit shape determined from the
clustering procedure. One simple assignment scheme is to define the momentum of a fuzzy
jet j as
p
jet j
=
n
X
i=1
p
i
(
1 j = argmax
k
q
ik
0 else
)
. (2.2)
In other words, this procedure assigns every particle to its most probable associated jet.
This scheme will be known as the hard maximum likelihood (HML) scheme, but is not
the only possible assignment algorithm. The dual problem in sequential recombination
is the jet area, which must be defined [19], whereas the jet kinematics are the ‘natural’
coordinates.
We now specialize the likelihood in eq. (2.1) to the case of clustering particles into jets
at a collider like the LHC. Consider a mixture model in two dimensions
4
with x
i
= ρ
i
. The
resulting mixture model (MM) jets are inherently not IR safe: particle p
T
does not appear
in the likelihood and therefore arbitrarily low energy particles can influence the clustering
procedure. Therefore, we add a modification to the log likelihood:
log L({p
T,i
, ρ
i
}|θ) =
n
X
i=1
p
α
T,i
log
k
X
j=1
π
j
f(ρ
i
|θ
j
)
, (2.3)
where α is a weighting factor. Equation (2.3) is the log of eq. (2.1) with the term p
α
T,i
inserted in the outer sum. For α > 0, the resulting modified mixture model (mMM) jets
are IR safe, and when α = 1, the jets are C safe. Therefore, for α = 1, the jets are IRC safe.
Different choices of component densities f in eq. (2.3) give rise to different IRC safe MM jet
algorithms. We have studied several possibilities for f, but for the remainder of this paper
will specialize to (wrapped
4
) Gaussian
5
f = Φ. The resulting fuzzy jets are called modified
Gaussian Mixture Model jets (mGMM) and are parameterized by the locations µ
j
, the
covariance matrices Σ
i
, and the cluster weights π
j
. We initialize π
j
= 1/k and Σ
j
= I.
Since practical procedures for maximizing the modified likelihood in eq. (2.3) may
converge to stationary points that are not globally optimal, the output of a fuzzy jet
3
Soft assignments for jets during clustering was studied in the context of the “optimal jet finder” [17]
which maximizes a function of the soft assignments.
4
One must take care in selecting a class of densities appropriate for the angular quantity φ. For more
details on the wrapped Gaussian distribution and motivation for its use in this context, see appendix A.
5
When f is a circular step function, the algorithm is related to the Snowmass iterative cone algorithm [20]
via the ‘Snowmass Potential’ [21].
– 4 –