
6 The CMS Collaboration / Physics Letters B 798 (2019) 134992
m
A
–tanβ values. The function F
sig
used to parametrize the signal
mass shape [21]is defined as:
F
sig
= w
h
F
h
+ w
H
F
H
+ w
A
F
A
. (1)
In Eq. (1), the terms F
h
, F
H
, and F
A
describe the mass shape of
the h, H, and A signals, respectively. Each term is a convolution
of a Breit–Wigner (BW) function to describe the resonance, with a
Gaussian function to account for the detector resolution. The two
parameters of the BW function, as well as the variance of each
Gaussian function, are free parameters of the fit used to determine
the signal model, while the quantities w
h
, w
H
, and w
A
are the
numbers of expected events for each boson passing the event se-
lection.
For the m
A
–tanβ points for which the signal samples were
not generated, the parameters are interpolated from the nearby
generated points. In order to correct for differences of the order
of a fewGeV between the pythia prediction of m
H
with respect to
the value calculated by FeynHiggs in the m
mod+
h
or the value used
in the hMSSM, especially for m
A
200 GeV, the invariant mass dis-
tribution
of the H boson is shifted by the corresponding amount.
For the model-independent analysis the signal shape is described
using one single resonance in Eq. (1).
The
analysis does not use background estimation from simula-
tion
due to the limited size of simulated events compared to data
in the region of interest, as well as due to the large theoretical un-
certainties
in the background description at high invariant masses.
Therefore, given the smooth dependence of the background shape
on the dimuon invariant mass, it is estimated from the data, by as-
suming
a functional form to describe its dependence as a function
of the reconstructed dimuon invariant mass, m
μμ
, and by fitting it
to the observed distribution.
The
functional form used to describe the background shape is
defined as:
F
bkg
= exp(λm
μμ
)
×
⎡
⎣
f
N
1
1
(m
μμ
−m
Z
)
2
+
2
Z
4
+
(
1 − f )
N
2
1
m
μμ
2
⎤
⎦
.
(2)
The quantity exp(λm
μμ
) parametrizes the exponential part of the
mass distribution, and f represents the weight of the BW term
with respect to DY photon exchange, while N
1
and N
2
correspond
to the integral of each term in F
bkg
. The quantities λ and f are free
parameters of the fit. The parameters
Z
and m
Z
are separately
determined for the two event categories by fitting the dimuon
mass distribution close to the Z boson mass. The fit provides the
effective values of such quantities, which include detector and res-
olution
effects. Their values are then kept constant when using
F
bkg
in the final fit. The systematic uncertainty that stems from
the choice of the functional form in Eq. (2), which was used in
earlier searches [21], is assessed as described below.
A
linear combination of the functions describing the expected
signal and the background is then used to perform a binned maxi-
mum
likelihood fit to the data, where the uncertainties are treated
as nuisance parameters:
F
fit
= (1 − f
bkg
)F
sig
+ f
bkg
F
bkg
. (3)
The fit is performed for each m
A
and tan β hypothesis, as the yield
of the signal events and the shape of F
sig
depend on these quan-
tities.
The parameters that describe the signal are determined by
fitting the simulated samples that pass the event selection with
Eq. (1), for each m
A
and tan β pair, as explained above. Subse-
quently
they are assigned as constant terms in F
fit
. The quantity
f
bkg
is a free parameter in the fit, and the fraction of signal events
is defined as f
sig
= (1 − f
bkg
). The overall normalization is also a
free parameter and is profiled in the fit.
For
each m
A
assumption, the function F
fit
is used to fit the data
over an m
μμ
range centered on m
A
. The range has to be large
enough to account for the signal width, including the experimental
resolution, and it is ±50 GeV for m
A
≤ 290 GeV, ±75 GeV for 290 <
m
A
≤ 390 GeV, and ±100 GeV for 390 < m
A
≤ 500 GeV. For values
of m
A
smaller than 165 GeV the lower bound of the mass window
is set to 115 GeV. For m
A
> 500 GeV, the entire range from 400
to 1200 GeV is used. The h boson is used to constrain the results
when its mass is included in the fitted mass range.
The
uncertainty introduced by the choice of the analytical func-
tion
used to parametrize the background is estimated by using a
method similar to that used in Refs. [3,21,89]. The method is based
on the determination of the number of spurious signal events that
are introduced by the choice of the background function F
bkg
,
when the background is fit by the function F
fit
. The invariant
mass spectrum is fitted by the function F
a
bkg
, chosen among var-
ious
functional forms: Eq. (2)or other similar expressions that
include a BW plus exponentials, and sum of exponentials. All these
functional forms adequately describe the background distribution
observed in data. The fit is performed in the proper mass range
centered around the assumed value of m
A
, and the parameters of
F
a
bkg
are determined. Then, thousands of MC pseudo-experiments
are generated, each one containing the same number of events
as observed in the data, distributed according to the functional
form F
a
bkg
. For each pseudo-experiment, the invariant mass dis-
tribution
is then fit with the function F
fit
of Eq. (3), once using
F
a
bkg
, and then using a different function F
b
bkg
, given by Eq. (2). For
each pseudo-experiment, the spurious signal yield, expressed by
the number of events N
a
bias
and N
b
bias
, is determined. The quantity
N
a
bias
is on average consistent with zero within statistical fluctua-
tions.
The quantity N
b
bias
represents the number of spurious signal
events that are found in the signal yield if the function F
b
bkg
is used
to describe the background, when the background itself is actually
distributed according to F
a
bkg
. The median of the distribution of
the difference N
a
bias
− N
b
bias
obtained from the pseudo-experiments
is defined as the bias introduced by using the function F
b
bkg
, rel-
ative
to the tested mass m
A
. This procedure is repeated for each
function F
a
bkg
among the functional forms mentioned above, and
the largest bias is taken as the systematic uncertainty in the num-
ber
of signal events obtained from the maximum likelihood fit, due
to the choice of Eq. (2)to parametrize the background distribution.
Choosing a different function F
b
bkg
, instead of Eq. (2), was shown
to lead to similar biases over the whole mass range. The num-
ber
of spurious signal events varies between a few units and a
few hundred depending on the mass of the signal and the event
category. Although the bias is due to the modeling of the back-
ground,
its impact on the result depends on the expected signal
strength and shape, both varying according to m
A
and tan β in the
model-dependent analysis, and according to the mass of a generic
resonance φ for the model-independent case. More details about
the effect of the bias on the final results are discussed in Section 7.
An
example of fits to the data with Eq. (3), for the model-
independent
case, is shown in Fig. 4. Two mass hypotheses, 400
and 980 GeV, are assumed for a single narrow-width resonance φ
decaying to two muons. The two event categories are combined
according to their sensitivity, S/(S + B), where S and B are the
number of events in the expected signal and observed background,
respectively. The uncertainties in the integrated luminosity, in the
signal efficiency, and in the background parametrization are taken
into account as nuisance parameters.