2329-9290 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TASLP.2015.2506269, IEEE/ACM
Transactions on Audio, Speech, and Language Processing
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 4
Note that for small microphone gain errors, we have that
ξ
24
≈ 2 and ξ
31
≈ 2, which implies that ρ ≈ 2 and λ ≈ 4.
Consequently, (11) can be further reduced to
Q
(g)
(Ω, α) ≈
24Ω
2
+ 3 (1 − α)
2
ζ
24Ω
2
α
2
+ 8Ω
2
(1 − α)
2
+ 3(1 − α)
2
ζ
. (15)
Specially, as expected, when there are no microphone gain
errors, it follows that (15) will degenerate to the DF of ideal
FOSDA, i.e., (7).
Regarding the effects of microphone gain errors on the DF
of the FOSDA, we have the following properties.
Property 3: The DF of the FOSDA with microphone gain
errors will always be greater than or at least equal to 1.
Moreover, the DF is equal to 1 if and only if α = 1.
Proof: See Appendix A.
Note from (3) that the FOSDA will degenerate into an
omnidirectional monopole when α = 1. Moreover, by (6),
it follows that the DF of an omnidirectional monopole or
omnidirectional microphone is equal to 1. Thus from Property
3, we know that the DF of the FOSDA will be no worse
than an omnidirectional microphone even in the presence of
microphone gain errors, i.e., we can always benefit from using
the FOSDA instead of an omnidirectional microphone.
Property 4: The DF of the FOSDA with microphone gain
errors will decrease with frequency or array size decreasing.
Proof: It follows from inspection of (37) in Appendix A
that Q
(g)
(Ω, α) is an increasing function of Ω.
According to Property 4, the FOSDA is more sensitive to
microphone gain errors in low frequencies or for small-sized
arrays. This is similar to the effect of microphone mismatches
on the additive microphone arrays [22], [23].
Property 5: For the maximum value of the DF, it holds that:
1) The maximum achievable DF remains equal to 4 when
ζ = 0, i.e., when the gain errors of microphones M
1
and
M
3
are the same, and those of microphones M
2
and M
4
are the same.
2) For the general case ζ = 0, the maximum value of the
DF is always less than 4. In particular, when
α =
3ζ − 2
2Ω
2
(9ζ + 32Ω
2
) + 16Ω
2
3ζ
, (16)
the DF attains its maximum value given by
max
α∈[0,1]
Q
(g)
(Ω, α)
≈
54Ω
2
+ 3Υ
2
ζ
6Ω
2
(2Υ − 3)
2
+ 8Ω
2
Υ
2
+ 3Υ
2
ζ
(17)
where Υ =
√
2Ω
2
(9ζ+32Ω
2
)−8Ω
2
ζ
.
Proof: See Appendix B.
Recall that the maximum value of the DF of ideal FOSDA
is equal to 4, which is achieved at α = 0.25 (hypercardioid
response). To keep the maximum achievable DF unaffected
by microphone gain errors regardless of mainlobe steering
direction, Property 5 indicates that the only means is to choose
sensors wisely such that each pair of diagonal microphones
have the same characteristics, i.e., η
1
= η
3
and η
2
= η
4
.
Otherwise, the maximum achievable DF will always degrade
due to microphone gain errors.
0 0.2 0.4 0.6 0.8 1
1
1.5
2
2.5
3
3.5
4
α
DF
Case 1, simulation results
Case 1, theoretical results
Case 2, simulation results
Case 2, theoretical results
Case 2
Case 1
Fig. 3. DF vs. α with microphone gain errors, where φ
s
= 60
◦
, and
Ω = π/16. Case 1: η
1
= 0.9, η
2
= 0.93, η
3
= 1.1, η
4
= 1.04; and Case
2: η
1
= η
3
= 0.9, η
2
= η
4
= 1.04.
0.2
0.4
0.6
0.8
1
30
210
60
240
90
270
120
300
150
330
180
0
α=0.25
α=0.5
(a)
0.2
0.4
0.6
0.8
1
30
210
60
240
90
270
120
300
150
330
180
0
α=0.25
α=0.5
(b)
Fig. 4. Normalized array responses of the FOSDA with/without microphone
gain errors, where φ
s
= 60
◦
and Ω = π/16. (a) Ideal FOSDA. (b) With
microphone gain errors same as Case 1 in Fig. 3.
Now we present several numerical examples to support the
above deterministic analysis. Consider two cases of micro-
phone gain errors: 1) η
1
= 0.9, η
2
= 0.93, η
3
= 1.1,
η
4
= 1.04; and 2) η
1
= η
3
= 0.9, η
2
= η
4
= 1.04, i.e.,
the special case where the gain errors of microphones M
1
and M
3
are the same, and those of microphones M
2
and M
4
are the same. Herein, the mainlobe steering direction is set to
φ
s
= 60
◦
, and Ω = π/16.
Fig. 3 shows the DF of the FOSDA versus α for the two
cases of microphone gain errors. Although some approxima-
tions have been used in our theoretical analysis, the theoretical
results are well-consistent with the simulation results, as
shown in Fig. 3. We can see from the simulations results
that the DF is always greater than 1 for arbitrary α ∈ [0, 1)
and equal to 1 only when α = 1, i.e., when the FOSDA
degenerates into a monopole, which is in agreement with
Property 3. Moreover, for Case 1, by Property 5.2 when
α = 0.280 the DF achieves its maximum value of 3.52.
This is very close to the simulation results where the DF
achieves the maximum value of 3.55 when α = 0.281. While
for Case 2, comparing with Fig. 2(a), i.e., the DF of ideal
FOSDA without microphone mismatches, we can see that the
microphone gain errors in such a case have little effect on
the DF of the FOSDA, which agrees well with our finding in
Property 5.1. Fig. 4(b) shows the normalized array responses
of the FOSDA with microphone gain errors as Case 1 in Fig. 3.
Note that the normalized array responses of the FOSDA with
microphone gain errors as Case 2 in Fig. 3 are actually same
as those of ideal FOSDA, i.e., Fig. 4(a). Comparing Fig. 4(b)