In many situations, we seek to check consistency of the
observations with some preconceived model. Alternative models
are usually difficult to describe parametrically since
inconsistency may be beyond our modeling capabilities. We need
a test that accepts consistency of observations with a model or
rejects the model without pronouncing a more favored
alternative. Assuming we know (or presume to know) the
probability distribution of the observations under
ℳ
0
ℳ
0
, the models are
-
ℳ
0
ℳ
0
:
r∼pr|
ℳ
0
r
r
p
r
ℳ
0
r
-
ℳ
1
ℳ
1
:
r≁pr|
ℳ
0
r
≁
r
p
r
ℳ
0
r
Null hypothesis testing seeks to determine
if the observations are consistent with this description. The
best procedure for consistency testing amounts to determining
whether the observations lie in a highly probable region as
defined by the null probability distribution. However, no one
region defines a probability that is less than unity. We must
restrict the size of the region so that it best represents those
observations maximally consistent with the model while
satisfying a performance criterion. Letting
P
F
P
F
be a false-alarm probability established by us, we
define the decision region
ℜ
0
ℜ
0
to satisfy
Prr∈
ℜ
0
|
ℳ
0
=∫
ℜ
0
pr|
ℳ
0
rdr=1-
P
F
ℳ
0
r
ℜ
0
r
ℜ
0
p
r
ℳ
0
r
1
P
F
and
min
ℜ
0
{∫
ℜ
0
dr}
ℜ
0
r
ℜ
0
Usually, this region is located about the mean, but
may not be symmetrically centered if the probability density is
skewed. Our null hypothesis test for model consistency becomes
r∈
ℜ
0
⇒"say observations are consistent"
r
ℜ
0
"say observations are consistent"
r∉
ℜ
0
⇒"say observations are not consistent"
r
ℜ
0
"say observations are not consistent"
Consider the problem of determining whether the sequence
r
l
r
l
,
l∈1…L
l
1
…
L
, is white and Gaussian with zero mean and unit
variance. Stated this way, the alternative model is not
provided: is this model correct or not? We could estimate the
probability density function of the observations and test the
estimate for consistency. Here we take the null-hypothesis
testing approach of converting this problem into a
one-dimensional one by considering the statistic
r=∑l
r
l
2
r
l
l
r
l
2
, which has a
χ
L
2
χ
L
2
. Because this probability distribution is unimodal,
the decision region can be safely assumed to be an interval
r
′
r
′′
r
′
r
′′
. In this case, we can find an analytic
solution to the problem of determining the decision region.
Letting
R=
r
′′
-
r
′
R
r
′′
r
′
denote the width of the interval, we seek the
solution of the constrained optimization problem
min
r
′
{R}
subject to
P
r
r
′
+R-
P
r
r
′
=1-
P
F
r
′
R
subject to
P
r
r
′
R
P
r
r
′
1
P
F
We convert the constrained problem into an
unconstrained one using Lagrange multipliers.
min
r
′
{R+λ
P
r
r
′
+R-
P
r
r
′
-1-
P
F
}
r
′
R
λ
P
r
r
′
R
P
r
r
′
1
P
F
Evaluation of the derivative of this quantity with respect to
r
′
r
′
yields the result
p
r
r
′
+R=
p
r
r
′
p
r
r
′
R
p
r
r
′
: to minimize the interval's width, the probability
density function's values at the interval's endpoints must be
equal. Finding these endpoints to satisfy the constraints
amounts to searching the probability distribution at such
points for increasing values of
R
R until the required probability is contained within.
For
L=100
L
100
and
P
F
=0.05
P
F
0.05
, the optimal decision region for the
χ
L
2
χ
L
2
distribution is
78.82128.5
78.82
128.5
. Figure 1 demonstrates ten
testing trials for observations that fit the model and for
observations that don't.