Suppose we observe
x=y+w
x
y
w
which are all
N×1
N
1
vectors and where
w∼𝒩0σ2I
w
0
σ
2
I
. Given xx
we wish to estimate yy. Think of yy as a signal in additive white
noise ww. xx is a noisy observation of the
signal.
Taking a Bayesian approach, put a prior on the
signal yy:
y∼𝒩0
R
yy
y
0
R
yy
which is independent of noise ww. The minimum MSE (MMSE) estimator is
y
^=
R
yx
R
xx
-1x
y
R
yx
R
xx
x
Under the modeling assumptions above
R
yx
=Ey(y+w)T=EyyT+EywT=EyyT=
R
yy
R
yx
y
y
w
y
y
y
w
y
y
R
yy
(1)
since
EywT=0
y
w
0
and since
yy
and
ww are
zero-mean and independent.
R
xx
=ExxT=E(y+w)(y+w)T=EyyT+EywT+EwyT+EwwT=
R
yy
+
R
ww
R
xx
x
x
y
w
y
w
y
y
y
w
w
y
w
w
R
yy
R
ww
(2)
since
EwwT=
R
ww
w
w
R
ww
.
Hence
y
^=
R
yy
R
yy
+
R
ww
-1x=
H
opt
x
y
R
yy
R
yy
R
ww
x
H
opt
x
Where
H
opt
H
opt
is the Wiener filter. Recall the frequency domain case
H
opt
f=
S
yy
f
S
yy
f+
S
ww
f
H
opt
f
S
yy
f
S
yy
f
S
ww
f
Now let's look at an actual problem scenario. Suppose that we
know
a priori that the signal
yy is
smooth or
lowpass. We can incorporate this prior knowledge
by carefully choosing the prior covariance
R
yy
R
yy
.
Recall the DFT
∀
k
,k=
0
,
…
,
N
-
1
:
𝒴
k
=1N∑
n
=0N
Y
k
e−(i2πknN)
k
k
0
,
…
,
N
-
1
𝒴
k
1
N
n
0
N
Y
k
2
k
n
N
or in vector notation
∀
k
,k=
0
,
…
,
N
-
1
:
𝒴
k
=〈y,
u
k
〉
k
k
0
,
…
,
N
-
1
𝒴
k
y
u
k
where
u
k
=(
1ei2πkNei2π2kN…ei2π(N−1)kN
)HN
u
k
1
2
k
N
2
2
k
N
…
2
N
1
k
N
N
(H dehotes Hermitian transpose)
〈
u
k
,
u
k
〉=
u
k
H
u
k
=1
u
k
u
k
u
k
u
k
1
,
∀
k∧l
,k≠l:〈
u
k
,
u
l
〉=
u
k
H
u
l
=0
k
l
k
l
u
k
u
l
u
k
u
l
0
,
i.e.,
∀
k
,k=
0
,
…
,
N
-
1
:
u
k
k
k
0
,
…
,
N
-
1
u
k
is an orthonormal basis.
The vector
u
k
u
k
spans the subspace corresponding to a frequency
band centered at frequency
f
k
=2πkN
f
k
2
k
N
("digital" frequency on
0
1
0
1
). If we know that
yy is lowpass, then
E∥〈y,
u
k
〉∥2=E∥
𝒴
k
∥2
y
u
k
2
𝒴
k
2
should be relatively small (compared to
E∥〈y,
u
0
〉∥2
y
u
0
2
) for high frequencies.
Let
σ
k
2=E∥〈y,
u
k
〉∥2
σ
k
2
y
u
k
2
A lowpass model implies
σ
0
2>
σ
1
2>…>
σ
N
2
2
σ
0
2
σ
1
2
…
σ
N
2
2
, assuming NN even, and
conjugate symmetry implies
∀
j
,j=
1
,
…
,
N
2
:
σ
N
-
j
2=
σ
j
2
j
j
1
,
…
,
N
2
σ
N
-
j
2
σ
j
2
Furthermore, let's model the DFT coefficients as zero-mean and
independent
E
𝒴
k
=0
𝒴
k
0
E
𝒴
k
𝒴
l
¯={
σ
k
2 if l=k0 if l≠k
𝒴
k
𝒴
l
σ
k
2
l
k
0
l
k
This completely specifies our prior
y∼𝒩0
R
yy
y
0
R
yy
R
yy
=UDU¯T
R
yy
U
D
U
where
D=(
σ
0
20…0
0
σ
1
2…0
⋮⋮⋱⋮
00…
σ
N
-
1
2
)
D
σ
0
2
0
…
0
0
σ
1
2
…
0
⋮
⋮
⋱
⋮
0
0
…
σ
N
-
1
2
and
U=(
u
0
u
1
…
u
N
-
1
)
U
u
0
u
1
…
u
N
-
1
𝒴=UHy
𝒴
U
y
is the DFT and
y=U𝒴
y
U
𝒴
is the inverse DFT.
With this prior on
yy the Wiener filter is
y
^=UDUHUDUH+σ2I-1x
y
U
D
U
U
D
U
σ
2
I
x
Since
UU is a
unitary matrix
UUH=I
U
U
I
and therefore
y
^=UDUHU(D+σ2I)UH-1x=UDUHUD+σ2I-1UHx=UDD+σ2I-1UHx
y
U
D
U
U
D
σ
2
I
U
x
U
D
U
U
D
σ
2
I
U
x
U
D
D
σ
2
I
U
x
(3)
Now take the DFT of both sides
𝒴
^=UH
y
^=DD+σ2I-1𝒳
𝒴
U
y
D
D
σ
2
I
𝒳
where
𝒳=UHx
𝒳
U
x
and is the DFT of
xx. Both
DD and
D+σ2I
D
σ
2
I
are diagonal so
𝒴^k=dk,kdk,k+σ2
𝒳
k
=
σ
k
2
σ
k
2+σ2
𝒳
k
𝒴
k
d
k
k
d
k
k
σ
2
𝒳
k
σ
k
2
σ
k
2
σ
2
𝒳
k
Hence the Wiener filter is a frequency (DFT) domain filter
𝒴^k=
H
k
𝒳
k
𝒴
k
H
k
𝒳
k
where
𝒳
k
𝒳
k
is the
k
th
k
th
DFT coefficient of
x
x and the filter response at digital
frequency
2πkN
2
k
N
is
H
k
=
σ
k
2
σ
k
2+σ2
H
k
σ
k
2
σ
k
2
σ
2
Assuming
σ
0
2>
σ
1
2>…>
σ
N
2
2
σ
0
2
σ
1
2
…
σ
N
2
2
and
∀
j
,j=
1
,
…
,
N
2
:
σ
N
-
j
2=
σ
j
2
j
j
1
,
…
,
N
2
σ
N
-
j
2
σ
j
2
.
The filter's response is a
digital lowpass
filter!