Proof of Stein's example

From HandWiki
Short description: Mathematical proof

Stein's example is an important result in decision theory which can be stated as

The ordinary decision rule for estimating the mean of a multivariate Gaussian distribution is inadmissible under mean squared error risk in dimension at least 3.

The following is an outline of its proof.[1] The reader is referred to the main article for more information.

Sketched proof

The risk function of the decision rule d(𝐱)=𝐱 is

R(θ,d)=Eθ[|θ𝐗|2]
=(θ𝐱)T(θ𝐱)(12π)n/2e(1/2)(θ𝐱)T(θ𝐱)m(dx)
=n.

Now consider the decision rule

d(𝐱)=𝐱α|𝐱|2𝐱

where α=n2. We will show that d is a better decision rule than d. The risk function is

R(θ,d)=Eθ[|θ𝐗+α|𝐗|2𝐗|2]
=Eθ[|θ𝐗|2+2(θ𝐗)Tα|𝐗|2𝐗+α2|𝐗|4|𝐗|2]
=Eθ[|θ𝐗|2]+2αEθ[(θ𝐗)𝐓𝐗|𝐗|2]+α2Eθ[1|𝐗|2]

— a quadratic in α. We may simplify the middle term by considering a general "well-behaved" function h:𝐱h(𝐱) and using integration by parts. For 1in, for any continuously differentiable h growing sufficiently slowly for large xi we have:

Eθ[(θiXi)h(𝐗)|Xj=xj(ji)]=(θixi)h(𝐱)(12π)n/2e(1/2)(𝐱θ)T(𝐱θ)m(dxi)
=[h(𝐱)(12π)n/2e(1/2)(𝐱θ)T(𝐱θ)]xi=hxi(𝐱)(12π)n/2e(1/2)(𝐱θ)T(𝐱θ)m(dxi)
=Eθ[hxi(𝐗)|Xj=xj(ji)].

Therefore,

Eθ[(θiXi)h(𝐗)]=Eθ[hxi(𝐗)].

(This result is known as Stein's lemma.)

Now, we choose

h(𝐱)=xi|𝐱|2.

If h met the "well-behaved" condition (it doesn't, but this can be remedied—see below), we would have

hxi=1|𝐱|22xi2|𝐱|4

and so

Eθ[(θ𝐗)𝐓𝐗|𝐗|2]=i=1nEθ[(θiXi)Xi|𝐗|2]
=i=1nEθ[1|𝐗|22Xi2|𝐗|4]
=(n2)Eθ[1|𝐗|2].

Then returning to the risk function of d:

R(θ,d)=n2α(n2)Eθ[1|𝐗|2]+α2Eθ[1|𝐗|2].

This quadratic in α is minimized at

α=n2,

giving

R(θ,d)=R(θ,d)(n2)2Eθ[1|𝐗|2]

which of course satisfies

R(θ,d)<R(θ,d).

making d an inadmissible decision rule.

It remains to justify the use of

h(𝐗)=𝐗|𝐗|2.

This function is not continuously differentiable, since it is singular at 𝐱=0. However, the function

h(𝐗)=𝐗ε+|𝐗|2

is continuously differentiable, and after following the algebra through and letting ε0, one obtains the same result.


References

  1. Samworth, Richard (December 2012). "Stein's Paradox". Eureka 62: 38–41. http://www.statslab.cam.ac.uk/~rjs57/SteinParadox.pdf.