METROPOLIS-HASTINGS FOR CONTINUOUS STATE SPACES 1 05
This is a situation where multishift coupling could b e applied, since the goal is to
couple normally distributed random variables with means over an interval from the
smallest value of a(i) to the largest value of a(i).
6.5 Metropolis-Hastings for continuous state spaces
Gibbs sampler Markov chains can often be made monotonic, but require special ap-
proaches such as multigamma coupling or multishift coupling. Metropolis-Hastings
Markov chains are usually not monotonic, but can make continuous spaces coalesce
down to a single point. To get the best from these two approaches, create an update
function that consists of several steps in a Gibbs chain to b ring the upper and lower
states close together, and then use a Metropolis-Hastings step to completely reduce
the image down to a single state.
6.5.1 Example: truncated autonormal
The multishift coupler was used previously to deal with the unrestricted autonormal
model, but in fact for image processing usually values are conditioned to lie in the
real interval from 0 (solid black) to 1 (solid white).
This is easy to deal with for the Gibbs sampler: when generating the normal with
mean a(i) and variance b(i), just condition on the normal lying between 0 and 1 .
Allison Gibbs [41] showed that this Markov chain was rapidly mixing. In [41] the
Wasserstein metric was used to assess convergence because it was easier to work
with; in [54] the method was improved to handle total variation distance. Therefore
the details of the Wasserstein me tric are omitted here.
The truncation complicates matters: the distributions at each node are no longer
shifts from the same family of distributions. For example, a normal with mean 0.5
and variance 1 conditioned to lie in [0,1] is not a shifted version of a normal of mean
0.7 and variance 1 conditioned to lie in [0,1]. So the multishift coupling used in the
previous section no longer works here. On the other hand, using the inverse transform
method to generate the two normals does give a monotonic update function.
A solution lies in the Metropolis-Hastings approach from Section 1.8.2. In this
approach a state is proposed to be moved to, an acceptance ratio formed, and if a
Unif([0,1]) falls below the acceptance ratio, the state is m oved to the proposed state.
Otherwise it stays where it is.
Suppose our proposed state is found by taking each node in turn, then indepen-
dently adding U
i
, which is a uniform [−
ε
,
ε
] to the value at that node, where
ε
> 0
is a small value. When
ε
is very small, it is likely that such a proposed move will be
accepted, regardless of th e initial state. Let U =(U
1
,...,U
#V
).Then
f
X
(x + U)
f
X
(x)
≥ exp
−
#V(2
ε
+
ε
2
)
2
σ
2
−
γ
2
2
#E(2
ε
+
ε
2
)
.
Making
ε
= Θ(min{
σ
2
/#V,1/[
γ
2
#E]}) gives f
X
(x + Y )/ f
X
(x) ≥ 1/2, and so
there is a 1/2 chance that the Metropolis-Hastings move will be accepted starting
from any state.