MULTISHIFT COUPLING 101
coupler [130] that accomplishes this for a wide variety of target distributions. More-
over, the multishift coupler is monotone, thus allowing the use of monotonic CFTP.
To illustrate how multishift coupling works, consider the Markov process that is
simple symmetric random walk on the real line, where if the current state is x,the
next state is chosen uniformly from [x −1,x + 1].
One update function is
φ
1
(x,U)=x + 2U −1. For U ∼ Unif([0, 1]),2U −1 ∼
Unif([−1,1]), and so this update function has the correct kernel. It is also monotone,
since f or any u ∈ [0, 1] and x ≤ y,
φ
1
(x,u) ≤
φ
1
(y,u).
It is, however, extraordinarily bad at coalescence. If x = y,then
φ
1
(x,u) =
φ
1
(y,u)
for all u ∈ [0,1]. A different update is needed in order to make states comes together.
Multishift coupling does the following. Consider the set of numbers S =
{...,−6, −4, −2, 0,2,4,6,...}. For any real
α
,letS+
α
denote the set {s+
α
: s ∈S}.
Hence S +0.5 = {...,−5.5, −3.5, −1.5,0.5,...}. Aga in let U ∼Unif([0,1]), and con-
sider the set S + 2U.
Then for any x ∈ R, exactly one point of S +2U falls in the interval [x −1,x + 1).
(Note that the point x+1 h as been removed from the interval, but this does not change
the kern el since this only occurs with probability 0 anyway.)
So the new update function is
φ
multishift
(x,U) is the unique element of (S + 2U) ∩[x −1,x + 1). (6.5)
Some pertinent facts about this update function:
Fact 6.1. Let
φ
multishift
(x,U)=min((S + 2U ) ∩ [x − 1,x + 1)). Then for U ∼
Unif([0,1]),
1. The update function has
φ
multishift
(x,U) ∼ Unif([x −1, x + 1)).
2. The update function is monotonic.
(∀x)(∀y ≥ x)(∀u ∈ [0 , 1])(
φ
multishift
(x,u) ≤
φ
multishift
(y,u)). (6.6)
Proof. Let a ∈ [x −1,x + 1).Thens = 2a/2 is the largest element of S that is at
most a. Then the event that
φ
multishift
(x,U) ≤a is just the event that s+2U ∈[x−1,a),
which occurs with probability (a −(x −1))/(s + 2 −s)=(a −(x −1))/2. Hence
φ
multishift
(x,U) ∼ Unif([x −1, x + 1)).
Let x < y and u ∈ [0,1]. Then if an element of S + 2u falls in [x −1,x + 1) ∩[y −
1,y + 1),then
φ
multishift
(x,u)=
φ
multishift
(y,u).Otherwise
φ
multishift
(x,u) < y −1and
φ
multishift
(y,u) > x + 1, so monotonicity is trivially obtained.
Suppose that the current state of the chain is bounded between x
min
and x
max
.
Then after taking one step using
φ
multishift
, the number of points to be coupled will
have been reduced to (x
max
−x
min
)/2.
There was nothing special about the width 2 on the interval. In general, given a
width w between elements of S and upper and lower bounds x
min
and x
max
, after one
step in the Markov chain there will remain at most (x
max
−x
min
)/w points.