Chapter 5: Advanced Techniques Using Coalescence (1/4)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 5

Advanced Techniques Using

Coalescence

Time has been transformed, and we have changed; it has advanced and

set u s in motion; it has unveiled its face, inspiring us with bewilderment and

exhilaration.

Khalil Gibran

In the previous chapter, coupling from the past was introduced, which remains the

most widely applicable protocol for cr eating perfect simulation algorithms. However,

CFTP has two important drawbacks. It requires that the random variables generated

be used twice, and it is noninterruptible. There are two variants of the algorithm that

each deal with one of these issues.

5.1 Read-once coupling from the past

First consider acceptance/rejection. Recall that AR generates X

∼

(B) until ﬁrst

encountering X

∈ A. Denote the event that X

∈ A by S (for success) and X

/∈ A by

F (for failure.)

Then a sequence of AR consists of a sequence of failures followed by a single

success. So for instance, FFFFFFFS,orFS,orFFFS,orS are all valid possible

sequences of blocks. That is, a valid sequence consists of a ﬁnite number of F blocks

followedbyanS block. The total number of blocks is a geometric random variable

with p arameter equal to the probability of an S block.

Suppose that multiple samples X ∼

(A) are desired. Then simply consider the

inﬁnite stream of blocks:

FFFSFSSFFFFFSFFFSFSSFFFS...

Every state at the end of an S block represents a sample exactly from the target

distribution.

Now consider homogeneous CFTP.LetS denote the event that U ∈ A,soitis

possible to determine if

(Ω,U)={y} for some y. For instance, when the update

function is monotonic, then S is the event that

max

,U)=

min

,U).

Then CFTP can be summarized as follows. Generate U randomly. If S occurs,

then report the state at the end of the S block. Otherwise, the value of U chosen

82 ADVANCED TECHNIQUES USING COALESCENCE

results in an F block, and recursion occurs to generate the stationary sample, which

is then run through the F block.

Suppose, for instance, three levels of recursion are necessary. Then the structure

of the output is S

. The zeroth level (the original level) is an F,theﬁrst is an F ,

and the second is an F, which have been labeled with subscripts for their appropriate

levels. Only the third level of recursion gives an S, which is then fed through the

second level F

block, which is fed through the ﬁrst level F

block, and ﬁnally the F

block.

So in general the form of homogeneous CFTP is SFF ···F ,wheretheﬁnal an-

swer is the state at the end of the ﬁnal F block, and where the total number of blocks

is a g eometric random variable with parameter equal to the probability of an S block.

Now suppose blocks are just run forward in time. Then the result looks very

similar to an AR run:

FSFFFSFFFSSFFSFSFFFFFFFSFFF ....

Notice that after the ﬁrst S block, there is a geometric number of blocks until

the next S block occurs. Hence the ﬁrst S block is followed by a geometric number

(minus 1) of F blocks, exactly as in the original homogeneous CFTP!

Therefore, the state at the end of the F block that precedes the ﬁrst S block must

have the target distribution. In fact, the state that precedes the second S block, the

third S block, etc., must all have the target distribution.

This is the essence of read-once coupling from the past (ROCFTP) as created

by Wilson [129]. Run the blocks forward until the ﬁrst S block. Then keep running

forward, and b efore each subsequent S block, the state will come from the target

distribution.

As before, let A be a set for an update function

such that if U ∈A,then

(Ω,U)

contains only a single element.

Read once coupling from the past

Input: k, t Output: y

,...,y

∼

iid

1) i ← 0, x ← an arbitrary element of Ω

2) Repeat

3) y

← x

4) Draw U ← Unif([0,1]

)

5) x ←

(x,U)

6) i ← i + 1(U ∈ A)

7) Until i > k

Unlike the earlier formu lations of perfect simulation, there is no explicit recur-

sion. As with AR, the above code could be reformulated as a recursive function since

repeat loops can always be written recursively. From a practical perspective, using a

repeat loop rather than recursion speeds up the process.

When i = 0, the repeat loop draws from

(Ω,U) until a success is obtained. This

is wasted effort, as these dr aws are then thrown away by the algor ithm, and y

READ-ONCE COUPLING FROM THE PAST 8 3

not part of the output. So to generate k iid draws from

, ROCFTP requires k + 1

different success blocks, while CFTP only requires k of them.

However, CFTP needs to evaluate each step twice, therefore to generate k sam-

ples requires 2k evaluations of the

function, while ROCFTP only requires k + 1.

For k = 1 there is no gain, but for larger numbers of samples this is a tremendous

speedup.

This imp rovement requir e s that lines 4, 5, and 6 be written in such a way that th e

state updates and the test that U ∈ A can be accomplished b y looking at U

,...,U

as a stream of choices. For instance, when the update is monotonic, this can be ac-

complished by applying the update function to both the upper, lower, and middle

state.

Monotonic Read once coupling from the past

Input: k, t Output: y

,...,y

∼

iid

1) i ← 0, x ← an arbitrary element of Ω

2) Repeat

3) y

← x

4) x

max

← largest element of Ω x

min

← smallest element of Ω

5) For t



from 1 to t

6) Draw U

← Unif([0, 1])

7) x ←

(x,U

), x

max

←

max

), x

min

←

min

)

8) i ← i + 1(x

max

= x

min

)

9) Until i > k

The downside is that in the repeat loop, the state x needs to b e saved at each

step before

(Ω,U) is calculated. Therefore, the memory requirements to hold the

conﬁguration has doubled.

Since this extra memory requirement is usually small compared to the storage

of the U variable, generally ROCFTP comes out ahead of CFTP in the memory

game. Furthermore, it can be written without the overhead of a random number of

recursions, which in many computer languages are very slow compared to repeat

loops. Therefore ROCFTP is typically preferred in practice to CFTP.

Lemma 5.1. Read

once coupling from the past generates Y

,...,Y

∼

that

are iid.

Proof. The fact that a geometric nu mber of blocks where the ﬁrst is an S block and

the r emainder are F blocks gives a state in the target distribution is just basic CFTP

where t is the same at each level of recursion. The correctness of this then follows as

a corollary of Theorem 3.1.

So the Y

are identically distributed. Are they independent? Well, the block that

follows after a state Y

is an S block. The output of this S block is independent of

the state Y

that precedes it, since this is a Markov chain. Hence all Y



with i



> i are

independent of Y

84 ADVANCED TECHNIQUES USING COALESCENCE

5.1.1 Example: ROCFTP for the Ising model

Now look at how this method works for the Ising model. The easiest way to use

ROCFTP for monotonic systems is to ﬁrst create an update function that takes as

input the current state and all randomness needed to complete the step.

Ising Gibbs update function

Input: x ∈ Ω, v ∈V , U ∈ [0,1] Output: x(v) ∈{−1,1}

1) Let n

be the number of neighbors of v labeled 1

2) Let n

−1

be the number of neighbors of v labeled - 1

3) x(v) ←−1 + 2 ·1(U ≤ exp (

)/[exp(

)+exp(

−1

)]

This pseudocode actually takes into account how computers work: given that

only the label on state v changes at each step, it does not make sense for the update

function to return the entire new state (although that is the formal deﬁnition). Instead,

only the new value of x(v) is returned. The calling function, of course, needs to b e

aware of this behavior in order to u tilize this function pr operly.

The next thing needed for monotonic CFTP is code that runs a single block

forward, reports the new state o f the chain, and also reports if the block coalesced or

not. The output variable B will be true if the b lock coalesced, and false otherwise.

Ising Gibbs block

Input: x ∈ Ω, t Output: x ∈ Ω, B ∈{TRUE,FALSE}

1) x

max

← (1, 1,...,1), x

min

← (−1, −1,...,−1)

2) For t



from 1 to t

3) Draw v uniformly from V , U uniformly from [0,1]

4) x(v) ← Ising

Gibbs update function(x, v,U )

5) x

max

(v) ← Ising Gibbs update function(x

min

,v,U)

6) x

max

(v) ← Ising Gibbs update function(x

max

,v,U)

7) B ← (∀v ∈V )(x

min

(v)=x

max

(v))

Note that as soon as the call to this routine is over, the random variables used are

gone: as the name says, the random variables in read-once coupling from the past do

not need to be stored. With the block algo rithm, the general algorithm can be built.

ROCFTP Ising Gibbs Input: k , t Output: y

,...,y

1) i ← 0, x ← (1,...,1)

2) Repeat

3) y

← x

4) (x,B) ← Ising

Gibbs block(x,t)

5) i ← i + 1(B)

6) Until i > k

This examp le illustrates a strength of ROCFTP: the random variable generation

can occur inside the block structure instead of outside it. This can greatly simplify

the code for problems where the randomness used to generate the Markov chain step

is complex and uses a random number of uniform random variables.

FILL, MACHIDA, MURDOCH, AND ROSENTHAL’S METHOD 85

5.2 Fill, Machida, Murdoch, and Rosenthal’s method

So the two problems with basic CFTP is that it is read twice and noninterruptible.

ROCFTP is read once but still noninterruptible.

Fill [35] was the ﬁrst to introduce a variant of CFTP that was interruptib le, but

sadly is still read twice. Møller and Schladitz [103] extended his method to anti-

monotone chains, and then in [37], Fill, Machida, Murdoch, and Rosenthal general-

ized Fill’s algorithm for general update functions. This last algorithm will be referred

to he re as FMMR.

To use ROCFTP, ﬁrst you need an S block, followed by a number of F blocks,

then fo llowed by a second S block. The stationary state is the state at the beginning

of the second S block. An S block is an update such that

(Ω,U)={x} for some

state x in Ω.

Suppose we ﬁrst ﬁxanx ∈ Ω. Now call a block an S

block if

(Ω,U)={x}.

Otherwise call the block an F block. Then the same argument for why the output

of ROCFTP is stationary means that if we generate an S

block, followed by some

number of F blocks, then a second S

block, the state at the beginning of the S

block

will be stationary.

Suppose that we could just generate an S

block, and it was possible to run the

chain backwards in time, to ﬁgure out what the state at the beginning of the block y

was given that it ended at state x.Theny ∼

Here is the central FMMR idea: start with state X

= x.RunX

backwards in

time to get X

t−1

t−2

,...,X

. Now run the chain forward in time conditioned on

,...,X

.If

(Ω,U)={x}, then accept this as a draw from an S

block. Other-

wise reject and start over.

This is similar to CFTP,butinAR form! So unlike CFTP it is interruptible. This

makes FMMR the ﬁrst widely applicable interruptible perfect simulation algorithm.

(It was not the last, however, see Chapters 8 and 9.)

To see what is meant by running the Markov chain backwards in time, it helps to

have an example. Consider the biased random walk on {0,1,2}, with update function

(x,U)=x + 1(x < 2,U > 2/3) −1(x > 0,U ≤ 2/3).

Hence the chain adds one to the state with probability 1/3, and subtracts one from

the state with proba bility 2/3 (unless the move would take the state out of {0,1,2}.)

The stationary distribution of this chain is

({0})=4/7,

({1})=2/7,

({2})=1/7. Suppose X

∼

, X

∼

, and the goal is to ﬁgure out how to simulate

If X

= 1, then X

is either 0 or 2. By Bayes’ rule

P(X

= 0|X

= 1)=P(X

= 0)P(X

= 1|X

= 0)/P (X

= 1)

({0})(1/3)/

({1})=(4/7)(1/3)/(2/7)=2/3.

In fact, it turns out that P(X

= i|X

= j)=P(X

= i|X

= j), the transition prob-

abilities for the Markov chain look exactly the same wh e ther the chain is being run

forward or backward in time. In general, this situation holds when

(dx)P(X

t+1

∈

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 5: Advanced Techniques Using Coalescence (1/4)

Create new playlist

Sign In

Sign Up

Table of Contents for
Chapter 5: Advanced Techniques Using Coalescence (1/4)