Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 5

Solvability of Colorless Tasks in Different Models

Abstract

This chapter explores the circumstances under which colorless tasks can be solved in different communication models, satisfying different fault-tolerance requirements. We consider both shared memory and message-passing models, wait-free and -resilient protocols, and protocols that work against adversaries.

Keywords

-resilient; Adversaries; Layered snapshot protocols; Message-passing protocols; Wait-free

In Chapter 4 we considered colorless layered immediate snapshot protocols and identified the colorless tasks that such protocols can solve while tolerating crash failures by any number of processes. This chapter explores the circumstances under which colorless tasks can be solved using other computational models.

We consider models with different communication mechanisms and different fault-tolerance requirements. We show that the ideas of the previous chapter can be extended to characterize the colorless tasks that can be solved when up to out of processes may crash, when the processes communicate by shared objects that solve -set agreement, or when the processes communicate by message passing.

Once we have established necessary and sufficient conditions for a task to have a protocol in a particular model, it is natural to ask whether it is decidable whether a given task satisfies those conditions. We will see that the answer to that question depends on the model.

5.1 Overview of models

Recall from Chapter 4 that a colorless task is one where only the sets of input or output values matter, not which process has which. For such tasks, an initial configuration remains an initial configuration if one participating process exchanges its own input value for another’s, and the same holds true for final configurations. Consensus and -set agreement are examples of colorless tasks.

In a model where the processes communicate by layered snapshots and any number of processes can fail by crashing, a protocol must be wait-free. A process cannot wait for another process to take a step, because it cannot tell whether that process has crashed or is merely slow. We have seen that a colorless task has a wait-free -process layered immediate snapshot protocol if and only if there is a continuous map

(5.1.1)

carried by (Theorem 4.3.1). Informally, this characterization says that wait-free layered snapshot protocols transform (sets of at most different) inputs to outputs in a continuous way.

In this chapter we consider several other models for which the computational power can be measured by a parameter . The colorless tasks solvable in a model with parameter are exactly those for which there is a continuous map

carried by . Thus, the wait-free layered snapshot model is the weakest, having , whereas a model with can solve any colorless task.

Sometimes the wait-free condition may be too demanding. Instead of tolerating failures by an arbitrary subset of processes, we may be willing to tolerate fewer failures. A protocol is -resilient if it tolerates halting failures by as many as processes. (A wait-free protocol is -resilient.) We say that a colorless task has a t-resilient protocol in a model if, for all , there is a -resilient -process protocol for that task. In Section 5.2 we will see that a colorless task has a -resilient layered snapshot protocol if and only if there is a continuous map

(5.1.2)

carried by . Not surprisingly, the -resilient Condition 5.1.2 is strictly weaker than its wait-free counterpart, Condition 5.1.1, since the map needs to be defined only over the -skeleton of the input complex. The lower the dimension , the easier it is to satisfy this condition and the more tasks that can be solved. In a sense, these two conditions capture the cost of fault tolerance. For colorless tasks, solvability is determined by the number of processes that can fail, whereas the total number of processes is irrelevant.

We also show (Section 5.3) that if we augment layered snapshot protocols by also allowing processes to communicate through -set agreement objects, then a colorless task has a wait-free layered protocol if and only if there is a continuous map

carried by . Adding -set agreement objects, , increases the computational power of layered snapshot protocols by lowering the dimension of the skeleton on which a map must exist.

It follows that fault tolerance and communication power are, in a sense, interchangeable for colorless computability. A -resilient layered colorless protocol and a wait-free layered protocol augmented by -set agreement objects, are equivalent: They can solve the same colorless tasks. Notice that in the extreme case, where , any colorless task is solvable, either because there are no failures or because the processes can reach consensus (Exercise 5.2). More generally, let be an integer, . Then, for any such that , there is a -resilient -set agreement layered snapshot protocol for a task if and only if there is a continuous map

carried by (see Exercise 5.4).

The previous chapter’s techniques extend even to the case where process failures are not independent. In Section 5.4, we show how to exploit knowledge of which potential failures are correlated and which are not. A parameter captures the power of such a model for solving colorless tasks. This parameter is the size of the smallest core in the system, a minimal set of processes that will not all fail in any execution. The result for -resilient solvability readily generalizes to dependent failures. A colorless task has a layered protocol with minimal core size if and only if there is a continuous map

carried by .

Next, in Section 5.5 we consider message-passing protocols. The layered snapshot model might appear to be stronger; once a process writes a value to shared memory, that value is there for all to see, whereas a value sent in a message is visible only to the process that received the message. Perhaps surprisingly, as long as a majority of processes is nonfaulty (that is, ), the two models are equivalent: Any task that has a -resilient layered immediate snapshot protocol has a -resilient message-passing protocol, and vice versa.

Once we have established necessary and sufficient conditions for a task to have a protocol in a particular model, it is natural to ask whether it is decidable whether a given task satisfies those conditions. We will see in Section 5.6 that the answer depends on the model. Essentially, for any model in which solvable tasks are exactly those for which there is a continuous map

carried by , then solvability is decidable if and only if .

5.2 t-Resilient layered snapshot protocols

Recall that wait-free protocols tolerate crash failures by all processes but one (that is, out of ). Sometimes this level of resilience is excessive, especially if there are many processes. Instead, it may be enough to tolerate only failures among processes, where , a property called -resilience.

A colorless -resilient layered immediate snapshot protocol (-resilient layered protocol when clear from context) is structured as shown in Figure 5.1. As in the wait-free case, the processes share a two-dimensional memory array mem[][i], where row is shared only by the processes participating in layer , and column is written only by . During layer writes its current view to mem[][i], waits for views (including its own) to be written to that layer’s row, and then takes a snapshot of that row. The waiting step introduces no danger of deadlock because at least nonfaulty processes will eventually reach each level and write their views.

Figure 5.1 -Resilient layered immediate snapshot protocol: Pseudo-code for .

Notice that the wait-free layered snapshot protocol of Figure 4.1, where , is a degenerate form of the -resilient protocol of Figure 5.1. In the wait-free protocol, once has written to mem[][i], it can proceed immediately because , and one view (its own) has already been written.

Right away we can see that even an -resilient protocol can solve colorless tasks that cannot be solved by a wait-free protocol (and in a single layer). The pseudo-code in Figure 5.2 solves -set agreement if at most processes may fail. In contrast, we know from Theorem 4.3.6 that there is no -set agreement protocol if processes can fail when . More generally, this impossibility holds for any value of (Theorem 5.2.9), so each additional level of resilience allows us to solve a harder instance of set agreement.

Lemma 5.2.1

There exists a -resilient layered snapshot protocol for -set agreement.

Proof

As shown in Figure 5.2, each process writes its input, waits until inputs have been written, and then chooses the least value read. Because there are at least nonfaulty processes, the waiting step has no danger of deadlock. Because each process can “miss” values from at most processes, each value chosen will be among the least input values, so at most distinct values can be chosen.

In Exercise 5.22, we ask you to show that this protocol does not actually require immediate snapshots.

Figure 5.2 -Resilient single-layer snapshot protocol for -set agreement.

The following lemma will be useful for characterizing the colorless tasks that can be solved, tolerating failures, by a layered colorless protocol. It is similar to Theorem 4.2.8 for wait-free single-layer colorless immediate snapshot protocol complex, and indeed the proof is similar as well.

By Definition 4.2.2 we can consider the triple for the protocol of Figure 5.1, where is the input complex of a task, is the protocol complex where each simplex is a colorless final configuration, and is the strict execution carrier map.

Lemma 5.2.2

For any colorless single-layer -process -resilient snapshot protocol , we have , and the restriction of the execution map to this skeleton is the composition of the -skeleton and barycentric subdivision operators.

Proof

Consider all executions of the -resilient protocol of Figure 5.1 on the input subcomplex . Assume all processes start with vertices from a simplex in . The sets of views assembled by the processes form a chain of faces

The inclusion follows because these views are snapshots, and snapshots are atomic: If assembles face and assembles face , then , or vice versa.

These chains can have length at most , because , so indeed the complex consisting of such simplices is contained in the -skeleton of the barycentric subdivision .

Moreover, any simplex in can be produced by such a chain. Consider an execution where processes start with input vertices from and at least one starts with each of the other vertices of . (There are enough processes because the chain has length at most .) Suppose all the processes with inputs from concurrently write to the array and immediately take a snapshot, ending up with views equal to . Similarly, all processes with input from write and immediately take a snapshot, and so on.

The complex consisting of such simplices is precisely the barycentric subdivision of the -skeleton of . Taking the complex over all possible inputs, we have contains , and is the restriction of .

A simple induction, with Lemma 5.2.2 as the base, yields the following.

Lemma 5.2.3

Any colorless -layer -process -resilient snapshot protocol is the composition of single-layer -resilient protocols, where , and the restriction of the execution map to this skeleton is the composition of the barycentric subdivision and -skeleton operators.

If a protocol solves a colorless task , then we are free to add a preprocessing step to the protocol, where first the processes agree on at most of their inputs, where , using the protocol of Figure 5.2. The following lemma states this formally using the protocol composition Definition 4.2.5.

Lemma 5.2.4

Skeleton Lemma

Assume that for any input complex there is an -process protocol, , that solves the -set agreement task for some fixed .

Assume furthermore that the protocol solves the colorless task with decision map . Then the composition of the -set agreement task with the protocol also solves using the same decision map .

Proof

Recall that by Definition 4.2.5 the task can be composed with the protocol , since . The result of the composition is a new protocol , where .

We check that is a correct decision map for the task. Pick an arbitrary . We have

where the last inclusion is a corollary of the fact that the protocol solves the task . It follows that is a decision map for the composite protocol.

We may now combine the previous results to show that, for -resilient colorless task solvability, we may assume without loss of generality that a protocol complex is a barycentric subdivision of the -skeleton of the input complex.

Lemma 5.2.5

If there is a -resilient layered protocol that solves the colorless task , then there is a -resilient layered protocol solving that task whose protocol complex is , and

Proof

By Lemma 5.2.1, there exists a -resilient layered snapshot protocol for -set agreement. By the Skeleton Lemma (5.2.4), we can assume without loss of generality that any -resilient colorless protocol’s input complex is . Starting on a simplex in , after the first layer each process’s view is a vertex of , and all their views form a simplex of . After layers, their views form a simplex of . It follows that .

The other direction follows from Lemma 5.2.3. It follows that .

Corollary 5.2.6

For any input complex , and , there is an -process -resilient layered protocol that solves the barycentric agreement task .

Theorem 5.2.7

The colorless task has a -resilient layered snapshot protocol if and only if there is a continuous map

(5.2.1)

carried by .

Proof

By Lemma 5.2.5, for any -resilient layered snapshot protocol we may assume the protocol complex is . Because layered snapshot protocols solve any barycentric agreement task, we can apply the Protocol Complex Lemma (4.2.6), which states that the protocol solves the task if and only if there is a continuous map

carried by . The claim follows because .

Applying the Discrete Protocol Complex Lemma (4.2.7),

Corollary 5.2.8

The colorless task has a -resilient layered snapshot protocol if and only if there is a subdivision of and a simplicial map

carried by .

Without loss of generality, we can assume that any -resilient layered protocol consists of one -set agreement layer followed by any number of immediate snapshot layers. Moreover, only the first -set agreements layer requires waiting; the remaining layers can be wait-free.

Theorem 5.2.9

There is no -resilient layered snapshot protocol for -set agreement.

Proof

See Exercise 5.3.

An important special case of the previous theorem occurs when , implying that consensus is not solvable by a layered protocol even if only a single process can fail.

5.3 Layered snapshots with k-set agreement

Practically all modern multiprocessor architectures provide synchronization primitives more powerful than simple read or write instructions. For example, the test-and-set instruction atomically swaps the value true for the contents of a memory location. If we augment layered snapshots with test-and-set, for example, it is possible to solve wait-free -set agreements for (see Exercise 5.5). In this section, we consider protocols constructed by composing layered snapshot protocols with -set agreement protocols.

In more detail, we consider protocols in the form of Figure 5.3. The protocol is similar to the colorless wait-free snapshot protocol of Figure 4.1 except that in addition to sharing memory, the objects share an array of -set agreement objects (Line 3). In each layer , the processes first join in a -set agreement protocol with the other processes in that layer (Line 8), and then they run an -layer immediate snapshot protocol (Line 11) for some .

Figure 5.3 Colorless layered set agreement protocol: Pseudo-code for .

Recall that the -set agreement protocol with input complex is , where the skeleton operator is considered as a strict carrier map (see Exercise 4.8).

Recall also that if and are protocols where the protocol complex for the first is contained in the input complex for the second, then their composition is the protocol , where (Definition 4.2.3).

Definition 5.3.1

A -set layered snapshot protocol is one composed from layered snapshot and -set agreement protocols.

Lemma 5.3.2

Without loss of generality, we can assume the that the first protocol in any such composition is a -set agreement protocol. (That is, .)

Proof

This claim follows directly from the Skeleton Lemma (5.2.4).

Lemma 5.3.3

If is a -set layered snapshot protocol, then is equal to for some .

Proof

We argue by induction on , the number of -set and layered snapshot protocols composed to construct . For the base case, when , the protocol is just a -set agreement protocol by Lemma 5.3.2, so the protocol complex is just .

For the induction step, assume that is the composition of and , where the first protocol is the result of composing -set or layered snapshot protocols, and . By the induction hypothesis, is for some .

There are two cases. First, if is a -set protocol, then

Second, if it is an -layer snapshot protocol, then

Theorem 5.3.4

The colorless task has a -set layered snapshot protocol if and only if there is a continuous map

(5.3.1)

carried by .

Proof

By Lemma 5.2.5, any -set layered snapshot protocol has . By the Protocol Complex Lemma (4.2.6), the protocol solves the task if and only if there is a continuous map

carried by . The claim follows because .

Applying the Discrete Protocol Complex Lemma (4.2.7):

Corollary 5.3.5

The colorless task has a -set layered snapshot protocol if and only if there is a subdivision of and a simplicial map

carried by .

Theorem 5.3.6

There is no -set layered snapshot protocol for -set agreement.

Proof

See Exercise 5.7.

The next corollary follows because Theorem 5.3.4 is independent of the order in which -set agreement layers are composed with immediate snapshot layers.

Corollary 5.3.7

We can assume without loss of generality that any set agreement protocol consists of a single -set agreement layer followed by some number of layered immediate snapshot protocols.

5.4 Adversaries

A -resilient protocol is designed under the assumption that failures are uniform: Any out of processes can fail. Often, however, failures are correlated. In a distributed system, processes running on the same node, in the same network partition, or managed by the same provider may be more likely to fail together. In a multiprocessor, processes running on the same core, on the same processor, or on the same card may be likely to fail together. It is often possible to design more effective fault-tolerant algorithms if we can exploit knowledge of which potential failures are correlated and which are not.

One way to think about such failure models is to assume that failures are controlled by an adversary who can cause certain subsets of processes to fail, but not others. There are several ways to characterize adversaries. The most straightforward is to enumerate the faulty sets: all sets of processes that fail in some execution. We will assume that faulty sets are closed under inclusion; if is a maximal set of processes that fail in some execution, then for any there is an execution in which is the actual set of processes that fail. There is a common-sense justification for this assumption: We want to respect the principle that fault-tolerant algorithms should continue to be correct if run in systems that display fewer failures than in the worst-case scenario. A model that permits algorithms that are correct only if certain failures occur is unlikely to be useful in practice.

Faulty sets can be described as a simplicial complex , called the faulty set complex, the vertices of which are process names and the simplices of which are sets of process names such that exactly those processes fail in some execution.

Faulty sets can be cumbersome, so we use a more succinct and flexible way to characterize adversaries. A core is a minimal set of processes that will not all fail in any execution. A core is a simplex that is not itself in the faulty set complex, but all of its proper faces are in . The following dual notion is also useful. A survivor set is a minimal set of processes that intersects every core (such a set is sometimes called a hitting set). In every execution, the set of nonfaulty processes includes a survivor set.

Here are some examples of cores and survivor sets.

The Wait-Free Adversary. The entire set of processes is the only core, and the singleton sets are the survivor sets.

The -Faulty Adversary. The cores are the sets of cardinality , and the survivor sets are the sets of cardinality .

An Irregular Adversary. Consider a system of four processes, , , and , where any individual process may fail, or and may both fail. Here is a core, since they cannot both fail, yet there is an execution in which each one fails. In all, there are five cores:

and three survivor sets:

The set is a survivor set, since there is an execution where only these processes are nonfaulty. This adversary is illustrated in Figure 5.4.

Figure 5.4 An irregular adversary: , and can each fail individually, or and may both fail. The faulty set complex consists of an edge linking and , shown as a solid line, and two isolated vertices, and . There are five cores, shown as dotted lines.

Here is how to use cores and survivor sets in designing a protocol. Given a fixed core , it is safe for a process to wait until it hears from some member of , because they cannot all fail. It is also safe for a process to wait until it hears from all members of some survivor set, because the set of nonfaulty processes always contains a survivor set. See Exercise 5.14.

Let be an adversary with minimum core size . We say that a protocol is -resilient if it tolerates any failure permitted by . As illustrated in Figure 5.5, an -resilient layered snapshot protocol differs from a -resilient protocol as follows. At each layer, after writing its own value, each process waits until all the processes in a survivor set (possibly including itself) have written their views to that layer’s memory. As noted, there is no danger of deadlock waiting until a survivor set has written.

Figure 5.5 -resilient layered snapshot protocol: Pseudo-code for .

Figure 5.6 -resilient layered snapshot protocol for -set agreement.

Notice that the -resilient layered snapshot protocol of Figure 5.1 is a degenerate form of the -resilient protocol of Figure 5.5 because, for the -resilient protocol, any set of processes is a survivor set.

Lemma 5.4.1

Let be an adversary with minimum core size . There is an -resilient layered snapshot protocol for -set agreement.

Proof

It is a little easier to explain this protocol using writes and snapshots instead of immediate snapshots (see Exercise 5.23). Pick a core of of minimal size . Figure 5.6 shows a single-layer protocol. Each process in writes its input to mem[0][i], while each process not in repeatedly takes snapshots until it sees a value written (by a process in ). It then replaces its own input value with the value it found. At most distinct values can be chosen. This protocol must terminate because is a core, and the adversary cannot fail every process in .

Lemma 5.4.2

Without loss of generality, for any -layer -resilient colorless protocol ,

Proof

By Lemma 5.4.1, there exists an -resilient layered snapshot protocol for -set agreement. By the Skeleton Lemma (5.2.4), we can assume without loss of generality that any -resilient colorless protocol’s input complex is . From that point on the rest of the proof is virtually identical to the proof of Lemma 5.2.5.

Theorem 5.4.3

The colorless task has an -resilient layered snapshot protocol if and only if there is a continuous map

(5.4.1)

carried by .

Proof

By Lemma 5.4.2, any -resilient layered snapshot protocol has . The Protocol Complex Lemma (4.2.6) states that the protocol solves the task if and only if there is a continuous

carried by . The claim follows because .

Applying the Discrete Protocol Complex Lemma (4.2.7):

Corollary 5.4.4

The colorless task has an -resilient layered snapshot protocol if and only if there is a subdivision of and a simplicial map

carried by .

Theorem 5.4.5

There is no -resilient -set agreement layered snapshot protocol.

Proof

See Exercise 5.15.

5.5 Message-passing protocols

So far we have focused on models in which processes communicate through shared memory. We now turn our attention to another common model of distributed computing, where processes communicate by message passing.

There are asynchronous processes that communicate by sending and receiving messages via a communication network. The network is fully connected; any process can send a message to any other. Message delivery is reliable; every message sent is delivered exactly once to its target process after a finite but potentially unbounded delay. Message delivery is first-in, first-out (FIFO); messages are delivered in the order in which they were sent.

The operational model is essentially unchanged from the layered snapshot model. The principal difference is that communication is now one-to-one rather than one-to-many. In Exercise 5.11, we ask you to show that barycentric agreement is impossible in a message-passing model if a majority of the process can fail. For this reason, we restrict our attention to -resilient protocols where , the number of processes that can fail, is less than half: .

We will see that as long as a majority of processes are nonfaulty, there is a -resilient message-passing protocol if and only if there is a -resilient layered snapshot protocol. We will see, however, that message-passing protocols look quite different from their shared-memory counterparts.

For shared-memory protocols, we focused on layered protocols because it is convenient to have a “clean” shared memory for each layer. For message-passing protocols, where there is no shared memory, we will not need to use layered protocols. Later, in Chapter 13, it will be convenient impose a layered structure on asynchronous message-passing executions.

In our examples we use the following notation. A process sends a message containing values to as follows:

send to

We say that a process broadcasts a message if it sends that message to all processes, including itself:

send to all

Here is how receives a message from :

upon receive do

… // do something with the values received

Some message-passing protocols require that each time a process receives a message from another, the receiver forwards that message to all processes. Each process must continue to forward messages even after it has chosen its output value. Without such a guarantee, a nonfaulty process that chooses an output and falls silent is indistinguishable from a crashed process, implying that tasks requiring a majority of processes to be nonfaulty become impossible. We think of this continual forwarding as a kind of operating system service running in the background, interleaved with steps of the protocol itself. In our examples, such loops are marked with the background keyword:

background // forward messages forever

upon receive do

send to all

We start with two useful protocols, one for -set agreement and one for barycentric agreement.

5.5.1 Set agreement

As a first step, each process assembles values from as many other processes as possible. The getQuorum() method shown in Figure 5.7 collects values until it has received messages from all but processes. It is safe to wait for that many messages because there are at least nonfaulty processes. It is not safe to wait for more, because the remaining processes may have crashed.

Figure 5.7 Return values from at least processes.

Figure 5.8 shows a simple protocol for -set agreement. Each process broadcasts its input value, waits to receive values from a quorum of messages, and chooses the least value among them. A proof of this protocol’s correctness is left as Exercise 5.9. Note that this protocol works for any value of .

Figure 5.8 -resilient message-passing protocol for -set agreement.

5.5.2 Barycentric agreement

Recall that in the barycentric agreement task, each process is assigned as input a vertex of a simplex , and after exchanging messages with the others, chooses a face containing , such that for any two participating processes and the faces they choose are ordered by inclusion: , or vice versa. This task is essentially equivalent to an immediate snapshot, which it is convenient (but not necessary) to assume as a shared-memory primitive operation. In message-passing models, however, we assume send and receive as primitives, and we must build barycentric agreement from them.

Figure 5.9 shows a message-passing protocol for barycentric agreement. Each maintains a set of messages it has received, initially only ’s input value (Line 2). repeatedly broadcasts , and waits to receive sets from other processes. If it receives such that (Line 7), then it increments its count of the number of times it has received . If it receives such that (Line 9). It sets to and starts over. When has received identical copies of from distinct processes, the protocol terminates, and decides . As usual, after the protocol terminates, must continue to forward messages to the others (Lines 15–17).

Lemma 5.5.1

The protocol in Figure 5.9 terminates.

Proof

Suppose, by way of contradiction, that runs this protocol forever. Because changes at most times, there is some time at which ’s assumes its final value . For every set that received earlier, , and for every received later, .

When updates to , it broadcasts to the others. Suppose a nonfaulty receives from , where . must have sent to when it first set to . Since henceforth does not change , either , or . If , then will send back to , increasing its count. If , then already sent to . Either way, receives a copy of from at least nonfaulty processes and terminates the protocol.

Lemma 5.5.2

In the protocol in Figure 5.9 , if decides and decides , then either , or vice versa.

Proof

Note that the sequence of sets broadcast by any process is strictly increasing: . To decide, received from a set of at least processes, and received from a set at least processes. Because cannot exceed , and must both contain a process that sent both and , implying they are ordered, a contradiction.

Figure 5.9 Barycentric agreement message-passing protocol.

5.5.3 Solvability condition

We can now characterize which tasks have protocols in the -resilient message-passing model.

Theorem 5.5.3

For , has a -resilient message-passing protocol if and only if there is a continuous map

carried by ,

Proof

Protocol Implies Map

If a task has an -process -resilient message-passing protocol, then it has an -process -resilient layered snapshot protocol (see Exercise 5.10). The claim then follows from Theorem 5.2.7.

Map Implies Protocol. The map

has a simplicial approximation,

also carried by . We construct a two-step protocol. In the first step, the processes use the -set agreement protocol of Figure 5.8 to converge to a simplex in , In the second step, they repeat the barycentric agreement protocol of Figure 5.9 to converge to a simplex in . Composing these protocols and using as a decision map yields the desired protocol.

Theorem 5.5.4

For has a -resilient message-passing protocol if and only if there is a subdivision of and a simplicial map

carried by .

Proof

See Exercise 5.16.

Theorem 5.5.5

There is no -resilient message-passing protocol for -set agreement.

Proof

See Exercise 5.17.

5.6 Decidability

This section uses more advanced mathematical techniques than the earlier sections.

Now that we have necessary and sufficient conditions for a task to have a protocol in various models, it is natural to ask whether we can automate the process of deciding whether a given task has a protocol in a particular model. Can we write a program (that is, a Turing machine) that takes a task description as input and returns a Boolean value indicating whether a protocol exists?

Not surprisingly, the answer depends on the model of computation. For wait-free layered snapshot protocols or wait-free -set layered snapshot protocols for , the answer is no: There exists a family of tasks for which it is undecidable whether a protocol exists. We will construct one such family: the loop agreement tasks, discussed in Chapter 15. On the other hand, for wait-free -set layered snapshot protocols for or 2, the answer is yes: For every task, it is decidable whether a protocol exists. For any model where the solvability question depends only on the 1-skeleton of the input complex, solvability is decidable (see Exercise 5.19).

5.6.1 Paths and loops

Let be a finite 2-dimensional complex. Recall from Chapter 3 that an edge path between vertices and in is a sequence of vertices such that each pair is an edge of for . A path is simple if the vertices are distinct.

Definition 5.6.1

An edge path is an edge loop if its first and last vertices are the same. An edge loop is simple if all the other vertices are distinct. An edge loop’s first vertex is called its base point.

All edge loops considered here are assumed to be simple.

Informally, we would like to distinguish between edge loops that circumscribe “solid regions” and edge loops that circumscribe holes. To make this notion precise, we must introduce some continuous concepts.

Definition 5.6.2

Fix a point on the unit circle . A continuous loop in with base point is a continuous map such that . A continuous loop is simple if it has no self-intersections: only if .

All continuous loops considered here are assumed to be simple.

As illustrated in Figure 5.10, a continuous loop in is contractible if it can be continuously deformed to its base point in finite “time,” leaving the base point fixed. Formally, we capture this notion as follows.

Definition 5.6.3

A continuous loop in is contractible if it can be extended to a continuous map , where denotes the -disk for which the boundary is the circle , the input domain for .

Figure 5.10 Noncontractible (left) and contractible (right) continuous loops.

A simple continuous loop is a representative of a simple edge loop if their geometric images are the same: .

Definition 5.6.4

A simple edge loop is contractible if it has a contractible representative.

Although any particular simple edge loop has an infinite number of representatives, it does not matter which one we pick.

Fact 5.6.5

Either all of an edge loop’s representatives are contractible, or none are.

In Exercise 5.18, we ask you to construct an explicit representative of an edge path.

Fact 5.6.6

The question whether an arbitrary simple edge loop in an arbitrary finite simplicial complex is contractible is undecidable.

Remarkably, the question remains undecidable even for complexes of dimension two (see Section 5.7, “Chapter notes”).

Mathematical Note 5.6.7

The notion of contractibility is a special case of a more general notion called loop homotopy. Given two continuous loops with the same base point, we would like to treat them as equivalent if one loop can be continuously deformed to the other in finite “time,” leaving their common base point fixed. Formally, two loops with common base point are homotopic if there is a continuous map , such that , for all . If we think of the second coordinate in as time, then is is , and is the intermediate loop at time , for . Note that the base point does not move during the deformation.

The trivial loop never leaves its base point. It is given by , where for all . It is a standard fact that a loop is contractible if and only if it is homotopic to the trivial loop at its base point.

The homotopy classes of loops for a topological space are used to define that space’s fundamental group, usually denoted . These groups are extensively studied in algebraic topology.

5.6.2 Loop agreement

Let denote the -simplex for which the vertices are labeled , and , and let denote an arbitrary -dimensional complex. We are given three distinct vertices , and in , along with three edge paths , and , such that each path goes from to . We let denote the corresponding -dimensional simplicial subcomplex as well, in which case we let . We assume that the paths are chosen to be nonself-intersecting and that they intersect each other only at corresponding end vertices.

Definition 5.6.8

These edge paths , and form a simple edge loop with base point , which we call a triangle loop, denoted by the -tuple .

In the loop agreement task, the processes start on vertices of and converge on a simplex in , subject to the following conditions. If all processes start on a single vertex , they converge on the corresponding vertex . If they start on two distinct input vertices, and , they converge on some simplex (vertex or edge) along the path linking and . Finally, if the processes start on all three input vertices , they converge to some simplex (vertex, edge, or triangle) of . See Figure 5.11 for an illustration. More precisely:

Definition 5.6.9

The loop agreement task associated with a triangle loop in a simplicial complex is a triple , where the carrier map is given by

Since the loop agreement task is completely determined by the complex and the triangle loop , we also denote it by .

Figure 5.11 Loop agreement.

5.6.3 Examples of loop agreement tasks

Here are some examples of interesting loop agreement tasks:

• A -set agreement task can be formulated as the loop agreement task , where .

• Let be an arbitrary subdivision of . In the -dimensional simplex agreement task, each process starts with a vertex in . If is the face composed of the starting vertices, then the processes converge on a simplex in . This task is the loop agreement task , where , with denoting the unique simple edge path from to in the subdivision of the edge .

• The -dimensional -th barycentric simplex agreement task is simplex agreement for , the -th iterated barycentric subdivision of . Notice that -barycentric agreement is just the trivial loop agreement task , where , since a process with input can directly decide .

• In the -dimensional -agreement task, input values are vertices of a face of , and output values are points of that lie within of one another in the convex hull of the input values. This task can be solved by a protocol for -barycentric simplex agreement for suitably large .

• In the -dimensional approximate agreement task input values are taken from the set , and output values are real numbers that lie within of one another in the convex hull of the input values. This task can be solved by a -dimensional -agreement protocol.

Of course, not all tasks can be cast as loop agreement tasks.

5.6.4 Decidability for layered snapshot protocols

We now show that a loop agreement task has layered snapshot protocol for if and only if the triangle loop is contractible in . Loop contractibility, however, is undecidable, and therefore so is the question whether an arbitrary loop agreement task has a protocol in this model.

We will need the following standard fact.

Fact 5.6.10

There is a homeomorphism from the 2-disk to ,

that carries boundary to boundary: .

Theorem 5.6.11

For , the loop agreement task has a -resilient layered snapshot protocol if and only if the triangle loop is contractible.

Proof

Note that because has dimension 2, for .

Protocol Implies Contractible. By Theorem 4.3.1, if the task has a wait-free layered snapshot protocol, then there exists a continuous map carried by . Because is carried by , satisfies , for , and , for . Composing with the homeomorphism of Fact 5.6.10, we see that the map , restricted to the 1-sphere , is a simple continuous loop . Moreover, this continuous loop is a representative of . Since the map can be extended to all of , it is contractible, and so is the triangle loop .

Contractible Implies Protocol. Let be the homeomorphism of Fact 5.6.10.

The edge map induces a continuous map

carried by : for , and for . The composition of followed by is a simple loop:

also carried by . Because is contractible, Fact 5.6.5 implies that can be extended to

also carried by . It is easy to check that the composition

is also carried by . Theorem 5.2.7 implies that there is a -resilient layered snapshot protocol for this loop agreement task.

Corollary 5.6.12

It is undecidable whether a loop agreement task has a -resilient layered snapshot protocol for .

5.6.5 Decidability with k-set agreement

Essentially the same argument shows that the existence of a wait-free loop agreement protocol is also undecidable for -set layered snapshot protocols for .

Corollary 5.6.13

A loop agreement task has a wait-free -set layered snapshot protocol for if and only if the triangle loop is contractible.

It follows from Fact 5.6.6 that it is undecidable whether a loop agreement task has a protocol for three processes in this model.

The situation is different in models capable of solving 1-set or 2-set agreement, such as 1-resilient layered snapshot or message-passing protocols, or wait-free -set layered snapshot protocols for or 2.

Theorem 5.6.14

In any model capable of solving -set agreement for , it is decidable whether a task has a protocol.

Proof

In each of these models, a task has a protocol if and only if there exists a continuous map carried by .

When , this map exists if and only if is nonempty for each , which is certainly decidable. When , this map exists if and only if, in addition to the nonemptiness condition, for every pair of vertices in there is a path from a vertex of to a vertex of contained in . This graph-theoretic question is decidable.

5.7 Chapter notes

The layered approach used in this chapter was employed by Herlihy, Rajsbaum, and Tuttle [88,89] for message-passing systems. It was used to prove that connectivity is conserved across layers, something we will do later on. In this chapter we used the more direct approach of showing that subdivisions are created in each layer. Earlier work by Herlihy and Rajsbaum [79] and Herlihy and Shavit [91] was based on the “critical state” approach, a style of argument by contradiction pioneered by Fischer, Lynch, and Paterson [55]. This last paper proved that consensus is not solvable in a message-passing system, even if only one process may fail by crashing, a special case of Theorem 5.5.5. Our message-passing impossibility result is simplified by using layering.

In shared-memory systems the wait-free layered approach used in this chapter was introduced as an “iterated model” of computation by Borowsky and Gafni [26]; see the survey by Rajsbaum [128] for additional references. Algorithms in this model can be presented in a recursive form as described by Gafni and Rajsbaum [68] and in the tutorial by Herlihy, Rajsbaum, and Raynal [87]. Fault-tolerant versions of the model were studied by Rajsbaum, Raynal, and Travers [132]. In Chapter 14 we study the relationship of this model with a more standard model in which processes can write and read the same shared array any number of times.

The BG-simulation [27] provides a way to transform colorless tasks wait-free impossibilities bounds to -resilient impossibilities. As we shall see in Chapter 7, the -resilient impossibility theorems proved directly in this chapter can be obtained by reduction to the wait-free case using this simulation. The BG simulation and layered models are discussed by Rajsbaum and Raynal [129]. Lubitch and Moran [111] provide a direct model-independent -resilient impossibility proof of consensus.

Early applications of Sperner’s lemma to set agreement are due to Chaudhuri [38] and to Chaudhuri, Herlihy, Lynch, and Tuttle [40]. Herlihy and Rajsbaum [79] present critical state arguments to prove results about the solvability of set agreement using set agreement objects. We explore in Chapter 9 why renaming is weaker than -set agreement, as shown by Gafni, Rajsbaum, and Herlihy [69].

Junqueira and Marzullo [99,98] introduced the core/survivor-set formalism for characterizing general adversaries used here, and they derived the first lower bounds for synchronous consensus against such an adversary. Delporte-Gallet et al. [46] investigate the computational power of more general adversaries in asynchronous shared memory using simulation. By contrast, the analogous impossibility results proved here use direct combinatorial arguments. The colorless task solvability characterization theorem for adversaries was proved by Herlihy and Rajsbaum [84] (and extended in [86], as discussed in Chapter 13).

Biran, Moran, and Zaks [19] showed that task solvability is decidable in a message-passing system where at most one process can fail by crashing, providing a characterization of solvable tasks in terms of graph connectivity, extending earlier work by Moran and Wolfstahl [118]. They further present a setting where the decision problem is NP-hard [20]. Gafni and Koutsoupias [63] were the first to note that three-process tasks are undecidable for wait-free layered snapshot protocols. This observation was generalized to other models by Herlihy and Rajsbaum [80].

The message-passing barycentric agreement protocol of Figure 5.9 is adapted from the stable vectors algorithm of Attiya et al. [9]. Attiya et al. [8] showed that it is possible to simulate shared memory using message-passing when a majority of processes are nonfaulty. One could use this simulation to show that our message-passing characterization follows from the shared-memory characterization.

The hierarchy of loop agreement tasks defined by Herlihy and Rajsbaum [83] will be presented in Chapter 15. Several variants and extensions have been studied. Degenerate loop agreement was defined in terms of two vertices of the output complex instead of three, by Liu, Pu, and Pan [108]. More general rendezvous task were studied by Liu, Xu, and Pan [109]. Similar techniques were used by Fraigniaud, Rajsbaum, and Travers [59] to derive hierarchies of tasks motivated by checkability issues.

Contractibility is undecidable because it reduces to the word problem for finitely presented groups: whether an expression reduces to the unit element. This problem was shown to be undecidable by S. P. Novikov [126] in 1955, and the isomorphism problem (whether two such groups are isomorphic) was shown to be undecidable by M. O. Rabin [127] in 1958. (For a more complete discussion of these problems, see Stillwell [142] or Sergeraert [140].)

Biran, Moran, and Zaks [21] study the round complexity of tasks in a message-passing system where at most one process can fail by crashing. Hoest and Shavit [94] consider nonuniform layered snapshot subdivisions to study the number of layers needed to solve a task in the wait-free case (see Exercise 5.21 about the complexity of solving colorless tasks).

Figure 5.12 Layered barycentric agreement message-passing protocol.

5.8 Exercises

Exercise 5.1

Show that the colorless complex corresponding to independently assigning values from a set to a set of processes is the -skeleton of a -dimensional simplex. Thus, it is homeomorphic to the -skeleton of a -disk.

Exercise 5.2

Show that any colorless task such that is nonempty for every input vertex is solvable by a -resilient layered snapshot colorless protocol and by a wait-free layered snapshot colorless protocol augmented with consensus objects.

Exercise 5.3

Prove Theorem 5.2.9: There is no -resilient layered snapshot protocol for -set agreement.

Exercise 5.4

Use the techniques of this chapter to show that there is a -resilient -set agreement layered snapshot protocol for a task if and only if there is a continuous map

carried by .

Exercise 5.5

Recall that the test-and-set atomically swaps 1 into a memory location and returns that location’s prior value. Give an -process protocol for solving -set agreement using layered snapshots and test-and-set instructions.

Exercise 5.6

Suppose we are given a “black box” object that solves -set agreement for processes. Give a wait-free -process layered snapshot protocol for -set agreement, where

Exercise 5.7

Prove Theorem 5.3.6: There is no -set layered snapshot protocol for -set agreement.

Exercise 5.8

Consider a model where message delivery is reliable, but the same message can be delivered more than once, and messages may be delivered out of order. Explain why that model is or is not equivalent to the one we use.

Exercise 5.9

Prove that the set agreement protocol of Figure 5.8 is correct.

Exercise 5.10

Show how to transform any -resilient message-passing protocol into a -resilient layered snapshot protocol, even when .

Exercise 5.11

Show that barycentric agreement is impossible if a majority of the processes can fail: . (Hint: A partition occurs when two disjoint sets of nonfaulty processes both complete their protocols without communicating.)

Exercise 5.12

Show that a barycentric agreement protocol is impossible if a process stops forwarding messages when it chooses an output value.

Exercise 5.13

Prove Theorem 5.5.5: There is no wait-free message-passing protocol for -set agreement. (Hint: Use Sperner’s Lemma.)

Exercise 5.14

Explain how to transform the set of cores of an adversary into the set of survivor sets, and vice versa. (Hint: Use disjunctive and conjunctive normal forms of Boolean logic.)

Exercise 5.15

Prove Theorem 5.4.5: There is no -resilient -set agreement layered snapshot protocol.

Exercise 5.16

Prove Theorem 5.5.4: For , has a -resilient message-passing protocol if and only if there is a subdivision of and a simplicial map

carried by .

Exercise 5.17

Prove Theorem 5.5.5: There is no -resilient message-passing protocol for -set agreement.

Exercise 5.18

Construct a loop that corresponds to the edge loop given by , , where . (Hint: Start by dividing the circle into equal parts.)

Exercise 5.19

Consider a model of computation where a colorless task has a protocol if and only if there is a continuous map

(5.8.1)

carried by . Prove that it is decidable whether a protocol exists for a colorless task in this model.

Exercise 5.20

Consider a model of computation where a colorless task has a protocol if and only if there is a continuous map

(5.8.2)

carried by . Prove that every loop agreement task is solvable in this model.

Exercise 5.21

Show that for any , and , there is a loop agreement task such that any -process -resilient snapshot protocol that solves it, requires more than layers. In more detail, suppose the number of edges in each path of the triangle loop of the task is . Then any -resilient snapshot protocol that solves it requires at least layers. (Hint: Use Lemma 5.2.3.)

Exercise 5.22

Show that the -resilient single-layer snapshot protocol for -set agreement protocol of Figure 5.2 still works if we replace the immediate snapshot with a nonatomic scan, reading the layer’s memory one word at a time.

Exercise 5.23

Rewrite the protocol of Figure 5.6 to use immediate snapshots.

Exercise 5.24

As noted, because message-passing protocols do not use shared memory, there is less motivation to use layered protocol. Figure 5.12 shows a layered message-passing barycentric agreement protocol. Is it correct?

Exercise 5.25

In the adversarial model, suppose we drop the requirement that faulty sets be closed under inclusion. Show that without this requirement, that if all and only sets of out of processes are faulty sets, then it is possible to solve consensus.