Consensus Potpourri: Approximate Majority With Catalytic Inputs - TIB AV-Portal

Consensus Potpourri: Approximate Majority With Catalytic Inputs

00:00

0

Related Material

Amir, Talley Aspnes, James Lazarsfeld, John

Formal Metadata

Title

Consensus Potpourri: Approximate Majority With Catalytic Inputs

Title of Series

OPODIS: International Conference on Principles of Distributed Systems, 2020

Number of Parts

30

Author

Lazarsfeld, John

License

CC Attribution 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/52878 (DOI)

Publisher

Release Date

Language

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

Third-state dynamics (Angluin et al. 2008; Perron et al. 2009) is a well-known process for quickly and robustly computing approximate majority through interactions between randomly-chosen pairs of agents. In this paper, we consider this process in a new model with persistent-state catalytic inputs, as well as in the presence of transient leak faults. Based on models considered in recent protocols for populations with persistent-state agents (Dudek et al. 2017; Alistarh et al. 2017; Alistarh et al. 2020), we formalize a Catalytic Input (CI) model comprised of n input agents and m worker agents. For m=Θ(n), we show that computing the PARITY of the input population with high probability requires at least Ω(n2) total interactions, demonstrating a strong separation between the CI and standard population models. On the other hand, we show that the third-state dynamics can be naturally adapted to this new model to solve approximate majority in O(nlogn) total steps with high probability when the input margin is Ω(√nlogn) , which preserves the time and space efficiency of the process in the original model. Moreover we show the robustness of third-state dynamics protocols to the transient leak faults considered by (Alistarh et al. 2017; Alistarh et al 2020). In both the original and CI models, these protocols successfully compute approximate majority with high probability in the presence of leaks occurring at each time step with probability β≤O(√nlognn) . The resilience of these dynamics to adversarial leaks exhibits a subtle connection to previous results involving Byzantine agents.

OPODIS: International Conference on Principles of Distributed Systems, 202017 / 30

1

23:04

Dynamic & Radio Networks: Broadcasting Competitively against Adaptive Adversary in Multi-channel Radio Networks

2

23:04

Dynamic & Radio Networks: Dynamic Byzantine Reliable Broadcast (DBRB)

3

15:43

Dynamic & Radio Networks: Broadcasting with mobile agents in dynamic networks

4

23:38

Dynamic & Radio Networks: On Broadcast in Generalized Network and Adversarial Models

5

13:33

Dynamic & Radio Networks: Maximally Resilient Replacement Paths for a Family of Product Graphs

6

20:21

Dynamic & Radio Networks: Self-stabilizing Byzantine-resilient communication in dynamic networks

7

22:39

Blockchain and BFT: Byzantine Lattice Agreement in Asynchronous Message Systems

8

25:18

Blockchain and BFT: Multi-Threshold Asynchronous Reliable Broadcast and Consensus

9

24:15

Blockchain and BFT: Heterogeneous Paxos

10

21:00

Blockchain and BFT: Echo-CGC: A Communication-Efficient Byzantine-tolerant Distributed Machine Learning Algorithm in Single-Hop Radio Network

11

18:38

Blockchain and BFT: Fast Byzantine SGD (Best Student Paper Award)

12

23:15

Blockchain and BFT: Security Analysis of Ripple Consensus

13

19:58

Consensus Potpourri: ACE: Abstract Consensus Encapsulation for Liveness Boosting of State Machine Replication

14

21:50

Consensus Potpourri: Information Theoretic HotStuff

15

22:53

Consensus Potpourri: Rational Behaviors in Committee-Based Blockchains

16

19:42

Consensus Potpourri: Locally Solvable Tasks and the Limitations of Valency Arguments

17

25:57

Consensus Potpourri: Approximate Majority With Catalytic Inputs

18

16:03

Consensus Potpourri: Optimal Resilience in Systems that Mix Shared Memory and Message Passing

19

22:19

Graph algorithms and population protocols: Fast Deterministic Algorithms for Highly-Dynamic Networks

20

22:14

Graph algorithms and population protocols: Approximating Bipartite Minimum Vertex Cover in the CONGEST model

21

20:52

Graph algorithms and population protocols: Distributed Distance Approximation

22

25:50

Graph algorithms and population protocols: Fast Hybrid Network Algorithms for Shortest Paths in Sparse Graphs

23

20:00

Graph algorithms and population protocols: Secured Distributed Algorithms without Hardness Assumptions

24

23:35

Graph algorithms and population protocols: Uniform Bipartition in Population Protocol Model with Arbitrary Communication Graphs

25

22:35

Concurrent data structures: Relaxed Queues and Stacks from Read/Write Operations

26

27:59

Concurrent data structures: Fast and Space-Efficient Queues via Relaxation

27

24:33

Concurrent data structures: Recoverable, Abortable, and Adaptive Mutual Exclusion with Sublogarithmic RMR Complexity

28

27:14

Concurrent data structures: CSR++: A Fast, Scalable, Update-Friendly Graph Data Structure

29

25:22

Formal methods: Monitoring Distributed Systems under Partial Synchrony

30

21:02

Formal methods: Decentralized Runtime Enforcement of Message Sequences in Message-Based Systems

Automatic playback

Speech

Text

Image

00:00

Data modeloutputState of matterProcess modelingBound stateCommunications protocolSample (statistics)Inheritance (object-oriented programming)ApproximationLogical constantComplete metric spacePhase transitionNL-vollständiges ProblemData structurePermianLeakEvent horizonAerodynamicsMathematical analysisApproximationMedical imagingFraction (mathematics)Data structureDynamical systemSequenceMathematicsOrder (biology)Self-organizationSoftwareFunction (mathematics)Type theoryBus (computing)Product (business)Level (video gaming)Random variableDrop (liquid)Phase transitionTask (computing)Flow separationTotal S.A.AnalogyMathematical optimizationEquivalence relationState of matterArithmetic meanBeta functionBitSeries (mathematics)Connected spaceLine (geometry)Green's functionGroup actionRandom walkComplex (psychology)LogarithmMereologyMarginal distributionSlide ruleResultantSampling (statistics)Spontaneous symmetry breakingTerm (mathematics)CountingNumberDataflowNear-ringQuicksortAreaConfiguration spaceRevision controlGoodness of fitLogical constantFamilySimilarity (geometry)MeasurementRegular graphCASE <Informatik>Process (computing)Strategy gameError messageRootInstance (computer science)State observerSquare numberHand fanAdditionPoint (geometry)Inheritance (object-oriented programming)Ocean currentSocial classSet (mathematics)outputArithmetic progressionCommunications protocolWord19 (number)Reading (process)Data conversionInteractive televisionLatent heatCorrespondence (mathematics)Chemical equationDecimalParity (mathematics)Event horizonBit rateView (database)Endliche ModelltheorieNP-hardDifferent (Kate Ryan album)NeuroinformatikCapability Maturity ModelLoginContext awarenessMultiplication sign2 (number)File archiverAssociative propertyMoore's lawValue-added networkMobile WebTwitterInvariant (mathematics)BijectionFunctional (mathematics)LeakPolygonDivisorBound stateSummierbarkeitComputer animation

Transcript: English(auto-generated)

00:01

Hi, my name is John Lazarsfeld and today I'm going to be talking about our paper, Approximate Majority with Catalytic Inputs, which is joint work with Talia Mir and James Osmanis. So our work is about the majority problem in a variant of the original population protocol model.

00:21

So we'll briefly review the original model where we have a population of n-mobile agents where at every step a pair of agents is chosen uniformly at random to interact. And the local state of both agents are updated according to some global transition function. So in the majority problem, we assume that initially every agent is in one of two states,

00:44

either a blue state or a red state. And we also assume that the input margin, which is just the absolute difference in the initial counts of the two states, is non-zero. And the goal then is to converge to a configuration where every agent ends up in the blue state,

01:01

which in this example is the initial majority state. So here are the set of transitions for a simple protocol for this task. And the protocol is referred to often as the third state dynamics, and it was independently discovered by Englund, Eisenstadt, and Osmanis in 2008, and by Perron, Vasudevan, and Vojnavic in 2009.

01:24

And here, whenever a blue and red agent interact, they both cancel each other out and transition into a third green state. Then whenever a green agent interacts with a blue or red agent, the green agent will transition into the state of its interacting partner.

01:44

And so the intuition here is that as long as the input margin is large enough and because the interactions are chosen uniformly at random, then a green agent will always be more likely to interact with a blue agent. And so eventually, we'll end up converging to a configuration

02:02

where every agent is in the blue state. So the complexity measures that we're interested in are one, the number of steps or interactions until we converge successfully. And second, we're interested in the number of states required by protocols. So in this case, the protocol used only three states.

02:26

So in this work, we consider the following variant of the original population model. So we have a population of N catalytic input agents, which are agents that never change state throughout the course of a computation,

02:40

as well as M worker agents that are able to change state. And we let capital N denote the total number of agents in the population. And several recent works by Alistair et al. in 2017 and 2020, as well as by Dudek and Kosowski in 2018, have considered populations that featured a similar distinction

03:02

between the two types of agents. And in particular, Alistair et al. noted that agents that never change state embody chemical catalysts because they may induce a state transition in another agent, while themselves never changing state. And in addition, population protocols have been used recently

03:22

to model both chemical reaction networks, as well as other types of chemical processes. So with this motivation in mind, we call this model in our work the catalytic input model, or the CI model for short. Now, just to visualize this a bit more,

03:40

whenever two catalytic input agents interact, neither will ever change state. But on the other hand, when a catalytic input and a worker agent interact, the worker agent might change state, but the catalytic input never will. And so the majority problem in this model extends naturally, where now we're interested in determining

04:02

which of the two catalytic input states is more prevalent in the population. And now we let the input margin denote just the absolute difference in the count of the two input states. And our goal is to eventually converge to a configuration where every worker reaches some state corresponding to the correct input majority.

04:22

So this might mean that every worker agent enters some blue worker state corresponding to a blue catalytic input majority. So we're interested in the majority problem in the catalytic input model. And so we can first recall

04:41

what we know about the majority problem in the original model. So in the case of general or exact majority, where the input margin could be as small as one, in the recent years, there have been several protocols which can do this quickly or in n-poly log n steps with high probability. Most recently by Ben-Nun et al in 2020.

05:04

But at the same time, early result by Alistair et al in 2017 showed that any protocol for exact majority in the original model requires at least log n states per agent in order to be correct with high probability.

05:21

So a rather large state requirement per agent. Now on the other hand, in a relaxed version of the problem called approximate majority, a correct answer is only expected when the input margin is sufficiently large. And in this case, this can be done using only three states

05:40

in n log n steps with high probability. In particular, using the third state dynamics protocol that we saw in the example on the first slide. And recently Tonden et al in 2019 improved the analysis of this protocol and showed that the protocol converges correctly

06:01

as long as the input margin is square root n log n. So what do we know about the majority problem in the catalytic input model? Well, a recent result by Alistair et al in 2020 gave a protocol for approximate majority in the CI model that converged correctly in n log n steps

06:22

with high probability but requires log n states per worker agent. And in addition, the protocol converged correctly only when the input margin was linear in the number of catalytic input agents. So it's remained open though, first, whether there is a protocol

06:41

for exact majority in this model that converges quickly. And also whether there is a constant state protocol for approximate majority in the CI model. And if so, under what requirements on the input margin. So the main contribution of this work

07:02

is to help partially resolve these two questions. Now, on the one hand, we show that in the case of exact majority, every protocol in the CI model requires at least n squared steps to be correct with high probability when the number of input and worker agents are both a constant fraction of the total population.

07:21

And this implies a strong separation between the CI and original population models. Now, on the other hand, we show that the third state dynamics of the original model can be adapted to give a constant state protocol for approximate majority in the CI model

07:40

that converges in n log n steps with high probability as long as the input margin is square root n log n. So for the remainder of the talk, we'll give a high level overview and intuition for these two results. And then we'll end by mentioning a second set of results that introduces a second variant to the original model.

08:04

So we'll start with the lower bound on protocols for exact majority in the CI model. And the key observation here is that we can never tell whether or not an input agent has previously interacted with a worker agent.

08:20

So whenever we have a interaction between a worker agent and a catalytic input agent, we can view this more as the worker agent taking a random sample with replacement from the input population. Now, for the sake of proving lower bounds in the CI model, we consider the following super CI model.

08:43

So the super CI model features a catalytic input population of n agents, as well as a computationally unbounded super worker w. And here, every time w interacts with the input population, it sees a blue input with probability p

09:03

and a red input with probability one minus p. And so again, we think of this as the super agent taking a random sample with replacement from the catalytic inputs. And so now in the context of majority, the goal of the super worker is to take s samples

09:22

and then use these samples to determine what it thinks the correct input majority is. Now, because the super worker is computationally unbounded, it's capable of simulating any regular CI model protocol. And so this means that if we have a lower bound

09:42

on the number of samples that the super worker needs, in order to produce a correct output with high probability, this implies a lower bound on the total number of interactions that any protocol in the regular CI model needs to be correct with high probability.

10:00

So now, for the exact majority problem, again, the hard case is when the input margin between the two catalytic input states is one. And so this means that the probability that the super worker sees a blue input is very close to the probability that it sees a red input. And moreover, we can show that the optimal strategy

10:24

in terms of minimizing its error of the super worker is just to take s samples and then to output the majority value of these samples. And so then it follows that in this hard instance, in order to overcome the sampling error,

10:42

the super worker needs at least n squared samples in order for half of them to be from the correct majority states. And now when the number of catalytic inputs is a constant fraction of the total population in the regular CI model,

11:01

this then implies the n squared lower bound on the total number of steps needed by any CI model protocol. So we note that this also, the same bound can be applied to determining the parity of the catalytic input states,

11:20

given that there's a one-to-one correspondence between the parity problem and the exact majority problem with input margin one. And again, because we know that exact majority can be done in the original model in n poly log n steps, that this implies a strong separation in the computational power of the two models.

11:47

So while exact majority in the CI model is hard, it turns out that we can successfully adapt the original model third state dynamics to give a constant state protocol for approximate majority in the CI model.

12:02

So specifically the protocol we present uses only five total states. So the two catalytic input states and the only three worker states. And the transitions of the protocol are again adapted from the original third state dynamics, which we saw in the example on the first slide.

12:22

And in particular, the set of interactions between two worker agents are identical to the transitions of the original third state dynamics. And we also add the following two transitions between catalytic input and a blank or green worker, where if a green worker interacts with a blue input,

12:43

the worker will transition to the blue worker state. And if the green worker interacts with the red input, it transitions to the red worker state. So again, the intuition is that if we assume that all the worker agents start in the green state, then as long as the input margin

13:02

between the blue and red input states is large enough, then worker agents in the green state will be more likely to transition into the blue worker state. And although that we will still have a small number of red inputs in the population that remain there over the course of the computation,

13:23

once the number of blue worker agents becomes large enough, then this again means that other worker agents will be more likely to transition into the blue worker state. And eventually this will allow us to converge to the all blue worker configuration corresponding to a blue input majority.

13:42

Now, specifically, we show that as long as the input margin is root n log n, and when the number of workers and inputs are both constant fractions of the total population, then we'll reach this converged configuration within n log n steps with high probability.

14:01

Now, to prove the correctness and the efficiency or the total number of steps needed until convergence, we reduce the problem to looking at a sequence of biased one-dimensional random walks. So in particular, we think about the bias as looking at the fraction of agents

14:22

in each of the five states at any given point of time in the computation. And although these counts and fractions are changing, we use a structure of phases that allows us to abound these counts throughout various points of time in the computation.

14:41

So specifically, we look at the following three progress measures. So I'll first point out that these lowercase letters are random variables denoting the number of agents in the corresponding state at any given point in the computation. And so the first progress measure p represents the total difference between

15:01

the number of agents supporting the blue input state and the number of agents supporting the red input state. And then the progress measures x hat and y hat capture the number of workers in the blue worker state and then the red worker state respectively.

15:21

And we notice the invariance where x hat plus y hat will always be equal to the total number of workers, which is m, throughout the computation. So with these progress measures in mind, we use the following set of three phases

15:40

which was first introduced by Condon et al in 2019 when looking at or when analyzing the original model third state dynamics. So in the first phase, we consider a logarithmic number of stages where at each stage the progress measure p doubles and this continues until p reaches a constant fraction

16:02

of the worker population plus the initial input margin. And then in the second phase, we consider a logarithmic number of stages where the progress measure y hat decreases by a factor of two. And this continues until y hat drops below some value that's logarithmic in the total number of workers.

16:23

And finally, in the third phase, we'd look at the progress measure y hat until it drops to zero. And at this point, when y hat is equal to zero, this corresponds to every worker being in the blue or x worker state. So the strategy of every phase

16:41

is to treat the progress measure we're considering as a biased one-dimensional random walk. And so first we bound the number of productive steps that are needed to complete the phase correctly where the productive steps are just this set of five

17:00

non-null transitions. And from there, we then bound the total number of steps for each phase that are needed to obtain the requisite number of productive steps with high probability and taking a union bound over every phase and stage then gives the final result. So just to visualize this a bit more,

17:21

here's an example execution of the protocol on a population with 5,000 inputs and worker agents where the blue, red, and green lines represent the number of agents in the corresponding states at the various steps of the computation.

17:41

So the first pink vertical line represents the end of the first phase, which we can think of as having a sufficiently large gap between the number of blue and red worker agents. And finally, the second vertical line represents the end of the second phase

18:01

where the progress measure y hat drops below some value that's logarithmic in the total number of workers. So in a sense, the sum of the green and red lines. And finally, in the third phase, the protocol continues until the progress measure y hat drops to zero.

18:21

In other words, meaning that every worker enters the blue worker state. So we now turn to discuss a second set of results, which considers an additional variant to the original population model. And this is called transient leak events. And these are events that were considered as well

18:42

in the work of Alistair et al in 2017 and 2020. And if we recall the connections between population protocols and modeling chemical processes that was mentioned earlier in the talk, we can think of a leak event as simulating the low probability event that a molecule undergoes

19:03

a reaction that would have typically taken place in the presence of a catalyst. So in population protocols, this can be modeled as a spontaneous state transition that occurs at a single agent at every step with some probability beta.

19:20

And in the catalytic input model, only worker agents are susceptible to this type of leak event given that catalytic input agents never change states. Now, similar to the work of Alistair et al, in our work, we consider these leak events to occur adversarially, meaning that they induce

19:41

the worst case state transition with respect to the progress of the protocol that we're interested in. So to visualize this a bit more, if we take our CI model protocol for approximate majority from the previous slide, so at any given step, we might have a regular interaction between two agents,

20:04

but with probability beta at every step, we have one of these transient leak events, which in the case of our protocol would mean that a blue worker agent suddenly flips to the red worker agent state.

20:20

So the protocol of Alistair et al for approximate majority in the CI model was shown to nearly converge within N log N steps with high probability when the leak rate beta was sufficiently small. And here, near convergence just means that the protocol reaches a configuration where all but a fraction of the worker agents

20:44

are in the worker state corresponding to the correct input majority. So our second set of results is to show a similar type of robustness to these leak events of the third state dynamics family of protocols.

21:00

So first, we show that our CI model protocol for approximate majority also nearly converges in N log N steps with high probability when the leak rate beta is sufficiently small. But we also show that the original third state dynamics protocol in the original population model where all agents are susceptible to these leak events,

21:23

that this protocol also nearly converges in N log N steps with high probability for small enough beta. So now I'll just briefly mention some more specifics about the convergence of our protocol in the CI model and the results

21:43

for the original model third state dynamics is follows analogously. So in the CI model, we show that when the input margin is at least root N log N, then the protocol within N log N steps will reach a configuration where at most

22:01

a beta fraction of the worker agents are in the incorrect worker state. So that would mean that a beta fraction of worker agents are in the red worker state even though we have a blue input majority. And the analysis for this near convergence

22:23

uses, it follows similarly from the analysis in the non-leak setting where we use a structure of phases and we consider a biased one-dimensional random walks. And so the strategy here is to first bound the number of these leak events that could occur at any phase of the computation.

22:43

And then from there, we bound the number of non-leak steps that are needed to still make progress throughout the protocol. So to visualize this a bit more, here again is an example execution of the protocol where we track the counts of worker agents

23:03

up to four N log N steps. And in this case, beta is zero, meaning we're in the non-leak setting. And here, just as we saw in the previous slide, we know the protocol will eventually reach a configuration where every worker is in the correct worker majority state.

23:26

But now when we have a leak rate of log N over N, again, we see at the end of the four N log N total steps that a large fraction of worker agents will be in the blue worker state, but that this convergence occurs much slower.

23:42

And finally, if we increase the leak rate to root N log N over N, we can see that after at this cutoff point, the fraction of agents in the blue worker state is still large but is dropped compared to the smaller leak rate. And this is in line with our theoretical results.

24:03

So finally, I'll mention that the third state dynamics protocols, both in the CI model and in the original model, behave similarly in the presence of these transient leak events as they do in the presence of Byzantine agents, which is a type of dishonest agent

24:21

which has been considered in previous work that looks at the original model third state dynamics. And so our final result is to show an equivalence in the behavior of the CI model and original model protocols in the presence of these leak events

24:41

and in the presence of the Byzantine agents. So in conclusion, we've shown that in these catalytic input model, which features the two distinct types of agents, there are some problems like exact majority which can be done quickly in the original model

25:02

but are hard to compute in the CI model. But at the same time, some techniques like the original third state dynamics can be easily adapted into the catalytic input model. So it remains an interesting open question to understand what other types of protocols

25:20

and techniques can be adapted successfully into the CI model. And finally, we showed that the original third state dynamics is robust to the type of transient leak events that we considered in the CI model. And so another interesting question is to better understand what class of protocols

25:41

is similarly robust to the transient leak events in the original population protocol model. So thank you for listening to this talk and we encourage you to read the full version of the paper which is on the archive. So thank you.