Francois Luus / Jacob Steeves / Ala Shaabana / Yuqian Hu / Sin Tai Liu
00/ Abstract
We formulate a stake-based consensus problem in terms of a two-player game and propose a protagonist consensus policy to optimize a Nash equilibrium via a weight reduction algorithm with a guarantee of minority stake deterioration. We generalize this to a two-team game and propose a smooth density evolution algorithm that outperforms coarser estimates. We perform a full-scale Monte Carlo analysis and confirm the accuracy of our theoretical results, and show the possibility of a 40% stake + 25% utility attack. The result is a variable-expense consensus algorithm that can be fit to blockchain compute constraints to reach accurate consensus in adversarial settings.
01/ Stake-based weight consensus
1.1 Problem definition
We consider a two-player game between (protagonist) honest stake0.5<sH≤1 and (adversarial) cabal stake (1−sH) competing for total fixed reward eH+eC=1with honest emission eH and cabal emission eC, respectively, followed by stake updates sH′=2sH+eH and sC′=21−sH+eC. The honest objective sH≤eH at least retains scoring power sH over all action transitions in the game, otherwise when eH≤sH honest emission will erode to 0 over time, despite a starting condition of 0.5<sH.
We assume honest stake sets objectively correct weights wH on itself, and 1−wH on the cabal, where honest weightwHrepresents an ongoing expense of the honest player, sustained throughout the game. However, cabal stake has an action policy that freely sets weight wC on itself, and 1−wC on the honest player, at no cost to the cabal player, with the objective to maximize the required honest self-weight expense wH viawC∗=argmaxwCE[wH∣sH=eH(sH,wH,wC)].
We then assume the honest majority sH>0.5 can counter with a consensus policy πallowed to modify all weights modulo player labels, so it is purely based on the anonymous weight distribution itself, optimizing the Nash equilibrium minπmaxwCE[wH∣sH=eH(sH,π(w))].
The majority stake enforces an independent and anonymous consensus policy π(e.g. through a blockchain solution) that modifies the weights to minimize the expense wH, which has been maximized by the cabal applying an objectively incorrect gratis self-weight wC. Consensus aims to produce π(w)→(wH′,wC′) so that wC′=1−wH′, by correcting the error ϵ=wC′+wH′−1>0. Note that the input cost wH remains fully expensed, and that wH′ merely modifies the reward distribution that follows, but not knowing which players are honest or cabal (anonymous property).
Figure 1 / Selfish weighting problem: Minority cabal sets wC=1 self-weight to maximally grow its relative stake, e.g. at (`1) honest majority stake of sH=0.6 and honest utility of wH=0.75 would require cabal to report self-weight wC<0.62 for honest stake to be retained. (b) Consensus solution: Stake-based consensus η=3 corrects excessive self-weight of minority stake, e.g. at (`2) sH=0.6 , wH=0.75 no selfish cabal weight can prevent honest stake retention, even wC=1 results in honest stake ratio gain. Zero-weight problem: Minority cabal is virtually the only scoring incentive recipient of the cabal utility reward when its actual utility is near-zero, e.g. at (`3) where honest stake deteriorates. (c) Weight trust solution: Require the majority stake to agree that a weight is non-zero, otherwise smoothly nullify the associated reward to the degree of mistrust, which then removes the honest stake deterioration region when wH>0.95. Consensus guarantee: Honest majority stake is retained when sH≥0.6 and wH≥0.75, despite strategic cabal weight setting.
Figure 2 / Retention line interpretation: (a) Honest incentive share contour plot for sH=0.6only, highlighting where the emission is eH=0.6, e.g. at (`1). However, at (`2) the contour recedes again due to the zero-weight problem. (b) Similarly, the specific emission contour plot for sH=0.7, highlighting the contour where the emission is eH=0.7, which means with inflation the honest share ratio of sH=0.7 can be retained if honest utility is at least wH>0.75 like at (`3). (c) Retention lines: A compound plot combines all the highlighted sH=eHcontours from individual contour plots (e.g. sH=0.6andsH=0.7), to show the overall retention profile. Generally, the higher the honest stake, the higher the honest utility requirement to retain stake proportion under adversarial weight setting.
Figure 3 / Scoring incentive: A percentage share of the utility rewards equal to stake in a score, as incentive to encourage honest scoring. (a) No scoring incentive leads to extreme selfish weight setting, since the cabal does not share in the honest rewards. (b)-(f) Higher scoring incentive reduces selfishness evidenced by receding honest self-weight requirement for stake retention. However, the zero-weight problem at (`1) increases as well, since only the cabal can claim reward share from both honest and cabal subsets while honest sets zero weight on the cabal. The weight trust solution with smooth edge coverage wH>0.95) can only be extended so far before legitimate low-utility is also nullified, which practically limits scoring incentive around 50%.
1.2 Reward emission
In the two-player characterization of the game, there are two bimodal weight distributions of (wH,1−wC) and (1−wH,wC) on the honest and cabal players, respectively. The stake proportions behind the bimodal distributions are (bHH,bCH)=(wHsH,(1−wC)(1−sH)) and(bHC,bCC)=((1−wH)sH,wC(1−sH)) , respectively.
Primary incentive i is the normalized sum of stake proportions, where honest rank rH=wHsH+(1−wC)(1−sH) and cabal rank rC=(1−wH)sH+wC(1−sH) are normalized to give iH=rH+rCrH and iC=rH+rCrC . An additional reward d is the scoring share of incentive, i.e. dH=rHwHsHiH+rC(1−wH)sHiC anddC=rH(1−wC)sCiH+rCwCsCiC. Finally, the complete reward emissions areeH=2iH+dH and eC=2iC+dC, such that eH+eC=1 .
Figure 4 / Larger minority excess: The excess weight above consensus is larger for minority-stake when wH<wC. (a) Positive contours when wH<wC at (`1) indicate regions of cabal error-correction potential. (b) Cabal error-correction region grows as majority stake increases sH=0.55→0.95. (c) However at (`2), larger majority excess appears on the right-side when wC<wH(to be avoided), which negatively impacts the majority weight more than the minority.
1.3 Consensus deviation
The weight consensus is the stake-proportion weight averagewj=∑i(siwij)wij/∑k(skwkj), and accordingly the consensus weights for the honest and cabal players are wH=sHwH+(1−sH)(1−wC)sHwH2+(1−sH)(1−wC)2 and , wC=sH(1−wH)+(1−sH)wCsH(1−wH)2+(1−sH)wC2 respectively
Under typical adversarial play with 1−wH<wC , the upper modes wH>wH and wC>wC of the honest and cabal weight distributions, respectively, will exceed the consensus. Honest excess wH<wH is present when 1−wC<wH:
Lemma 1 (Larger Minority Excess). Minority-stake excess weight is larger than majority-stake excess weight, i.e.wC−wC>wH−wH,whenwH<wC
We use a symbolic solver (Wolfram) to show that the cabal excess is larger when wH<wC, i.e. dwHdwC=wH−wHwC−wC>1 The cabal excess is larger with majority honest stake when
Stake-proportional consensus advantages the honest player with sH>0.5 , since it biases the consensus weight toward the honest vote and exposes the cabal excess self-weightwC>wC where dwC>dwH . Consequently, a consensus policy π(w)=min(w,w) can reduce excess weight above the consensusw , where cabal weight should decrease more than honest weight. The weight reductions normally only happen in the upper modes wH and wC of the honest and cabal weights, respectively.
Figure 5 / Larger minority excess: The excess weight above consensus is larger for minority-stake when wH<wC. (a) Positive contours when wH<wC at (`1) indicate regions of cabal error-correction potential. (b) Cabal error-correction region grows as majority stake increases sH=0.55→0.95. (c) However at (`2), larger majority excess appears on the right-side when wC<wH(to be avoided), which negatively impacts the majority weight more than the minority.
The consensus policy π(w)→(wH′,wC′) attempts to correct the error ϵ=wH+wC−1 so that
The approximate number of weight reduction steps is η, and the consensus policy is thus converted to an iterated function π=fη , where the function is repeated η times f3(w)=f(f(f(w))) . Note thatf(w)=min(w,w) recomputes the consensus weight w each time.
Figure 6 / Emission slashing: Iterative weight correction reduces effective scoring weight and incentive share. (a) Initial scoring weight is always 1, but weight correction reduces this whenever ∑w=1, with the largest reduction seen at (`1). (b) Emission is the average of scoring incentive and utility reward and cabal emission is slashed, i.e. e(η=3)<e(η=0) particularly in the wH<wC and 0.5<sH region around (`2). (c) Consequently, honest emission is boosted in region (`2), but the zero weight vulnerability at (`3) slashes honest emission, although comparatively little.
Figure 7 / Cabal slash (two-team). (a) Statistical analysis simplifies the two-player result. (b) The two-team generalization fully enacts the weight distribution, and a Monte Carlo analysis reveals the worst-case cabal slash closely following the two-player result. (c) Cabal incentive is boosted at the zero weight vulnerability of (`1), but weight trust solution ensures active cabal slash at (`2).
We compute η=dwH+dwCwH+wC−1 and compare with the correction factordwC/dwH to identify the optimal η avoiding over-correction in the detrimentalwC<wH region where dwC/dwH<1
We observe that higher η>3 values extend the correction further into the detrimentalwC<wH region where dwC−dwH<0 , hence an optimal η≈3 is identified, which is large enough to provide sufficient correction when wH<wC .
Application of the consensus policy π(w)=fη≈3(w) can partially correct the error ϵ=wC′+wH′−1>0 , in particular the previous expense wH=1 is reduced to wH<0.75 for sH=0.6 , even at Nash equilibrium with wC∗=0.8 . Importantly, the consensus policy π(w) operates on anonymized weights and do not assume the player identities, thus behaves impartially in terms of a stake-based consensus.
1.5 Smoothed weight reduction
The correction function f(w)=min(w,w) should be smoothed to ensurelimdw→0fη(w+dw)−fη(w)<ε where adjacent weights are corrected to a similar degree. The correction factor should also depend on the magnitude of deviation from consensus, in terms of a standard deviation σ. We opt for a stake-weighted mean absolute deviation, since it does not make the normal assumption as strongly as mean square deviation, as follows
σ(w)=∑kskwkj∑isiwij∣wij−w∣.
The standard correction 0≤α<1 fully applies whenw−w=σ(w), and amplifies whenw−w>σ(w) to a maximum correction atw with a proposed smoothed function iterate
Figure 8 / Weight trust: Majority stake should set non-zero weight to a player, otherwise its reward is nullified. (a) Two-team honest emission with just the consensus policy, with zero weight vulnerability at (`1). (b) Applying weight trust protects the 0.95<wH region and removes the exploit. (c) Honest retention is now monotonically possible at increasing honest self-weight.
The smoothed function iterate now requires more steps, where a larger α≈1−δresults in a larger η , such that ηδ≈3 , according to
We compare the previous retention graph (η=3,α=0)against (η=3/(1−0.95),α=0.95) and observe a reduced cost wH=0.7<0.75with the smoothed iterate with α>0.
We ensure monotonicity of the consensus policy by choosing the minimum ε>0added to the deviation σ(w).
Weight trust T=(W>0)S is the sum of stake assigning a non-zero weight to a player, and a consensusC=(1+exp(−ρ(T−κ)))−1 provides a smooth threshold at κ where exceedingκ ratio of stake quickly allows for high trust. A modified rankr′=rc multiplies rank with the weight trust consensus, which influences the emission so that zero cabal weight wC′=1−wH≈0 receives low consensus thereby penalizing cabal emissions.
The vulnerable region of wH=1 and 0.8<wC<0.95allows for cabal stake gain when sH=0.6, but the weight trust consensus smoothly pads the region aroundwH=1 and removes the vulnerability. The cabal can thus not claim reward when the honest majority deems cabal utility to be zero, despite the non-zero self-weight reported by the minority cabal.
Figure 9 / Evolution smoothness: Evolving through more smaller steps withη=59 reduces the zero weight exploit at black (`3), compared to η=3 at (`1), since fine-grained correction steps more accurately track changes in consensus. The two-team honest emissions (Monte Carlo worst-case analysis) tend to further reduce the zero weight exploits at (`2), (`4) compared to the two-player case at (`1), (`3).
02/ Density generalization
2.1 Overview
We generalize the two-player game to a two-team game with ∣H∣ honest and ∣C∣ cabal players, that have ∑i∈Hsi=sH honest stake and ∑i∈Csi=1−sH cabal stake. Honest players i∈H set ∑j∈Hwij=wH self-weight and ∑j∈Cwij=1−wH weight on cabal players, while cabal players i∈C set ∑j∈Cwij=wC self-weight and ∑j∈Hwij=1−wCweight on honest players. The rank components result in the same aggregates in the two-player game.
In particular, the weight consensus of an individual honest player is shown to be wh=∣H∣wH as follows (and similarly for cabal players wc=∣C∣wC)
Under the minimal assumption that the average self-weights set on honest and cabal players are∣H∣wH and ∣C∣wC we can construct weight densitiesph(w)=phh(w)+pch(w) and pc(w)=phc(w)+pcc(w), here according to the normal assumption (other densities with a similar first moment could possibly also be valid)
The consensus and mean absolute deviations of a weight density function p(w) are
p=∫wp(w)dw,andσ(p)=∫∣w−p∣p(w)dw.
We overload the iterated function f as a density evolution function f(p(w)) that contracts a densityp(w) above consensusp by a nominal degree of α at a single deviation σ(p)w−p, in order to correct the error ϵ=wH+wC−1. The density is contracted via g(w)=f−1(w)involving the original iterated function f.
f(p(w))=p(w∣w≤p)+p(g(w)∣p<w)g(w)wdwdg(w)
The final rank after applying the consensus policy π=fη is ri=∫fη(pi(w))dw, where a single function iteration contracts the consensus by p′−p, which is equal to
∫wp(w)dw−∫wp(g(w)∣μp<w)g(w)wdwdg(w)dw.
Figure 10 / Density evolution: Weight correction through density evolution reduces cabal weight consensus more than honest reduction (sH=0.6,wH=0.7,wC=0.8) . The honest weights and cabal weights have an equal starting consensus at (`1), but through density evolution the cabal reduces more to (`3), versus a higher end honest consensus at (`2). Density evolution thus succeeds in penalizing the minority cabal and allows for honest stake retention. The theoretical probability densities in (a), (c) closely match the stochastically sampled results in (b), (d). The crosshair markers indicate the consensus flanked by a standard deviation above and below.
Simulating the consensus policy (η=3/(1−0.95),α=0.95) on weight densities set on honest and cabal players where sH=0.6,wH=0.7,wC=0.8, we see an equal starting consensus weight reduce further for the cabal players (6.76→3.8) vs honest players (6.76→4.41). The consensus policy acts as an upper-mode resiliency test where cabal self-weight with minority stake fails comparatively to honest self-weight with majority stake.
2.2 Stochastic sampling
We move from theoretical density analysis to a stochastic sampling analysis, where the originalπ(w)=fη(w) can be applied directly to a weight sample w for a player, gradually contracting excess weight toward the consensus until an optimal contraction volume is reached. We observe very similar density evolution results as with the theoretical density analysis.
2.3 Two-team game
We perform a worst-case Monte Carlo analysis of a full-scale two-team game by sampling from normal densities, primarily to confirm the accuracy of the preceding aggregate analysis. We run a number of Monte Carlo iterations and record the worse-case results. A blockchain-based consensus algorithm has space and compute limitations, which would favor a smaller η number of density evolution operations, each of which requiresO(n2) operations. A small η=3 with α=0 produce a full-scale result very close to the aggregate result.
Increasing the number of density evolution steps to η=59 withα=0.95 manages to remove the zero-utility exploit at wH>0.98 seen at η=3. However, the aggregate result in the theoretical honest retention deviates slightly, likely due to deviation of the upper mode density below consensus not accounted for in the aggregate.