serai

mirror of https://github.com/serai-dex/serai.git synced 2024-12-23 12:09:37 +00:00

Author	SHA1	Message	Date
Luke Parker	2532423d42	Remove the RemoveParticipant protocol for having new DKGs specify the participants which were removed Obvious code cleanup is obvious.	2023-12-14 23:51:57 -05:00
Luke Parker	b60e3c2524	Replace PSTTrait and PstTxType with PublishSeraiTransaction	2023-12-14 16:06:08 -05:00
Luke Parker	77edd00725	Handle the combination of DKG removals with re-attempts With a DKG removal comes a reduction in the amount of participants which was ignored by re-attempts. Now, we determine n/i based on the parties removed, and deterministically obtain the context of who was removd.	2023-12-13 14:03:07 -05:00
Luke Parker	6a172825aa	Reattempts (#483 ) * Schedule re-attempts and add a (not filled out) match statement to actually execute them A comment explains the methodology. To copy it here: """ This is because we always re-attempt any protocol which had participation. That doesn't mean we should re-attempt this protocol. The alternatives were: 1) Note on-chain we completed a protocol, halting re-attempts upon 34%. 2) Vote on-chain to re-attempt a protocol. This schema doesn't have any additional messages upon the success case (whereas alternative #1 does) and doesn't have overhead (as alternative #2 does, sending votes and then preprocesses. This only sends preprocesses). """ Any signing protocol which reaches sufficient participation will be re-attempted until it no longer does. * Have the Substrate scanner track DKG removals/completions for the Tributary code * Don't keep trying to publish a participant removal if we've already set keys * Pad out the re-attempt match a bit more * Have CosignEvaluator reload from the DB * Correctly schedule cosign re-attempts * Actuall spawn new DKG removal attempts * Use u32 for Batch ID in SubstrateSignableId, finish Batch re-attempt routing The batch ID was an opaque [u8; 5] which also included the network, yet that's redundant and unhelpful. * Clarify a pair of TODOs in the coordinator * Remove old TODO * Final comment cleanup * Correct usage of TARGET_BLOCK_TIME in reattempt scheduler It's in ms and I assumed it was in s. * Have coordinator tests drop BatchReattempts which aren't relevant yet may exist * Bug fix and pointless oddity removal We scheduled a re-attempt upon receiving 2/3rds of preprocesses and upon receiving 2/3rds of shares, so any signing protocol could cause two re-attempts (not one more). The coordinator tests randomly generated the Batch ID since it was prior an opaque byte array. While that didn't break the test, it was pointless and did make the already-succeeded check before re-attempting impossible to hit. * Add log statements, correct dead-lock in coordinator tests * Increase pessimistic timeout on recv_message to compensate for tighter best-case timeouts * Further bump timeout by a minute AFAICT, GH failed by just a few seconds. This also is worst-case in a single instance, making it fine to be decently long. * Further further bump timeout due to lack of distinct error	2023-12-12 12:28:53 -05:00
Luke Parker	11fdb6da1d	Coordinator Cleanup (#481 ) * Move logic for evaluating if a cosign should occur to its own file Cleans it up and makes it more robust. * Have expected_next_batch return an error instead of retrying While convenient to offer an error-free implementation, it potentially caused very long lived lock acquisitions in handle_processor_message. * Unify and clean DkgConfirmer and DkgRemoval Does so via adding a new file for the common code, SigningProtocol. Modifies from_cache to return the preprocess with the machine, as there's no reason not to. Also removes an unused Result around the type. Clarifies the security around deterministic nonces, removing them for saved-to-disk cached preprocesses. The cached preprocesses are encrypted as the DB is not a proper secret store. Moves arguments always present in the protocol from function arguments into the struct itself. Removes the horribly ugly code in DkgRemoval, fixing multiple issues present with it which would cause it to fail on use. * Set SeraiBlockNumber in cosign.rs as it's used by the cosigning protocol * Remove unnecessary Clone from lambdas in coordinator * Remove the EventDb from Tributary scanner We used per-Transaction DB TXNs so on error, we don't have to rescan the entire block yet only the rest of it. We prevented scanning multiple transactions by tracking which we already had. This is over-engineered and not worth it. * Implement borsh for HasEvents, removing the manual encoding * Merge DkgConfirmer and DkgRemoval into signing_protocol.rs Fixes a bug in DkgConfirmer which would cause it to improperly handle indexes if any validator had multiple key shares. * Strictly type DataSpecification's Label * Correct threshold_i_map_to_keys_and_musig_i_map It didn't include the participant's own index and accordingly was offset. * Create TributaryBlockHandler This struct contains all variables prior passed to handle_block and stops them from being passed around again and again. This also ensures fatal_slash is only called while handling a block, as needed as it expects to operate under perfect consensus. * Inline accumulate, store confirmation nonces with shares Inlining accumulate makes sense due to the amount of data accumulate needed to be passed. Storing confirmation nonces with shares ensures that both are available or neither. Prior, one could be yet the other may not have been (requiring an assert in runtime to ensure we didn't bungle it somehow). * Create helper functions for handling DkgRemoval/SubstrateSign/Sign Tributary TXs * Move Label into SignData All of our transactions which use SignData end up with the same common usage pattern for Label, justifying this. Removes 3 transactions, explicitly de-duplicating their handlers. * Remove CurrentlyCompletingKeyPair for the non-contextual DkgKeyPair * Remove the manual read/write for TributarySpec for borsh This struct doesn't have any optimizations booned by the manual impl. Using borsh reduces our scope. * Use temporary variables to further minimize LoC in tributary handler * Remove usage of tuples for non-trivial Tributary transactions * Remove serde from dkg serde could be used to deserialize intenrally inconsistent objects which could lead to panics or faults. The BorshDeserialize derives have been replaced with a manual implementation which won't produce inconsistent objects. * Abstract Future generics using new trait definitions in coordinator * Move published_signed_transaction to tributary/mod.rs to reduce the size of main.rs * Split coordinator/src/tributary/mod.rs into spec.rs and transaction.rs	2023-12-10 20:21:44 -05:00
Luke Parker	6caf45ea1d	Downscope usage of futures	2023-12-10 19:32:52 -05:00
Luke Parker	7122e0faf4	Cache the block's events within TemporalSerai Event retrieval was prior: - Retrieve all events in the block, which may be hundreds of KB - Filter to just a few Since it's frequent to want multiple sets of events, each filtered in their own way, this caused the retrieval to happen multiple times. Now, it only will happen once. Also has the scoped clients take a reference, not an owned TemporalSerai.	2023-12-08 10:46:10 -05:00
David Bell	16b22dd105	Convert coordinator/substrate/db to use create_db macro (#436 ) * chore: implement create_db for substrate (fix broken branch) * Correct rebase artifacts * chore: remove todo statement * chore: rename BlockDb to NextBlock * chore: return empty tuple instead of empty array for event storage * Finish rebasing * .Minor tweaks to remove leftover variables These may be rebase artifacts. --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-12-08 05:12:16 -05:00
Luke Parker	5c047ebe74	Log the reason for yielding BlockError::Fatal to Tendermint from the Tributary	2023-12-07 09:30:25 -05:00
econsta	91a024e119	coordinator/src/db.rs db macro implimentation (#431 ) * coordinator/src/db.rs db macro implimentation * fixed fmt errors * converted txn functions to get/set counterparts * use take_signed_transaction function * fix for two fo the tests * Misc tweaks * Minor tweaks --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-12-07 09:30:11 -05:00
Luke Parker	c511a54d18	Move serai-client off serai-runtime, MIT licensing it Uses a full-fledged serai-abi to do so. Removes use of UncheckedExtrinsic as a pointlessly (for us) length-prefixed block with a more complicated signing algorithm than advantageous. In the future, we should considering consolidating the various primitives crates. I'm not convinced we benefit from one primitives crate per pallet.	2023-12-07 02:30:09 -05:00
Luke Parker	797ed49e7b	DKG Removals (#467 ) * Update ValidatorSets with a remove_participant call * Add DkgRemoval, a sign machine for producing the relevant MuSig signatures * Don't use position-dependent u8s yet Public when removing validators from the DKG * Add DkgRemovalPreprocess, DkgRemovalShares Implementation is via a new publish_tributary_tx lambda. This is code is a copy-pasted mess which will need to be cleaned up. * Only allow non-removed validators to vote for removals Otherwise, it's risked that the remaining validators fall below 67% of the original set. * Correct publish_serai_tx, which was prior publish_set_keys in practice	2023-12-04 07:04:44 -05:00
Luke Parker	4446a369b1	Remove old TODO	2023-12-03 00:04:58 -05:00
Luke Parker	ce038972df	Use as_slice instead of as_ref I don't know why this didn't trigger for me. Potentially a difference in the month of the nightly clippy?	2023-12-03 00:04:58 -05:00
Luke Parker	2f6fb93f87	Bridge the gap between the prior two commits	2023-12-03 00:04:58 -05:00
Luke Parker	1e6cb8044c	Domain Separate the coordinator's tributary transaction hashes	2023-12-03 00:04:58 -05:00
Luke Parker	c82d1283af	Move the coordinator to expect multiple nonces This mirrors how Provided TXs handle topics. Now, instead of managing a global nonce stream, we can use items such as plan IDs as topics. This massively benefits re-attempts, as else we'd need a NOP TX to clear unused nonces.	2023-12-03 00:04:58 -05:00
Luke Parker	b823413c9b	Use parity-db in current Dockerfiles (#455 ) * Use redb and in Dockerfiles The motivation for redb was to remove the multiple rocksdb compile times from CI. * Correct feature flagging of coordinator and message-queue in Dockerfiles * Correct message-queue DB type alias * Use consistent table typing in redb * Correct rebase artifacts * Correct removal of binaries feature from message-queue * Correct processor feature flagging * Replace redb with parity-db It still has much better compile times yet doesn't block when creating multiple transactions. It also is actively maintained and doesn't grow our tree. The MPT aspects are irrelevant. * Correct stray Redb * clippy warning * Correct txn get	2023-11-30 04:22:37 -05:00
Luke Parker	f0ff3a18d2	Use debug builds in our Dockerfiles to reduce CI times (#462 ) * Use debug builds in our Dockerfiles to reduce CI times Also enables only spawning the mdns service when debug in the coordinator. * Correct underflow in processor Prior undetected due to relase builds not having bounds checks enabled. * Restore Serai release due to CI/RPC failures caused by compiling it in debug mode This is probably worth an issue filed upstream, if it can be tracked down. * Correct failing debug asserts in Monero These debug asserts assumed there was a change address to take the remainder. If there's no change address, the remainder is shunted to the fee, causing the fee to be distinct from the estimate. We presumably need to modify monero-serai such that change: None isn't valid, and users must use Change::Fingerprintable(None).	2023-11-29 00:24:37 -05:00
Luke Parker	695d1f0ecf	Remove subxt (#460 ) * Remove subxt Removes ~20 crates from our Cargo.lock. Removes downloading the metadata and enables removing the getMetadata RPC route (relevant to #379). Moves forward #337. Done now due to distinctions in the subxt 0.32 API surface which make it justifiable to not update. * fmt, update due to deny triggering on a yanked crate * Correct the handling of substrate_block_notifier now that it's ephemeral, not long-lived * Correct URL in tests/coordinator from ws to http	2023-11-28 02:29:50 -05:00
Luke Parker	571195bfda	Resolve #360 (#456 ) * Remove NetworkId from processor-messages Because intent binds to the sender/receiver, it's not needed for intent. The processor knows what the network is. The coordinator knows which to use because it's sending this message to the processor for that network. Also removes the unused zeroize. * ProcessorMessage::Completed use Session instead of key * Move SubstrateSignId to Session * Finish replacing key with session	2023-11-26 12:14:23 -05:00
Luke Parker	b79cf8abde	Move message-queue to a fully binary representation (#454 ) * Move message-queue to a fully binary representation Additionally adds a timeout to the message queue test. * coordinator clippy * Remove contention for the message-queue socket by using per-request sockets * clippy	2023-11-26 11:22:18 -05:00
Luke Parker	b296be8515	Replace bincode with borsh (#452 ) * Add SignalsConfig to chain_spec * Correct multiexp feature flagging for rand_core std * Remove bincode for borsh Replaces a non-canonical encoding with a canonical encoding which additionally should be faster. Also fixes an issue where we used bincode in transcripts where it cannot be trusted. This ended up fixing a myriad of other bugs observed, unfortunately. Accordingly, it either has to be merged or the bug fixes from it must be ported to a new PR. * Make serde optional, minimize usage * Make borsh an optional dependency of substrate/ crates * Remove unused dependencies * Use [u8; 64] where possible in the processor messages * Correct borsh feature flagging	2023-11-25 04:01:11 -05:00
Luke Parker	eb1d00aa55	Update TX format in coordinator test	2023-11-23 11:45:00 -05:00
Luke Parker	88a1726399	Fixes for prior commit	2023-11-22 16:24:50 -05:00
akildemir	fcfdadc791	Integrate session pallet into validator-sets pallet (#440 ) * remove pallet-session * Store key shares in InSet * integrate grandpa to vs-pallet * integrate pallet babe * remove pallet-session & authority discovery from runtime * update the grandpa pallet path * cargo update grandpa * cargo update substrate * Misc tweaks Sets validators for BABE/GRANDPA in chain_spec, per Akil's realization that was the missing piece. * fix pr comments * bug fix & tidy up --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-22 06:22:46 -05:00
Luke Parker	07c657306b	Remove duplicated genesis presence in Tributary Scanner DB keys This wasted 32-bytes per every single entry in the DB (ignoring de-duplication possible by the DB layer).	2023-11-22 04:43:49 -05:00
David Bell	9df8c9476e	Convert tributary to use create_db macro (#448 ) * chore: convert tributary to use create_db macro * chore: fix fmt * chore: break long line	2023-11-22 04:17:51 -05:00
Luke Parker	1822e31142	Grow the yamux buffers to exceed the maximum message size	2023-11-21 02:01:41 -05:00
Luke Parker	797604ad73	Replace usage of `io::Error::new(io::ErrorKind::Other,` with `io::Error::other` Newly possible with Rust 1.74.	2023-11-19 18:31:37 -05:00
Luke Parker	74a8df4c7b	Add a new primitive of a DB-backed channel The coordinator already had one of these, albeit implemented much worse than the one now properly introduced. It had to either be sending or receiving, whereas the new one can do both at the same time. This replaces said instance and enables pleasant patterns when implementing the processor/coordinator.	2023-11-19 02:05:01 -05:00
Luke Parker	6e4ecbc90c	Support trailing commas in create_db	2023-11-18 20:54:37 -05:00
Luke Parker	be48dcc4a4	Use multiple LibP2P topics We still need a peering protocol... Hopefully, we can read peers off of the Substrate node's DHT.	2023-11-18 20:37:55 -05:00
Luke Parker	ee50f584aa	Add saner log statements to the coordinator Disables trace on every single P2P message. Logs a short-form of each transaction.	2023-11-16 13:37:39 -05:00
Luke Parker	14f3f330db	Remove deadlock in cosign_evaluator	2023-11-16 13:32:15 -05:00
Luke Parker	a03a1edbff	Remove needless transposition in coordinator	2023-11-15 22:49:58 -05:00
Luke Parker	369af0fab5	\#339 addendum	2023-11-15 20:23:19 -05:00
Luke Parker	96f1d26f7a	Add a cosigning protocol to ensure finalizations are unique (#433 ) * Add a function to deterministically decide which Serai blocks should be co-signed Has a 5 minute latency between co-signs, also used as the maximal latency before a co-sign is started. * Get all active tributaries we're in at a specific block * Add and route CosignSubstrateBlock, a new provided TX * Split queued cosigns per network * Rename BatchSignId to SubstrateSignId * Add SubstrateSignableId, a meta-type for either Batch or Block, and modularize around it * Handle the CosignSubstrateBlock provided TX * Revert substrate_signer.rs to develop (and patch to still work) Due to SubstrateSigner moving when the prior multisig closes, yet cosigning occurring with the most recent key, a single SubstrateSigner can be reused. We could manage multiple SubstrateSigners, yet considering the much lower specifications for cosigning, I'd rather treat it distinctly. * Route cosigning through the processor * Add note to rename SubstrateSigner post-PR I don't want to do so now in order to preserve the diff's clarity. * Implement cosign evaluation into the coordinator * Get tests to compile * Bug fixes, mark blocks without cosigners available as cosigned * Correct the ID Batch preprocesses are saved under, add log statements * Create a dedicated function to handle cosigns * Correct the flow around Batch verification/queueing Verifying `Batch`s could stall when a `Batch` was signed before its predecessors/before the block it's contained in was cosigned (the latter being inevitable as we can't sign a block containing a signed batch before signing the batch). Now, Batch verification happens on a distinct async task in order to not block the handling of processor messages. This task is the sole caller of verify in order to ensure last_verified_batch isn't unexpectedly mutated. When the processor message handler needs to access it, or needs to queue a Batch, it associates the DB TXN with a lock preventing the other task from doing so. This lock, as currently implemented, is a poor and inefficient design. It should be modified to the pattern used for cosign management. Additionally, a new primitive of a DB-backed channel may be immensely valuable. Fixes a standing potential deadlock and a deadlock introduced with the cosigning protocol. * Working full-stack tests After the last commit, this only required extending a timeout. * Replace "co-sign" with "cosign" to make finding text easier * Update the coordinator tests to support cosigning * Inline prior_batch calculation to prevent panic on rotation Noticed when doing a final review of the branch.	2023-11-15 16:57:21 -05:00
David Bell	c328e5ea68	Convert coordinator/tributary/nonce_decider to use create_db macro (#423 ) * chore: convert nonce_deicer to use create_db macro * Restore pub NonceDecider * Remove extraneous comma I forgot to run git commit --amend on the prior commit :/ --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-12 12:04:34 -05:00
Luke Parker	54f1929078	Route blame between Processor and Coordinator (#427 ) * Have processor report errors during the DKG to the coordinator * Add RemoveParticipant, InvalidDkgShare to coordinator * Route DKG blame around coordinator * Allow public construction of AdditionalBlameMachine Necessary for upcoming work on handling DKG blame in the processor and coordinator. Additionally fixes a publicly reachable panic when commitments parsed with one ThresholdParams are used in a machine using another set of ThresholdParams. Renames InvalidProofOfKnowledge to InvalidCommitments. * Remove unused error from dleq * Implement support for VerifyBlame in the processor * Have coordinator send the processor share message relevant to Blame * Remove desync between processors reporting InvalidShare and ones reporting GeneratedKeyPair * Route blame on sign between processor and coordinator Doesn't yet act on it in coordinator. * Move txn usage as needed for stable Rust to build * Correct InvalidDkgShare serialization	2023-11-12 07:24:41 -05:00
akildemir	d015ee96a3	Dex improvements (#422 ) * remove dex traits&balance types * remove liq tokens pallet in favor of coins-pallet instance * fix tests & benchmarks * remove liquidity tokens trait * fix CI * fix pr comments * Slight renamings * Add burn_with_instruction as a negative to LiquidityTokens CallFilter * Remove use of One, Zero, Saturating taits in dex pallet --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-12 06:37:31 -05:00
Luke Parker	c03fb6c71b	Add dedicated BatchSignId	2023-11-06 20:06:36 -05:00
Luke Parker	de41be6e26	Slash on SignCompleted for unrecognized plan	2023-11-05 13:42:01 -05:00
akildemir	97fedf65d0	add reasons to slash evidence (#414 ) * add reasons to slash evidence * fix CI failing * Remove unnecessary clones .encode() takes &self * InvalidVr to InvalidValidRound * Unrelated to this PR: Clarify reasoning/potentials behind dropping evidence * Clarify prevotes in SlashEvidence test * Replace use of read_to_end * Restore decode_signed_message --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-05 00:04:41 -04:00
Luke Parker	e05b77d830	Support multiple key shares per validator (#416 ) * Update the coordinator to give key shares based on weight, not based on existence Participants are now identified by their starting index. While this compiles, the following is unimplemented: 1) A conversion for DKG `i` values. It assumes the threshold `i` values used will be identical for the MuSig signature used to confirm the DKG. 2) Expansion from compressed values to full values before forwarding to the processor. * Add a fn to the DkgConfirmer to convert `i` values as needed Also removes TODOs regarding Serai ensuring validator key uniqueness + validity. The current infra achieves both. * Have the Tributary DB track participation by shares, not by count * Prevent a node from obtaining 34% of the maximum amount of key shares This is actually mainly intended to set a bound on message sizes in the coordinator. Message sizes are amplified by the amount of key shares held, so setting an upper bound on said amount lets it determine constants. While that upper bound could be 150, that'd be unreasonable and increase the potential for DoS attacks. * Correct the mechanism to detect if sufficient accumulation has occured It used to check if the latest accumulation hit the required threshold. Now, accumulations may jump past the required threshold. The required mechanism is to check the threshold wasn't prior met and is now met. * Finish updating the coordinator to handle a multiple key share per validator environment * Adjust stategy re: preventing noce reuse in DKG Confirmer * Add TODOs regarding dropped transactions, add possible TODO fix * Update tests/coordinator This doesn't add new multi-key-share tests, it solely updates the existing single key-share tests to compile and run, with the necessary fixes to the coordinator. * Update processor key_gen to handle generating multiple key shares at once * Update SubstrateSigner * Update signer, clippy * Update processor tests * Update processor docker tests	2023-11-04 19:26:13 -04:00
github-actions[bot]	a2089c61fb	November 2023 - Rust Nightly Update (#413 ) * Update nightly * Replace .get(0) with .first() * allow new clippy lint --------- Co-authored-by: GitHub Actions <> Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-03 05:28:07 -04:00
Luke Parker	652c878f54	Note slight malleability in batch verification	2023-10-27 23:08:31 -04:00
Luke Parker	f22aedc007	Add tweaks meant for prior commit	2023-10-24 04:34:47 -04:00
Luke Parker	0198d4cc46	Add a TributaryState struct with higher-level DB logic	2023-10-24 03:55:13 -04:00
Luke Parker	a1b2bdf0a2	clippy fixes	2023-10-19 06:30:58 -04:00

1 2 3 4 5

219 commits