serai

mirror of https://github.com/serai-dex/serai.git synced 2024-11-17 01:17:36 +00:00

Author	SHA1	Message	Date
Luke Parker	7eb388e546	PR to track down CI failures (#501 ) * Use an extended timeout for DKGs specifically * Add a log statement when message-queue connection fails * Add a 60 second keep-alive to connections * Use zalloc for processor/message-queue/coordinator An additional layer which protects us against edge cases with Zeroizing (objects which don't support it or don't miss it). * Add further logs to message-queue * Further increase re-attempt timeouts in CI * Remove misplaced continue inmessage-queue client Fixes observed CI failures. * Revert "Further increase re-attempt timeouts in CI" This reverts commit `3723530cf6`.	2024-01-04 01:08:13 -05:00
Luke Parker	02776c54a8	Increase reattempt delays in the GH CI, which is extremely latent	2023-12-30 22:11:04 -05:00
Luke Parker	b493e3e31f	Validator DHT (#494 ) * Route validators for any active set through sc-authority-discovery Additionally adds an RPC route to retrieve their P2P addresses. * Have the coordinator get peers from substrate * Have the RPC return one address, not up to 3 Prevents the coordinator from believing it has 3 peers when it has one. * Add missing feature to serai-client * Correct network argument in serai-client for p2p_validators call * Add a test in serai-client to check DHT population with a much quicker failure than the coordinator tests * Update to latest Substrate Removes distinguishing BABE/AuthorityDiscovery keys which causes sc_authority_discovery to populate as desired. * Update to a properly tagged substrate commit * Add all dialed to peers to GossipSub * cargo fmt * Reduce common code in serai-coordinator-tests with amore involved new_test * Use a recursive async function to spawn `n` DockerTests with the necessary networking configuration * Merge UNIQUE_ID and ONE_AT_A_TIME * Tidy up the new recursive code in tests/coordinator * Use a Mutex in CONTEXT to let it be set multiple times * Make complimentary edits to full-stack tests * Augment coordinator P2p connection logs * Drop lock acquisitions before recursing * Better scope lock acquisitions in full-stack, preventing a deadlock * Ensure OUTER_OPS is reset across the test boundary * Add cargo deny allowance for dockertest fork	2023-12-22 21:09:18 -05:00
Luke Parker	00774c29d7	Replace remaining direct uses of futures with futures_util Slight downscope which helps combat the antipattern which is the futures glob crate. While futures_util is still a large crate, it has better defaults and is smaller by virtue of not pulling the executor.	2023-12-18 19:45:08 -05:00
Luke Parker	ea3af28139	Add workspace lints	2023-12-17 00:04:47 -05:00
Luke Parker	11fdb6da1d	Coordinator Cleanup (#481 ) * Move logic for evaluating if a cosign should occur to its own file Cleans it up and makes it more robust. * Have expected_next_batch return an error instead of retrying While convenient to offer an error-free implementation, it potentially caused very long lived lock acquisitions in handle_processor_message. * Unify and clean DkgConfirmer and DkgRemoval Does so via adding a new file for the common code, SigningProtocol. Modifies from_cache to return the preprocess with the machine, as there's no reason not to. Also removes an unused Result around the type. Clarifies the security around deterministic nonces, removing them for saved-to-disk cached preprocesses. The cached preprocesses are encrypted as the DB is not a proper secret store. Moves arguments always present in the protocol from function arguments into the struct itself. Removes the horribly ugly code in DkgRemoval, fixing multiple issues present with it which would cause it to fail on use. * Set SeraiBlockNumber in cosign.rs as it's used by the cosigning protocol * Remove unnecessary Clone from lambdas in coordinator * Remove the EventDb from Tributary scanner We used per-Transaction DB TXNs so on error, we don't have to rescan the entire block yet only the rest of it. We prevented scanning multiple transactions by tracking which we already had. This is over-engineered and not worth it. * Implement borsh for HasEvents, removing the manual encoding * Merge DkgConfirmer and DkgRemoval into signing_protocol.rs Fixes a bug in DkgConfirmer which would cause it to improperly handle indexes if any validator had multiple key shares. * Strictly type DataSpecification's Label * Correct threshold_i_map_to_keys_and_musig_i_map It didn't include the participant's own index and accordingly was offset. * Create TributaryBlockHandler This struct contains all variables prior passed to handle_block and stops them from being passed around again and again. This also ensures fatal_slash is only called while handling a block, as needed as it expects to operate under perfect consensus. * Inline accumulate, store confirmation nonces with shares Inlining accumulate makes sense due to the amount of data accumulate needed to be passed. Storing confirmation nonces with shares ensures that both are available or neither. Prior, one could be yet the other may not have been (requiring an assert in runtime to ensure we didn't bungle it somehow). * Create helper functions for handling DkgRemoval/SubstrateSign/Sign Tributary TXs * Move Label into SignData All of our transactions which use SignData end up with the same common usage pattern for Label, justifying this. Removes 3 transactions, explicitly de-duplicating their handlers. * Remove CurrentlyCompletingKeyPair for the non-contextual DkgKeyPair * Remove the manual read/write for TributarySpec for borsh This struct doesn't have any optimizations booned by the manual impl. Using borsh reduces our scope. * Use temporary variables to further minimize LoC in tributary handler * Remove usage of tuples for non-trivial Tributary transactions * Remove serde from dkg serde could be used to deserialize intenrally inconsistent objects which could lead to panics or faults. The BorshDeserialize derives have been replaced with a manual implementation which won't produce inconsistent objects. * Abstract Future generics using new trait definitions in coordinator * Move published_signed_transaction to tributary/mod.rs to reduce the size of main.rs * Split coordinator/src/tributary/mod.rs into spec.rs and transaction.rs	2023-12-10 20:21:44 -05:00
Luke Parker	6caf45ea1d	Downscope usage of futures	2023-12-10 19:32:52 -05:00
Luke Parker	b823413c9b	Use parity-db in current Dockerfiles (#455 ) * Use redb and in Dockerfiles The motivation for redb was to remove the multiple rocksdb compile times from CI. * Correct feature flagging of coordinator and message-queue in Dockerfiles * Correct message-queue DB type alias * Use consistent table typing in redb * Correct rebase artifacts * Correct removal of binaries feature from message-queue * Correct processor feature flagging * Replace redb with parity-db It still has much better compile times yet doesn't block when creating multiple transactions. It also is actively maintained and doesn't grow our tree. The MPT aspects are irrelevant. * Correct stray Redb * clippy warning * Correct txn get	2023-11-30 04:22:37 -05:00
Luke Parker	b296be8515	Replace bincode with borsh (#452 ) * Add SignalsConfig to chain_spec * Correct multiexp feature flagging for rand_core std * Remove bincode for borsh Replaces a non-canonical encoding with a canonical encoding which additionally should be faster. Also fixes an issue where we used bincode in transcripts where it cannot be trusted. This ended up fixing a myriad of other bugs observed, unfortunately. Accordingly, it either has to be merged or the bug fixes from it must be ported to a new PR. * Make serde optional, minimize usage * Make borsh an optional dependency of substrate/ crates * Remove unused dependencies * Use [u8; 64] where possible in the processor messages * Correct borsh feature flagging	2023-11-25 04:01:11 -05:00
Luke Parker	96f1d26f7a	Add a cosigning protocol to ensure finalizations are unique (#433 ) * Add a function to deterministically decide which Serai blocks should be co-signed Has a 5 minute latency between co-signs, also used as the maximal latency before a co-sign is started. * Get all active tributaries we're in at a specific block * Add and route CosignSubstrateBlock, a new provided TX * Split queued cosigns per network * Rename BatchSignId to SubstrateSignId * Add SubstrateSignableId, a meta-type for either Batch or Block, and modularize around it * Handle the CosignSubstrateBlock provided TX * Revert substrate_signer.rs to develop (and patch to still work) Due to SubstrateSigner moving when the prior multisig closes, yet cosigning occurring with the most recent key, a single SubstrateSigner can be reused. We could manage multiple SubstrateSigners, yet considering the much lower specifications for cosigning, I'd rather treat it distinctly. * Route cosigning through the processor * Add note to rename SubstrateSigner post-PR I don't want to do so now in order to preserve the diff's clarity. * Implement cosign evaluation into the coordinator * Get tests to compile * Bug fixes, mark blocks without cosigners available as cosigned * Correct the ID Batch preprocesses are saved under, add log statements * Create a dedicated function to handle cosigns * Correct the flow around Batch verification/queueing Verifying `Batch`s could stall when a `Batch` was signed before its predecessors/before the block it's contained in was cosigned (the latter being inevitable as we can't sign a block containing a signed batch before signing the batch). Now, Batch verification happens on a distinct async task in order to not block the handling of processor messages. This task is the sole caller of verify in order to ensure last_verified_batch isn't unexpectedly mutated. When the processor message handler needs to access it, or needs to queue a Batch, it associates the DB TXN with a lock preventing the other task from doing so. This lock, as currently implemented, is a poor and inefficient design. It should be modified to the pattern used for cosign management. Additionally, a new primitive of a DB-backed channel may be immensely valuable. Fixes a standing potential deadlock and a deadlock introduced with the cosigning protocol. * Working full-stack tests After the last commit, this only required extending a timeout. * Replace "co-sign" with "cosign" to make finding text easier * Update the coordinator tests to support cosigning * Inline prior_batch calculation to prevent panic on rotation Noticed when doing a final review of the branch.	2023-11-15 16:57:21 -05:00
David Bell	c328e5ea68	Convert coordinator/tributary/nonce_decider to use create_db macro (#423 ) * chore: convert nonce_deicer to use create_db macro * Restore pub NonceDecider * Remove extraneous comma I forgot to run git commit --amend on the prior commit :/ --------- Co-authored-by: Luke Parker <lukeparker5132@gmail.com>	2023-11-12 12:04:34 -05:00
Luke Parker	05dc474cb3	Correct std feature-flagging If a crate has std set, it should enable std for all dependencies in order to let them properly select which algorithms to use. Some crates fallback to slower/worse algorithms on no-std. Also more aggressively sets default-features = false leading to a 10% reduction in the amount of crates coordinator builds.	2023-10-31 07:44:02 -04:00
Luke Parker	0198d4cc46	Add a TributaryState struct with higher-level DB logic	2023-10-24 03:55:13 -04:00
Luke Parker	77f7794452	Remove lazy_static for proper use of channels	2023-09-25 18:23:52 -04:00
Luke Parker	5e02f936e4	Perform MuSig signing of generated keys	2023-08-14 06:08:55 -04:00
Luke Parker	f6f945e747	Add a LibP2P instantiation to coordinator It's largely unoptimized, and not yet exclusive to validators, yet has basic sanity (using message content for ID instead of sender + index). Fixes bugs as found. Notably, we used a time in milliseconds where the Tributary expected seconds. Also has Tributary::new jump to the presumed round number. This reduces slashes when starting new chains (whose times will be before the current time) and was the only way I was able to observe successful confirmations given current surrounding infrastructure.	2023-08-08 15:12:47 -04:00
Luke Parker	d5c787fea2	Add initial coordinator e2e tests	2023-08-01 19:00:48 -04:00
Luke Parker	3862731a12	Minimize features pulled in to try and reduce build times	2023-07-25 22:29:39 -04:00
Luke Parker	9effd5ccdc	Add a Docker-based test for the message-queue service	2023-07-20 18:53:11 -04:00
Luke Parker	a7c9c1ef55	Integrate coordinator with MessageQueue and RocksDB Also resolves a couple TODOs.	2023-07-18 01:53:51 -04:00
Luke Parker	d49c636f0f	Use serai- prefixes on Serai-specific packages Fixes deny.toml, also runs a minor cargo update shrinking the tree.	2023-07-03 08:50:23 -04:00
Luke Parker	168f2899f0	Create a vote transaction upon GeneratedKeyPair	2023-05-10 00:46:51 -04:00
Luke Parker	ad5522d854	Start handling P2P messages This defines the tart of a very complex series of locks I'm really unhappy with. At the same time, there's not immediately a better solution. This also should work without issue.	2023-04-23 17:01:30 -04:00
Luke Parker	2b09309adc	Handle adding new Tributaries Removes last_block as an argument from Tendermint. It now loads from the DB as needed. While slightly less performant, it's easiest and should be fine.	2023-04-23 03:51:26 -04:00
Luke Parker	710e6e5217	Add Transaction::sign. While I don't love the introduction of empty_signed, it's practically fine.	2023-04-23 01:25:45 -04:00
Luke Parker	af84b7f707	Add a test for Tributary Further fleshes out the Tributary testing code.	2023-04-22 22:28:20 -04:00
Luke Parker	8c74576cf0	Add a test to the coordinator for running a Tributary Impls a LocalP2p for testing. Moves rebroadcasting into Tendermint, since it's what knows if a message is fully valid + original. Removes TributarySpec::validators() HashMap, as its non-determinism caused different instances to have different round robin schedules. It was already prior moved to a Vec for this issue, so I'm unsure why this remnant existed. Also renames the GH no-std workflow from the prior commit.	2023-04-22 10:49:52 -04:00
Luke Parker	9da0eb69c7	Use an enum for Coin/NetworkId It originally wasn't an enum so software which had yet to update before an integration wouldn't error (as now enums are strictly typed). The strict typing is preferable though.	2023-04-18 02:04:47 -04:00
Luke Parker	79655672ef	Make progres on handling NewSet events Further bones out the coordinator.	2023-04-16 00:51:56 -04:00
Luke Parker	eafd054296	Start defining the coordinator	2023-04-15 17:38:47 -04:00
Luke Parker	2cfee536f6	Define all coordinator transaction types	2023-04-11 19:04:53 -04:00
Luke Parker	de52c4db7f	Add empty coordinator	2023-04-11 09:21:35 -04:00

32 commits