serai

mirror of https://github.com/serai-dex/serai.git synced 2025-12-13 14:39:25 +00:00

Author	SHA1	Message	Date
Luke Parker	96f1d26f7a	Add a cosigning protocol to ensure finalizations are unique (#433 ) * Add a function to deterministically decide which Serai blocks should be co-signed Has a 5 minute latency between co-signs, also used as the maximal latency before a co-sign is started. * Get all active tributaries we're in at a specific block * Add and route CosignSubstrateBlock, a new provided TX * Split queued cosigns per network * Rename BatchSignId to SubstrateSignId * Add SubstrateSignableId, a meta-type for either Batch or Block, and modularize around it * Handle the CosignSubstrateBlock provided TX * Revert substrate_signer.rs to develop (and patch to still work) Due to SubstrateSigner moving when the prior multisig closes, yet cosigning occurring with the most recent key, a single SubstrateSigner can be reused. We could manage multiple SubstrateSigners, yet considering the much lower specifications for cosigning, I'd rather treat it distinctly. * Route cosigning through the processor * Add note to rename SubstrateSigner post-PR I don't want to do so now in order to preserve the diff's clarity. * Implement cosign evaluation into the coordinator * Get tests to compile * Bug fixes, mark blocks without cosigners available as cosigned * Correct the ID Batch preprocesses are saved under, add log statements * Create a dedicated function to handle cosigns * Correct the flow around Batch verification/queueing Verifying `Batch`s could stall when a `Batch` was signed before its predecessors/before the block it's contained in was cosigned (the latter being inevitable as we can't sign a block containing a signed batch before signing the batch). Now, Batch verification happens on a distinct async task in order to not block the handling of processor messages. This task is the sole caller of verify in order to ensure last_verified_batch isn't unexpectedly mutated. When the processor message handler needs to access it, or needs to queue a Batch, it associates the DB TXN with a lock preventing the other task from doing so. This lock, as currently implemented, is a poor and inefficient design. It should be modified to the pattern used for cosign management. Additionally, a new primitive of a DB-backed channel may be immensely valuable. Fixes a standing potential deadlock and a deadlock introduced with the cosigning protocol. * Working full-stack tests After the last commit, this only required extending a timeout. * Replace "co-sign" with "cosign" to make finding text easier * Update the coordinator tests to support cosigning * Inline prior_batch calculation to prevent panic on rotation Noticed when doing a final review of the branch.	2023-11-15 16:57:21 -05:00
Luke Parker	863a7842ca	Have every node respond to Heartbeat so they don't download the messages over the net	2023-10-14 15:27:40 -04:00
Luke Parker	f414735be5	Redo new_tributary from being over ActiveTributary to TributaryEvent TributaryEvent also allows broadcasting a retiry event.	2023-10-14 15:27:39 -04:00
Luke Parker	80e5ca9328	Move heartbeat_tributaries and handle_p2p to p2p.rs	2023-10-13 22:40:11 -04:00
Luke Parker	a73b19e2b8	Tweak coordinator test timing	2023-10-13 21:46:26 -04:00
Luke Parker	96c397caa0	Add content-based deduplication to the tests' shimmed P2P The tests have recently had their timing stilted, causing failures. The tests are... fine. They're fragile, as obvious, yet they're logical. The simplest fix is to unstilt their timing rather to make them non-fragile. The recent change, which presumably caused said stilting, was the the rebroadcasting added. This de-duplication prevents most of the impact of rebroadcasting. While there's still the async task, and the lock acquisition on attempt to rebroadcast, this hopefully is enough.	2023-10-13 19:47:58 -04:00
Luke Parker	32a9a33226	Adjust sync test timeout to resolve infreuqent failure This isn't an unacceptable timeout. It matches a prior timeout. I'm unsure why it's now needed to be extended though. My best guess is the test runtime is single threaded and there's now new overhead in the task management (or perhaps higher latency now that messages per-tributary is serialized).	2023-09-26 17:28:41 -04:00
Luke Parker	9f3840d1cf	Localize Tributary HashMaps, offering flexibility and removing contention	2023-09-25 19:28:53 -04:00
Luke Parker	f6f945e747	Add a LibP2P instantiation to coordinator It's largely unoptimized, and not yet exclusive to validators, yet has basic sanity (using message content for ID instead of sender + index). Fixes bugs as found. Notably, we used a time in milliseconds where the Tributary expected seconds. Also has Tributary::new jump to the presumed round number. This reduces slashes when starting new chains (whose times will be before the current time) and was the only way I was able to observe successful confirmations given current surrounding infrastructure.	2023-08-08 15:12:47 -04:00
Luke Parker	f6a497f3ac	Slight terminology correction in sync test Also correct a mistake from merging the most recent polkadot version.	2023-06-28 15:20:50 -04:00
akildemir	790fe7ee23	fix tributary sync test	2023-06-28 15:01:55 -04:00
Luke Parker	e74b4ab94f	Add a TributaryReader which doesn't require a borrow to operate Reduces lock contention. Additionally changes block_key to include the genesis. While not technically needed, the lack of genesis introduced a side effect where any Tributary on the the database could return the block of any other Tributary. While that wasn't a security issue, returning it suggested it was on-chain when it wasn't. This may have been usable to create issues.	2023-04-24 07:02:00 -04:00
Luke Parker	2feebe536e	Test handle_p2p and Tributary syncing Includes bug fixes.	2023-04-24 03:30:19 -04:00

13 Commits