-
Notifications
You must be signed in to change notification settings - Fork 747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix flaky zombienet test zombienet-cumulus-0002-pov_recovery
#6573
Comments
This zombienet test also fails locally consistently at registering the parachain. The tx sent here is not finalized.
This is really strange. I found the issue was introduced with #6400. Not sure why removing the network starter causes grandpa issue. Reverting that PR makes the test pass again. @paritytech/networking Please take a deeper look into this. |
@skunert can you please more logs? What you mean by the "tx is not finalized"? Only the tx never makes it into a block or does finality doesn't work at all? |
Finalization doesn't work at all when we use the reserved flags Thx! |
@bkchr assembled some more logs:
All node logs: |
Let me clarify, the finalization is stalled if and only if both is true:
|
That is the working assumption. In my tests I focused on finding the commit that broke it. Did not verify the involvement of the reserved only flag. |
I just deployed to networks ( from latest image of master), with And without the flag finalization works as expected, here are the logs https://grafana.teleport.parity.io/goto/ss2DJdIHg?orgId=1 I will deploy other networks (without #6400) to check if the behavior is the same. Thx! |
Small update, I deployed the same config with the previous commit from master (05ad5475dec748a8a30685bc29a7caba6e63c7ab) and both cases works as expected. Here are the losg: Thx! |
Why are we passing Without |
What happens here is that due to removed network starter networking starts first and connects to reserved peers on default (sync) protocol before Because we abuse the reserved nodes mechanism on non-default protocols to track sync peers, When This issue can potentially manifest in the wild if a node is configured to connect only to |
The issue is fixed by #7011. |
To simplify debugging of peerset related issues like #6573 (comment). --------- Co-authored-by: command-bot <>
Introduce `SyncEvent::InitialPeers` message sent to new subscribers to allow them correctly tracking sync peers. This resolves a race condition described in #6573 (comment). Fixes #6573. --------- Co-authored-by: command-bot <> Co-authored-by: Bastian Köcher <[email protected]>
Some examples from Tuesday
Fail to start (node three)
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7786180
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785532
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784688
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784356
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784158
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7778904
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7778369
https://grafana.teleport.parity.io/goto/YZ5Wuv7Ng?orgId=1
Job timeout (1hr)
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7786118
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785725
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785632
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785631
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785467
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785047
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7785046
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784814
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784395
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784394
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7784326
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7783865
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7783800
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7783336
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782963
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782782
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782653
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782364
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782290
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782160
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782095
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7782030
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7781962
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7781897
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7780855
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7780523
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7780495
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7780314
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7780056
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779863
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779772
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779765
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779680
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779553
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779490
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779425
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779332
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779298
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779102
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779102
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7779036
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7778876
https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/7778239
cc: @paritytech/sdk-node
The text was updated successfully, but these errors were encountered: