Abstract
At the moment an amaru node will fail to reconnect to an upstream node if that one stops then restarts.
We need to write a test showing that a node will always reconnect when the upstream node has temporarily been disconnected.
Why?
We need operations to be reliable, so that SPOs don't need to restart the node manually in that situation.
How?
This seems to be a bug somewhere since the current Manager code in amaru-network is supposed to schedule a reconnection.
Testing Strategy / Acceptance Criteria
We should have a script (or possibly an in-memory test, using TCP over 2 nodes, like this one), for those 2 situations:
-
The initiator node is started, while the responder (upstream) node is down. Then the responder node is started, and the initiator node should connect and synchronize.
-
Both nodes are started and the responder node is restarted. We should observe a reconnection and the synchronisation resuming.
Discussion points
No response
Dependencies & Related Tasks
No response
Checklist
Abstract
At the moment an
amarunode will fail to reconnect to an upstream node if that one stops then restarts.We need to write a test showing that a node will always reconnect when the upstream node has temporarily been disconnected.
Why?
We need operations to be reliable, so that SPOs don't need to restart the node manually in that situation.
How?
This seems to be a bug somewhere since the current
Managercode inamaru-networkis supposed to schedule a reconnection.Testing Strategy / Acceptance Criteria
We should have a script (or possibly an in-memory test, using TCP over 2 nodes, like this one), for those 2 situations:
The initiator node is started, while the responder (upstream) node is down. Then the responder node is started, and the initiator node should connect and synchronize.
Both nodes are started and the responder node is restarted. We should observe a reconnection and the synchronisation resuming.
Discussion points
No response
Dependencies & Related Tasks
No response
Checklist