Skip to content

Commit 8fc9c6c

Browse files
committed
chore(soak): raise monitor wedge threshold 5->30min (known #767 residual self-recovers)
1 parent 0519800 commit 8fc9c6c

1 file changed

Lines changed: 5 additions & 2 deletions

File tree

scripts/soak/pots-wedge-monitor.sh

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,11 @@ NODE_PAT="release-prof/dugite-node.*db-mainnet-genesis"
1717
# the monitor follows the node across restarts.
1818
LOG=$(cat /tmp/dugite-mainnet-soak-current.log 2>/dev/null || echo /private/tmp/dugite-mainnet-soak-20260615-154625.log)
1919
REPORTS=/Users/michaelfazio/Source/dugite/reports
20-
WEDGE_STALL=5 # consecutive 60s ticks with no block advance before sampling
21-
# (>=5min; tolerates long Alonzo/Babbage snapshot+flush pauses)
20+
WEDGE_STALL=30 # consecutive 60s ticks with no block advance before sampling
21+
# (>=30min). Raised from 5: the KNOWN #767 residual (peer-Slow-
22+
# cascade) self-recovers within ~5-7min, so a 5min threshold spams
23+
# samples; only a genuine >30min hard wedge (beyond the residual)
24+
# should alert now.
2225
# Match ERROR/FATAL *log-level* lines (anchored on the tracing timestamp so we
2326
# don't match "ERROR" inside a message), plus panics and REAL divergence
2427
# signals. Deliberately NOT a bare "diverg" — that matches the benign

0 commit comments

Comments
 (0)