This checklist defines the minimum operational resilience gates before CFP go-live.
Execution automation:
- GitHub Actions workflow:
CFP Go-Live Resilience(.github/workflows/cfp-go-live-resilience.yml).
- PR merged into
main. - Latest
Production Releaseworkflow completed successfully. - Health endpoints validated:
/q/health//comunidad/eventos/proyectos
- Run first-level incident commands:
homedir-ir-first-level.sh statushomedir-ir-first-level.sh snapshothomedir-ir-first-level.sh shield-on- verify maintenance mode externally (HTTP
503) homedir-ir-first-level.sh shield-off
- Keep evidence in incident log directory.
- Generate encrypted backup with
homedir-dr-backup.sh. - Verify restore extraction with
homedir-dr-restore.py. - Execute
homedir-dr-recover.sh --dry-run --skip-data-restore --apply-hardening. - Confirm no placeholder secrets in
/etc/homedir.env.
homedir-cfp-traffic-guard.timerenabled and active.homedir-cfp-traffic-guard.sh checkpasses.- Alerting path configured (
homedir-discord-alert.sh).
- Execute synthetic CFP/community probe with multi-origin simulation.
- Enforce thresholds on:
- error rate
429volume- timeout volume
- Store artifact report in GitHub Actions run.