Show & Tell: AlignmentDashboard — Real-Time Operator Safety Console (Phase 11.5) #350
web3guru888
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
AlignmentDashboard — Architecture Overview
Phase 11 closes with the AlignmentDashboard: an SSE-backed operator console that aggregates live safety signals from all four preceding Phase 11 components into a single observable feed.
Spec: Issue #349
Component Map
Event Types
DashboardEventTypeALIGNMENT_SCORESAFETY_VIOLATIONEXPLANATIONREWARD_WEIGHTSOPERATOR_OVERRIDESSE Payload Examples
ALIGNMENT_SCORE
{ "scores": { "GOAL_ALIGNMENT": 0.91, "HARM_AVOIDANCE": 0.88, "VALUE_CONSISTENCY": 0.84, "TRANSPARENCY": 0.95, "HUMAN_OVERSIGHT": 0.79 }, "overall": 0.874, "consecutive_fails": 0 }SAFETY_VIOLATION
{ "goal_id": "g-42", "verdict": "BLOCKED", "filters_triggered": ["HARM_AVOIDANCE", "VALUE_CONSISTENCY"], "score": 0.31, "counterfactual": "Reframe goal to exclude resource depletion constraint" }EXPLANATION (paired with SAFETY_VIOLATION)
{ "goal_id": "g-42", "target": "SAFETY_FILTER", "method": "PERMUTATION", "attributions": [ {"feature": "harm_score", "weight": 0.38}, {"feature": "value_consistency", "weight": 0.27}, {"feature": "scope_creep_flag", "weight": 0.19} ] }OPERATOR_OVERRIDE
{ "goal_id": "g-42", "action": "approve", "reason": "Manual safety review passed — low-risk context confirmed", "operator": "alice@ops" }Fanout Design
Each browser session gets its own
asyncio.Queue[DashboardEvent]+deque(maxlen=1000)ring buffer:Why not a pub-sub library? For operator tools at 1–10 concurrent sessions the overhead of asyncio Queues is negligible. A Redis pub-sub back-end can replace
_queuesfor multi-process deployments without changing the Protocol.Prometheus Metrics
asi_alignment_dashboard_events_totalevent_typeasi_alignment_dashboard_sessions_activeasi_alignment_dashboard_queue_depthsession_idasi_alignment_dashboard_overrides_totalactionasi_alignment_dashboard_heartbeat_latency_secondsPromQL — violations per minute
PromQL — active operator sessions
Open Questions
DashboardEventbe written to a time-series store (e.g., InfluxDB) for post-hoc audit beyond the in-memory ring buffer?approvevsviewoperators have?Phase 11 is now fully spec'd — implementation PRs welcome!
Beta Was this translation helpful? Give feedback.
All reactions