You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update ARCHITECTURE and TODO docs with v0.2 features, add zh/ar translations
Rewrite ARCHITECTURE.md to reflect current state: 4-step Research pipeline,
mixin-based orchestrator, skills system, conda isolation, DB state management,
9 agents. Add Chinese and Arabic translations. Update TODO.md with recently
completed v0.2 items and correct test count (115).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
| 3 | Planner | Generate initial research plan from survey results |
51
+
| 4 | Experimenter | Run first round of experiments based on plan |
52
+
53
+
### Review Loop
54
+
55
+
Each iteration runs 5 steps: Compile → Review → Plan → Execute → Validate.
56
+
57
+
The Planner outputs structured YAML action plans:
58
+
59
+
```yaml
60
+
actions:
61
+
- agent: experimenter
62
+
task: "Run perplexity validation experiment"
63
+
priority: 1
64
+
- agent: writer
65
+
task: "Update Section 4.2"
66
+
priority: 2
35
67
```
36
68
37
69
## Core Components
@@ -52,7 +84,7 @@ class SimpleMemory:
52
84
```
53
85
54
86
Additional features:
55
-
-**Issue tracking**: Counts how many times each issue reappears across iterations
87
+
- **Issue tracking**: Content-based dedup — counts how many times each issue reappears across iterations
56
88
- **Repair validation**: Verifies that attempted fixes actually resolved the issue
57
89
- **Strategy escalation**: Automatically bans ineffective methods and suggests alternatives
58
90
- **Meta-debugging**: Triggers diagnostic when the system is stuck
@@ -61,50 +93,54 @@ Additional features:
61
93
62
94
Every agent invocation includes a constant "Goal Anchor" that describes the project's core objectives. This prevents agents from drifting off-topic over many iterations.
63
95
64
-
The Goal Anchor is project-specific and should be configured per project.
65
-
66
-
### 3. Planner Agent
96
+
### 3. Orchestrator (`orchestrator.py`)
67
97
68
-
The **core decision-maker**. Outputs a structured action plan:
98
+
Mixin-based design with 5 mixins:
69
99
70
-
```yaml
71
-
actions:
72
-
- agent: experimenter
73
-
task: "Run perplexity validation experiment"
74
-
priority: 1
75
-
- agent: writer
76
-
task: "Update Section 4.2"
77
-
priority: 2
100
+
```python
101
+
class Orchestrator(ResearchMixin, DevMixin, ReviewMixin, FigureMixin, BaseMixin):
102
+
# Dispatches to the correct phase based on mode
103
+
# Syncs status to DB after each step
104
+
# Handles Telegram notifications
78
105
```
79
106
80
-
### 4. Orchestrator (`orchestrator.py`)
107
+
### 4. Skills System (`skills/`)
81
108
82
-
Minimal control flow:
109
+
Modular instruction sets loaded at runtime:
83
110
84
-
```python
85
-
def run_paper_iteration():
86
-
# 1. Review
87
-
review = run_agent("reviewer")
88
-
score = parse_score(review)
89
-
memory.record_score(score)
90
-
91
-
# 2. Stagnation detection
92
-
if memory.is_stagnating():
93
-
send_notification("Human intervention needed")
94
-
95
-
# 3. Planner decides + execute
96
-
run_planner_cycle(review)
97
-
98
-
# 4. Visualize + commit
99
-
run_figure_phase()
100
-
compile_latex()
101
-
git_commit()
102
-
```
111
+
| Skill | Purpose |
112
+
|:------|:--------|
113
+
| **research-integrity** | Anti-simulation: agents must run real experiments |
114
+
| **human-intervention** | Escalation protocol via Telegram |
0 commit comments