Add CLI conda env support and update docs for v0.2 features

JihaoXin · claude · JihaoXin · commit 8de72525e991 · 2026-04-15T12:25:25.000Z
ark run now auto-detects per-project conda env (matching webapp behavior).
Update all three READMEs with: 4-step research pipeline, skills system,
environment isolation, anti-simulation enforcement, human intervention
protocol, rich Telegram notifications, and phase badges in web portal.

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/README.md b/README.md
@@ -18,7 +18,7 @@
   <a href="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml"><img src="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
   <img src="https://img.shields.io/badge/agents-8-orange.svg" alt="8 Agents">
   <img src="https://img.shields.io/badge/venues-11+-purple.svg" alt="11+ Venues">
-  <img src="https://img.shields.io/badge/tests-106-brightgreen.svg" alt="106 Tests">
+  <img src="https://img.shields.io/badge/tests-115-brightgreen.svg" alt="115 Tests">
 </p>
 
 <p align="center">
@@ -59,7 +59,7 @@ ARK runs three phases in sequence. The Review phase loops until the paper reache
 
 | Phase | What Happens |
 |:------|:-------------|
-| **Research** | Gemini Deep Research runs a literature survey and gathers background knowledge |
+| **Research** | 4-step pipeline: Deep Research &rarr; Initializer (bootstrap env &amp; citations) &rarr; Planner &rarr; Experimenter |
 | **Dev** | Iterative experiment cycle: plan &rarr; run on Slurm &rarr; analyze &rarr; write initial draft |
 | **Review** | Compile &rarr; Review &rarr; Plan &rarr; Execute &rarr; Validate, repeating until score &ge; threshold |
 
@@ -110,6 +110,40 @@ The loop repeats until the score reaches the acceptance threshold &mdash; or you
 | **Formatting** | Broken layouts, LaTeX errors, manual cleanup | Hard-coded LaTeX + venue templates (NeurIPS, ACL, IEEE&hellip;) |
 | **Citations** | LLMs fabricate plausible-looking references | Every citation verified against DBLP &mdash; no fake references |
 | **Figures** | Default styles, wrong sizes, no page awareness | Nano Banana + venue-aware canvas, column widths, and fonts |
+| **Isolation** | Shared env &mdash; projects interfere with each other | Per-project conda env, sandboxed HOME, full multi-tenant isolation |
+| **Integrity** | LLMs simulate results instead of running real experiments | Anti-simulation prompts + builtin skills enforce real execution |
+
+---
+
+## Environment Isolation
+
+Each project runs in its own **per-project conda environment**, cloned from a base env at project creation. This ensures full multi-tenant isolation:
+
+- **Sandboxed Python** &mdash; per-project `.env/` directory with its own packages
+- **Isolated HOME** &mdash; each orchestrator runs with `HOME` set to the project directory
+- **No cross-contamination** &mdash; `PYTHONNOUSERSITE=1` prevents leaking user-site packages
+- **Automatic provisioning** &mdash; `ark run` and the Web Portal detect and use the project conda env; the pipeline bootstraps it if missing
+
+```bash
+# The conda env is created automatically on first run.
+# ark run will detect and use it:
+ark run myproject
+#   Conda env: /path/to/projects/myproject/.env
+```
+
+## Skills System
+
+ARK ships with **builtin skills** &mdash; modular instruction sets that agents load at runtime to enforce best practices:
+
+| Skill | Purpose |
+|:------|:--------|
+| **research-integrity** | Anti-simulation prompts: agents must run real experiments, not fabricate outputs |
+| **human-intervention** | Escalation protocol: agents pause and ask via Telegram before irreversible actions |
+| **env-isolation** | Enforces per-project environment boundaries |
+| **figure-integrity** | Validates figure content matches data; prevents placeholder or hallucinated plots |
+| **page-adjustment** | Maintains page limits by adjusting content density, not deleting sections |
+
+Skills live in `skills/builtin/` and are auto-installed during pipeline bootstrap.
 
 ---
 
@@ -149,7 +183,7 @@ ARK parses the PDF with PyMuPDF + Claude Haiku, pre-fills the wizard, and kicks
 | Command | Description |
 |:--------|:------------|
 | `ark new <name>` | Create project via interactive wizard |
-| `ark run <name>` | Launch the autonomous pipeline |
+| `ark run <name>` | Launch the pipeline (auto-detects per-project conda env) |
 | `ark status [name]` | Score, iteration, phase, cost |
 | `ark monitor <name>` | Live dashboard: agent activity, score trend |
 | `ark update <name>` | Inject a mid-run instruction |
@@ -167,7 +201,7 @@ ARK parses the PDF with PyMuPDF + Claude Haiku, pre-fills the wizard, and kicks
 
 ## Web Portal
 
-ARK includes a web-based portal for managing projects, viewing scores, and steering agents.
+ARK includes a web-based portal for managing projects, viewing scores, and steering agents. The portal shows **live phase badges** (Research / Dev / Review), per-project conda env status, and real-time cost tracking.
 
 ### Configuration
 
@@ -220,10 +254,11 @@ ark setup-bot    # one-time: paste BotFather token, auto-detect chat ID
 ```
 
 What you get:
-- **Live notifications** &mdash; score changes, phase transitions, errors
-- **Send instructions** &mdash; steer the current iteration
+- **Rich notifications** &mdash; formatted score changes, phase transitions, agent activity, and errors
+- **Send instructions** &mdash; steer the current iteration in real time
 - **Request PDFs** &mdash; latest compiled paper sent to chat
-- **Proactive confirmations** &mdash; ARK asks before key decisions
+- **Human intervention** &mdash; agents escalate decisions to you before irreversible actions
+- **HPC-friendly** &mdash; handles self-signed SSL certificates on enterprise/HPC networks
 
 ---
 
diff --git a/README_ar.md b/README_ar.md
@@ -18,7 +18,7 @@
   <a href="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml"><img src="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
   <img src="https://img.shields.io/badge/agents-8-orange.svg" alt="8 وكلاء">
   <img src="https://img.shields.io/badge/venues-11+-purple.svg" alt="+11 مؤتمر">
-  <img src="https://img.shields.io/badge/tests-106-brightgreen.svg" alt="106 اختبار">
+  <img src="https://img.shields.io/badge/tests-115-brightgreen.svg" alt="115 اختبار">
 </p>
 
 <p align="center">
@@ -59,8 +59,8 @@
 
 | المرحلة | ما يحدث |
 |:--------|:--------|
-| **Research** | Gemini Deep Research يجمع المعرفة الخلفية والمسح الأدبي |
-| **Dev** | دورة تجارب تكرارية: تخطيط &rarr; تشغيل Slurm &rarr; تحليل &rarr; كتابة مسودة أولية |
+| **Research** | خط أنابيب من ٤ خطوات: Deep Research &rarr; المُهيّئ (تمهيد البيئة والاستشهادات) &rarr; المخطط &rarr; المُجرِّب |
+| **Dev** | دورة تجا��ب تكرارية: تخطيط &rarr; تشغيل Slurm &rarr; تحل��ل &rarr; كتابة مسودة أولية |
 | **Review** | تجميع &rarr; مراجعة &rarr; تخطيط &rarr; تنفيذ &rarr; تحقّق، تكرار حتى الدرجة &ge; العتبة |
 
 ### دورة المراجعة
@@ -110,6 +110,40 @@
 | **التنسيق** | تخطيطات مضطربة، أخطاء LaTeX، إصلاح يدوي | LaTeX مُبرمَج + قوالب مؤتمرات (NeurIPS، ACL، IEEE&hellip;) |
 | **الاستشهادات** | LLM يختلق مراجع تبدو معقولة لكنها غير موجودة | كل اقتباس يُتحقَّق عبر DBLP — لا مراجع وهمية |
 | **الأشكال** | أنماط افتراضية، أحجام خاطئة، لا مراعاة لقيود الصفحة | Nano Banana + أبعاد لوحة وعرض أعمدة وخطوط دقيقة |
+| **العزل** | بيئة مشتركة — المشاريع تتداخل مع بعضها | بيئة conda لكل مشروع، HOME معزول، عزل كامل متعدد المستأجرين |
+| **النزاهة** | LLM يحاكي النتائج بدلاً من تشغيل تجارب حقيقية | موجّهات مضادة للمحاكاة + مهارات مدمجة تفرض التنفيذ الحقيقي |
+
+---
+
+## عزل البيئات
+
+يعمل كل مشروع في **بيئة conda مستقلة لكل مشروع**، مُستنسخة من بيئة أساسية عند الإنشاء. هذا يضمن عزلاً كاملاً متعدد المستأجرين:
+
+- **Python معزول** &mdash; مجلد `.env/` لكل مشروع بحزمه الخاصة
+- **HOME معزول** &mdash; كل مُنسق يعمل بمجلد المشروع كـ HOME
+- **لا تلوث متبادل** &mdash; `PYTHONNOUSERSITE=1` يمنع تسرب حزم المستخدم
+- **توفير تلقائي** &mdash; `ark run` وبوابة الويب يكتشفان ويستخدمان بيئة conda للمشروع؛ خط الأنابيب يُنشئها تلقائياً عند الحاجة
+
+```bash
+# بيئة conda تُنشأ تلقائياً عند أول تشغيل
+# ark run يكتشفها ويستخدمها:
+ark run myproject
+#   Conda env: /path/to/projects/myproject/.env
+```
+
+## نظام المهارات
+
+يأتي ARK مع **مهارات مدمجة** &mdash; مجموعات تعليمات نمطية يحمّلها الوكلاء أثناء التشغيل لفرض أفضل الممارسات:
+
+| المهارة | الغرض |
+|:--------|:-------|
+| **research-integrity** | موجّهات مضادة للمحاكاة: الوكلاء يجب أن يُجروا تجارب حقيقية لا أن يختلقوا نتائج |
+| **human-intervention** | بروتوكول التصعيد: الوكلاء يتوقفون ويسألون عبر Telegram قبل الإجراءات غير القابلة للعكس |
+| **env-isolation** | فرض حدود البيئة لكل مشروع |
+| **figure-integrity** | التحقق من تطابق محتوى الأشكال مع البيانات؛ منع الرسوم البيانية الوهمية |
+| **page-adjustment** | الحفاظ على حدود الصفحات بتعديل كثافة المحتوى لا بحذف الأقسام |
+
+المهارات موجودة في `skills/builtin/` وتُثبّت تلقائياً أثناء مرحلة التمهيد.
 
 ---
 
@@ -149,7 +183,7 @@ ark new mma --from-pdf proposal.pdf
 | الأمر | الوظيفة |
 |:------|:--------|
 | `ark new <name>` | إنشاء مشروع عبر معالج تفاعلي |
-| `ark run <name>` | بدء الـ pipeline المستقل |
+| `ark run <name>` | بدء الـ pipeline (يكتشف تلقائياً بيئة conda للمشروع) |
 | `ark status [name]` | التقييم، التكرار، المرحلة، التكلفة |
 | `ark monitor <name>` | لوحة مراقبة مباشرة: نشاط الوكلاء، اتجاه التقييم |
 | `ark update <name>` | إدخال تعليمات أثناء التشغيل |
@@ -167,7 +201,7 @@ ark new mma --from-pdf proposal.pdf
 
 ## بوابة الويب
 
-يتضمن ARK بوابة ويب لإدارة المشاريع وعرض الدرجات وتوجيه الوكلاء.
+يتضمن ARK بوابة ويب لإدارة المشاريع وعرض الدرجات وتوجيه الوكلاء. تعرض البوابة **شارات مراحل مباشرة** (Research / Dev / Review)، حالة بيئة conda لكل مشروع، وتتبع التكلفة في الوقت الحقيقي.
 
 ### الإعدادات
 
@@ -220,10 +254,11 @@ ark setup-bot    # مرة واحدة: الصق رمز BotFather، كشف تلق
 ```
 
 ما تحصل عليه:
-- **إشعارات مباشرة** &mdash; تغيرات التقييم، انتقالات المراحل، الأخطاء
-- **إرسال تعليمات** &mdash; توجيه التكرار الحالي
+- **إشعارات غنية** &mdash; تغيرات التقييم المنسقة، انتقالات المراحل، نشاط الوكلاء، والأخطاء
+- **إرسال تعليمات** &mdash; توجيه التكرار الحالي في الوقت الحقيقي
 - **طلب PDF** &mdash; أحدث ورقة مترجمة تُرسل للمحادثة
-- **تأكيدات استباقية** &mdash; يسأل ARK قبل القرارات المهمة
+- **تدخل بشري** &mdash; الوكلاء يصعّدون القرارات إليك قبل الإجراءات غير القابلة للعكس
+- **متوافق مع HPC** &mdash; يدعم شهادات SSL الموقّعة ذاتياً على شبكات المؤسسات/HPC
 
 ---
 
diff --git a/README_zh.md b/README_zh.md
@@ -18,7 +18,7 @@
   <a href="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml"><img src="https://github.com/kaust-ark/ARK/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
   <img src="https://img.shields.io/badge/agents-8-orange.svg" alt="8 Agents">
   <img src="https://img.shields.io/badge/venues-11+-purple.svg" alt="11+ Venues">
-  <img src="https://img.shields.io/badge/tests-106-brightgreen.svg" alt="106 Tests">
+  <img src="https://img.shields.io/badge/tests-115-brightgreen.svg" alt="115 Tests">
 </p>
 
 <p align="center">
@@ -59,7 +59,7 @@ ARK 按三个阶段依次执行。Review 阶段循环迭代直到论文达到目
 
 | 阶段 | 执行内容 |
 |:------|:---------|
-| **Research** | Gemini Deep Research 文献调研与背景知识收集 |
+| **Research** | 4 步流水线：Deep Research &rarr; 初始化器（环境引导 &amp; 引用准备）&rarr; 规划器 &rarr; 实验者 |
 | **Dev** | 迭代实验循环：规划 &rarr; Slurm 运行 &rarr; 分析 &rarr; 撰写初稿 |
 | **Review** | 编译 &rarr; 审稿 &rarr; 规划 &rarr; 执行 &rarr; 验证，循环直到分数 &ge; 阈值 |
 
@@ -110,6 +110,40 @@ ARK 按三个阶段依次执行。Review 阶段循环迭代直到论文达到目
 | **排版** | 布局混乱、LaTeX 报错、大量人工修复 | 硬编码 LaTeX + 会议模板（NeurIPS、ACL、IEEE……） |
 | **引用** | LLM 编造看似合理但不存在的引用 | 每条引用经 DBLP API 验证，杜绝虚假文献 |
 | **图表** | 默认样式、尺寸失控、无视页面约束 | Nano Banana + 会议画布尺寸、栏宽、字号精确匹配 |
+| **隔离** | 共享环境，项目之间互相干扰 | 每项目独立 conda 环境、沙盒 HOME、完全多租户隔离 |
+| **完整性** | LLM 模拟结果而非运行真实实验 | 反模拟提示 + 内置 Skills 强制真实执行 |
+
+---
+
+## 环境隔离
+
+每个项目运行在独立的 **项目级 conda 环境** 中，在创建时从基础环境克隆。这确保了完全的多租户隔离：
+
+- **沙盒 Python** &mdash; 每项目 `.env/` 目录，拥有独立的包
+- **隔离 HOME** &mdash; 每个 orchestrator 以项目目录作为 HOME 运行
+- **无交叉污染** &mdash; `PYTHONNOUSERSITE=1` 防止用户级包泄露
+- **自动配置** &mdash; `ark run` 和 Web 门户自动检测并使用项目 conda 环境；流水线在缺失时自动引导创建
+
+```bash
+# conda 环境在首次运行时自动创建
+# ark run 会检测并使用它：
+ark run myproject
+#   Conda env: /path/to/projects/myproject/.env
+```
+
+## Skills 系统
+
+ARK 内置 **builtin skills** &mdash; 模块化指令集，智能体在运行时加载以强制执行最佳实践：
+
+| Skill | 用途 |
+|:------|:-----|
+| **research-integrity** | 反模拟提示：智能体必须运行真实实验，不得伪造输出 |
+| **human-intervention** | 升级协议：智能体在执行不可逆操作前通过 Telegram 暂停并请求确认 |
+| **env-isolation** | 强制项目级环境边界 |
+| **figure-integrity** | 验证图表内容与数据一致；防止占位或虚构的图表 |
+| **page-adjustment** | 通过调整内容密度维持页数限制，而非删除章节 |
+
+Skills 位于 `skills/builtin/`，在流水线引导阶段自动安装。
 
 ---
 
@@ -149,7 +183,7 @@ ARK 使用 PyMuPDF + Claude Haiku 解析 PDF，预填向导，从提取的规格
 | 命令 | 功能 |
 |:-----|:-----|
 | `ark new <name>` | 通过交互式向导创建项目 |
-| `ark run <name>` | 启动自主 pipeline |
+| `ark run <name>` | 启动 pipeline（自动检测项目级 conda 环境） |
 | `ark status [name]` | 分数、迭代、阶段、成本 |
 | `ark monitor <name>` | 实时仪表板：智能体活动、分数趋势 |
 | `ark update <name>` | 注入运行中指令 |
@@ -167,7 +201,7 @@ ARK 使用 PyMuPDF + Claude Haiku 解析 PDF，预填向导，从提取的规格
 
 ## Web 门户
 
-ARK 提供基于 Web 的门户，用于管理项目、查看评分和控制智能体。
+ARK 提供基于 Web 的门户，用于管理项目、查看评分和控制智能体。门户展示 **实时阶段徽章**（Research / Dev / Review）、项目级 conda 环境状态和实时成本追踪。
 
 ### 配置
 
@@ -220,10 +254,11 @@ ark setup-bot    # 一次性配置：粘贴 BotFather token，自动检测 chat
 ```
 
 功能：
-- **实时通知** &mdash; 分数变化、阶段转换、错误报告
-- **发送指令** &mdash; 引导当前迭代方向
+- **富��本通知** &mdash; 格式化的分数变化、阶段转换、智能体活动和��误报告
+- **发送指令** &mdash; 实时引导当前迭代方向
 - **请求 PDF** &mdash; 获取最新编译论文
-- **主动确认** &mdash; 关键决策前主动询问
+- **人工干预** &mdash; 智能体在执行不可逆操作前向你请求确认
+- **HPC 友好** &mdash; 支持企业/HPC 网络的自签名 SSL ��书
 
 ---
 
diff --git a/ark/cli.py b/ark/cli.py
@@ -1890,9 +1890,28 @@ def cmd_run(args):
         except Exception:
             pass
 
-    # Launch orchestrator in background
-    cmd = [
-        sys.executable, "-m", "ark.orchestrator",
+    # Launch orchestrator in background, preferring per-project conda env
+    try:
+        from ark.webapp.jobs import (
+            find_conda_binary, project_env_ready, project_env_prefix,
+        )
+        conda_bin = find_conda_binary()
+        if conda_bin and project_env_ready(project_dir):
+            python_prefix = [conda_bin, "run", "--no-capture-output",
+                             "--prefix", str(project_env_prefix(project_dir)),
+                             "python"]
+            print(f"  Conda env:      {project_env_prefix(project_dir)}")
+        elif conda_bin and config.get("conda_env"):
+            python_prefix = [conda_bin, "run", "--no-capture-output",
+                             "-n", config["conda_env"], "python"]
+            print(f"  Conda env:      {config['conda_env']}")
+        else:
+            python_prefix = [sys.executable]
+    except ImportError:
+        python_prefix = [sys.executable]
+
+    cmd = python_prefix + [
+        "-m", "ark.orchestrator",
         "--project", name,
         "--mode", mode,
         "--model", model,
@@ -1906,6 +1925,9 @@ def cmd_run(args):
 
     # Strip CLAUDECODE so orchestrator can call claude CLI freely
     env = {k: v for k, v in os.environ.items() if k != "CLAUDECODE"}
+    # Ensure orchestrator can find the ark package in a project-local conda env
+    ark_root = str(Path(__file__).resolve().parents[1].parent)
+    env["PYTHONPATH"] = ark_root + ((":" + env["PYTHONPATH"]) if env.get("PYTHONPATH") else "")
 
     with open(log_file, "w") as lf:
         process = subprocess.Popen(