docs: update CLAUDE.md to reflect current state of the project

S1M0N38 · S1M0N38 · commit 9bb83f835478 · 2026-01-06T16:14:07.000+01:00
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -2,54 +2,171 @@
 
 This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
 
+**GitHub Repository**: [`s1m0n38/balatrollm`](https://github.com/s1m0n38/balatrollm)
+
 ## Overview
 
-BalatroLLM is an LLM-powered bot that plays Balatro using the BalatroBot JSON-RPC API. It communicates with the game via HTTP/JSON-RPC 2.0 and uses OpenAI-compatible APIs (OpenAI, OpenRouter, etc.) for strategic decision making.
+BalatroLLM is an LLM-powered bot that autonomously plays Balatro using strategic decision-making. It uses the BalatroBot API (a Lua mod exposing JSON-RPC 2.0 endpoints) to control the game while leveraging large language models for gameplay decisions.
+
+**Key capabilities:**
+- Autonomous gameplay from start to completion
+- LLM-driven strategic decision-making using OpenAI function calling
+- Configurable strategies via Jinja2 templates
+- Parallel execution of multiple game instances
+- Comprehensive data collection for analysis and benchmarking
+
+### Make Commands
+
+Available make targets:
+
+| Target           | Description                                             |
+| ---------------- | ------------------------------------------------------- |
+| `make help`      | Show all available targets                              |
+| `make lint`      | Run ruff linter (check only)                            |
+| `make format`    | Run ruff and mdformat formatters                        |
+| `make typecheck` | Run type checker                                        |
+| `make quality`   | Run all code quality checks (lint + typecheck + format) |
+| `make all`       | Run all quality checks                                  |
+| `make install`   | Install dependencies                                    |
+
+**Important rules:**
+
+1. **Only run make commands when explicitly asked.** Do not proactively run `make quality`, `make lint`, etc.
+2. **Never run bare linting/formatting/typechecking tools.** Always use make targets instead:
+   - Use `make lint` instead of `ruff check`
+   - Use `make format` instead of `ruff format`
+   - Use `make typecheck` instead of `ty check`
+   - Use `make quality` for all checks combined
 
 ## Architecture
 
-### Core Components
+### Python Components (`src/balatrollm/`)
+
+The bot is structured as a pipeline: CLI → Executor → Bot → (LLMClient + BalatroClient) → Game.
+
+#### CLI (`cli.py`)
+
+Entry point for the `balatrollm` command. Parses CLI arguments and YAML config files, generates task combinations from game parameters (seed, deck, stake, strategy, model). Config precedence: environment variables < YAML < CLI args.
+
+#### Bot (`bot.py`)
+
+Core game loop with LLM-powered decision-making. Handles game states (SELECTING_HAND, SHOP, ROUND_EVAL, BLIND_SELECT, GAME_OVER). Each decision: sends game state + strategy + memory to LLM → receives tool call → executes via BalatroClient. Tracks consecutive failures (max 3) and logs all data via Collector.
+
+#### Client (`client.py`)
+
+JSON-RPC 2.0 HTTP client for BalatroBot mod. Async client to `http://127.0.0.1:12346` (default 30s timeout). Methods: `gamestate`, `start`, `select`, `cash_out`, `play`, `buy`, `sell`, `use`, `screenshot`, etc.
+
+#### LLMClient (`llm.py`)
+
+Async OpenAI client wrapper with retry logic. Provider-agnostic (works with OpenAI, OpenRouter, any compatible API). Default: OpenRouter. Exponential backoff (max 3 retries), aborts after 3 consecutive timeouts. Default timeout: 240s.
+
+#### StrategyManager (`strategy.py`)
+
+Manages Jinja2 strategy templates. Each strategy folder contains: `STRATEGY.md.jinja` (game rules/tactics), `GAMESTATE.md.jinja` (current state), `MEMORY.md.jinja` (action history), `TOOLS.json` (function calling tools), `manifest.json` (metadata). Current: `default` (conservative, financially-focused).
 
-1. **Client** (`src/balatrollm/client.py`):
+#### Config (`config.py`)
 
-    - JSON-RPC 2.0 over HTTP client using `httpx`.
-    - Communicates with BalatroBot (Lua) on port 12346 request/response.
-    - Handles `BalatroError` exceptions.
+Multi-source configuration. Precedence: environment variables < YAML < CLI args. Key env vars: `BALATROLLM_API_KEY`, `BALATROLLM_BASE_URL`, `BALATROLLM_MODEL`, `BALATROLLM_SEED/DECK/STAKE`, `BALATROLLM_STRATEGY`, `BALATROLLM_PARALLEL`, `BALATROLLM_HOST/PORT`.
 
-2. **Bot** (`src/balatrollm/bot.py`):
+#### Executor (`executor.py`)
 
-    - Main game loop implementation.
-    - Flow: Fetch state -> Render Strategy -> Query LLM -> Execute Action.
-    - Manages game lifecycle (start, play, game over).
+Orchestrates parallel execution. Manages pool of Balatro instances via `balatrobot` package, assigns consecutive ports (12346, 12347, ...), uses task queue with worker pool pattern.
 
-3. **Strategies** (`src/balatrollm/strategies/`):
+#### Collector (`collector.py`)
 
-    - Template-based prompt generation system.
-    - `GAMESTATE.md.jinja`: Renders current game state for LLM.
-    - `STRATEGY.md.jinja`: High-level strategic guidance.
-    - `TOOLS.json`: JSON-RPC tool definitions exposed to the LLM.
+Data collection and statistics. Output: `runs/v1.0.0/{strategy}/{vendor}/{model}/{timestamp}_{deck}_{stake}_{seed}/`. JSONL format, OpenAI Batch API compatible.
 
-### JSON-RPC Protocol
+### Strategy System
 
-- **Transport**: HTTP POST to `http://127.0.0.1:12346`
-- **Format**: `{"jsonrpc": "2.0", "method": "...", "params": {...}, "id": 1}`
-- **Key Endpoints**:
-    - `start`: Start a new run (params: `deck`, `stake`, `seed`).
-    - `gamestate`: specific get state call.
-    - `play` / `discard`: Action on selected cards.
-    - `buy` / `sell` / `use`: Shop and consumable actions.
-    - `select` / `skip`: Blind interactions.
+Template-based system using Jinja2. Templates render strategy guidance, game state, and action history into LLM prompts. At each decision point: render templates → send to LLM with tools → execute tool call. Custom strategies: create folder in `src/balatrollm/strategies/` with required template files.
 
-## Key Changes in V1
+### Relationship with BalatroBot
 
-- **State Enum**: Game states are now string enums (e.g., `SELECTING_HAND`, `SHOP`, `GAME_OVER`) instead of integers.
-- **Card Structure**: 0-based indexing. Structure is flattened with short codes (e.g., Suit "H", Rank "A", Edition "FOIL").
-- **Split Actions**: Complex actions are split (e.g., `shop` -> `buy`, `reroll`, `next_round`).
+**BalatroBot** is a Lua mod that runs inside the Balatro game and exposes a JSON-RPC 2.0 HTTP API for programmatic control.
+
+**How BalatroLLM uses BalatroBot:**
+
+```
+┌─────────────────────────────────────────┐
+│          BalatroLLM (Python)            │
+│                                         │
+│  ┌────────┐      ┌──────────────────┐   │
+│  │  Bot   │──────│   LLMClient      │   │◄──── OpenAI API
+│  └───┬────┘      └──────────────────┘   │      (OpenRouter, etc.)
+│      │                                  │
+│  ┌───▼────────────────────────────┐     │
+│  │      BalatroClient             │     │
+│  │  (JSON-RPC 2.0 HTTP Client)    │     │
+│  └───┬────────────────────────────┘     │
+└──────┼──────────────────────────────────┘
+       │ HTTP POST
+       │ JSON-RPC 2.0
+┌──────▼──────────────────────────────────┐
+│         BalatroBot (Lua Mod)            │
+│    JSON-RPC 2.0 HTTP API Server         │
+└──────┬──────────────────────────────────┘
+       │
+┌──────▼──────────────────────────────────┐
+│          Balatro Game                   │
+│           (Love2D)                      │
+└─────────────────────────────────────────┘
+```
+
+**Integration**: BalatroLLM imports `BalatroInstance` from `balatrobot` to manage game instances. `BalatroClient` sends JSON-RPC requests to BalatroBot's HTTP server (default port 12346).
+
+## Data Collection & Output Structure
+
+Output directory: `runs/v1.0.0/{strategy}/{vendor}/{model}/{timestamp}_{deck}_{stake}_{seed}/`
+
+**Files generated:**
+- `requests.jsonl` - LLM requests (OpenAI Batch API format)
+- `responses.jsonl` - LLM responses with timing/token data
+- `gamestates.jsonl` - Game state snapshots after each action
+- `screenshots/` - PNG screenshots at each decision point
+- `task.json` - Run configuration
+- `strategy.json` - Strategy metadata
+- `stats.json` - Aggregated statistics (outcome, tokens, timing, cost, provider distribution)
 
 ## Key Files
 
-- `src/balatrollm/client.py`: JSON-RPC client implementation.
-- `src/balatrollm/bot.py`: Main bot logic.
-- `src/balatrollm/data_collection.py`: Run logging and stats.
-- `balatro.sh`: Helper script to manage Balatro game instances.
-- `runs/`: Directory where game execution logs and artifacts are stored.
+### Python
+
+| File | Purpose |
+|------|---------|
+| `src/balatrollm/cli.py` | CLI entry point and argument parsing |
+| `src/balatrollm/bot.py` | Core game loop and LLM integration |
+| `src/balatrollm/client.py` | BalatroBot JSON-RPC client |
+| `src/balatrollm/llm.py` | OpenAI client wrapper with retry logic |
+| `src/balatrollm/strategy.py` | Strategy template management |
+| `src/balatrollm/config.py` | Multi-source configuration management |
+| `src/balatrollm/executor.py` | Parallel execution orchestration |
+| `src/balatrollm/collector.py` | Data collection and statistics |
+
+### Configuration
+
+| File | Purpose |
+|------|---------|
+| `pyproject.toml` | Python dependencies and tool configuration |
+| `config/example.yaml` | Example YAML configuration file |
+| `Makefile` | Development commands (lint, format, typecheck) |
+
+### Strategies
+
+| File | Purpose |
+|------|---------|
+| `src/balatrollm/strategies/default/` | Default strategy templates |
+
+## Error Handling
+
+Tracks consecutive failures (max 3 before aborting). Two failure types: **error calls** (invalid LLM response) and **failed calls** (valid tool call but execution failed). LLM retry with exponential backoff (max 3 retries), aborts after 3 consecutive timeouts. Error context included in next prompt's memory for recovery.
+
+## Configuration
+
+Config precedence: environment variables < YAML config file < CLI arguments (highest priority).
+
+**Environment variables** (prefix: `BALATROLLM_`):
+- `API_KEY`, `BASE_URL`, `MODEL` - LLM configuration
+- `SEED`, `DECK`, `STAKE`, `STRATEGY` - Game parameters
+- `PARALLEL`, `HOST`, `PORT` - Execution settings
+
+**Usage**: `balatrollm config/example.yaml --model openai/gpt-4o --seed AAAAAAA --deck RED --stake WHITE`