AutomateThePlanet
diff --git a/‎CLAUDE.md‎
Lines changed: 1 addition & 0 deletions b/‎CLAUDE.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎PROJECT-KNOWLEDGE.md‎
Lines changed: 1 addition & 0 deletions b/‎PROJECT-KNOWLEDGE.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/cli-reference.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/cli-reference.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/skills-integration.md‎
Lines changed: 14 additions & 0 deletions b/‎docs/skills-integration.md‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎specs/033-from-description-chat-flow/checklists/requirements.md‎
Lines changed: 35 additions & 0 deletions b/‎specs/033-from-description-chat-flow/checklists/requirements.md‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎specs/033-from-description-chat-flow/plan.md‎
Lines changed: 165 additions & 0 deletions b/‎specs/033-from-description-chat-flow/plan.md‎
Lines changed: 165 additions & 0 deletions
@@ -209,6 +209,7 @@ spectra config list-automation-dirs                 # List dirs with existence s
 - **Tests:** xUnit with structured results (never throw on validation errors)
 
 ## Recent Changes
+- 033-from-description-chat-flow: ✅ COMPLETE - From-description chat flow & doc-aware manual tests. Updated `spectra-generate` SKILL with dedicated "When the user wants to create a specific test case" section (numbered 5-step sequence) and intent-routing table mapping topic-vs-scenario signals to `--focus`, `--from-description`, or `--from-suggestions`. Updated `spectra-generation` agent prompt with new "Test Creation Intent Routing" section (Intent 1: explore area → `--focus`; Intent 2: specific test → `--from-description`; Intent 3: from suggestions → `--from-suggestions`) and explicit "do NOT ask about count or scope" rule. Enhanced `UserDescribedGenerator` with new public static `BuildPrompt()` method (testable prompt construction) and optional `documentContext` / `criteriaContext` / `sourceRefPaths` parameters on `GenerateAsync()`. `GenerateHandler.ExecuteFromDescriptionAsync` now best-effort loads matching documentation (capped at 3 docs × 8000 chars via `SourceDocumentLoader`) and acceptance criteria (via existing `LoadCriteriaContextAsync`) before calling the generator — failures are swallowed (best-effort, non-blocking). Resulting tests get populated `source_refs` (from loaded doc paths) and `criteria` (from AI-matched IDs) when context is available; `grounding.verdict` remains `manual` regardless. New `FilterDocsForSuite` and `FormatDocContext` private helpers in `GenerateHandler`. New tests: `UserDescribedGeneratorTests` (9 prompt-builder tests) and `GenerateSkillContentTests` (10 SKILL/agent content tests). `GenerationAgent_LineCount` limit raised 100→140 to fit the new routing section. 19 new tests. 1453 total tests passing.
 - 032-quickstart-skill-usage-guide: ✅ COMPLETE - Quickstart SKILL & USAGE.md offline guide. New `spectra-quickstart` SKILL (12th bundled SKILL) — workflow-oriented onboarding that responds to "help me get started", "tutorial", "walk me through" with 12 workflow walkthroughs and example conversations. Teaching-only (no CLI execution); delegates actual workflow execution to the corresponding workflow SKILLs. New `USAGE.md` bundled doc written to project root by `spectra init` (offline mirror of the quickstart SKILL, free of in-chat tool references). Both artifacts hash-tracked by the existing `update-skills` system. New `ProfileFormatLoader.LoadEmbeddedUsageGuide()` method. New `InitHandler.CreateUsageGuideAsync` (gated by `--skip-skills`). Generation and execution agent prompts gain a `**QUICKSTART**` delegation line directing onboarding intents to the new SKILL. Updated SKILL count test (11→12). 7 new tests (quickstart SKILL content, USAGE.md content + offline-clean assertions, init creates both files, --skip-skills skips both files, both agents reference quickstart). 1434 total tests passing.
 - 030-prompt-templates: ✅ COMPLETE - Customizable root prompt templates. Introduced `.spectra/prompts/` directory with 5 markdown templates (behavior-analysis, test-generation, criteria-extraction, critic-verification, test-update) controlling all AI operations. Templates use `{{placeholder}}`, `{{#if}}`, `{{#each}}` syntax with built-in defaults as embedded resources. New `PlaceholderResolver`, `PromptTemplateParser`, `PromptTemplateLoader`, `BuiltInTemplates` in `Spectra.CLI/Prompts/`. Replaced hardcoded prompts in `BehaviorAnalyzer`, `CopilotGenerationAgent`, `CriteriaExtractor`, `CriticPromptBuilder` with template-driven approach (legacy fallback preserved). New `analysis.categories` config section with 6 default categories (happy_path, negative, edge_case, boundary, error_handling, security). New `spectra prompts list/show/reset/validate` CLI commands with JSON output. New `spectra-prompts` SKILL (11th bundled SKILL). Init creates `.spectra/prompts/` with defaults. `update-skills` tracks template hashes for safe updates. 65+ new tests. 1417 total tests passing.
 - 029-spectra-update-skill: ✅ COMPLETE - Added spectra-update SKILL (10th bundled SKILL) for test update workflow via Copilot Chat. SKILL wraps `spectra ai update` with progress page, result file, classification breakdown (UP_TO_DATE, OUTDATED, ORPHANED, REDUNDANT). Agent delegation tables updated (both generation and execution agents delegate update requests to SKILL). Extended `UpdateResult` with `success`, `totalTests`, `testsFlagged`, `flaggedTests`, `duration` fields. Generation agent inline update section replaced with delegation row. 6 new tests (SKILL content, step format, do-NOTHING instruction, tools list, agent delegation). Documentation updated (SKILL count 9→10). Version 1.35.0.
 
@@ -310,6 +310,7 @@ Three-section unified coverage with distinct semantics:
 
 | # | Feature | Key Changes |
 |---|---------|-------------|
+| 033 | From-Description Chat Flow | Dedicated `--from-description` SKILL section, agent intent routing (focus vs from-description vs from-suggestions), doc-aware manual tests with populated `source_refs` and `criteria` (verdict stays manual) |
 | 029 | spectra-update SKILL (10th) | Agent delegation, documentation sync, version 1.35.0 |
 | 028 | Coverage & Criteria Pipeline | Fixed criteria propagation in parser, wired criteria into generation pipeline, always write criteria: [] |
 | 027 | SKILL/Agent Deduplication | Agents delegate to SKILLs, execution ~120 lines, generation ~81 lines, SKILL consistency fixes |
 
@@ -179,6 +179,8 @@ Session state is stored in `.spectra/session.json` and expires after 1 hour.
 
 User-described tests are marked with `grounding.verdict: manual` and `source: user-described`.
 
+When a project has documentation in `docs/` and acceptance criteria in `docs/criteria/`, `--from-description` runs in **doc-aware mode**: it best-effort loads matching docs (capped at 3 docs × 8000 chars) and matching `.criteria.yaml` entries as formatting context, then populates the new test's `source_refs` (with the doc paths used) and `criteria` fields (with any IDs the AI matches to your description). The grounding verdict stays `manual` — doc context is used for terminology and navigation alignment only, never for verification. If no docs or criteria exist, the flow is identical to the no-context behavior.
+
 Duplicate detection warns when a new test has >80% title similarity to an existing test.
 
 **Exit codes:** `0` = success, `1` = error, `3` = missing required args with `--no-interaction`.
 
@@ -78,6 +78,20 @@ spectra ai generate --suite {suite} --from-description "{text}" --context "{ctx}
 spectra ai generate --suite {suite} --auto-complete --output-format json --verbosity quiet
 ```
 
+### Intent Routing in Chat (spec 033)
+
+The `spectra-generate` SKILL contains a dedicated section for `--from-description` and an intent-routing table that the `spectra-generation` agent uses to choose between flows:
+
+| User intent | Signal | Flow |
+|-------------|--------|------|
+| Explore a feature area | "Generate tests for...", "Cover... module" | Main analyze → generate flow with `--focus` |
+| Create a specific test | "Add a test for...", "I need a test that verifies..." | `--from-description` (1 test, no analysis, no count question) |
+| Generate from suggestions | "Use the previous suggestions" | `--from-suggestions` |
+
+**Key rule**: if you can read the user's request as a single test case title, the agent routes to `--from-description`. If it's a topic to explore, the agent routes to `--focus`. The agent never asks the user for count or scope to disambiguate — the topic-vs-scenario shape is the only signal.
+
+When `--from-description` runs in a project that has documentation and acceptance criteria, the CLI best-effort loads matching docs (capped at 3 docs × 8000 chars) and matching `.criteria.yaml` entries as formatting context. The resulting test case has populated `source_refs` and `criteria` fields, but `grounding.verdict` stays `manual` — doc context is used for terminology alignment only, never for verification.
+
 ### Non-Interactive Mode
 
 For CI pipelines and automated workflows:
 
@@ -0,0 +1,35 @@
+# Specification Quality Checklist: From-Description Chat Flow & Doc-Aware Manual Tests
+
+**Purpose**: Validate specification completeness and quality before proceeding to planning
+**Created**: 2026-04-10
+**Feature**: [spec.md](../spec.md)
+
+## Content Quality
+
+- [x] No implementation details (languages, frameworks, APIs)
+- [x] Focused on user value and business needs
+- [x] Written for non-technical stakeholders
+- [x] All mandatory sections completed
+
+## Requirement Completeness
+
+- [x] No [NEEDS CLARIFICATION] markers remain
+- [x] Requirements are testable and unambiguous
+- [x] Success criteria are measurable
+- [x] Success criteria are technology-agnostic (no implementation details)
+- [x] All acceptance scenarios are defined
+- [x] Edge cases are identified
+- [x] Scope is clearly bounded
+- [x] Dependencies and assumptions identified
+
+## Feature Readiness
+
+- [x] All functional requirements have clear acceptance criteria
+- [x] User scenarios cover primary flows
+- [x] Feature meets measurable outcomes defined in Success Criteria
+- [x] No implementation details leak into specification
+
+## Notes
+
+- Spec references implementation file names (`UserDescribedGenerator`, `GenerateHandler`, `LoadCriteriaContextAsync`) in the Assumptions section. These are intentional anchor references for the implementer; the user-facing requirements (FR-001..FR-017) and success criteria are technology-agnostic.
+- All items pass on first iteration. Spec is ready for `/speckit.plan`.
@@ -0,0 +1,165 @@
+# Implementation Plan: From-Description Chat Flow & Doc-Aware Manual Tests
+
+**Branch**: `033-from-description-chat-flow` | **Date**: 2026-04-10 | **Spec**: [spec.md](./spec.md)
+
+## Summary
+
+Three-part feature: (1) update `spectra-generate` SKILL with a dedicated single-test "from-description" flow and an intent routing table, (2) update the `spectra-generation` agent prompt with explicit intent-classification rules, (3) enhance `UserDescribedGenerator` to load relevant docs and acceptance criteria as best-effort formatting context — populating `source_refs` and `criteria` on the resulting test while keeping `grounding.verdict: manual`.
+
+## Technical Context
+
+**Language/Version**: C# 12, .NET 8
+**Primary Dependencies**: Spectra.CLI (existing), Spectra.Core (TestCase, GroundingMetadata, AcceptanceCriterion)
+**Storage**: File-based — embedded SKILL/agent `.md` resources in `Spectra.CLI`; SHA-256 hashes computed at install time
+**Testing**: xUnit (`Spectra.CLI.Tests`)
+**Target Platform**: Cross-platform .NET CLI
+**Project Type**: CLI (single project)
+**Constraints**: Best-effort doc/criteria loading must not block or fail the command. Doc context capped at 3 docs × 8000 chars.
+**Scale/Scope**: ~3 source files modified, 1 SKILL md file, 1 agent md file, ~10 new tests, 9 doc files updated.
+
+## Constitution Check
+
+No constitution file. Standard CLAUDE.md guidelines apply: no unnecessary refactors, only test the changed paths, prefer small focused changes.
+
+## Project Structure
+
+### Documentation (this feature)
+
+```text
+specs/033-from-description-chat-flow/
+├── spec.md
+├── plan.md
+├── tasks.md
+├── checklists/
+│   └── requirements.md
+└── (no contracts/ — internal CLI feature, no API)
+```
+
+### Source Code (touched paths)
+
+```text
+src/Spectra.CLI/
+├── Commands/Generate/
+│   ├── UserDescribedGenerator.cs   # MODIFIED: add documentContext + criteriaContext params, refactor prompt builder for testability
+│   └── GenerateHandler.cs          # MODIFIED: ExecuteFromDescriptionAsync loads doc + criteria context, populates source_refs
+└── Skills/Content/
+    ├── Skills/spectra-generate.md  # MODIFIED: add "create a specific test case" section + routing table
+    └── Agents/spectra-generation.agent.md  # MODIFIED: add Test Creation Intent Routing section
+
+tests/Spectra.CLI.Tests/
+├── Commands/Generate/
+│   └── UserDescribedGeneratorTests.cs  # NEW: prompt-building tests
+└── Skills/
+    └── GenerateSkillContentTests.cs    # NEW: SKILL/agent content assertions
+```
+
+## Phases
+
+### Phase 0 — Research / discovery
+
+No external research needed. All primitives exist:
+- `SourceDocumentLoader.LoadAllAsync(basePath, maxDocuments, maxContentLengthPerDoc, ct)` already supports caps.
+- `LoadCriteriaContextAsync` (private static in `GenerateHandler`, line 1943) is the criteria primitive — promote to internal/static-helper-callable from the from-description branch.
+- `SkillContent` / `AgentContent` already load embedded resources via `SkillResourceLoader`. SHA-256 hashes are computed at install time, not stored — so editing the `.md` resources is sufficient; no manifest table to regenerate.
+
+### Phase 1 — SKILL & agent content (no code changes beyond .md files)
+
+1. Add new section to `Skills/Content/Skills/spectra-generate.md`:
+   - Heading: `## When the user wants to create a specific test case`
+   - Numbered Step 1..5 sequence (open progress page → runInTerminal → awaitTerminal → readFile → present).
+   - Command line: `spectra ai generate --suite {suite} --from-description "{description}" --context "{context}" --no-interaction --output-format json --verbosity quiet`.
+   - Explicit "Do NOT run analysis. Do NOT ask how many tests. Always 1 test." line.
+   - Routing table mapping intent signal → flow.
+
+2. Add new section to `Skills/Content/Agents/spectra-generation.agent.md`:
+   - Heading: `## Test Creation Intent Routing`.
+   - Three intent classes (Intent 1: explore area → `--focus`, Intent 2: specific test → `--from-description`, Intent 3: from suggestions → `--from-suggestions`) with examples and actions.
+   - Ambiguous-intent rule: topic-vs-scenario; never ask about count.
+
+3. Verify SkillContent/AgentContent dictionaries still resolve (smoke test in build).
+
+### Phase 2 — Doc-aware `--from-description` (CLI code)
+
+1. **`UserDescribedGenerator.cs`** — refactor:
+   - Add public `static string BuildPrompt(string description, string? context, string suite, IReadOnlyCollection<string> existingIds, string? documentContext, string? criteriaContext)` method that returns the AI prompt string. This makes prompt construction testable without invoking AI.
+   - Add optional parameters `string? documentContext = null`, `string? criteriaContext = null`, and `IReadOnlyList<string>? sourceRefPaths = null` to `GenerateAsync(...)`.
+   - When `documentContext` is non-null: insert "## Reference Documentation (for formatting context only)" section in the prompt.
+   - When `criteriaContext` is non-null: insert "## Related Acceptance Criteria" section.
+   - When `sourceRefPaths` is non-null: populate the returned `TestCase.SourceRefs` from those paths instead of `[]`.
+   - Keep AI's `criteria` output (already populated by `agent.GenerateTestsAsync`) flowing into `TestCase.Criteria`.
+   - Keep `grounding.verdict = Manual` unconditionally.
+
+2. **`GenerateHandler.cs`** — modify `ExecuteFromDescriptionAsync`:
+   - Promote `LoadCriteriaContextAsync` from `private static` to allow reuse, OR call directly (it is already in the same class).
+   - After loading config, before calling `generator.GenerateAsync`, perform best-effort load:
+     ```csharp
+     string? docContext = null;
+     IReadOnlyList<string> docPaths = [];
+     try
+     {
+         var loader = new SourceDocumentLoader(config.Source);
+         var allDocs = await loader.LoadAllAsync(currentDir, maxDocuments: null, maxContentLengthPerDoc: 8000, ct);
+         var matching = allDocs
+             .Where(d => MatchesSuite(d, suite))
+             .Take(3)
+             .ToList();
+         if (matching.Count > 0)
+         {
+             docContext = FormatDocContext(matching);
+             docPaths = matching.Select(d => d.Path).ToList();
+         }
+     }
+     catch { /* best-effort */ }
+
+     string? criteriaContext = null;
+     try { criteriaContext = await LoadCriteriaContextAsync(currentDir, suite, config, ct); }
+     catch { /* best-effort */ }
+     ```
+   - `MatchesSuite` is a small private helper: case-insensitive contains on `doc.Path` filename or `doc.Title`.
+   - `FormatDocContext` produces a delimited string of `## {title}\n{content}\n`.
+   - Pass `docContext`, `criteriaContext`, `docPaths` to `generator.GenerateAsync`.
+
+3. **JSON result** — no shape change. `source_refs` and `criteria` are persisted via `TestFileWriter`, which already writes them. No `GenerateResult` schema change needed.
+
+### Phase 3 — Tests
+
+1. **`UserDescribedGeneratorTests`** (new):
+   - `BuildPrompt_WithoutContext_DoesNotIncludeRefSection`
+   - `BuildPrompt_WithDocContext_IncludesRefDocumentationSection`
+   - `BuildPrompt_WithCriteriaContext_IncludesAcceptanceCriteriaSection`
+   - `BuildPrompt_WithBothContexts_IncludesBoth`
+   - `BuildPrompt_IncludesUserDescriptionAsSourceOfTruth`
+
+2. **`GenerateSkillContentTests`** (new):
+   - `GenerateSkill_HasFromDescriptionSection` — asserts `SkillContent.Generate.Contains("create a specific test case")`.
+   - `GenerateSkill_HasIntentRoutingTable` — asserts the table headers ("User intent", "Signal", "Flow") all present.
+   - `GenerateSkill_FromDescriptionUsesCorrectFlags` — asserts `--from-description` line contains `--no-interaction` and `--output-format json` and `--verbosity quiet`.
+   - `GenerationAgent_HasIntentRoutingSection` — asserts agent content contains "Test Creation Intent Routing" + "--from-description" + "--focus".
+   - `GenerationAgent_RoutesToFromDescriptionForSpecificTest` — asserts agent content includes the example "Add a test for".
+   - `GenerationAgent_DoesNotAskAboutCountInRoutingRules` — asserts the "do NOT ask clarifying questions about count" instruction exists.
+
+3. **Integration tests** — deferred. The from-description path invokes AgentFactory which requires real AI. Coverage of FR-008..FR-014 is via the prompt-building unit tests + manual smoke; no integration test will be added in this spec to keep the test suite isolated from network/AI dependencies.
+
+### Phase 4 — Documentation updates
+
+Update the 9 doc files listed in the spec:
+- `CLAUDE.md` — add 033 to Recent Changes.
+- `PROJECT-KNOWLEDGE.md` — add 033 implemented entry.
+- `README.md` — add "create a specific test" example near Quick Start.
+- `docs/getting-started.md` — add from-description example.
+- `docs/cli-reference.md` — note `--from-description` doc/criteria context.
+- `docs/skills-integration.md` — describe new from-description flow + intent routing.
+- `docs/test-format.md` — note `source_refs`/`criteria` may be populated for manual tests.
+- `docs/cli-vs-chat-generation.md` — update Dimension 8.
+- `docs/coverage.md` — note manual tests can now contribute to coverage.
+
+(If any of these files do not exist, skip that line item — they are optional polish.)
+
+## Risks & Mitigations
+
+| Risk | Mitigation |
+|------|------------|
+| Doc loading slows from-description noticeably | Cap at 3 docs × 8000 chars; load synchronously inside best-effort try block; no timeout needed since file I/O is bounded. |
+| AI emits criteria IDs that don't exist | Acceptable — coverage analyzer will simply not match them. The criteria context tells the AI which IDs are valid, so this should be rare. |
+| SKILL .md changes break existing skill content tests | Search existing tests for hardcoded SKILL strings before edit; update them in the same change. |
+| Refactoring `BuildPrompt` to static breaks existing call site | The existing `GenerateAsync` will still build the prompt internally (calling `BuildPrompt`), so call sites are unchanged. |