All notable changes to this project will be documented in this file.
plan-issue-deliverynow ships explicit runtime adapter install/sync/status tooling plus adapter templates for Codex, Claude Code, and OpenCode viascripts/plan-issue-adapter(#233).
- Markdown linting now uses
rumdlbehind the existing audit/check entrypoints, with related prompt/docs cleanup and refreshed third-party artifact metadata (#234). - The
image-processingskill/docs now align with the current multi-formatconvert --inCLI, including raster inputs,jpgoutput, and current report artifacts (#235).
plan-issue-deliverynow tracks the Codex adapter config template as part of the runtime adapter rollout (#233).
- Shared
create-plan/create-plan-rigorousplan-authoring baseline reference atskills/workflows/plan/_shared/references/PLAN_AUTHORING_BASELINE.mdand shared pytest helpers underskills/workflows/plan/_shared/python/(#232).
create-plannow points to a single shared baseline for plan authoring, executability, and grouping rules while keeping only base-skill deltas locally (#231, #232).create-plan-rigorousnow builds on the same shared baseline and keeps only rigorous-specific sizing, scorecard, and review guidance locally (#231, #232).- The shared plan template now includes optional execution metadata and rigorous scorecard placeholders so the scaffold matches the actual workflow contract more closely (#232).
- Final outcomes artifact at
docs/plans/skills-review-final-outcomes.mdwith auditable keep/remove decisions and explicit migration mapping for removed entrypoints.
- Repo-level docs were aligned to the finalized skill/check surface, including canonical
scripts/check.shgates and entrypoint-drift guards inPROJECT_DEV_WORKFLOW.md. README.mdstructure notes now reflect current tracked directories and remove stale progress-log wording.docs/testing/script-smoke.mdplan-issue cleanup example now uses the current<owner__repo>workspace slug convention.- Obsolete
docs/plansartifacts were removed, and legacy simplification notes were dropped from issue workflow script help text. scripts/README.mdwas refreshed to match the current script inventory and remove outdated sections.
- Docs freshness audit coverage, including a dedicated helper and lint/local workflow integration (#214, #215).
- CI ownership/stale-script guardrails with refreshed regression smoke coverage for script specs (#212, #213).
- Local pre-commit check wrapper and updated developer command guidance (#219).
- CI phase orchestration now centralizes bootstrap/setup and adds parity guardrails for check flows (#210, #211).
plan-issue-deliverymerge behavior now prefers squash, with merge fallback guidance for protected workflows (#220).- Repository plan-doc artifacts and related plan checks were pruned as part of CI cleanup/refactor work (#219).
- CI stability issues across docs/ownership/lint/pytest lanes were resolved to unblock runner-safe execution (#217, #218).
- Deterministic third-party artifact generation via
scripts/generate-third-party-artifacts.shforTHIRD_PARTY_LICENSES.mdandTHIRD_PARTY_NOTICES.md(#205). - Third-party artifact regression coverage, including dedicated pytest cases and smoke-spec entries for generator/audit script help flows (#205).
- Local/CI required checks now enforce strict third-party artifact freshness through
scripts/ci/third-party-artifacts-audit.shinscripts/check.sh --alland the lint workflow (#205). - Developer docs (
DEVELOPMENT.md,scripts/README.md) now document the third-party artifact generation/audit workflow and command entrypoints (#205).
plan-issue-deliverynow enforces plan-branch integration and sync gates across task-lane flows (#204).
release-workflownow enforces plain issue/PR references in release guidance and audits (#204).
issue-pr-reviewnow ships a reusable review-evidence template to document decision rationale and merge/follow-up outcomes (#203).
- Issue delivery workflows now standardize task-lane continuity and post-review sync expectations across main-agent/subagent handoffs (#200).
plan-issue-deliverynow requires main-agent init snapshot artifacts before dispatching sprint work (#202).issue-pr-reviewmerge/request-followup/close flows now require evidence-gated decision inputs and validation hooks (#203).
scripts/project-resolvenow fails fast when required option values are missing, with updated smoke coverage (#201).
plan-issue-deliverynow includes a worktree cleanup helper and associated smoke docs (#198).plan-issuenow requires dispatch snapshot artifacts for run traceability.plan-issue-deliverytest coverage now includes sprint1/sprint2 fixture artifact and PR normalization checks (#195, #196).
- Plan-issue prompt/runbook guidance now clarifies worktree usage, approval flow, and sprint close-gate checklists (#197, #199).
plan-issue-deliverynow documents runtime workspace policy and resolves prompt paths viaAGENT_HOME.- Skill/docs metadata were normalized with strict markdown linting and updated issue-delivery automation slug naming.
issue-subagent-prnow asserts dispatch snapshot environment variable names.- README automation-skill listings now remove duplicate entries and restore missing plan-issue prompt preset references.
- New bug PR workflows:
create-bug-pr,deliver-bug-pr, andclose-bug-pr. - New issue workflows:
issue-lifecycle,issue-pr-review, andissue-subagent-pr. - New issue-delivery automation flows:
issue-deliveryandplan-issue-delivery. - Plan-issue delivery prompts now support rendered subagent prompt enforcement.
- Issue-delivery flows now use plan-issue CLI binaries and updated prompt/checklist guidance.
create-plan-rigorousnow includes split-PR sizing guidance and sprint scorecard guardrails.- Legacy progress PR workflows were removed, with feature tooling simplified around issue/bug delivery paths.
- Issue workflows now enforce
pr-isolatedexecution mode. plan-issue-deliverynow returns clearer usage errors.- Issue workflow smoke scripts now support Bash 3.2 compatibility.
- New
docs-plan-cleanupworkflow skill to prune outdateddocs/planscontent and reconcile plan-related docs safely.
docs-plan-cleanupoutput now renders as markdown tables, with a bundled response template and test coverage.- Documentation cleanup removed obsolete plan/runbook docs and refreshed progress index references.
find-and-fix-bugsguidance now clarifies GitHub issue triage behavior.
- Feature PR close-cleanup scripts now handle git worktrees safely.
image-processingskill docs/tests now align with the SVG-first CLI flow.- Docker auth/home-path defaults are aligned for runtime tooling.
- Docker runtime defaults now point
CODEX_AUTH_FILEto$HOME/.codex/auth.jsonin compose and workspace launch flows.
docker/agent-env/bin/entrypoint.shnow falls backCODEX_HOMEto$HOME/.codexand defaultsCODEX_AUTH_FILEto$CODEX_HOME/auth.json.
close-feature-pr: automatically ready draft PRs before merge.deliver-feature-pr: add explicit preflight ambiguity bypass support.create-project-skill: auto-prefix generated skill names.- Devex: add the
codex-notifydesktop notification wrapper. - Scripts: add a
clean-untrackedhelper command.
- Environment/home-variable migration: standardize on
AGENT_HOME/agents_homenaming across docs and scripts, plusagent-envnaming updates. - Branding/docs: sync
codex-kitreferences toagent-kitand refresh workspace-launcher guidance. - Docker: simplify agent-environment path defaults and add an
agent-envoverview. - Workflows: update env-var references and align preflight/release documentation.
- Env resolution: remove inconsistent
AGENTS_HOMEfallback usage in runtime scripts and skills. agent-doc-init: align home-resolution behavior withAGENT_HOME.workspace-launcher: normalize workspace container naming.- Media tests: align screen-record test environment-variable handling.
create-feature-pr: support kickoff-first draft PR flow.deliver-feature-pr: add dirty preflight triage support.
- Runbooks: document Codex Cloud setup for Ubuntu and ensure Linuxbrew path guidance.
- Docker: pin
zsh-kitreference tonils-clifor image builds.
create-feature-pr: remove legacy Status section and harden progress URL resolution checks.deliver-feature-pr: handle empty arrays safely underset -u.- CI/scripts: harden Homebrew install workflow and add install-homebrew help mode.
- Semgrep profile: allow scanning the
commands/directory.
- Align
screen-recordskill contract and guide with current CLI behavior: screenshot mode, selector/mode gates, diagnostics flags (--metadata-out,--diagnostics-out), and--if-changed*. - Refresh
screen-recordassistant response template to distinguish recording vs screenshot completion details. - Expand
macos-agent-opsworkflow docs with permission preflight (preflight --include-probes) and diff-aware screenshot triage patterns.
- Add doc-guard tests for
screen-recordandmacos-agent-opsskills so key CLI usage examples do not drift.
- New
deliver-feature-prworkflow skill for create-PR -> CI-fix loop -> close-PR delivery. - New
create-project-skillworkflow under skill-management. - New
macos-agent-opsskill for Homebrewmacos-agentapp automation routines. - Browser tooling now includes
chrome-devtools-debug-companionin place of site-search workflow. create-skillnow updates the skill catalog during scaffolding.
- Agent-doc dispatcher docs tightened strict preflight flow and baseline fallback handling.
- Playwright workflow now requires MCP output under
out/playwright. - Developer/docs updates for local executable workflow and README skill index clarifications.
- Legacy command binaries were removed from
commands/.
macos-agent-opsguidance and rules now align with AX/input-source workflows.- US input-source detection and empty-string initializer handling were hardened in macOS agent scripts.
deliver-feature-prskill metadata quoting was corrected.deliver-feature-prscript now uses shell-style single-quoted emptylocalinitializers.
release-workflow: harden strict audit allow-dirty array handling underset -u.release-workflow: handle empty allow-dirty input safely in strict mode.
- Agent docs: roll out core
startup/task-tools/project-dev/skill-devcontexts and trial tooling. - Tests: add missing smoke specs for previously uncovered scripts.
- Docs: document the research workflow and update README media-skill platform support.
- Docs: align
gh-fix-ciCI-watch command guidance with 10-second interval behavior. - Tests: remove the orphan
image-processingsmoke spec.
agent-doc-init: avoidmapfileusage in project required-file parsing for broader shell compatibility.create-feature-pr: omit optional PR sections when they resolve toNone.release-workflow: allow strict release audit with changelog-only dirty state via--allow-dirty-path.script-smoke: align feature PR smoke cases with optional progress-section handling.
- Browser automation: add the
playwrightskill and wrapper CLI help flow. - Media capture: add
screen-recordandscreenshotskills, including desktop screenshot mode. - Testing: add
api-test-runnerlocal GraphQL fixture coverage in CI.
- Skills/docs: align
screen-recordandscreenshotcontracts with Linux and desktop-target behavior. - Automation: relocate
gh-fix-ciunder automation workflows and refresh CI/workflow naming/filters. - Tooling: migrate script entrypoints to
nils-clicommand wrappers and remove deprecated helper wrappers.
- Worktree cleanup: handle worktree paths containing spaces.
- Screenshot flow: avoid unnecessary macOS permission prompt when running list/discovery modes.
- PR workflows/tests: normalize empty-string quote handling and progress-section cleanup behavior.
- Skills: publish
docs/runbooks/skills/SKILL_MD_FORMAT_V1.mdandscripts/skills/audit_skill_md_format.pyfor SKILL.md format enforcement. - Tests: add script-spec smoke coverage for
scripts/skills/audit_skill_md_format.py.
- Skills: enforce
## Contractplacement (Contract-first, short preamble) and updatecreate-skillto scaffold SKILL.md from a shared template. - Docs: expand and reorganize
image-processingSKILL.md guidance to follow the Contract-first format. semantic-commit: forbid extra repo inspection commands to keep commit generation strictly staged-context driven.
- Plan tooling: detect TODO/TBD placeholders in required plan fields.
- Docs: add
docker/agent-workspace-launcher/README.md. - Tests: add script-spec smoke coverage for more commands and skill scripts.
- Docs: consolidate progress templates and refresh progress PR workflow docs.
- Skills: rename assistant response template references.
git-scope: re-bundle from zsh-kit.
git-scope: exit 0 when no matches are found.
- SQL skills:
sql-postgres,sql-mysql, andsql-mssql(plus shared tooling underskills/tools/sql/_shared). - Plan:
docs/plans/sql-skills-db-migration-plan.mdfor migrating existing DB tooling.
- Scripts: consolidate DB connect tooling under the SQL skills and remove legacy
scripts/db-connect/{psql,mysql,mssql}.zsh. - Docs: update SQL/testing documentation to match the new layout.
- SQL scripts: pass shell style checks.
- Plan workflows: shared Plan Format v1 template at
skills/workflows/plan/_shared/assets/plan-template.md. - Plan tooling:
plan-tooling scaffoldhelper to generatedocs/plans/*-plan.mdfrom the shared template.
create-plan/create-plan-rigorous: reference the shared plan template and requirements.- Docs: trim README skill governance and skill management sections.
- Tests: update plan-tooling smoke script paths after plan tooling refactor.
- None.
- Docs: backfill
v2.0.1changelog entry. - Docs: clarify credential instructions for
agent-env.
- None.
- None.
- Docs: clarify workflow docs.
- Docs: document
image-processingskill.
- None.
- Skill lifecycle tooling:
create-skillandremove-skillskills. - Skill governance tooling:
skill-governanceskill with layout + contract validation scripts. - Per-skill tests for tracked skills, enforced via audits + CI.
image-processingskill for convert/resize/crop/optimize workflows via ImageMagick.
- Breaking: skills structure reorg (v2). The v1
skills/layout and prior skill entrypoints are not backward compatible. - Plan tooling is now shipped under
skills/workflows/**.
- Lint workflows: route checks through the v2 skill governance entrypoints.
image-processing: add missing shebang forimage_processing.py.- Tests: ignore
.worktreesto prevent noisy collection.
- Plan toolchain: plan lint/parse/batches scripts and
scripts/check.sh --plansto keep plans executable and parallelizable.
- CI: consolidate publish workflows into a single pipeline.
- Scripts: set
AGENT_HOMEto repo root by default for more resilient runs. - Plans: remove internal dogfood/review planning docs (keep format + toolchain docs).
bundle-wrapper: improve parsing of array-style arguments.- Tests: fix git commit ban regex enforcement.
- Planning workflows:
create-plan,create-plan-rigorous, andexecute-plan-parallelskills (plusdocs/plans/convention).
- None.
git-scope: use literal prefix matching for tracked filters.graphql-api-testing: prevent xh/HTTPie from reading stdin in non-interactive runs.
agent-env:prefetch-zsh-plugins.shwith retry/backoff for plugin installs.agent-env:PREFETCH_ZSH_PLUGINSbuild arg to skip plugin prefetch.agent-env:ZSH_PLUGIN_FETCH_RETRIESbuild arg to tune retry attempts.
- Dockerfile: move image metadata ARGs to the top for consistency.
agent-env: moveCODEX_AUTH_FILEexport intoentrypoint.sh.
api-report: resolve--out/--responsepaths relative to the derived project root.
agent-workspace: launcher contract for capability discovery + JSON output; wrapper migration docs.agent-workspace:--no-cloneoption for bringing up an existing workspace without cloning.agent-env: GitHub Actions workflows for GHCR/Docker Hub publishing, including multi-arch (arm64) support and OCI labels.- Lint: pyright typechecking in the Python lint workflow.
script_smoke: spec coverage foraudit-skill-layout.sh.
agent-env: usetinias init; addrsync/linuxbrew directory; disable weather/quote on boot; and improve mount override flows.- CI: set
AGENT_HOMEglobally, optimize multi-arch builds, and refresh runner labels. - Docs: canonicalize script references to
$AGENT_HOMEand use$HOME/in path examples. find-and-fix-bugs: add problem + reproduction sections to the skill and PR template.- Workspace auth: remove token env vars from the container for safer Git authentication.
api-report: expand tilde paths and guard stdin response clashes.git-scope: handlemktempfallback on macOS.agent-workspace: handle long container names when computing hostnames.- Progress templates: repair the progress template symlink.
- Docs and tooling: fix duplicated
codex_homereferences and clarify desktop notification word limit guidance.
- Docker agent env (Ubuntu 24.04): root
Dockerfile+ compose, tool install scripts, and compose overlays for secrets/SSH/local overrides. - Workspace launcher:
docker/agent-env/bin/agent-workspace(up/ls/shell/tunnel/rm) with--secrets-mountsupport and improved auth/mount flows. - Docker agent env docs:
docker/agent-env/README.mdanddocker/agent-env/WORKSPACE_QUICKSTART.md(plus root README link). - Git commit context JSON: new
commands/git-commit-context-jsonwrapper andgit-toolsJSON output support.
- Docker env: clean up environment variables; add
CODEX_AUTH_FILEconfig; defaultCODEX_COMMANDS_PATHandZSH_FEATURES. semantic-commit: staged context now outputs a JSON + patch bundle and falls back togit diff --stagedwhen wrappers are unavailable.
- Shell style fixer: preserve initializer handling in
$AGENT_HOME/scripts/fix-zsh-typeset-initializers.zsh.
- Skill layout audit now enforces
TEMPLATEmarkdown placement underreferences/orassets/templates/.
- Docs: update template placement guidance.
- None.
- Semgrep tooling:
.semgrep.yaml,.semgrepignore, and$AGENT_HOME/scripts/semgrep-scan.shwith curated defaults. semgrep-find-and-fixautomation skill, including local config and PR/report templates.- Repo verification tooling:
$AGENT_HOME/scripts/check.shand$AGENT_HOME/scripts/lint.sh(shellcheck/bash -n/zsh -n, ruff, mypy) plus dev configs (ruff.toml,mypy.ini,requirements-dev.txt). - Shell style fixers:
$AGENT_HOME/scripts/fix-shell-style.zsh,$AGENT_HOME/scripts/fix-typeset-empty-string-quotes.zsh,$AGENT_HOME/scripts/fix-zsh-typeset-initializers.zsh. - API test report templates/metadata plus
api-gql/api-rest report-from-cmdworkflow helpers (REST + GraphQL).
- CI: lint workflow now validates skill contracts and runs stricter Python type checks.
- Repo checks: rename
verify.shtocheck.shand split checks into modular flags. - GraphQL/REST helpers: improve report formatting, quoting, and metadata.
- Workflows: remove
evalusage and parseproject-resolveJSON safely during releases. - Semgrep: sanitize test fixtures for stable scans.
api-test-runner: fix quoting forACCESS_TOKENin the docs snippet.
- Top-level
commands/directory exposing reusable primitives (git-scope,git-tools,project-resolve). - Functional script coverage reporting for smoke tests.
open-changed-files-reviewcode-path override option.
- Command wrappers are now shipped via
commands/(instead of ascripts/loader). - Standardized commands path resolution via
CODEX_COMMANDS_PATH/$AGENT_HOME/commands. - Release workflow moved into automation, resolves guides/templates deterministically, and audits the changelog pre-publish.
- PR workflows reduce redundant
ghmetadata lookups. - Docs: commit workflow, automation commit guidance, and find-and-fix-bugs classification updates.
- Prompts: remove obsolete openspec prompt files.
.gitignore: ignoretmp/directory.
git-tools: clean up commit context temp file.chrome-devtools-mcp: useAGENT_HOMEfor default paths and expand tilde paths.graphql-api-testing: quoteAGENT_HOMEduring script path rewrites.- Shell scripts: address minor shellcheck warnings.
script_smokepytest suite with spec-driven + fixture-based coverage across agent-kit scripts.- Hermetic stubs under
tests/stubs/bin/**(DB clients, HTTP clients,gh, and misc tools) for CI-friendly runs. - Docs and helpers for managing the smoke test expansion plan.
- CI: upload
script_smokeartifacts and add API test runner workflows for demos/fixtures. - Smoke coverage expanded via Step 2 planned PRs.
git-scopesmoke spec now tracks the archived plan file path.
- Pytest-based script regression suite and docs.
- MIT license.
- CI: run pytest in lint workflow.
- PR workflows: standardize planning PR references and reduce
ghcalls.
- PR merge script now avoids unsupported
gh pr merge --yesflag.
- Initial release of agent-kit (prompts, skills, scripts, and docs).
- Release workflow fallback template and helper scripts for changelog-driven GitHub releases.
- None (initial release).
- None (initial release).