CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Repository Overview

Create-llama is a monorepo containing CLI tools and server frameworks for building LlamaIndex-powered applications. The repository combines TypeScript/Node.js and Python components in a unified development environment.

Architecture

Monorepo Structure

packages/create-llama/: Main CLI tool for scaffolding LlamaIndex applications
python/llama-index-server/: Python/FastAPI server framework
Root: Workspace configuration and shared development tools

Key Technologies

Package Manager: pnpm with workspace configuration
Build Tools: bunchee (TypeScript), Next.js, hatchling (Python)
Testing: Playwright for e2e, pytest for Python
Version Management: changesets for TypeScript packages, manual for Python

Development Commands

Root Level (Monorepo)

pnpm dev          # Start all packages in development mode
pnpm build        # Build all packages
pnpm lint         # ESLint across TypeScript packages
pnpm format       # Prettier formatting
pnpm e2e          # Run end-to-end tests

Create-llama Package

cd packages/create-llama
npm run build     # Build CLI using bash script and ncc
npm run dev       # Watch mode development
npm run e2e       # Playwright tests for generated projects
npm run clean     # Clean build artifacts and template caches

Python Server Package

cd python/llama-index-server
uv run generate   # Index data files
fastapi dev       # Start development server with hot reload
pytest            # Run test suite

Template System

The CLI uses a sophisticated template system in packages/create-llama/templates/:

Organization

types/: Base project structures (streaming, reflex, llamaindexserver)
components/: Reusable components across frameworks
- engines/ - Chat and agent engines
- loaders/ - File, web, database loaders
- providers/ - AI model configurations
- vectordbs/ - Vector database integrations
- use-cases/ - Workflow implementations

Development Workflow

Templates support multiple frameworks (Next.js, Express, FastAPI)
Component system allows mix-and-match functionality
E2E tests validate generated projects work correctly

Server Framework Architecture

Python Server (`llama-index-server`)

Core: LlamaIndexServer class extending FastAPI
Architecture: Workflow factory pattern for stateless request handling
UI Generation: AI-powered React component generation from Pydantic schemas
Development: Hot reloading support with dev mode

Common Patterns

Workflow Integration

Both server frameworks use factory patterns:

// TypeScript
const server = new LlamaIndexServer({
  workflow: (context) => createWorkflow(context)
});

// Python
def create_workflow(chat_request: ChatRequest) -> Workflow:
    return MyWorkflow(chat_request.messages)

Event System

Structured events for UI communication:

UIEvent: Custom components with Pydantic/Zod schemas
ArtifactEvent: Code/documents for Canvas panel
SourceNodesEvent: Document sources with metadata
AgentRunEvent: Tool usage and progress tracking

File Handling

Both servers auto-mount data/ and output/ directories
LlamaCloud integration for remote file access
Static file serving through framework-specific methods

Testing Strategy

E2E Testing

Playwright tests in packages/create-llama/e2e/
Tests both Python and TypeScript generated projects
Validates CLI generation and application functionality

Unit Testing

Python: pytest with comprehensive API and service tests
TypeScript: Integrated testing through build process

Build Process

Create-llama CLI

TypeScript compilation with bash script
ncc bundling for standalone executable
Template validation and caching

Server Package Build

prebuild: Clean directories
build: bunchee compilation to ESM/CJS
postbuild: Next.js preparation and static asset generation
prepare:py-static: Python integration assets

Release Process

pnpm release     # Build all + publish npm packages + Python release

Development Environment Setup

Prerequisites

Node.js >=16.14.0
Python with uv package manager
pnpm for package management

Common Workflow

Clone repository and run pnpm install
For CLI development: work in packages/create-llama/
For server development: choose TypeScript or Python package
Use pnpm dev for concurrent development across packages
Run pnpm e2e to validate changes with generated projects

Special Considerations

Template Development

Changes to templates require rebuilding CLI
E2E tests validate template functionality across frameworks
Template caching system speeds up repeated builds

Cross-package Dependencies

Server package builds static assets for Python integration
Version synchronization between TypeScript and Python packages
Shared UI components and styling across implementations

Performance

CLI uses caching for template operations
Server frameworks support streaming responses
Background processing for file operations and LlamaCloud integration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Repository Overview

Architecture

Monorepo Structure

Key Technologies

Development Commands

Root Level (Monorepo)

Create-llama Package

Python Server Package

Template System

Organization

Development Workflow

Server Framework Architecture

Python Server (`llama-index-server`)

Common Patterns

Workflow Integration

Event System

File Handling

Testing Strategy

E2E Testing

Unit Testing

Build Process

Create-llama CLI

Server Package Build

Release Process

Development Environment Setup

Prerequisites

Common Workflow

Special Considerations

Template Development

Cross-package Dependencies

Performance

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Repository Overview

Architecture

Monorepo Structure

Key Technologies

Development Commands

Root Level (Monorepo)

Create-llama Package

Python Server Package

Template System

Organization

Development Workflow

Server Framework Architecture

Python Server (llama-index-server)

Common Patterns

Workflow Integration

Event System

File Handling

Testing Strategy

E2E Testing

Unit Testing

Build Process

Create-llama CLI

Server Package Build

Release Process

Development Environment Setup

Prerequisites

Common Workflow

Special Considerations

Template Development

Cross-package Dependencies

Performance

Python Server (`llama-index-server`)