Open Ask AI Server

A deployable-ready, serverless documentation assistant AI agent, can be hosted on Vercel, leveraging LLM through Vercel AI Gateway, and providing in-memory filesystem based agentic searching using bash-tool, ALL FOR FREE (Vercel Hobby Plan with generous quota usage).

Open Ask AI Server on GitHub is a template repository ready for deploy as a serverless function on Vercel.

Once deployed, it provides an API endpoint /api/stream that accepts conversation messages and streams back AI-generated responses based on pre-scanned markdown documentation files.

Then integrate with Open Ask AI Widget on your documentation site.

Usage

Click "Use this template" to create your own repo, or fork this repository.
Edit projects.json to define your documentation projects.
Add your markdown documentation files in the projects/ directory.
Optionally, add an AGENTS.md file in each project folder to provide additional agent instructions, such as what the project is about and what's the docs file structure.
Deploy to Vercel using the Vercel CLI.

npm i -g vercel
vercel login
vercel --prod

Your AI agent API will be live at https://<your-vercel-project>.vercel.app/api/stream.

It accepts a POST request with conversation messages and streams back AI-generated responses. See the "API Endpoints" section below for details.

Features

Multi-project support: Serve multiple documentation projects from a single deployment
Conversation-based API: Full conversation history support with UIMessage format
Streaming AI responses: Real-time interaction with ToolLoopAgent
Pre-generated documentation: Fast in-memory file access from pre-scanned JSON
Read-only bash access: Secure bash commands for searching markdown documentation
Production-ready: Vercel Functions with Fluid compute for cost-efficient scaling

Architecture

Core Components

API Endpoint: /api/stream - POST endpoint with streaming UIMessage responses
Agent: AI SDK v6 ToolLoopAgent with bash and readFile tools
Documentation Scanner: Pre-scans project docs into JSON files for fast access
Project System: Multi-project configuration via projects.json
Model: OpenAI GPT-OSS-120B with low reasoning effort for fast responses
Deployment: Vercel Functions with Fluid compute (60s max duration)

How It Works

Pre-generation: Run npm run build to scan all projects in projects/ directory
Generated Files: Creates JSON files in generated/ with all markdown content
Runtime: API loads project JSON into memory and creates bash-tool with files
Agent Execution: ToolLoopAgent uses bash commands to search pre-loaded files
Streaming: Returns UIMessage stream compatible with AI SDK UI components

Client Request → Vercel Function → AI SDK
                                      ↓
                                ToolLoopAgent
                                      ↓
                                  bash-tool
                                      ↓
                          Pre-loaded Files (in-memory)
                                      ↓
                          Streaming Response → Client

Prerequisites

Node.js 20+
Markdown documentation files

Setup

1. Install Dependencies

npm install

2. Configure Projects

Edit projects.json to define your documentation projects:

{
  "my-project": {
    "name": "My Project"
  }
}

3. Add Documentation

Create a directory for each project in projects/:

projects/
├── my-project/
│   ├── AGENTS.md             # (Optional) Agent instructions
│   ├── getting-started.md
│   ├── api-reference.md
│   └── guides/
│       └── authentication.md

4. Generate Documentation Files

Scan all projects and generate JSON files:

npm run build

This creates files in generated/:

generated/
└── my-project.json

5. Initialize Vercel Project

npm install -g vercel
vercel login
vercel

5. Run Locally

vercel dev

Visit http://localhost:3000/api/health to verify the server is running.

API Endpoints

GET /api/health

Health check endpoint.

Response:

{
  "status": "ok",
  "timestamp": "2026-02-04T12:00:00.000Z",
  "version": "1.0.0"
}

POST /api/stream

Streaming conversation endpoint with multi-project support.

Request:

{
  "messages": [
    {
      "role": "user",
      "parts": [
        {
          "type": "text",
          "text": "How do I configure authentication?"
        }
      ]
    }
  ],
  "project": "my-project"
}

Parameters:

messages (optional): Array of UIMessage objects for conversation history
project (optional): Project ID to use (defaults to first project in projects.json)

Response: Server-Sent Events stream with UIMessage format

Usage Examples

Health Check

curl http://localhost:3000/api/health

Simple Query

curl -X POST http://localhost:3000/api/stream \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {
        "role": "user",
        "parts": [
          {
            "type": "text",
            "text": "What topics are covered in the documentation?"
          }
        ]
      }
    ],
    "project": "my-project"
  }'

Query Specific Project

curl -X POST http://localhost:3000/api/stream \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {
        "role": "user",
        "parts": [
          {
            "type": "text",
            "text": "Find API endpoints"
          }
        ]
      }
    ],
    "project": "my-project"
  }'

Conversation with History

curl -X POST http://localhost:3000/api/stream \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {
        "role": "user",
        "parts": [
          {
            "type": "text",
            "text": "What is authentication?"
          }
        ]
      },
      {
        "role": "assistant",
        "parts": [
          {
            "type": "text",
            "text": "Authentication is..."
          }
        ]
      },
      {
        "role": "user",
        "parts": [
          {
            "type": "text",
            "text": "How do I implement it?"
          }
        ]
      }
    ],
    "project": "my-project"
  }'

Project Structure

open-ask-ai-server/
├── api/
│   ├── health.ts                    # GET /api/health
│   └── stream.ts                    # POST /api/stream
├── lib/
│   ├── types.ts                     # TypeScript interfaces
│   ├── utils.ts                     # Error handling utilities
│   ├── bash-tool-setup.ts          # OverlayFs + createBashTool config
│   └── agent.ts                     # Agent configuration
├── scripts/
│   └── scan-docs.ts                 # Documentation scanner
├── projects/
│   └── [project-id]/                # Project documentation directories
│       └── *.md                     # Markdown files
├── generated/
│   └── [project-id].json            # Generated project files
├── projects.json                    # Project configuration
├── package.json                     # Dependencies
├── tsconfig.json                    # TypeScript config
├── vercel.json                      # Vercel deployment config
└── README.md                        # Documentation

Agent Capabilities

The agent can execute bash commands to explore documentation:

find - Locate markdown files
grep - Search for keywords
cat - Read file contents
head/tail - Preview files
Pipes and command combinations

All commands operate on pre-loaded in-memory files for fast access.

Deployment

Deploy to Vercel

vercel --prod

Environment Variables

The following environment variables can be configured to customize the LLM behavior. It's recommended to configure them in the Vercel Dashboard (Settings > Environment Variables) for production deployments.

Variable	Description	Default Value
`LLM_MODEL`	Model to use for the agent	`openai/gpt-oss-120b`
`LLM_REASONING_EFFORT`	Reasoning effort level (low, medium, high)	`low`
`LLM_TEXT_VERBOSITY`	Text verbosity level (low, medium, high)	`medium`
`MAX_STEPS`	Maximum number of steps the agent can take	`16`

For local development, edit .env.local:

LLM_MODEL=openai/gpt-oss-120b
LLM_REASONING_EFFORT=low
LLM_TEXT_VERBOSITY=medium
MAX_STEPS=16

For production deployment on Vercel:

Go to your project in the Vercel Dashboard
Navigate to Settings > Environment Variables
Add the environment variables you want to customize
Redeploy your project for the changes to take effect

Development

Type Checking

npx tsc --noEmit

Local Development

vercel dev

Performance

Pre-generated Files: All documentation loaded into memory at startup
Fluid Compute: Enabled for cost-efficient scaling
maxDuration: 60: Functions can run up to 60 seconds
just-bash: A simulated bash environment with an in-memory virtual filesystem
Streaming: Real-time responses improve perceived performance

Security

Read-Only Access: just-bash provides a read-only in-memory virtual filesystem
Input Validation: Messages validated before processing
No LLM API Keys: Vercel Functions call Vercel AI Gateway directly
No File System Access: All files pre-loaded from JSON

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open Ask AI Server

Usage

Features

Architecture

Core Components

How It Works

Prerequisites

Setup

1. Install Dependencies

2. Configure Projects

3. Add Documentation

4. Generate Documentation Files

5. Initialize Vercel Project

5. Run Locally

API Endpoints

GET /api/health

POST /api/stream

Usage Examples

Health Check

Simple Query

Query Specific Project

Conversation with History

Project Structure

Agent Capabilities

Deployment

Deploy to Vercel

Environment Variables

Development

Type Checking

Local Development

Performance

Security

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Open Ask AI Server

Usage

Features

Architecture

Core Components

How It Works

Prerequisites

Setup

1. Install Dependencies

2. Configure Projects

3. Add Documentation

4. Generate Documentation Files

5. Initialize Vercel Project

5. Run Locally

API Endpoints

GET /api/health

POST /api/stream

Usage Examples

Health Check

Simple Query

Query Specific Project

Conversation with History

Project Structure

Agent Capabilities

Deployment

Deploy to Vercel

Environment Variables

Development

Type Checking

Local Development

Performance

Security

License