vibe-blog/README_EN.md at main · datawhalechina/vibe-blog

Turn complex tech into stories everyone can understand.

中文 | English

A multi-Agent AI-powered long-form blog generator with deep research, smart illustrations, and Mermaid diagrams,
transforming technical knowledge into easy-to-understand articles for everyone

🎯 Lower the barrier to technical writing, making knowledge sharing simpler

If this project is useful to you, please star🌟 & fork🍴

✨ Project Origin

Have you ever been stuck in this situation: you want to write a technical blog, but don't know how to make it understandable for non-technical readers; you have lots of technical knowledge in mind, but struggle to explain it with vivid metaphors?

Traditional technical blog writing has the following pain points:

1️⃣ Time-consuming: A high-quality technical article takes hours or even days
2️⃣ Illustration difficulties: Hard to find suitable images, Mermaid syntax is complex
3️⃣ Lack of depth: No time for deep research, content tends to be superficial
4️⃣ Single audience: Difficult to adjust content depth for readers of different technical levels
5️⃣ Tedious distribution: Need to manually adapt to different platform format requirements

vibe-blog was born to solve these problems. Based on multi-Agent collaborative architecture, it automatically completes the entire process of research, planning, writing, and illustration, letting you focus on the knowledge itself.

👨‍💻 Use Cases

Tech Bloggers: Quickly generate high-quality technical articles, saving writing time
Developer Advocates: Transform complex technology into easy-to-understand content, expanding influence
Educators: Generate teaching materials, using life-like metaphors to help students understand
Product Managers: Quickly understand technical concepts, better communicate with development teams
Tech Beginners: Easily get started with new technologies through AI-generated articles

🖼️ Demo & Results

Homepage - Clean and Elegant Input Interface

Input topic, select article type and length, generate with one click

Article Types:

📚 Tutorial: Step-by-step teaching, master technology from zero to one
🔧 Problem Solving: Targeted solutions for specific problems
📊 Comparative Analysis: Multi-solution comparison to help with tech selection

Article Length:

Length	Chapters	Reading Time	Depth	Use Case
📄 Short	3-5 chapters	~30 min	shallow	Quick introduction, fast start
📑 Medium	5-8 chapters	~60 min	medium	Concrete examples + step-by-step, deep learning
📚 Long	8-12 chapters	~90+ min	deep	Principle analysis + data support + edge cases, comprehensive mastery

💡 Questioning Depth: The system automatically adjusts content review standards based on article length. Long articles trigger stricter depth checks to ensure each concept has data support and principle analysis.

AI Workflow Status - Real-time Generation Progress Tracking


Step 1: Material Collection Intelligent web search for resources	Step 2-3: Outline Planning & Content Writing Generate structured outline, write by chapter

Step 4: Depth Questioning Check content depth, supplement details	Step 5: Code Generation Generate runnable example code

Step 6: Image Generation Mermaid diagrams + AI images	Step 7: Quality Review Score and provide improvement suggestions

Step 8: Document Assembly Assemble complete document, extract summary	🎉 Generation Complete Auto-save Markdown file

Blog Results - Professional Technical Articles

Complete blog content preview, supports image export and Markdown download

🎨 Blog Generation Examples

Blog Title	Local Preview	CSDN
Triton Deployment Practical Guide: From Design Principles to Production	Markdown	View
vLLM Inference Engine Deep Dive: Core Acceleration Mechanisms and Component Principles	Markdown	View
Message Queue Getting Started: Building an Async Communication System from Scratch	Markdown	View
Distributed Lock Practical Guide: Master High-Concurrency Resource Synchronization in 30 Minutes	Markdown	View
RAG Evolution Illustrated: Traditional RAG vs Graph RAG Architecture Comparison	Markdown	View
Redis Quick Start Tutorial: Building a High-Performance Cache System from Scratch	Markdown	View

🎯 Feature Introduction

1. Multi-Agent Collaborative Architecture

Based on LangGraph, multi-Agent workflow with clear division of labor and efficient collaboration:

Agent	Role	Core Capability
Orchestrator	Director	Coordinate entire workflow, manage Agent communication
Researcher	Researcher	Web search, knowledge extraction, document fusion
SearchCoordinator	Multi-round Search	Multi-round search based on Writer/Questioner feedback, detect knowledge gaps
Planner	Planner	Generate structured outlines, design article framework
Writer	Writer	Write each chapter content in loop, ensure logical coherence
Questioner	Depth Checker	Core role for article length control, deep check Writer output, expand content based on depth type
Coder	Coder	Generate example code, provide runnable code
Artist	Illustrator	Generate Mermaid diagrams, AI cover images
Reviewer	Quality Controller	Core quality control role, check and score Writer/Questioner output, regenerate if below threshold
Assembler	Assembler	Final document assembly, multi-format export

All Agents share unified state management and Prompt template library, ensuring efficient collaboration and consistent output quality.

2. Deep Research Capability

Zhipu Search Integration: Automatically search the web for latest technical materials
Knowledge Extraction: Extract key information from search results
Citation Annotation: Automatically annotate information sources, ensuring credibility

3. Smart Illustration System

Mermaid Diagrams: Automatically generate flowcharts, architecture diagrams, sequence diagrams
AI Cover Images: Generate cartoon-style covers based on nano-banana-pro
Context-Aware: Generate unique illustrations based on section content

4. Multi-Format Export

Markdown: Standard Markdown format, ready for direct publishing
Image Export: One-click export article as long image
Live Preview: Real-time Markdown and Mermaid rendering in frontend

🗺️ Development Roadmap

Status	Milestone
✅ Completed	Multi-Agent architecture (10 Agents: Orchestrator/Researcher/SearchCoordinator/Planner/Writer/Questioner/Coder/Artist/Reviewer/Assembler)
✅ Completed	Zhipu search service integration
✅ Completed	Mermaid diagram auto-generation
✅ Completed	AI cover image generation (nano-banana-pro)
✅ Completed	SSE real-time progress push
✅ Completed	Markdown live preview rendering
✅ Completed	Export article as image
🧭 Planned	AI Smart Reading Guide (Mind Map + Interactive Reading)
🧭 Planned	PDF knowledge parsing and deep research
🧭 Planned	Podcast format output (TTS synthesis)
🧭 Planned	Tutorial video generation
🧭 Planned	Multi-audience adaptation (students/children/professionals)
🧭 Planned	Comic format output
🧭 Planned	One-click publish to social media platforms

📦 Usage

⚡️ Recommended: Docker deployment (simple, consistent)

📖 Full Guide: Docker Deployment Guide

Method 1: Docker Deployment (Recommended)

Configure environment variables

cp backend/.env.example backend/.env
# Edit .env to configure API keys

Edit backend/.env file to configure environment variables:

# AI Provider format
AI_PROVIDER_FORMAT=openai

# OpenAI compatible API
OPENAI_API_KEY=your-api-key-here
OPENAI_API_BASE=https://dashscope.aliyuncs.com/compatible-mode/v1
TEXT_MODEL=qwen3-max-preview

# Zhipu Search API (optional, for deep research)
ZAI_SEARCH_API_KEY=your-zhipu-api-key

# Nano Banana Pro API (optional, for AI cover images)
NANO_BANANA_API_KEY=your-nano-banana-api-key
NANO_BANANA_API_BASE=https://grsai.dakka.com.cn

🚀 Quick Start (One-Click Redeploy)

./docker/redeploy.sh

Start services

docker compose -f docker/docker-compose.yml up -d

Access the application
- Frontend: http://localhost:3000
- API: http://localhost:5000

Management commands

# View logs
docker compose -f docker/docker-compose.yml logs -f
# Stop services
docker compose -f docker/docker-compose.yml down

Method 2: Local Development Deployment

Clone the repository

git clone https://github.com/datawhalechina/vibe-blog

Create virtual environment

cd backend
python -m venv venv
source venv/bin/activate  # Linux/Mac
# venv\Scripts\activate  # Windows

Install dependencies

pip install -r requirements.txt

# frontend dependencies
cd frontend
npm install

Configure environment variables

cp .env.example .env
# Edit .env to configure necessary environment variables

Edit backend/.env file to configure environment variables:

# AI Provider format
AI_PROVIDER_FORMAT=openai

# OpenAI compatible API
OPENAI_API_KEY=your-api-key-here
OPENAI_API_BASE=https://dashscope.aliyuncs.com/compatible-mode/v1
TEXT_MODEL=qwen3-max-preview

# Zhipu Search API (optional, for deep research)
ZAI_SEARCH_API_KEY=your-zhipu-api-key

# Nano Banana Pro API (optional, for AI cover images)
NANO_BANANA_API_KEY=your-nano-banana-api-key
NANO_BANANA_API_BASE=https://grsai.dakka.com.cn

Start the service

# backend
cd backend
python app.py

# frontend
cd frontend
npm run dev

Access the application
- Frontend: http://localhost:5001
- API: http://localhost:5001/api

🛠️ Technical Architecture

Backend Tech Stack

Language: Python 3.10+
Framework: Flask 3.0
AI Framework: LangGraph (Multi-Agent orchestration)
Template Engine: Jinja2 (Prompt management)
Real-time Communication: Server-Sent Events (SSE)

AI Models & Services

Function	Provider	Model/API	Description
Text Generation	Alibaba Bailian	Qwen (Qianwen)	Used for Agent text generation and reasoning
Web Search	Zhipu	Web Search API	Used for Researcher Agent's deep research
AI Image Generation	Nano Banana	nano-banana-pro	Used for AI cover images and illustrations

API Endpoints

Text Model: OpenAI-compatible API format
Search Service: https://open.bigmodel.cn/api/paas/v4/web_search
Image Generation: https://grsai.dakka.com.cn (China direct)

Frontend Tech Stack

Rendering: Native HTML + JavaScript
Markdown: marked.js
Code Highlighting: highlight.js
Diagram Rendering: Mermaid.js
Image Export: html2canvas

📁 Project Structure

banana-blog/
├── backend/                              # Flask backend application
│   ├── app.py                            # Flask application entry + API routes
│   ├── config.py                         # Configuration file
│   ├── requirements.txt                  # Python dependencies
│   ├── .env.example                      # Environment variable example
│   ├── static/
│   │   └── index.html                    # Frontend page (HTML + JS)
│   ├── outputs/                          # Generated article output directory
│   │   └── images/                       # AI generated images
│   └── services/
│       ├── llm_service.py                # LLM service wrapper
│       ├── image_service.py              # Image generation service (Nano Banana)
│       ├── task_service.py               # SSE task management
│       ├── database_service.py           # Database service
│       ├── file_parser_service.py        # File parser service (PDF/MD/TXT)
│       ├── knowledge_service.py          # Knowledge management service
│       ├── pipeline_service.py           # Pipeline service
│       ├── transform_service.py          # Transform service
│       ├── prompts/                      # Service layer Prompt templates
│       │   ├── document_summary.j2       # Document summary Prompt
│       │   └── image_caption.j2          # Image caption Prompt
│       └── blog_generator/               # Blog generator core
│           ├── blog_service.py           # Blog generation service entry
│           ├── generator.py              # LangGraph workflow definition
│           ├── agents/                   # 10 Agent implementations
│           │   ├── researcher.py         # Research Agent - web search
│           │   ├── search_coordinator.py # Search Coordinator Agent - multi-round search
│           │   ├── planner.py            # Planning Agent - outline generation
│           │   ├── writer.py             # Writing Agent - content writing
│           │   ├── questioner.py         # Questioner Agent - depth check
│           │   ├── coder.py              # Code Agent - example generation
│           │   ├── artist.py             # Artist Agent - Mermaid + AI images
│           │   ├── reviewer.py           # Reviewer Agent - quality scoring
│           │   └── assembler.py          # Assembler Agent - document synthesis
│           ├── templates/                # Jinja2 Prompt templates
│           │   ├── researcher.j2         # Research Prompt
│           │   ├── planner.j2            # Planning Prompt
│           │   ├── writer.j2             # Writing Prompt
│           │   ├── writer_enhance.j2     # Writing enhancement Prompt
│           │   ├── writer_enhance_knowledge.j2  # Knowledge-enhanced writing Prompt
│           │   ├── questioner.j2         # Questioner Prompt
│           │   ├── coder.j2              # Code Prompt
│           │   ├── artist.j2             # Artist Prompt
│           │   ├── cover_image_prompt.j2 # Cover image Prompt
│           │   ├── reviewer.j2           # Reviewer Prompt
│           │   ├── search_query.j2       # Search query Prompt
│           │   ├── search_summarizer.j2  # Search summarizer Prompt
│           │   ├── knowledge_gap_detector.j2  # Knowledge gap detector Prompt
│           │   ├── assembler_header.j2   # Assembler header Prompt
│           │   └── assembler_footer.j2   # Assembler footer Prompt
│           ├── prompts/
│           │   └── prompt_manager.py     # Prompt rendering manager
│           ├── schemas/
│           │   └── state.py              # Shared state definition
│           ├── post_processors/
│           │   └── markdown_formatter.py # Markdown post-processor
│           ├── utils/
│           │   └── helpers.py            # Utility functions
│           └── services/
│               └── search_service.py     # Zhipu search service
├── logo/                                 # Logo resources
└── README.md

🔧 Environment Variables

📋 Full Configuration (Click to Expand)

AI Model Configuration

Variable	Description	Default
`AI_PROVIDER_FORMAT`	AI Provider format	openai
`TEXT_MODEL`	Text generation model	qwen3-max-preview
`IMAGE_CAPTION_MODEL`	Image caption model	qwen3-vl-plus-2025-12-19

OpenAI Compatible API

Variable	Description	Example
`OPENAI_API_KEY`	OpenAI compatible API Key	sk-xxx
`OPENAI_API_BASE`	OpenAI compatible API Base URL	https://dashscope.aliyuncs.com/compatible-mode/v1

Image Generation (Nano Banana)

Variable	Description	Example
`NANO_BANANA_API_KEY`	Nano Banana API Key	sk-xxx
`NANO_BANANA_API_BASE`	Nano Banana API Base URL	https://grsai.dakka.com.cn
`NANO_BANANA_MODEL`	Nano Banana model name	nano-banana-pro

Search Configuration (Zhipu Web Search)

Variable	Description	Example
`ZAI_SEARCH_API_KEY`	Zhipu Web Search API Key	xxx
`ZAI_SEARCH_API_BASE`	Zhipu Search API Base URL	https://open.bigmodel.cn/api/paas/v4/web_search
`ZAI_SEARCH_ENGINE`	Search engine type	search_pro_quark
`ZAI_SEARCH_MAX_RESULTS`	Max search results	5
`ZAI_SEARCH_CONTENT_SIZE`	Content size	medium
`ZAI_SEARCH_RECENCY_FILTER`	Recency filter	noLimit

Multi-Search Configuration

Variable	Description	Default
`MULTI_SEARCH_MAX_SHORT`	Max search rounds for short articles	3
`MULTI_SEARCH_MAX_MEDIUM`	Max search rounds for medium articles	5
`MULTI_SEARCH_MAX_LONG`	Max search rounds for long articles	8

Knowledge Fusion Configuration

Variable	Description	Default
`KNOWLEDGE_MAX_CONTENT_LENGTH`	Max content length	8000
`KNOWLEDGE_MAX_DOC_ITEMS`	Max document items	10
`KNOWLEDGE_CHUNK_SIZE`	Chunk size	2000
`KNOWLEDGE_CHUNK_OVERLAP`	Chunk overlap	200

🤝 Contributing

Welcome to contribute to this project through Issue and Pull Request!

📄 License

CC BY-NC-SA 4.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Project Origin

👨‍💻 Use Cases

🖼️ Demo & Results

Homepage - Clean and Elegant Input Interface

AI Workflow Status - Real-time Generation Progress Tracking

Blog Results - Professional Technical Articles

🎨 Blog Generation Examples

🎯 Feature Introduction

1. Multi-Agent Collaborative Architecture

2. Deep Research Capability

3. Smart Illustration System

4. Multi-Format Export

🗺️ Development Roadmap

📦 Usage

Method 1: Docker Deployment (Recommended)

🚀 Quick Start (One-Click Redeploy)

Method 2: Local Development Deployment

🛠️ Technical Architecture

Backend Tech Stack

AI Models & Services

API Endpoints

Frontend Tech Stack

📁 Project Structure

🔧 Environment Variables

AI Model Configuration

OpenAI Compatible API

Image Generation (Nano Banana)

Search Configuration (Zhipu Web Search)

Multi-Search Configuration

Knowledge Fusion Configuration

🤝 Contributing

📄 License

FilesExpand file tree

README_EN.md

Latest commit

History

README_EN.md

File metadata and controls

✨ Project Origin

👨‍💻 Use Cases

🖼️ Demo & Results

Homepage - Clean and Elegant Input Interface

AI Workflow Status - Real-time Generation Progress Tracking

Blog Results - Professional Technical Articles

🎨 Blog Generation Examples

🎯 Feature Introduction

1. Multi-Agent Collaborative Architecture

2. Deep Research Capability

3. Smart Illustration System

4. Multi-Format Export

🗺️ Development Roadmap

📦 Usage

Method 1: Docker Deployment (Recommended)

🚀 Quick Start (One-Click Redeploy)

Method 2: Local Development Deployment

🛠️ Technical Architecture

Backend Tech Stack

AI Models & Services

API Endpoints

Frontend Tech Stack

📁 Project Structure

🔧 Environment Variables

AI Model Configuration

OpenAI Compatible API

Image Generation (Nano Banana)

Search Configuration (Zhipu Web Search)

Multi-Search Configuration

Knowledge Fusion Configuration

🤝 Contributing

📄 License