docs: add key features section to README

arabold · arabold · commit 49b83fafb3c3 · 2025-04-08T11:03:13.000-07:00
diff --git a/README.md b/README.md
@@ -2,9 +2,18 @@
 
 A MCP server for fetching and searching 3rd party package documentation.
 
-This project provides a Model Context Protocol (MCP) server designed to scrape, process, index, and search documentation for various software libraries and packages. It fetches content from specified URLs, splits it into meaningful chunks using semantic splitting techniques, generates vector embeddings using OpenAI, and stores the data in an SQLite database. The server utilizes `sqlite-vec` for efficient vector similarity search and FTS5 for full-text search capabilities, combining them for hybrid search results. It supports versioning, allowing documentation for different library versions (including unversioned content) to be stored and queried distinctly.
+## ✨ Key Features
+
+- 🌐 **Scrape & Index:** Fetch documentation from web sources or local files.
+- 🧠 **Smart Processing:** Utilize semantic splitting and OpenAI embeddings for meaningful content chunks.
+- 💾 **Efficient Storage:** Store data in SQLite, leveraging `sqlite-vec` for vector search and FTS5 for full-text search.
+- 🔍 **Hybrid Search:** Combine vector and full-text search for relevant results across different library versions.
+- ⚙️ **Job Management:** Handle scraping tasks asynchronously with a robust job queue and management tools (MCP & CLI).
+- 🐳 **Easy Deployment:** Run the server easily using the provided Docker image.
 
-The scraping process is managed by an asynchronous job queue (`PipelineManager`), allowing multiple scrape jobs to run concurrently.
+## Overview
+
+This project provides a Model Context Protocol (MCP) server designed to scrape, process, index, and search documentation for various software libraries and packages. It fetches content from specified URLs, splits it into meaningful chunks using semantic splitting techniques, generates vector embeddings using OpenAI, and stores the data in an SQLite database. The server utilizes `sqlite-vec` for efficient vector similarity search and FTS5 for full-text search capabilities, combining them for hybrid search results. It supports versioning, allowing documentation for different library versions (including unversioned content) to be stored and queried distinctly.
 
 The server exposes MCP tools for: