Skip to content

Latest commit

 

History

History
241 lines (141 loc) · 16 KB

File metadata and controls

241 lines (141 loc) · 16 KB

1.10.0 (2025-04-21)

Bug Fixes

  • ci: set PLAYWRIGHT_LAUNCH_ARGS for tests (55ea901)
  • correct Playwright dependencies in Dockerfile (6f19fc0)
  • deps: remove drizzle dependencies (ad6a09a), closes #57
  • scraper: replace domcontentloaded with load event in Playwright (9345152), closes #62
  • silence JSDOM virtual console output (61e41be), closes #53

Features

  • add initial JS sandbox utility and executor middleware (#18) (19dea10)
  • cli: add --scrape-mode option and update README (e8e4beb)
  • cli: add --scrape-mode option to fetch-url command (cc6465a)
  • refactor content processing to middleware pipeline (00f9a2f), closes #17
  • scraper: add HtmlPlaywrightMiddleware for dynamic content rendering (ee3118f), closes #19
  • scraper: enable external script fetching in sandbox (88b7e7a)
  • scraper: replace JSDOM with Cheerio for HTML parsing (5dd624a)

1.9.0 (2025-04-14)

Bug Fixes

  • scraper: use JSDOM title property for robust HTML title extraction (dee350f), closes #41

Features

  • increase default maxPages and add constants (7b10eba), closes #43

1.8.0 (2025-04-14)

Bug Fixes

  • disabled removal of form elements (3b6afde)
  • preserve line breaks in pre tags (b94b1e3)
  • remove overly aggressive html filtering (6c76509), closes #36
  • resolve store path correctly when not in project root (49a3c1f)
  • search: remove exactMatch flag from MCP API, improve internal handling (e5cb8d1), closes #24

Features

  • add fetch-url tool to CLI and MCP server (604175f), closes #34

1.7.0 (2025-04-11)

Features

  • embeddings: add support for multiple embedding providers (e197bec), closes #28

1.6.0 (2025-04-11)

Features

  • #26: add environment variables to Dockerfile (51b7059), closes #26
  • #26: handle different embedding model dimensions via padding (f712c9b), closes #26
  • #26: support OpenAI API base URL and model name config (66b70bb), closes #26

1.5.0 (2025-04-08)

Bug Fixes

  • ci: increase allowed footer line length (afbc62c)

Features

  • scraper: enhance crawler controls with scope and redirect options (45d0e93), closes #15

1.4.5 (2025-04-08)

Bug Fixes

  • empty commit to trigger patch release (ca62a92)

1.4.4 (2025-04-08)

Bug Fixes

  • empty commit to trigger patch release (be47616)
  • empty commit to trigger patch release (ff7f518)
  • workflow: update semantic-release configuration and output variables (7725875)
  • workflow: update semantic-release configuration and output variables (7628854)

1.4.4 (2025-04-08)

Bug Fixes

  • empty commit to trigger patch release (ff7f518)
  • workflow: update semantic-release configuration and output variables (7725875)
  • workflow: update semantic-release configuration and output variables (7628854)

1.4.4 (2025-04-08)

Bug Fixes

  • workflow: update semantic-release configuration and output variables (7725875)
  • workflow: update semantic-release configuration and output variables (7628854)

1.4.3 (2025-04-08)

Bug Fixes

  • empty commit to trigger patch release (50bb240)

1.4.2 (2025-04-08)

Bug Fixes

  • empty commit to trigger patch release (c8f9a0f)

1.4.1 (2025-04-08)

Bug Fixes

  • docs: clarify docker volume creation in README (03a58d6)

1.4.0 (2025-04-08)

Features

  • docker: add configurable storage path & improve support (9f35c54)
  • store: implement dynamic database path selection (527d9f9)

1.3.0 (2025-04-03)

Features

  • search: provide suggestions for unknown libraries (d6628bb), closes #12

1.2.1 (2025-04-01)

Bug Fixes

  • store: escape FTS query to handle special characters (bcf01a8), closes #10

1.2.0 (2025-03-30)

Features

  • deploy: add Smithery.ai deployment configuration (3763168)

1.1.0 (2025-03-30)

Features

  • implement log level control via CLI flags (b2f8b73)

1.0.0 (2025-03-30)

Bug Fixes

  • Cleaned up log messages in MCP server (db2c82e)
  • Cleaned up README (0ac054e)
  • Fixed concatenation of chunks in the DocumentRetrieverService (ae4ff6b)
  • Fixed several linter and formatter issues (a2e4594)
  • package: remove relative prefix from bin paths in package.json (22f74e3)
  • removed unnecessary file extends in imports (117903f)
  • restore progress callbacks in scraper (0cebe97)
  • various linter issues and type cleanup (14b02bd)

Code Refactoring

  • improve type organization and method signatures (da16170)

Features

  • Add comprehensive logging system (ba8a6f1)
  • add configurable concurrency for web scraping (f6c3baa)
  • Add document ordering and URL tracking (11ff1c8)
  • Add pipeline management tools to MCP server (e01d31e)
  • Add remove documents functionality (642a320)
  • add store clearing before scraping (9557014)
  • Add vitest tests for MCP tools (0c40c9e)
  • Added .env.example to repository (93c47f1)
  • Added Cline custom instructions file (aabb806)
  • ci: configure automated releases with semantic-release (8af5595)
  • enhance web scraping and error handling (d3aa894)
  • Implement optional version handling and improve CLI (9b41856)
  • improve document processing and architecture docs (b996d19)
  • Improve scraping, indexing, and URL handling (3fc0931)
  • improve search capabilities with PostgreSQL integration (4e04aa7)
  • Make search tool version and limit optional and update dependencies (bd83392)
  • Refactor scraper and introduce document processing pipeline (6229f97)
  • scraper: implement configurable subpage scraping behavior (1dc2a11)
  • scraper: Implement local file scraping and refactor strategy pattern (d058b48)
  • Simplify pipeline job data returned by MCP tools (35c3279)
  • switch to jsdom for DOM processing and improve database queries (ba4768f)
  • tooling: configure CI/CD, semantic-release, and commit hooks (3d9b7a3)
  • Updated dependencies (2b345c7)

BREAKING CHANGES

  • DocumentStore and VectorStoreService method signatures have changed
  • Reorganize types across domains:

    • Move domain-specific types closer to their implementations
    • Keep only shared types in src/types/index.ts
    • Add domain prefixes to type names for clarity
  • Standardize method signatures:

    • Replace filter objects with explicit library/version parameters
    • Make parameter order consistent across all methods
    • Update all tests to match new signatures
  • Improve type naming:

    • Rename DocContent -> Document
    • Rename PageResult -> ScrapedPage
    • Rename ScrapeOptions -> ScraperOptions
    • Rename ScrapingProgress -> ScraperProgress
    • Rename SearchResult -> StoreSearchResult
    • Rename VersionInfo -> LibraryVersion
    • Rename SplitterOptions -> MarkdownSplitterOptions

The changes improve code organization, make dependencies clearer, and provide a more consistent and explicit API across the codebase.