1.10.0 (2025-04-21)
- ci: set PLAYWRIGHT_LAUNCH_ARGS for tests (55ea901)
- correct Playwright dependencies in Dockerfile (6f19fc0)
- deps: remove drizzle dependencies (ad6a09a), closes #57
- scraper: replace domcontentloaded with load event in Playwright (9345152), closes #62
- silence JSDOM virtual console output (61e41be), closes #53
- add initial JS sandbox utility and executor middleware (#18) (19dea10)
- cli: add --scrape-mode option and update README (e8e4beb)
- cli: add --scrape-mode option to fetch-url command (cc6465a)
- refactor content processing to middleware pipeline (00f9a2f), closes #17
- scraper: add HtmlPlaywrightMiddleware for dynamic content rendering (ee3118f), closes #19
- scraper: enable external script fetching in sandbox (88b7e7a)
- scraper: replace JSDOM with Cheerio for HTML parsing (5dd624a)
1.9.0 (2025-04-14)
1.8.0 (2025-04-14)
- disabled removal of form elements (3b6afde)
- preserve line breaks in pre tags (b94b1e3)
- remove overly aggressive html filtering (6c76509), closes #36
- resolve store path correctly when not in project root (49a3c1f)
- search: remove exactMatch flag from MCP API, improve internal handling (e5cb8d1), closes #24
1.7.0 (2025-04-11)
1.6.0 (2025-04-11)
- #26: add environment variables to Dockerfile (51b7059), closes #26
- #26: handle different embedding model dimensions via padding (f712c9b), closes #26
- #26: support OpenAI API base URL and model name config (66b70bb), closes #26
1.5.0 (2025-04-08)
- ci: increase allowed footer line length (afbc62c)
1.4.5 (2025-04-08)
- empty commit to trigger patch release (ca62a92)
1.4.4 (2025-04-08)
- empty commit to trigger patch release (be47616)
- empty commit to trigger patch release (ff7f518)
- workflow: update semantic-release configuration and output variables (7725875)
- workflow: update semantic-release configuration and output variables (7628854)
1.4.4 (2025-04-08)
- empty commit to trigger patch release (ff7f518)
- workflow: update semantic-release configuration and output variables (7725875)
- workflow: update semantic-release configuration and output variables (7628854)
1.4.4 (2025-04-08)
- workflow: update semantic-release configuration and output variables (7725875)
- workflow: update semantic-release configuration and output variables (7628854)
1.4.3 (2025-04-08)
- empty commit to trigger patch release (50bb240)
1.4.2 (2025-04-08)
- empty commit to trigger patch release (c8f9a0f)
1.4.1 (2025-04-08)
- docs: clarify docker volume creation in README (03a58d6)
1.4.0 (2025-04-08)
- docker: add configurable storage path & improve support (9f35c54)
- store: implement dynamic database path selection (527d9f9)
1.3.0 (2025-04-03)
1.2.1 (2025-04-01)
1.2.0 (2025-03-30)
- deploy: add Smithery.ai deployment configuration (3763168)
1.1.0 (2025-03-30)
- implement log level control via CLI flags (b2f8b73)
- Cleaned up log messages in MCP server (db2c82e)
- Cleaned up README (0ac054e)
- Fixed concatenation of chunks in the DocumentRetrieverService (ae4ff6b)
- Fixed several linter and formatter issues (a2e4594)
- package: remove relative prefix from bin paths in package.json (22f74e3)
- removed unnecessary file extends in imports (117903f)
- restore progress callbacks in scraper (0cebe97)
- various linter issues and type cleanup (14b02bd)
- improve type organization and method signatures (da16170)
- Add comprehensive logging system (ba8a6f1)
- add configurable concurrency for web scraping (f6c3baa)
- Add document ordering and URL tracking (11ff1c8)
- Add pipeline management tools to MCP server (e01d31e)
- Add remove documents functionality (642a320)
- add store clearing before scraping (9557014)
- Add vitest tests for MCP tools (0c40c9e)
- Added .env.example to repository (93c47f1)
- Added Cline custom instructions file (aabb806)
- ci: configure automated releases with semantic-release (8af5595)
- enhance web scraping and error handling (d3aa894)
- Implement optional version handling and improve CLI (9b41856)
- improve document processing and architecture docs (b996d19)
- Improve scraping, indexing, and URL handling (3fc0931)
- improve search capabilities with PostgreSQL integration (4e04aa7)
- Make search tool version and limit optional and update dependencies (bd83392)
- Refactor scraper and introduce document processing pipeline (6229f97)
- scraper: implement configurable subpage scraping behavior (1dc2a11)
- scraper: Implement local file scraping and refactor strategy pattern (d058b48)
- Simplify pipeline job data returned by MCP tools (35c3279)
- switch to jsdom for DOM processing and improve database queries (ba4768f)
- tooling: configure CI/CD, semantic-release, and commit hooks (3d9b7a3)
- Updated dependencies (2b345c7)
- DocumentStore and VectorStoreService method signatures have changed
-
Reorganize types across domains:
- Move domain-specific types closer to their implementations
- Keep only shared types in src/types/index.ts
- Add domain prefixes to type names for clarity
-
Standardize method signatures:
- Replace filter objects with explicit library/version parameters
- Make parameter order consistent across all methods
- Update all tests to match new signatures
-
Improve type naming:
- Rename DocContent -> Document
- Rename PageResult -> ScrapedPage
- Rename ScrapeOptions -> ScraperOptions
- Rename ScrapingProgress -> ScraperProgress
- Rename SearchResult -> StoreSearchResult
- Rename VersionInfo -> LibraryVersion
- Rename SplitterOptions -> MarkdownSplitterOptions
The changes improve code organization, make dependencies clearer, and provide a more consistent and explicit API across the codebase.