-
Notifications
You must be signed in to change notification settings - Fork 16
[0.2.0v][Feature] PR indexing with dedicated collection and pr-duplicate CLI #43
Copy link
Copy link
Closed
Copy link
Labels
coreRelated to core engineRelated to core engineenhancementNew feature or requestNew feature or requesttriageLabel for incoming issuesLabel for incoming issues
Milestone
Description
Overview
Extend the simili index command to optionally index pull requests into a dedicated Qdrant collection, and add a simili pr-duplicate CLI command to detect duplicate PRs against both issues and PRs.
Scope
- Add
--include-prsflag tosimili indexto index PR metadata (title, description, changed file paths, linked issues) into a separate collection - Add
QDRANT_PR_COLLECTION/qdrant.pr_collectionconfig option for the dedicated PR collection - Add
simili pr-duplicateCLI command to query both collections and run LLM duplicate detection processPullRequestworker: fetch PR details + file paths, build metadata text, embed and upsert- Dry-run support for PR indexing
Acceptance Criteria
-
simili index --include-prsindexes PRs into the configured PR collection - PR collection is created automatically if it does not exist
-
simili pr-duplicate --repo owner/repo --number 123returns duplicate candidates from both collections - Falls back gracefully if no PR collection is configured
- All existing issue indexing behaviour is unchanged
- Tests cover the new indexing and query paths
Notes
Extracted from PR #40. Implementation reference: nick1udwig/simili-bot@index-and-query-prs.
Depends on: #42 (OpenAI provider support) or can be implemented independently with Gemini only.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
coreRelated to core engineRelated to core engineenhancementNew feature or requestNew feature or requesttriageLabel for incoming issuesLabel for incoming issues
Type
Projects
Status
Done