Skip to content

feat(scanner): add hallucination detection scanner#21

Open
SleuthCo wants to merge 1 commit intomasterfrom
pr/hallucination-scanner
Open

feat(scanner): add hallucination detection scanner#21
SleuthCo wants to merge 1 commit intomasterfrom
pr/hallucination-scanner

Conversation

@SleuthCo
Copy link
Copy Markdown
Owner

Summary

  • Add response scanner that detects hallucinated URLs, citations, and fabricated references in LLM responses
  • Pattern matching for suspicious URL structures, fake DOI/ISBN patterns, and confidence scoring
  • Designed to run as part of the ClawShield response pipeline

Test plan

  • Unit tests pass (go test ./proxy/internal/scanner/...)
  • Integration with evaluator pipeline
  • Verify false positive rate on known-good responses

🤖 Generated with Claude Code

Add response scanner that detects hallucinated URLs, citations, and
fabricated references in LLM responses. Includes pattern matching for
suspicious URL structures, fake DOI/ISBN patterns, and confidence
scoring. Designed to run as part of the ClawShield response pipeline.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant