You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(extract): send spatial layout annotations from OCR to LLM
Send compact line-level bounding boxes from tesseract OCR to the LLM
during extraction, improving accuracy for invoices, forms, and tabular
documents. The format is [left,top,width] per line (~2x token overhead
vs plain text), with confidence scores shown only for suspect lines
(below a configurable threshold, default 70).
- Add SpatialTextFromTSV() that converts raw TSV to compact spatial format
- Drop height from bounding boxes (nearly constant, no signal)
- Threshold-based confidence: only annotate lines with minConf < threshold
- New config: ocr_tsv (default true), ocr_conf_threshold (default 70)
- Toggle in extraction overlay: press 't' to switch layout on/off on rerun
- Thread config through pipeline, prompt builder, and app plumbing
closes#699
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments