You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(extract): send spatial layout annotations from OCR to LLM (#724)
## Summary
- Send compact spatial layout annotations from tesseract OCR to the LLM
during document extraction, improving accuracy for invoices, forms, and
tabular documents
- Format: `[left,top,width]` per line (~2x token overhead vs plain
text), with `[left,top,width;conf]` only for suspect lines below a
configurable confidence threshold
- Drop height from bounding boxes (nearly constant across lines, no
useful signal)
- New config: `[extraction.ocr]` subtable with `tsv` (default `true`)
and `confidence_threshold` (default `70`)
- Toggle in extraction overlay: press `t` to switch spatial layout
on/off and rerun LLM extraction
- Env vars: `MICASA_EXTRACTION_OCR_TSV`,
`MICASA_EXTRACTION_OCR_CONFIDENCE_THRESHOLD`
closes#699
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments