You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(extract): address PR review for spatial OCR annotations
- Fall back to plain text when TSV-to-spatial conversion yields empty
- Validate confidence_threshold is 0-100 in config loading
- Detect page breaks in concatenated per-page TSV via block number decrease
- Fix doc comment to mention paragraph breaks
- Add tests for page break detection, spatial fallback, and threshold validation
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments