Docling: Unsafe Zip Extraction in EasyOCR Model Download

Impact

In versions < 2.91.0, The EasyOCR model download functionality extracted ZIP archives without validating member paths, enabling Zip Slip attacks. If an attacker could compromise the model download source (via supply chain attack, DNS spoofing, or MITM), they could write arbitrary files to any location writable by the process, potentially achieving:

Remote code execution by overwriting Python files or system binaries
Persistent backdoors by modifying startup scripts or SSH keys
Data corruption or system compromise

Patches

Fixed in version 2.91.0. The extraction process now validates each archive member path using os.path.realpath() to ensure it remains within the target directory, raising a SecurityError for any path traversal attempts.

Workarounds

Ensure model downloads occur over secure, authenticated channels. Use integrity verification (checksums) for downloaded models. Run the application with minimal file system permissions.

References

Fix release: v2.91.0

References

dolfim-ibm published to docling-project/docling Jun 2, 2026

Published to the GitHub Advisory Database Jun 3, 2026

Reviewed Jun 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Package

Affected versions

Patched versions

Description

Impact

Patches

Workarounds

References

References

Severity

CVSS overall score

CVSS v3 base metrics

CVSS v3 base metrics

EPSS score

Exploit Prediction Scoring System (EPSS)

Weaknesses

Improper Limitation of a Pathname to a Restricted Directory ('Path Traversal')

CVE ID

GHSA ID

Source code

Uh oh!