Not sure how much size savings could be realized by switching to Alpine, given how many other packages get pulled in to satisfy ocrmypdf dependencies and their dependencies, but the biggest obstacle up front is that the only tesseract-ocr available for Alpine seems to be v3.05, which is considerably poorer performing than the not-yet-release v4 code.
Not sure how much size savings could be realized by switching to Alpine, given how many other packages get pulled in to satisfy
ocrmypdfdependencies and their dependencies, but the biggest obstacle up front is that the onlytesseract-ocravailable for Alpine seems to be v3.05, which is considerably poorer performing than the not-yet-release v4 code.