ocrmypdf 6.2.2-1 (amd64 binary) in ubuntu cosmic
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.
Details
- Package version:
- 6.2.2-1
- Status:
- Superseded
- Component:
- universe
- Priority:
- Optional
Downloadable files
- ocrmypdf_6.2.2-1_all.deb (74.2 KiB)
Package relationships
- Depends on:
- ghostscript (>= 9.18~dfsg~)
- icc-profiles-free
- liblept5
- python3 (>= 3.3.2-2~)
- python3-cffi-backend-api-max (>= 9729)
- python3-cffi-backend-api-min (<= 9729)
- python3-defusedxml
- python3-img2pdf (>= 0.2.1)
- python3-pil
- python3-pkg-resources
- python3-pypdf2 (>= 1.26)
- python3-reportlab
- python3-ruffus
- qpdf (>= 7.0.0)
- tesseract-ocr
- zlib1g
- Suggests:
- Recommends: