mirror of https://github.com/tldr-pages/tldr.git synced 2025-07-23 20:35:25 +02:00

Máté Gyöngyösi cefbc950da

ocrmypdf: replace semi-duplicate with optimization (#17175 )

Co-authored-by: Managor <42655600+Managor@users.noreply.github.com>

2025-07-09 06:12:10 +03:00

995 B

Raw Blame History

ocrmypdf

Generate a searchable PDF or PDF/A from a scanned PDF or an image of text. More information: https://ocrmypdf.readthedocs.io/en/latest/cookbook.html.

Create a new searchable PDF/A file from a scanned PDF or image file:

ocrmypdf {{path/to/input}} {{path/to/output.pdf}}

Skip pages of a mixed-format input PDF file that already contain text:

ocrmypdf --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}

Clean, de-skew, and rotate pages of a poor scan:

ocrmypdf --clean --deskew --rotate-pages {{path/to/input.pdf}} {{path/to/output.pdf}}

Perform lossy optimization on a PDF without performing any OCR:

ocrmypdf --tesseract-timeout 0 --optimize 2 --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}

Set the metadata of a searchable PDF file:

ocrmypdf --title "{{title}}" --author "{{author}}" --subject "{{subject}}" --keywords "{{keyword; key phrase; ...}}" {{path/to/input.pdf}} {{path/to/output.pdf}}

Display help:

ocrmypdf --help

995 B Raw Blame History

ocrmypdf

995 B

Raw Blame History