PDF tools

From Simson Garfinkel
Revision as of 16:31, 7 April 2023 by Simson (talk | contribs)
Jump to navigationJump to search

PDF page manipulation

  • pdftk - combines, removes, and rotates pages in PDFs
  • pdfjam - resizes pages (by running through LaTeX)

PDF OCR

  • ocrmypdf - creates PDF/A files and runs tesseract

HTML to PDF

Other sources:

Extract text from PDF

  • pymupdf (python module)