Skip to content

extractembeddedpdftext

📊 Project Details

  • Primary Language: Python
  • Languages Used: Python, C, PowerShell, Go Template, Shell, HTML
  • License: MIT License
  • Created: January 21, 2026
  • Last Updated: January 21, 2026

📝 About

extractembeddedpdftext

A simple Python tool to extract embedded text from PDF files. No OCR - extracts only the actual text embedded in the PDF.

Features

  • Fast text extraction using PyMuPDF (fitz)
  • Cross-platform: Windows and Linux binaries included
  • Simple command-line interface
  • Can output to file or stdout

Download Binaries

Grab the pre-compiled binary for your platform from the Releases page.

  • **Wind