extractembeddedpdftext
🔗 Quick Links
- View on GitHub
- Latest Release: v1.0.0 (January 21, 2026)
📊 Project Details
- Primary Language: Python
- Languages Used: Python, C, PowerShell, Go Template, Shell, HTML
- License: MIT License
- Created: January 21, 2026
- Last Updated: January 21, 2026
📝 About
extractembeddedpdftext
A simple Python tool to extract embedded text from PDF files. No OCR - extracts only the actual text embedded in the PDF.
Features
- Fast text extraction using PyMuPDF (fitz)
- Cross-platform: Windows and Linux binaries included
- Simple command-line interface
- Can output to file or stdout
Download Binaries
Grab the pre-compiled binary for your platform from the Releases page.
- **Wind