Commit Graph

1 Commits

Author SHA1 Message Date
Nicolas Sanchez 392522402d Initial release: Fast PDF text extraction CLI and API daemon
Features:
- CLI tool (pdf_extractor.py) for local files and URLs using PyMuPDF
- FastAPI daemon (pdf_daemon.py) with GET /extract endpoint
- Query parameter-based API for easier agent integration
- Comprehensive test suites included

Performance:
- ~40-60x faster than pdfplumber (~50ms average extraction time)
- Handles PDFs up to 36+ MB efficiently

Documentation:
- README.md with full API reference
- QUICKSTART.md for both CLI and daemon modes
2026-03-16 12:03:22 -03:00