π¦ ClawHub
by @wu-uk
PDF manipulation toolkit. Extract text/tables, create PDFs, merge/split, fill forms, for programmatic document processing and analysis.
π‘ Examples
from pypdf import PdfReader, PdfWriterRead a PDF
reader = PdfReader("document.pdf")
print(f"Pages: {len(reader.pages)}")Extract text
text = ""
for page in reader.pages:
text += page.extract_text()
TERMINAL
clawhub install find-topk-similiar-chemicals-pdf