π¦ ClawHub
universal-pdf-vision-parser
by @mingensiie
Extract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs using multimodal vision (Qwen-VL-Max)....
π‘ Examples
User: "Convert this German-Chinese note to markdown: notes.pdf"
Agent Action:
python scripts/vision_parse.py --pdf notes.pdf --out notes.md
βοΈ Configuration
1. DashScope API Key: A valid key from Alibaba Cloud Bailian with qwen-vl-max access.
2. Environment:
pip install pymupdf dashscope
TERMINAL
clawhub install universal-pdf-vision-parse