🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

claw-text-and-pics

by @photon78

Extract text and embedded images from scanned documents, PDFs, and photos via Mistral OCR API. Use when reading receipts, invoices, contracts, handwritten no...

⚑ When to Use
TriggerAction
- Pull embedded images from PDFs or scans
- Convert handwritten notes or photos to searchable text
- Send extracted images directly to Telegram
πŸ’‘ Examples

# Extract text only
python3 ocr.py --input scan.jpg

Extract text from PDF (3 pages)

python3 ocr.py --input document.pdf --pages 3

Extract embedded images

python3 ocr.py --input scan.jpg --extract-images --output-dir ./images/

Extract images and send to Telegram

python3 ocr.py --input scan.jpg --extract-images --send --target 123456789

Works with URLs too

python3 ocr.py --input https://example.com/document.pdf

βš™οΈ Configuration

Set in ~/.openclaw/.env or as environment variables:

| Variable | Required | Description | |----------|----------|-------------| | MISTRAL_API_KEY | Yes | Your Mistral API key | | TELEGRAM_BOT_TOKEN | Only for --send | Your Telegram bot token | | TELEGRAM_CHAT_ID | Optional | Default chat ID (overridable with --target) |

View on ClawHub
TERMINAL
clawhub install claw-text-and-pics

πŸ§ͺ Use this skill with your agent

Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

πŸ” Can't find the right skill?

Search 60,000+ AI agent skills β€” free, no login needed.

Search Skills β†’