π¦ ClawHub
Document Ingestion
by @samledger67-dotcom
Process raw accounting source documents (PDFs, CSVs, bank statements, invoices, receipts) into standardized transaction records for QBO import. Use when batc...
π‘ Examples
# Process a directory of mixed documents
python3 scripts/pipelines/document-ingestion.py \
--slug sb-paulson \
--input-dir ~/Downloads/month-end-docsSingle file
python3 scripts/pipelines/document-ingestion.py \
--slug sb-paulson \
--file ~/Downloads/invoice_march.pdfMultiple files + custom output dir
python3 scripts/pipelines/document-ingestion.py \
--slug glowlabs \
--file ~/Downloads/stmt.csv \
--file ~/Downloads/payroll.csv \
--out ~/Desktop/ingestedOffline mode (no QBO auth needed)
python3 scripts/pipelines/document-ingestion.py \
--slug sb-paulson \
--input-dir ./docs \
--no-qbo-coaQBO sandbox
python3 scripts/pipelines/document-ingestion.py \
--slug sb-paulson \
--input-dir ./docs \
--sandbox
All CLI Flags
| Flag | Default | Description | |---|---|---| |--slug | required | Company slug (QBO + client vendor map) |
| --input-dir | β | Directory of docs to process |
| --file | β | Single file (repeatable) |
| --out | ~/Desktop | Output directory |
| --no-qbo-coa | false | Use built-in COA only (offline) |
| --sandbox | false | QBO sandbox mode |TERMINAL
clawhub install document-ingestion