🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

PinchBench

by @olearycrew

Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting b...

Versionv1.0.0
Installs3
Comments2
πŸ’‘ Examples

cd 

Run benchmark with a specific model

uv run benchmark.py --model anthropic/claude-sonnet-4

Run only automated tasks (faster)

uv run benchmark.py --model anthropic/claude-sonnet-4 --suite automated-only

Run specific tasks

uv run benchmark.py --model anthropic/claude-sonnet-4 --suite task_01_calendar,task_02_stock

Skip uploading results

uv run benchmark.py --model anthropic/claude-sonnet-4 --no-upload

βš™οΈ Configuration

  • Python 3.10+
  • uv package manager
  • OpenClaw instance (this agent)
  • View on ClawHub
    TERMINAL
    clawhub install pinchbench

    πŸ§ͺ Use this skill with your agent

    Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

    πŸ” Can't find the right skill?

    Search 60,000+ AI agent skills β€” free, no login needed.

    Search Skills β†’