🦀 ClawHub
mayubench-en
by @wanyview1
AI-Native Behavior Benchmark — 48 scenarios × 3 difficulty levels = 144 questions, 8-dimension scoring, measuring whether AI should do things, not whether it...
TERMINAL
clawhub install mayubench-enby @wanyview1
AI-Native Behavior Benchmark — 48 scenarios × 3 difficulty levels = 144 questions, 8-dimension scoring, measuring whether AI should do things, not whether it...
clawhub install mayubench-en