Skill Test
by @ivangdavila
Test skills before using or publishing. Trial, compare, evaluate in isolation without affecting your environment.
Trial a skill:
sessions_spawn(
task="Test skill X: Load ONLY its SKILL.md, run [sample task], report quality",
model="anthropic/claude-haiku"
)
Compare two skills: 1. Run same task through each (separate sub-agents) 2. Present outputs side-by-side 3. Ask: "Which works better? Why?"
Test Modes
Trial Mode β Before installing
Evaluation Mode β Before publishing
evaluate.md)Sandbox Isolation
β οΈ Never load test skill into your main context.
Sub-agent approach (recommended):
sessions_spawn(
task="You have ONE skill loaded: [skill content]. Test by doing: [task]",
model="anthropic/claude-haiku"
)
What to check:
Edge Cases
Skill requires credentials: Ask user for test credentials or skip auth-dependent features.
Skill not found: Verify slug with npx clawhub info before testing.
Test fails mid-way: Sub-agent terminates cleanly. Review logs, adjust test task, retry.
Skill has many auxiliary files: Load SKILL.md first, reference others only if needed during test.
*Test thoroughly. Install only after explicit user approval.*
clawhub install skill-test