Improvement Discriminator
by @lanyasheng
当需要对改进候选多人盲审打分、用 LLM 做语义评估、判断候选是否应被接受、或打分结果全是 hold 想知道为什么时使用。支持 --panel 多审阅者盲审和 --llm-judge 语义评估。不用于结构评估(用 improvement-learner)或门禁决策(用 improvement-gate)。
⚡ When to Use
| Trigger | Action |
|---|
| - 运行多审阅者盲审(CONSENSUS/VERIFIED/DISPUTED 认知标签),降低单人偏见 |
| - 用 LLM-as-Judge 评估 4 个语义维度(clarity, specificity, consistency, safety) |
| - 组合 --panel + --llm-judge 获得最全面的评估(两者不互斥) |
| - 调试为什么所有候选都被标为 hold——通常是 risk_penalty 过高或缺少 source_refs |
| - 在 orchestrator pipeline 第 2 阶段自动调用 |
| - 需要可解释的评分明细时(每个维度独立打分,附 judge_notes) |
| - 需要对比多轮改进的候选质量趋势时 |
clawhub install improvement-discriminator
🧪 Use this skill with your agent
Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.
🔍 Can't find the right skill?
Search 60,000+ AI agent skills — free, no login needed.
Search Skills →