π¦ ClawHub
Nm Leyline Evaluation Framework
by @athola
Patterns for building evaluation and scoring systems, quality gates, rubrics, and decision frameworks. Use for any scored assessment
β‘ When to Use
π‘ Examples
Define Your Evaluation
1. Identify criteria: What aspects matter for your domain? 2. Assign weights: Which criteria are most important? (sum to 1.0) 3. Create scoring guides: What does each score range mean? 4. Set thresholds: What total scores trigger which decisions?
Example: Code Review Evaluation
criteria:
correctness: {weight: 0.40, description: Does code work as intended?}
maintainability: {weight: 0.25, description: Is it readable?}
performance: {weight: 0.20, description: Meets performance needs?}
testing: {weight: 0.15, description: Tests detailed?}thresholds:
85-100: Approve immediately
70-84: Approve with minor feedback
50-69: Request changes
0-49: Reject, major issues
Verification: Run pytest -v to verify tests pass.Evaluation Workflow
Verification: Run the command with --help flag to verify availability.
1. Review artifact against each criterion
2. Assign 0-100 score for each criterion
3. Calculate: total = Ξ£(score Γ weight)
4. Compare total to thresholds
5. Take action based on threshold range
Verification: Run the command with --help flag to verify availability.π Tips & Best Practices
Common Issues
Command not found Ensure all dependencies are installed and in PATH
Permission errors Check file permissions and run with appropriate privileges
Unexpected behavior
Enable verbose logging with --verbose flag
TERMINAL
clawhub install nm-leyline-evaluation-framework