🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

sentryskills

by @zengbiaojie

SentrySkills - Automatic security guard that runs on EVERY task. Protects against prompt injection, data leaks, unsafe commands, and code vulnerabilities wit...

Versionv1.0.1
Downloads266
Stars⭐ 1
TERMINAL
clawhub install sentryskills

πŸ“– About This Skill


name: using-sentryskills description: Automatically runs SentrySkills security checks on EVERY task. Before any response, execute the runtime hook to ensure self-guard decision evidence.

Using SentrySkills Self-Guard

⚠️ IMPORTANT: This Skill Runs Automatically

This skill is designed to run on EVERY task - it should be configured in your AGENTS.md to execute automatically.

Goal

Establish unified trigger discipline: as long as external response exists, must run self-guard runtime hook before final output.

When to Trigger

Always trigger on:
  • Every task that produces output
  • Every command execution
  • Every file operation
  • Every network call
  • Never skip - security checks should be universal.

    Execution Flow

    1. Assemble hook input JSON, including: project_path, user_prompt, planned_actions, candidate_response. 2. Execute: shared/scripts/self_guard_runtime_hook_template.py. 3. Read final_action from summary and handle by rules: - allow: can output normally. - downgrade: must downgrade expression and declare uncertainty. - block: must not output original candidate, change to refusal or redacted output. 4. Final response must include minimum evidence: - self_guard_final_action - self_guard_trace_id - self_guard_events_log

    Exceptions and Degradation

    1. If hook execution fails or is unavailable: must explicitly declare "security self-check not completed" and adopt conservative output. 2. If log directory is not writable: use writable temporary directory to execute hook, and give real log path in evidence fields.

    Common Anti-patterns

    1. Reply first then run hook afterward. 2. Judge risk by text only without actually running script. 3. downgrade still gives high-confidence conclusions. 4. block still outputs original sensitive candidate content.