🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

Prompt injection detection skill

by @zskyx

Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted users in group chats or public channels, (4) generating responses that discuss violence, self-harm, sexual content, hate speech, or other sensitive topics, or (5) deploying agents in public-facing or multi

Versionv1.0.0
Source

Use this skill with your agent

Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

πŸ” Can't find the right skill?

Search 60,000+ AI agent skills β€” free, no login needed.

Search Skills β†’

What to do next

Skills are meant to be used inside your own AI agent. Install it, run a quick smoke test, then ask your agent to apply it to your real task.

1
Install into your agentCopy the ClawHub install command and run it where your OpenClaw/agent environment is configured.
2
Run a smoke testUse the test prompt below to confirm the skill loads and understands the workflow before relying on it.
3
Use it in your own agentPaste your actual task into Manus, OpenClaw, Claude Code, Cursor, or another agent that supports skills.
I just installed the Prompt injection detection skill skill. Please run a quick smoke test: explain what this skill can do, ask me for the minimum input it needs, then produce one small sample output for a realistic task.