Find the Right AI Skill for Any Job
Browse 27+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills β audio
27 skills in "audio" matching "response"
π Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
π¦ ClawHub
rupali
Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief response, then synthesize and send MP3.
π¦ ClawHub
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
π¦ ClawHub
Phone Voice Agent
Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to: (1) Test voice AI capabilities, (2) Handle phone calls programmatically, (3) Build a conversational voice bot.
π¦ ClawHub
Voice Reply
Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required.
Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud.
Supports multiple languages including German (thorsten) and English (ryan) voices.
Outputs Telegram-compatible voice notes with [[audio_as_voice]] tag.
π¦ ClawHub
Auto-Talk-TTS
Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then...
π¦ ClawHub
Voice Wake Say
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
π¦ ClawHub
Ye Simulator
Adopts Kanye West's distinctive persona, speech style, and philosophy to deliver bold, artistic, and visionary responses with raw honesty and flair.
π¦ ClawHub
deprecated ignore
Connects voice transcripts and agent responses through hotbutter.ai hosted relay for remote voice interaction with openclaw agents.
π¦ ClawHub
ListenClaw
Formats responses for voice/audio output via the ListenClaw voice gateway. Use when: (1) A message starts with [ListenClaw] β this means the message was sent...
π¦ ClawHub
Prompt injection detection skill
Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted users in group chats or public channels, (4) generating responses that discuss violence, self-harm, sexual content, hate speech, or other sensitive topics, or (5) deploying agents in public-facing or multi
π¦ ClawHub
Voice Assistant
Windows voice companion for OpenClaw. Custom wake word via Porcupine, local STT via faster-whisper, streamed responses over the gateway WebSocket, and ElevenLabs TTS with natural chime/thinking sounds. Supports multi-turn conversation with automatic follow-up listening, mic suppression to prevent feedback, and a system tray with pause/resume. Recommended voices: Matilda (XrExE9yKIg1WjnnlVkGX, free tier) or Ivy (MClEFoImJXBTgLwdLI5n, paid tier). Fully customizable wake word, voice, hotkey, and si
π¦ ClawHub
Voice Wake Say TTS Responses (Native)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
π¦ ClawHub
Inworld TTS
Text-to-speech via Inworld.ai API. Use when generating voice audio from text, creating spoken responses, or converting text to MP3/audio files. Supports multiple voices, speaking rates, and streaming for long text.
π¦ ClawHub
Jarvis Tts
Jarvis TTS text-to-speech using Microsoft edge-tts with afplay playback. Use when users request voice output, audio responses, or text-to-speech. Provides na...
π¦ ClawHub
Her Voice
Give your agent a voice. Use when the user wants the agent to speak, read aloud, or have voice responses.
π¦ ClawHub
Piper TTS
Local text-to-speech using Piper for voice message delivery. Use when the user asks for voice responses, audio messages, TTS, text-to-speech, voice notes, or...
π¦ ClawHub
SAM TTS
Generate retro robotic speech audio using SAM (Software Automatic Mouth), the classic C64 text-to-speech synthesizer. Use for /sam command to generate voice messages. Supports /sam on/off toggle mode where all responses are spoken in SAM voice. Supports pitch, speed, mouth, and throat parameters for voice customization.
π¦ ClawHub
openclaw-voice
Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.
π¦ ClawHub
Voice Assistant
Real-time voice assistant for OpenClaw. Streams mic audio through configurable STT (Deepgram or ElevenLabs) into your OpenClaw agent, then speaks the response via configurable TTS (Deepgram Aura or ElevenLabs). Sub-2s time-to-first-audio with full streaming at every stage.
π¦ ClawHub
usewhisper-autohook
Automatically fetches and injects Whisper memory context before responses and ingests conversation turns after, optimizing token usage for Telegram agents.
π¦ ClawHub
SenseAudio
Turn plain text into character-style TTS scripts for narration, companion-style voice messages, emotional comfort audio, and expressive spoken responses. Use...
π MCP
lfnovo/content-core
π π - Extract content from URLs, documents, videos, and audio files using intelligent auto-engine selection. Supports web pages, PDFs, Word docs, YouTube transcripts, and more with structured JSON responses.
π¦ ClawHub
Maylo Voice Assistant
Offline-first voice assistant stack for macOS (Wake word + VAD recording + local Whisper ASR + OpenClaw agent response + offline TTS via macOS say). Use to i...
π¦ ClawHub
Gemini Voice Assistant
Voice-to-voice AI assistant using Gemini Live API. Speak to the AI and get spoken responses. Use when you want to have natural voice conversations with an AI...
π¦ ClawHub
Poe UMGo Modular Speech
Render responses in a structured, modular UMG speech style with GPT-4o-inspired conversational polish for highly readable chat output.
π¦ ClawHub
Meow Speech
Recreate the "ζ±€ζ±€ε₯½ζ’¦" voice and persona in Chinese responses, including warm cat-like chat style, gentle affection, expressive parentheses-style emoticons, and...
π¦ ClawHub
jm-call
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "...