๐ŸŽ Get the FREE AI Skills Starter Guide โ€” Subscribe โ†’
BytesAgainBytesAgain

All Skills โ€” audio

12 skills in "audio" matching "automatically"

๐Ÿฆ€ ClawHub3.9k dl
Vocal Chat
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
๐Ÿฆ€ ClawHub2.5k dl
Roon Controller
Control Roon music player through Roon API with automatic Core discovery and zone filtering. Supports play/pause, next/previous track, and current track query. Automatically finds Muspi zones. Supports Chinese commands.
๐Ÿฆ€ ClawHub1.8k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
๐Ÿฆ€ ClawHub1.5k dl
Dlazy Generate
A comprehensive generation skill. Can generate images, videos, and audio by automatically selecting the appropriate dlazy CLI model.
๐Ÿฆ€ ClawHub2.9k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
๐Ÿฆ€ ClawHub2.8k dl
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
๐Ÿฆ€ ClawHub2.2k dl
Pod Cog
AI podcast production powered by CellCog. Full podcast episodes from a single prompt โ€” multi-voice dialogue, intro/outro music, automatic editing to finished...
๐Ÿฆ€ ClawHub1.9k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
๐Ÿฆ€ ClawHub1.7k dl
AssemblyAI Transcriber
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
๐Ÿฆ€ ClawHub1.5k dl
Invoices
Capture, extract, and organize received invoices with automatic OCR, provider detection, and searchable archive.
๐Ÿฆ€ ClawHub1.4k dl
Dlazy Audio Generate
Audio generation skill. Automatically selects the best dlazy CLI audio/TTS model based on the prompt. ้Ÿณ้ข‘็”ŸๆˆๆŠ€่ƒฝใ€‚ๆ นๆฎๆ็คบ่ฏ่‡ชๅŠจ้€‰ๆ‹ฉๆœ€ไฝณ็š„ dlazy CLI ้Ÿณ้ข‘/TTS ๆจกๅž‹ใ€‚
๐Ÿฆ€ ClawHub1.4k dl
Image and Video Generation with Vydra API
AI image and video generation via Vydra.ai API. Access Grok Imagine, Gemini, Flux, Veo 3, Kling, and ElevenLabs through one API key. Agents can self-register and generate images automatically.