Find the Right AI Skill for Any Job
Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,510 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
hum
AI content writer that researches, outlines, drafts, publishes, and manages engagement for LinkedIn and X using your voice and style guidelines.
🦀 ClawHub
Tiktok Add Music
Get music-backed videos ready to post, without touching a single slider. Upload your video clips (MP4, MOV, AVI, WebM, up to 500MB), say something like "add...
🦀 ClawHub
Best Video Audio Replace
replace video with audio into re-audited video files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. YouTubers, content creators, marketer...
🦀 ClawHub
Movie Producer Scene
Create high-end cinematic scene prompts and production-ready scene briefs in a Hollywood producer voice. Use when the user asks for movie scene generation, s...
🦀 ClawHub
ClickSend
ClickSend API integration with managed authentication. Send SMS, MMS, and voice messages, manage contacts and lists.
Use this skill when users want to send text messages, make voice calls, manage contact lists, or track message delivery.
For other third party apps, use the api-gateway skill (https://clawhub.ai/byungkyu/api-gateway).
🦀 ClawHub
Best Add Music To
add video clips into music-backed videos with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. content creators use it for adding background mus...
🦀 ClawHub
Ai Scam Defense
Identify and defend against AI-powered scams including deepfakes, voice cloning, AI phishing, and fake job offers. Use when someone received a suspicious cal...
🦀 ClawHub
Summarize Garrison
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Twilio
Twilio API integration with managed OAuth. SMS, voice calls, phone numbers, and communications.
Use this skill when users want to send SMS messages, make voice calls, manage phone numbers, or work with Twilio resources.
For other third party apps, use the api-gateway skill (https://clawhub.ai/byungkyu/api-gateway).
Requires network access and valid Maton API key.
🦀 ClawHub
ton
Ton namespace for Netsnek e.U. audio and media processing tools. Handles audio transcription, format conversion, waveform analysis, and podcast production wo...
🦀 ClawHub
Voice Log
Background voice journaling with Soniox realtime STT for OpenClaw. Requires SONIOX_API_KEY. Get/create your Soniox API key at https://soniox.com/speech-to-te...
🦀 ClawHub
AI Avatar
Guide users to VideoAny AI Avatar tool to create talking avatar videos from an image and voice.
🦀 ClawHub
🗣️ Edge-TTS Skill using uvx
Text-to-speech conversion using `uvx edge-tts` for generating audio from text.
Use when:
(1) User requests audio/voice output with the "tts" trigger or keyword.
(2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking).
(3) User wants a specific voice, speed, pitch, or format for TTS output.
⭐ GitHub
Ask Dr. Andrew Huberman
Maximize your productivity, physical and mental health with neuroscience. Trained with all the podcast episodes from Huberman Lab by [@jyboy](https://github.com/jyboy)
🦀 ClawHub
Percept Listen
Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.
🦀 ClawHub
transcription
Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...
🦀 ClawHub
feishu-minimax-t2a-voice
飞书语音消息收发:接收语音自动转文字(飞书原生 Transcript + Whisper 降级),回复语音由 MiniMax T2A 合成后发送。
🦀 ClawHub
Dental Ai Receptionist
Complete AI voice receptionist system for dental practices. 12 workflows covering inbound call routing, appointment booking, reminders, no-show followup, can...
🦀 ClawHub
Characteristic Voice
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...
🦀 ClawHub
Urdu
Write Urdu that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Feishu Voice API Sender
飞书语音消息发送:使用官方 API 上传 OPUS 音频并发送语音消息,解决 OpenClaw 内置发送缺少 duration 参数的问题。| Send voice messages via Feishu official API, fixing OpenClaw's missing duration param...
🦀 ClawHub
Podcast Strategist
Expert AI agent specializing in podcast strategist. From The Agency (github.com/msitarzewski/agency-agents).
🦀 ClawHub
Fal.ai API
Fal.Ai Media Generation — Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.)
🦀 ClawHub
Openai Whisper 1.0.0
Local speech-to-text with the Whisper CLI (no API key).
🦀 ClawHub
Zen Koan Daily
Daily Zen Buddhist koan (禅宗公案) lecture with Chinese ink wash illustration and TTS audio. Generates detailed lecture (origin, background, interpretation, mode...
⭐ GitHub
enginesound
A GUI and command line application used to procedurally generate semi-realistic engine sounds. Featuring in-depth configuration, variable sample rate and a frequency analysis window.
⭐ GitHub
Spotifyd
An open source Spotify client running as a UNIX daemon. [](https://github.com/Spotifyd/spotifyd/actions/workflows/ci.yml)
🦀 ClawHub
Spark Bitcoin L2 Proxy for AI Agents
Use a Spark Bitcoin L2 wallet proxy for AI agents via HTTP API. Check balances, send payments, create invoices, pay L402 paywalls — all without holding the m...
🦀 ClawHub
Kokoro Agent Voices
Local zero-cost text-to-speech with per-agent voice profiles using Kokoro TTS (82M params). 54 voices available, named agent mappings, WAV output. Use when b...
🦀 ClawHub
Slopbuster
AI text humanizer for prose, code, and academic writing. Strips AI-generated patterns and restores human voice. Use when editing or reviewing text to make it...
🦀 ClawHub
Tiktok Image To Video
Skip the learning curve of professional editing software. Describe what you want — turn these photos into a TikTok video with music and transitions — and get...
🦀 ClawHub
Best Free Video Editor
Browser-based free video editor with AI auto captions, silence removal, voiceover, no watermark, and 1080p exports for Windows and Mac in 2025.
🦀 ClawHub
lnd macaroon bakery
Bake, inspect, and manage lnd macaroons for least-privilege agent access. Use when an agent needs scoped credentials — pay-only, invoice-only, read-only, or custom permissions. Also covers signer macaroon scoping and macaroon rotation.
🦀 ClawHub
Miranda SAG (ElevenLabs TTS say-UX)
ElevenLabs text-to-speech with mac-style say UX.
🦀 ClawHub
Experience Cma Fest Nashville
Four days. Fifty thousand voices. The longest-running country music festival in the world. Nashville is going to teach you what a song does when an entire ci...
🦀 ClawHub
Video AD Prod
Generate optimized video ads from text briefs using InVideo AI, producing scripts, voiceovers, captions, CTAs, and platform-specific exports for Facebook, In...
🦀 ClawHub
vietnam-invoice
越南发票验真 - 识别发票信息并通过越南税务 API 查验真伪
🦀 ClawHub
Chinese
Write Chinese that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
douyin-research-kit
Extract and analyze Douyin (抖音) content using yt-dlp. Supports video metadata, caption extraction, user profile analysis, music/sound info, and engagement st...
🦀 ClawHub
Bilibili Subtitles
使用 yt-dlp 从哔哩哔哩公开视频提取已有字幕或自动字幕(不下载整段视频)。当用户提到 B 站、bilibili、BV 号、视频字幕、拉字幕、做摘要、根据视频内容回答问题时使用。v1 仅支持平台已提供字幕轨道的视频;无字幕视频需换源或后续用 Whisper 等方案。
🦀 ClawHub
Feishu Voice Sender
飞书语音消息发送技能 - 根据 channel 自动选择发送方式
🦀 ClawHub
Kre Video Translator
Translate local audio or video files into multilingual .srt subtitles with KreTrans. Use when a user wants audio/video translation, subtitle generation, tran...
🦀 ClawHub
Qwen3-TTS VoiceDesign
Text-to-speech with Qwen3-TTS VoiceDesign. Design custom voices via natural language descriptions + seed-based timbre fixation. Includes OpenAI-compatible AP...
🦀 ClawHub
Ringg Voice Agent
Integrate Ringg AI voice agents with OpenClaw for making, receiving, and managing phone calls powered by Ringg's Voice OS. Use this skill when the user wants to: (1) make outbound voice calls via Ringg AI agents, (2) trigger Ringg AI campaigns from OpenClaw, (3) check call status or retrieve call history/analytics from Ringg, (4) manage Ringg AI assistants (list, create, update), (5) connect OpenClaw to Ringg's voice platform for automated phone interactions like lead qualification, feedback col
🦀 ClawHub
Add Music To Video
Add Music to Video â AI Background Music and Audio for Video Editing. Silent footage kills the mood. Add Music to Video lets you describe the vibe â 'upb...
🦀 ClawHub
TelCall Twilio
Make emergency phone calls via Twilio. Use when you need to call someone and play a voice message programmatically (e.g., server down alerts, security notifi...
🦀 ClawHub
Content Humanizer
Makes AI-generated content sound genuinely human — not just cleaned up, but alive. Use when content feels robotic, uses too many AI clichés, lacks personalit...
🦀 ClawHub
Donotify Voice Call Reminder
Send immediate voice call reminders or schedule future calls via DoNotify.