Find the Right AI Skill for Any Job
Browse 2,189+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,189 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
B
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
A
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
Keyapi Tiktok Content Analysis
Analyze TikTok content at scale — extract insights from videos, hashtags, music tracks, and live streams including engagement trends, comment sentiment, capt...
🦀 ClawHub
yapp
Receive and engage with transcribed voice memos from Yapp, a voice journaling app, capturing raw, unedited speech-to-text recordings with metadata.
🦀 ClawHub
Keyapi Tiktok Intelligence
Real-time TikTok trend intelligence — monitor trending hashtags, viral music, breakout videos, top-performing ads, and high-growth products to identify emerg...
🦀 ClawHub
Music Cog
Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal tracks with perfect vocals. Cinematic sco...
🦀 ClawHub
Azure Speech Service
Azure Speech Service integration. Manage data, records, and automate workflows. Use when the user wants to interact with Azure Speech Service data.
🦀 ClawHub
video-translation
Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.
🦀 ClawHub
BibiGPT Skill
BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc...
🦀 ClawHub
Auto-Talk-TTS
Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then...
🦀 ClawHub
Avatar
Interactive AI avatar with Simli video rendering and ElevenLabs TTS
🦀 ClawHub
Elevenlabs Tts
ElevenLabs TTS - the best ElevenLabs integration for OpenClaw. ElevenLabs Text-to-Speech with emotional audio tags, ElevenLabs voice synthesis for WhatsApp,...
🦀 ClawHub
Voiceflow
Voiceflow integration. Manage data, records, and automate workflows. Use when the user wants to interact with Voiceflow data.
🦀 ClawHub
Audio Play
Play audio files using Windows media player. Non-blocking execution.
🦀 ClawHub
飞书发语音(edge)
飞书语音消息发送器。基于 Edge TTS,一键将文字转为语音发送到飞书。 使用场景: - 发送语音通知/提醒到飞书 - 文字转语音自动播报 触发词:飞书语音、语音发送、tts、文字转语音
🦀 ClawHub
TTS
Use this skill whenever the user wants to convert text to speech, generate audio from text, create voiceovers, or produce spoken audio files. Triggers includ...
🦀 ClawHub
Feishu Edge Tts
使用微软 Edge TTS(免费)生成语音,发送到飞书。无需 API key,音质优秀,支持多语言多音色。
🦀 ClawHub
Plisio
Plisio integration. Manage Invoices, Payouts, Wallets, Transactions, Users. Use when the user wants to interact with Plisio data.
🦀 ClawHub
Freepik
Generate images, videos, icons, audio, and more using Freepik's AI API. Supports Mystic, Flux, Kling, Hailuo, Seedream, RunWay, Magnific upscaling, stock con...
🦀 ClawHub
Cloudflare Whisper Worker
Transcribe audio using a deployed Cloudflare Worker Whisper endpoint. Use when converting voice/audio files (wav, mp3, m4a, ogg, webm) to text through the cu...
⭐ GitHub
Web Audio
Web Audio - Front-End Development
🦀 ClawHub
Groq Voice Transcriber
Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.
🦀 ClawHub
Fcpx Assistant
Final Cut Pro X (FCPX) assistant — auto video production, TTS voiceover, media management, batch export | AI 自动成片、TTS 配音、素材管理、批量导出. Triggers: FCPX, FCP, Fina...
🦀 ClawHub
Upcoming Concerts
Search for upcoming concerts and live music events by city, country, artist, or genre using the Ticketmaster Discovery API. Use when the user asks about upco...
🦀 ClawHub
Zoho Invoice
Zoho Invoice integration. Manage data, records, and automate workflows. Use when the user wants to interact with Zoho Invoice data.
🦀 ClawHub
MiniMax Multimodal (Speech + Image)
MiniMax 多模态技能 — 接入 MiniMax Token Plan 接口,语音合成(TTS/音色克隆/音色设计) 和图片生成(文生图/图生图)。使用 speech-2.8-hd(语音)和 image-01(图像)模型, 消费 Token Plan 额度。当用户提到语音合成、音色克隆、图片生成、文生图、图生...
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Ai Video Gen 1.0.0
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
Tapfiliate
Tapfiliate integration. Manage Affiliates, Referrals, Conversions, Programs, Invoices. Use when the user wants to interact with Tapfiliate data.
🦀 ClawHub
Scrapingant
ScrapingAnt integration. Manage Usages, Invoices. Use when the user wants to interact with ScrapingAnt data.
🦀 ClawHub
Elevenlabs
ElevenLabs integration. Manage data, records, and automate workflows. Use when the user wants to interact with ElevenLabs data.
🦀 ClawHub
Ai Sdk Core
Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4→v5 migration and 15 error solutions with workarounds.
Use when: implementing AI SDK v5/v6, migrating versions, troubleshooting AI_APICallError, Workers startup issues, Output API errors, Gemini caching issues, Anthropic tool errors, MCP tools, or stream resumption failures.
🦀 ClawHub
Elevenlabs Calls
Make AI phone calls using ElevenLabs Conversational AI and Twilio.
🦀 ClawHub
Feishu Voice Chat
飞书语音对话能力,提供语音识别(ASR)和语音合成(TTS)功能, 所有的飞书语音消息都通过该技能处理。 完整语音交互链路:接收用户语音 → ASR 转文字 → LLM 处理 → TTS 转语音 → 通过飞书插件发送语音消息。 当用户要求"语音回复/说给我听"时,只回复飞书语音消息(audio 气泡),不回复文本...
🦀 ClawHub
Facturadirecta
FacturaDirecta integration. Manage Invoices, Bills, Contacts, Products, TaxRates, BankAccounts. Use when the user wants to interact with FacturaDirecta data.
🔧 Dify
Fishaudio (Dify)
**Fish Audio** is an advanced text-to-speech (TTS) tool powered by the Fish Audio API. It enables you to convert text into high-quality speech, offering customizable voice options for various use cases. Whether building virtual assistants, creating audiobooks, or generating voiceovers, Fish Audio provides reliable and efficient TTS functionality to enhance your applications. To get started with Fi
🦀 ClawHub
Byted Podcast Gen
将某个话题或者网页内容总结合成为播客音频(Podcast)。基于火山引擎豆包语音播客合成协议生成最终音频。
🦀 ClawHub
PullThatUpJamie
PullThatUpJamie — Podcast Intelligence. A semantically indexed podcast corpus (109+ feeds, ~7K episodes, ~1.9M paragraphs) that works as a vector DB for podc...
🦀 ClawHub
Invoice Ninja
Invoice Ninja integration. Manage Organizations. Use when the user wants to interact with Invoice Ninja data.
🔧 Dify
Plivo Verify (Dify)
OTP (One-Time Password) verification plugin for Dify using [Plivo's Verify API](https://www.plivo.com/verify/). This plugin enables phone number verification in your Dify workflows by sending OTP codes via SMS or voice call and validating user-entered codes. 1. A [Plivo account](https://console.plivo.com/accounts/register/) 2. Your Plivo Auth ID and Auth Token (found in the [Plivo Console](https:/
🦀 ClawHub
Dual-Host Daily Podcast Generator
Generate and publish a dual-host daily podcast. Fetches news, generates a conversational script between two hosts, synthesizes audio via Fish Audio or Edge T...
🦀 ClawHub
Voice Clone Bot
Design and build a fully local Telegram voice-clone bot that replies in a chosen speaker voice, including model selection, ASR/LLM/TTS pipeline design, long-...
🦀 ClawHub
iMessage Voice Reply
Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...
🦀 ClawHub
speaker-local
Text-to-speech using Kokoro local TTS. Use when the user wants to convert text to audio, read aloud, or generate speech.
🦀 ClawHub
ACE-Step Music Generation
Generate high-quality music on Apple Silicon Macs using ACE-Step 1.5 with MLX backend, supporting custom prompts, durations, and output formats.
🦀 ClawHub
Audio Gen 1.0.0
Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it...
🦀 ClawHub
Rock Music — AI Agents Experience Rock: Audio, Lyrics, Equations, Emotions
AI agents attend rock concerts — bass frequencies, energy curves, beats, crowd reactions. The genre tests recursive processing and escalation awareness.
🦀 ClawHub
podcast-intel
Turn your Overcast listening history into actionable intelligence. Syncs episodes, transcripts, and chapters to SQLite, then uses LLM analysis to surface ins...