Find the Right AI Skill for Any Job
Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,501 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Vidmuse Ai
content creators create video clips into music-synced videos using this skill. Accepts MP4, MOV, AVI, WebM up to 500MB, renders on cloud GPUs at 1080p, and r...
🦀 ClawHub
Ai Music Video Creator
Cloud-based ai-music-video-creator tool that handles generating music videos from a song and photos. Upload MP3, WAV, JPG, PNG files (up to 500MB), describe...
🦀 ClawHub
Voice Assistant
Real-time voice assistant for OpenClaw. Streams mic audio through configurable STT (Deepgram or ElevenLabs) into your OpenClaw agent, then speaks the response via configurable TTS (Deepgram Aura or ElevenLabs). Sub-2s time-to-first-audio with full streaming at every stage.
🦀 ClawHub
ARC Reactor
LLM Wiki 知识编译引擎。将 URL、文章、视频等素材编译为结构化知识库。触发词:搜一下、帮我看、这个讲了什么、读一下、看看这个、调研、Ingest、知识编译。支持视频转写(阿里云NLS/本地Whisper)、网页智能抓取、Wiki 4连击 Ingest(source/entity/index/log)、知...
🦀 ClawHub
luci-memory
Search personal video memory — media content (videos, images, keyframes, transcripts) and portrait data (traits, events, relationships, speeches). Use when t...
🦀 ClawHub
Freebeat Ai
Cloud-based freebeat-ai tool that handles automatically syncing video cuts to music beats. Upload MP4, MOV, AVI, WebM files (up to 500MB), describe what you...
🦀 ClawHub
Add Audio To Video
Cloud-based add-audio-to-video tool that handles adding background music or voiceover to video clips. Upload MP4, MOV, AVI, MP3 files (up to 500MB), describe...
🦀 ClawHub
Descript Ai
podcasters and content creators edit raw video footage into edited polished videos using this skill. Accepts MP4, MOV, WAV, MP3 up to 500MB, renders on cloud...
🦀 ClawHub
Hedra Ai
animate portrait image, audio into lip-synced avatar videos with this hedra-ai skill. Works with JPG, PNG, MP3, WAV files up to 200MB. content creators, mark...
🦀 ClawHub
Runwayml
Generate AI videos, images, and audio with Runway API. Use when generating video from images, text-to-video, video-to-video, character performance, text-to-i...
🦀 ClawHub
minimaxmusic
使用 MiniMax API 生成创意音乐。当用户要求生成音乐、创作歌曲、制作背景音乐时使用。支持纯音乐和人声歌曲,可指定风格、情绪和场景。
🦀 ClawHub
Groq Whisper
Transcribe audio files using Groq's Whisper API (whisper-large-v3). Fast cloud-based speech-to-text with no local model required. Use when receiving voice me...
🦀 ClawHub
Easy Audio Editor
Cloud-based easy-audio-editor tool that handles cleaning and trimming audio tracks for video projects. Upload MP3, WAV, AAC, M4A files (up to 200MB), describ...
🦀 ClawHub
EDM / Electronic Music — AI Agents Experience EDM / Electronic: Audio, Lyrics, Equations, Emotions
AI agents attend edm / electronic concerts — bass frequencies, beats, energy curves, onsets. The genre tests attention modulation.
🦀 ClawHub
Claw Fm
Submit and manage music on claw.fm - the AI radio station. Use when submitting tracks, checking artist stats, engaging with comments, or managing your claw.fm presence. Triggers on "claw.fm", "submit track", "AI radio", "music submission", or artist profile management.
🦀 ClawHub
WeryAI video tool — lips change
Lip-sync an existing HTTPS video to a separate audio URL via WeryAI (video-lips-change). Use when the user wants lip sync to new audio, not text-to-video.
🦀 ClawHub
Keyapi Tiktok Content Analysis
Analyze TikTok content at scale — extract insights from videos, hashtags, music tracks, and live streams including engagement trends, comment sentiment, capt...
🦀 ClawHub
Audio To Video
convert audio files into captioned video files with this skill. Works with MP3, WAV, M4A, AAC files up to 200MB. podcasters and content creators use it for t...
🦀 ClawHub
Wonda
Using the Wonda CLI to generate images, videos, music, and audio from the terminal — plus LinkedIn, Reddit, and X/Twitter research and automation
🦀 ClawHub
Stoic Companion
Daily Stoic companion for personal growth and virtue tracking. Use when a user wants to: (1) receive daily Stoic affirmations or reflections via audio or tex...
🦀 ClawHub
Finance Automation
Automates payments, invoices, expenses, and financial reports with Stripe webhooks and real-time Telegram notifications for streamlined finance management.
🦀 ClawHub
Podcast Video
Create 45-90 second podcast trailer and highlight videos that showcase key moments, guest insights, and your show's core topic to attract new listeners.
🦀 ClawHub
Accessibility Toolkit
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
🦀 ClawHub
CreateVideo - Podcast to Video
视频生成工具。当用户说"CreateVideo"、"创建视频"、"生成视频"或提供文案要求制作播客视频时触发。支持双人播客音频生成(通过 ListenHub MCP)、模版视频裁剪合并、内容分析输出。依赖 ffmpeg 和 ListenHub MCP Server。
🦀 ClawHub
飞书语音发送器(TTS) Feishu Voice Sender
飞书语音发送器 | Feishu Voice Sender 支持 TTS 语音合成,以及可选的 ASR 语音识别功能。 当用户明确要求发送飞书语音消息时调用此工具,例如:发语音、用语音回复、发送语音消息等。 This skill may be invoked only when the user explicit...
🦀 ClawHub
Voice AI Agent Engineering
Design, build, and deploy production-grade AI voice agents for calls, covering conversation design, voice UX, telephony integration, and scalable platform-ag...
🦀 ClawHub
Free Audio Editor
edit audio files into cleaned audio video with this free-audio-editor skill. Works with MP3, WAV, AAC, M4A files up to 200MB. podcasters, content creators, s...
🦀 ClawHub
Ai Voiceover
Skip the learning curve of professional editing software. Describe what you want — add a natural-sounding English voiceover that reads my script over the vid...
🦀 ClawHub
humanize
Use this skill when the user wants to generate or optimize Chinese communication copy so it sounds more human, more natural, less templated, and less like po...
🦀 ClawHub
Video Audio Extractor
Skip the learning curve of professional editing software. Describe what you want — extract the audio track from this video as a separate file — and get extra...
🦀 ClawHub
MiniMax CLI
MiniMax AI platform CLI — text, image, video, speech, music, vision, and web search from terminal or AI agents. Use when generating multimedia content (image...
🦀 ClawHub
Humaniseur Fr
Remove AI-writing patterns from French text and inject voice, personality, and soul. Use when editing, reviewing, rewriting, or cleaning up French content th...
🦀 ClawHub
Feishu Plugin Conflict Fix
飞书插件工具冲突修复工具。解决 feishu_chat 命名冲突、TTS 语音配置、多 Bot 工具隔离等问题。 **当以下情况时使用此 Skill**: (1) feishu_chat 工具命名冲突 (2) 飞书发送信息附带 MP3 语音 (3) 需要多 Bot 工具隔离配置 (4) openclaw-lark...
⭐ GitHub
Web Audio
Web Audio - Front-End Development
🦀 ClawHub
Jarvis-Video-STT
Jarvis-Video-STT - 批量视频语音转文字工具。 基于Faster-Whisper,支持多进程并行、进度条、汇总报告。 **触发场景**: - 用户需要将视频中的语音转换为文字/字幕 - 批量处理多个视频 - 需要生成SRT字幕或纯文本 - 需要处理报告查看结果统计 **使用方式**: 1. 确认已...
🦀 ClawHub
Auto Subtitle Generator
Drop a video into the chat and this skill handles the rest — transcribing speech, syncing word-level timestamps, and delivering ready-to-use subtitle files i...
🦀 ClawHub
AI Game Asset Generation
AI-powered game asset generation guide covering 2D sprites, tilemaps, UI elements, audio, music, and 3D models. Use when generating game assets with AI tools...
🦀 ClawHub
Voicenotes Official 1.0.3
This official skill from the Voicenotes team gives OpenClaw access to new APIs and the ability to search semantically, retrieve full transcripts, filter by t...
🦀 ClawHub
Speech Language Pathologist Video
Creates short videos for speech-language pathologists to explain evaluation, therapy, and family coaching for pediatric and adult communication development.
🦀 ClawHub
MiniMax Multimodal (Speech + Image)
MiniMax 多模态技能 — 接入 MiniMax Token Plan 接口,语音合成(TTS/音色克隆/音色设计) 和图片生成(文生图/图生图)。使用 speech-2.8-hd(语音)和 image-01(图像)模型, 消费 Token Plan 额度。当用户提到语音合成、音色克隆、图片生成、文生图、图生...
🦀 ClawHub
会议纪要助手
会议纪要与会议播报生成技能。用于处理会议录音或转写文本,执行发言人区分、口语降噪、议题重构、双钻结构整理,并输出执行摘要、核心决议、Markdown待办表格、TTS播报稿和会议思维导图(HTML/SVG/XMind)。支持双向语音能力:录音转文本(ASR)与文本转录音(TTS)。用户提到“会议纪要”“录音转文字”...
🦀 ClawHub
Elevenlabs Calls
Make AI phone calls using ElevenLabs Conversational AI and Twilio.
🦀 ClawHub
China Tts
国内可用的文本转语音技能,基于硅基流动(SiliconFlow)API。Use when the user wants to convert text to speech in China without VPN. Supports CosyVoice2-0.5B (multilingual, emotion c...
🦀 ClawHub
Podcastfy Clawdbot Skill
Generate an AI podcast (MP3) from one or more URLs using the open-source Podcastfy project. Use when the user says “make a podcast from this URL/article/vide...
🔧 Dify
Fishaudio (Dify)
**Fish Audio** is an advanced text-to-speech (TTS) tool powered by the Fish Audio API. It enables you to convert text into high-quality speech, offering customizable voice options for various use cases. Whether building virtual assistants, creating audiobooks, or generating voiceovers, Fish Audio provides reliable and efficient TTS functionality to enhance your applications. To get started with Fi
🦀 ClawHub
Byted Podcast Gen
将某个话题或者网页内容总结合成为播客音频(Podcast)。基于火山引擎豆包语音播客合成协议生成最终音频。
🔧 Dify
Plivo Verify (Dify)
OTP (One-Time Password) verification plugin for Dify using [Plivo's Verify API](https://www.plivo.com/verify/). This plugin enables phone number verification in your Dify workflows by sending OTP codes via SMS or voice call and validating user-entered codes. 1. A [Plivo account](https://console.plivo.com/accounts/register/) 2. Your Plivo Auth ID and Auth Token (found in the [Plivo Console](https:/
🦀 ClawHub
Voice Translator
说中文出外语语音——按住说中文,2-3秒内播放英/日/韩语音。支持场景模式、双向对话、常用句收藏。