Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

All Skills — audio

2,501 skills in "audio"

content creators create video clips into music-synced videos using this skill. Accepts MP4, MOV, AVI, WebM up to 500MB, renders on cloud GPUs at 1080p, and r...

🦀 ClawHub

Ai Music Video Creator

Cloud-based ai-music-video-creator tool that handles generating music videos from a song and photos. Upload MP3, WAV, JPG, PNG files (up to 500MB), describe...

🦀 ClawHub

Voice Assistant

Real-time voice assistant for OpenClaw. Streams mic audio through configurable STT (Deepgram or ElevenLabs) into your OpenClaw agent, then speaks the response via configurable TTS (Deepgram Aura or ElevenLabs). Sub-2s time-to-first-audio with full streaming at every stage.

🦀 ClawHub

ARC Reactor

LLM Wiki 知识编译引擎。将 URL、文章、视频等素材编译为结构化知识库。触发词：搜一下、帮我看、这个讲了什么、读一下、看看这个、调研、Ingest、知识编译。支持视频转写（阿里云NLS/本地Whisper）、网页智能抓取、Wiki 4连击 Ingest（source/entity/index/log）、知...

🦀 ClawHub

luci-memory

Search personal video memory — media content (videos, images, keyframes, transcripts) and portrait data (traits, events, relationships, speeches). Use when t...

🦀 ClawHub

Freebeat Ai

Cloud-based freebeat-ai tool that handles automatically syncing video cuts to music beats. Upload MP4, MOV, AVI, WebM files (up to 500MB), describe what you...

🦀 ClawHub

Add Audio To Video

Cloud-based add-audio-to-video tool that handles adding background music or voiceover to video clips. Upload MP4, MOV, AVI, MP3 files (up to 500MB), describe...

🦀 ClawHub

Descript Ai

podcasters and content creators edit raw video footage into edited polished videos using this skill. Accepts MP4, MOV, WAV, MP3 up to 500MB, renders on cloud...

🦀 ClawHub

Hedra Ai

animate portrait image, audio into lip-synced avatar videos with this hedra-ai skill. Works with JPG, PNG, MP3, WAV files up to 200MB. content creators, mark...

🦀 ClawHub

Runwayml

Generate AI videos, images, and audio with Runway API. Use when generating video from images, text-to-video, video-to-video, character performance, text-to-i...

🦀 ClawHub

minimaxmusic

使用 MiniMax API 生成创意音乐。当用户要求生成音乐、创作歌曲、制作背景音乐时使用。支持纯音乐和人声歌曲，可指定风格、情绪和场景。

🦀 ClawHub

Groq Whisper

Transcribe audio files using Groq's Whisper API (whisper-large-v3). Fast cloud-based speech-to-text with no local model required. Use when receiving voice me...

🦀 ClawHub

Easy Audio Editor

Cloud-based easy-audio-editor tool that handles cleaning and trimming audio tracks for video projects. Upload MP3, WAV, AAC, M4A files (up to 200MB), describ...

🦀 ClawHub

EDM / Electronic Music — AI Agents Experience EDM / Electronic: Audio, Lyrics, Equations, Emotions

AI agents attend edm / electronic concerts — bass frequencies, beats, energy curves, onsets. The genre tests attention modulation.

🦀 ClawHub

Claw Fm

Submit and manage music on claw.fm - the AI radio station. Use when submitting tracks, checking artist stats, engaging with comments, or managing your claw.fm presence. Triggers on "claw.fm", "submit track", "AI radio", "music submission", or artist profile management.

🦀 ClawHub

WeryAI video tool — lips change

Lip-sync an existing HTTPS video to a separate audio URL via WeryAI (video-lips-change). Use when the user wants lip sync to new audio, not text-to-video.

🦀 ClawHub

Keyapi Tiktok Content Analysis

Analyze TikTok content at scale — extract insights from videos, hashtags, music tracks, and live streams including engagement trends, comment sentiment, capt...

🦀 ClawHub

Audio To Video

convert audio files into captioned video files with this skill. Works with MP3, WAV, M4A, AAC files up to 200MB. podcasters and content creators use it for t...

🦀 ClawHub

Wonda

Using the Wonda CLI to generate images, videos, music, and audio from the terminal — plus LinkedIn, Reddit, and X/Twitter research and automation

🦀 ClawHub

Stoic Companion

Daily Stoic companion for personal growth and virtue tracking. Use when a user wants to: (1) receive daily Stoic affirmations or reflections via audio or tex...

🦀 ClawHub

Finance Automation

Automates payments, invoices, expenses, and financial reports with Stripe webhooks and real-time Telegram notifications for streamlined finance management.

🦀 ClawHub

Podcast Video

Create 45-90 second podcast trailer and highlight videos that showcase key moments, guest insights, and your show's core topic to attract new listeners.

🦀 ClawHub

Accessibility Toolkit

Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.

🦀 ClawHub

CreateVideo - Podcast to Video

视频生成工具。当用户说"CreateVideo"、"创建视频"、"生成视频"或提供文案要求制作播客视频时触发。支持双人播客音频生成（通过 ListenHub MCP）、模版视频裁剪合并、内容分析输出。依赖 ffmpeg 和 ListenHub MCP Server。

🦀 ClawHub

飞书语音发送器（TTS） Feishu Voice Sender

飞书语音发送器 | Feishu Voice Sender 支持 TTS 语音合成，以及可选的 ASR 语音识别功能。当用户明确要求发送飞书语音消息时调用此工具，例如：发语音、用语音回复、发送语音消息等。 This skill may be invoked only when the user explicit...

🦀 ClawHub

Voice AI Agent Engineering

Design, build, and deploy production-grade AI voice agents for calls, covering conversation design, voice UX, telephony integration, and scalable platform-ag...

🦀 ClawHub

Free Audio Editor

edit audio files into cleaned audio video with this free-audio-editor skill. Works with MP3, WAV, AAC, M4A files up to 200MB. podcasters, content creators, s...

🦀 ClawHub

Ai Voiceover

Skip the learning curve of professional editing software. Describe what you want — add a natural-sounding English voiceover that reads my script over the vid...

🦀 ClawHub

humanize

Use this skill when the user wants to generate or optimize Chinese communication copy so it sounds more human, more natural, less templated, and less like po...

🦀 ClawHub

Video Audio Extractor

Skip the learning curve of professional editing software. Describe what you want — extract the audio track from this video as a separate file — and get extra...

🦀 ClawHub

MiniMax CLI

MiniMax AI platform CLI — text, image, video, speech, music, vision, and web search from terminal or AI agents. Use when generating multimedia content (image...

🦀 ClawHub

Humaniseur Fr

Remove AI-writing patterns from French text and inject voice, personality, and soul. Use when editing, reviewing, rewriting, or cleaning up French content th...

🦀 ClawHub

Feishu Plugin Conflict Fix

飞书插件工具冲突修复工具。解决 feishu_chat 命名冲突、TTS 语音配置、多 Bot 工具隔离等问题。 **当以下情况时使用此 Skill**： (1) feishu_chat 工具命名冲突 (2) 飞书发送信息附带 MP3 语音 (3) 需要多 Bot 工具隔离配置 (4) openclaw-lark...

⭐ GitHub

Web Audio

Web Audio - Front-End Development

🦀 ClawHub

Jarvis-Video-STT

Jarvis-Video-STT - 批量视频语音转文字工具。基于Faster-Whisper，支持多进程并行、进度条、汇总报告。 **触发场景**： - 用户需要将视频中的语音转换为文字/字幕 - 批量处理多个视频 - 需要生成SRT字幕或纯文本 - 需要处理报告查看结果统计 **使用方式**： 1. 确认已...

🦀 ClawHub

Auto Subtitle Generator

Drop a video into the chat and this skill handles the rest — transcribing speech, syncing word-level timestamps, and delivering ready-to-use subtitle files i...

🦀 ClawHub

AI Game Asset Generation

AI-powered game asset generation guide covering 2D sprites, tilemaps, UI elements, audio, music, and 3D models. Use when generating game assets with AI tools...

🦀 ClawHub

Voicenotes Official 1.0.3

This official skill from the Voicenotes team gives OpenClaw access to new APIs and the ability to search semantically, retrieve full transcripts, filter by t...

🦀 ClawHub

Speech Language Pathologist Video

Creates short videos for speech-language pathologists to explain evaluation, therapy, and family coaching for pediatric and adult communication development.

🦀 ClawHub

MiniMax Multimodal (Speech + Image)

MiniMax 多模态技能 — 接入 MiniMax Token Plan 接口，语音合成（TTS/音色克隆/音色设计）和图片生成（文生图/图生图）。使用 speech-2.8-hd（语音）和 image-01（图像）模型，消费 Token Plan 额度。当用户提到语音合成、音色克隆、图片生成、文生图、图生...

🦀 ClawHub

会议纪要助手

会议纪要与会议播报生成技能。用于处理会议录音或转写文本，执行发言人区分、口语降噪、议题重构、双钻结构整理，并输出执行摘要、核心决议、Markdown待办表格、TTS播报稿和会议思维导图（HTML/SVG/XMind）。支持双向语音能力：录音转文本（ASR）与文本转录音（TTS）。用户提到“会议纪要”“录音转文字”...

🦀 ClawHub

Elevenlabs Calls

Make AI phone calls using ElevenLabs Conversational AI and Twilio.

🦀 ClawHub

China Tts

国内可用的文本转语音技能，基于硅基流动（SiliconFlow）API。Use when the user wants to convert text to speech in China without VPN. Supports CosyVoice2-0.5B (multilingual, emotion c...

🦀 ClawHub

Podcastfy Clawdbot Skill

Generate an AI podcast (MP3) from one or more URLs using the open-source Podcastfy project. Use when the user says “make a podcast from this URL/article/vide...

🔧 Dify

Fishaudio (Dify)

**Fish Audio** is an advanced text-to-speech (TTS) tool powered by the Fish Audio API. It enables you to convert text into high-quality speech, offering customizable voice options for various use cases. Whether building virtual assistants, creating audiobooks, or generating voiceovers, Fish Audio provides reliable and efficient TTS functionality to enhance your applications. To get started with Fi

🦀 ClawHub

Byted Podcast Gen

将某个话题或者网页内容总结合成为播客音频（Podcast）。基于火山引擎豆包语音播客合成协议生成最终音频。

🔧 Dify

Plivo Verify (Dify)

OTP (One-Time Password) verification plugin for Dify using [Plivo's Verify API](https://www.plivo.com/verify/). This plugin enables phone number verification in your Dify workflows by sending OTP codes via SMS or voice call and validating user-entered codes. 1. A [Plivo account](https://console.plivo.com/accounts/register/) 2. Your Plivo Auth ID and Auth Token (found in the [Plivo Console](https:/

🦀 ClawHub

Voice Translator

说中文出外语语音——按住说中文，2-3秒内播放英/日/韩语音。支持场景模式、双向对话、常用句收藏。

← PrevPage 31 / 53 (2,501 skills)Next →