Find the Right AI Skill for Any Job

Browse 401+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

All Skills — audio

401 skills in "audio" matching "Generate"

Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music...

🦀 ClawHub

Agent Tool Scout

Give AI hands to control any Mac app. Auto-discover installed apps, generate CLI wrappers, return structured JSON. Works with Music, Finder, Chrome, Word, Fi...

🦀 ClawHub

Tomoviee Video Background Music

Generate music tailored to video content. Use when users request video_soundtrack operations or related tasks.

🦀 ClawHub

Construction Daily Report Generator

Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.

🦀 ClawHub

Construction Meeting Minutes Generator

Generate structured construction meeting minutes from rough notes or voice transcription, with separated action items, decision tracking, and contractual fla...

🦀 ClawHub

Mayar.id Payment

Integrate Mayar.id for Indonesian payments to create invoices, generate payment links, track transactions, manage subscriptions, and automate payment workflo...

🦀 ClawHub

Phone Voice Agent

Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to: (1) Test voice AI capabilities, (2) Handle phone calls programmatically, (3) Build a conversational voice bot.

🦀 ClawHub

Business Document Generator

Generate professional, customizable business documents including proposals, quotes, invoices, contracts, and letters tailored to your industry and needs.

🦀 ClawHub

WeChat Video Editor - AI Video Editing for Douyin Xiaohongshu and TikTok

支持微信视频号、抖音、小红书、TikTok 格式导出。中文对话剪辑，无需打开任何软件。 AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects...

🦀 ClawHub

notebooklm-cli

Command-line interface to manage Google NotebookLM notebooks, sources, and generate audio, quizzes, reports, presentations, and visual study materials progra...

🦀 ClawHub

Slides/PPT generation and voice narration

AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or summarize documents into presentations. Use when users request to "create a presentation", "make slides", "generate a deck", "create slides from this content/document/image", or any presentation creation task. Supports theme selection, multiple languages, and both synchronous and asynchronous generation modes.

🦀 ClawHub

FlowVoice — Clone Any Voice From a Short Audio Sample

Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...

🦀 ClawHub

Suno AI

Generate music via Suno with the local browser-backed flow. Use when the user wants Suno songs, instrumental tracks, lyric-based songs, Suno credit checks, o...

🦀 ClawHub

Video Subtitles

Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.

🦀 ClawHub

AudioPod

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API

🦀 ClawHub

Nex Einvoice

Generate Belgian-compliant e-invoices in the Peppol BIS 3.0 UBL format from natural language input in Dutch or English, satisfying mandatory requirements for...

🦀 ClawHub

add narration to a video automatically

Generate narration for silent screen-recording videos. Extracts key frames, analyzes on-screen content, writes a presentation-style voiceover script, synthes...

🦀 ClawHub

Podcast Generation with Microsoft Foundry

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.

🦀 ClawHub

seedance2.0-guide

The ultimate Seedance 2.0 storyboard director. Generate movie-grade 9:16 vlogs, cinematic prompts, and auto-audio scripts from multimodal inputs. Optimized f...

🦀 ClawHub

Generate ai Music

AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...

🦀 ClawHub

Ai Music

AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...

🦀 ClawHub

Text to Music

AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...

🦀 ClawHub

generate-drama

根据主题自动生成多角色有声短剧，调用 SenseAudio TTS API 合成音频并拼接输出

🦀 ClawHub

Comfy Story Video

Generate illustrated children's story videos with AI images and TTS narration using ComfyUI running locally.

🦀 ClawHub

Ai Humanizer Backup

Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern det...

🦀 ClawHub

xiaomi-mimo-v2-tts

Generate speech audio (WAV) from text using Xiaomi MiMo TTS (mimo-v2-tts model). Supports preset voices (mimo_default, default_zh, default_en), style control...

🦀 ClawHub

Giggle Generation Music

Use when the user wants to create, generate, or compose music—whether from text description, custom lyrics, or instrumental background music. Triggers: gener...

🦀 ClawHub

Book Summary

Generate podcast-style audio scripts summarizing books with 3 key ideas, actionable takeaways, and estimated duration for single-narrator delivery.

🦀 ClawHub

An OpenClaw skill for AI-powered multimedia generation (image, video, audio, 3D) via 170+ RunningHub API endpoints — zero dependencies, pure Python.

Generate images, videos, audio, and 3D models via RunningHub API (170+ endpoints) and run any RunningHub AI Application (custom ComfyUI workflow) by webappId...

🦀 ClawHub

VoiceClaw

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...

🦀 ClawHub

AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...

🦀 ClawHub

AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...

🦀 ClawHub

TTS

Use this skill whenever the user wants to convert text to speech, generate audio from text, create voiceovers, or produce spoken audio files. Triggers includ...

🦀 ClawHub

Freepik

Generate images, videos, icons, audio, and more using Freepik's AI API. Supports Mystic, Flux, Kling, Hailuo, Seedream, RunWay, Magnific upscaling, stock con...

🦀 ClawHub

Groq Voice Transcriber

Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.

🦀 ClawHub

Ai Sdk Core

Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4→v5 migration and 15 error solutions with workarounds. Use when: implementing AI SDK v5/v6, migrating versions, troubleshooting AI_APICallError, Workers startup issues, Output API errors, Gemini caching issues, Anthropic tool errors, MCP tools, or stream resumption failures.

🦀 ClawHub

Dual-Host Daily Podcast Generator

Generate and publish a dual-host daily podcast. Fetches news, generates a conversational script between two hosts, synthesizes audio via Fish Audio or Edge T...

🦀 ClawHub

iMessage Voice Reply

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...

🦀 ClawHub

speaker-local

Text-to-speech using Kokoro local TTS. Use when the user wants to convert text to audio, read aloud, or generate speech.

🦀 ClawHub

ACE-Step Music Generation

Generate high-quality music on Apple Silicon Macs using ACE-Step 1.5 with MLX backend, supporting custom prompts, durations, and output formats.

🦀 ClawHub

Audio Gen 1.0.0

Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it...

🦀 ClawHub

Invoicy

Generate, download, and email professional invoices with GST/IGST support and flexible payment terms.

🦀 ClawHub

Humanizer

Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Combines Wikipedia's "Sig...

🦀 ClawHub

SatsRail MCP — Bitcoin Lightning Payments for AI Agents

Enable AI agents to create Bitcoin Lightning payment orders, generate invoices, check payment status, and manage payments via natural language with SatsRail...

🦀 ClawHub

Vidu API comic strip short film generation capability, with built-in AI-generated videos, images, and TTS.

将用户创意或剧本转化为完整动漫成片，从剧本创作到自动拼接全流程使用 Vidu API 完成生图、生视频与 TTS，且禁止使用任何非 Vidu 模型。在用户需要制作动漫/动画短片、提供创意主题或详细剧本需求时使用；依赖 ffmpeg 与已配置的 Vidu API 凭证。

🦀 ClawHub

Productivity Improving

Personal productivity tracking and analysis skill. Records work and life activities via voice/text input, tracks time, categorizes tasks, and generates daily...

🦀 ClawHub

Podcast Show Notes Mcp

Generate podcast show notes from audio: timestamps, topics, guest bios, key quotes, SEO summaries.

🦀 ClawHub

rupali

Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief response, then synthesize and send MP3.

Page 1 / 9 (401 skills)Next →