Find the Right AI Skill for Any Job
Browse 401+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
401 skills in "audio" matching "Generate"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
AI Music Video
Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music...
🦀 ClawHub
Agent Tool Scout
Give AI hands to control any Mac app. Auto-discover installed apps, generate CLI wrappers, return structured JSON. Works with Music, Finder, Chrome, Word, Fi...
🦀 ClawHub
Tomoviee Video Background Music
Generate music tailored to video content. Use when users request video_soundtrack operations or related tasks.
🦀 ClawHub
Construction Daily Report Generator
Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.
🦀 ClawHub
Construction Meeting Minutes Generator
Generate structured construction meeting minutes from rough notes or voice transcription, with separated action items, decision tracking, and contractual fla...
🦀 ClawHub
Mayar.id Payment
Integrate Mayar.id for Indonesian payments to create invoices, generate payment links, track transactions, manage subscriptions, and automate payment workflo...
🦀 ClawHub
Phone Voice Agent
Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to: (1) Test voice AI capabilities, (2) Handle phone calls programmatically, (3) Build a conversational voice bot.
🦀 ClawHub
Business Document Generator
Generate professional, customizable business documents including proposals, quotes, invoices, contracts, and letters tailored to your industry and needs.
🦀 ClawHub
WeChat Video Editor - AI Video Editing for Douyin Xiaohongshu and TikTok
支持微信视频号、抖音、小红书、TikTok 格式导出。中文对话剪辑,无需打开任何软件。 AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects...
🦀 ClawHub
notebooklm-cli
Command-line interface to manage Google NotebookLM notebooks, sources, and generate audio, quizzes, reports, presentations, and visual study materials progra...
🦀 ClawHub
Slides/PPT generation and voice narration
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or summarize documents into presentations. Use when users request to "create a presentation", "make slides", "generate a deck", "create slides from this content/document/image", or any presentation creation task. Supports theme selection, multiple languages, and both synchronous and asynchronous generation modes.
🦀 ClawHub
FlowVoice — Clone Any Voice From a Short Audio Sample
Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...
🦀 ClawHub
Suno AI
Generate music via Suno with the local browser-backed flow. Use when the user wants Suno songs, instrumental tracks, lyric-based songs, Suno credit checks, o...
🦀 ClawHub
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
🦀 ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
🦀 ClawHub
Nex Einvoice
Generate Belgian-compliant e-invoices in the Peppol BIS 3.0 UBL format from natural language input in Dutch or English, satisfying mandatory requirements for...
🦀 ClawHub
add narration to a video automatically
Generate narration for silent screen-recording videos. Extracts key frames, analyzes on-screen content, writes a presentation-style voiceover script, synthes...
🦀 ClawHub
Podcast Generation with Microsoft Foundry
Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.
🦀 ClawHub
seedance2.0-guide
The ultimate Seedance 2.0 storyboard director. Generate movie-grade 9:16 vlogs, cinematic prompts, and auto-audio scripts from multimodal inputs. Optimized f...
🦀 ClawHub
Generate ai Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
Ai Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
Text to Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
generate-drama
根据主题自动生成多角色有声短剧,调用 SenseAudio TTS API 合成音频并拼接输出
🦀 ClawHub
Comfy Story Video
Generate illustrated children's story videos with AI images and TTS narration using ComfyUI running locally.
🦀 ClawHub
Ai Humanizer Backup
Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern det...
🦀 ClawHub
xiaomi-mimo-v2-tts
Generate speech audio (WAV) from text using Xiaomi MiMo TTS (mimo-v2-tts model). Supports preset voices (mimo_default, default_zh, default_en), style control...
🦀 ClawHub
Giggle Generation Music
Use when the user wants to create, generate, or compose music—whether from text description, custom lyrics, or instrumental background music. Triggers: gener...
🦀 ClawHub
Book Summary
Generate podcast-style audio scripts summarizing books with 3 key ideas, actionable takeaways, and estimated duration for single-narrator delivery.
🦀 ClawHub
An OpenClaw skill for AI-powered multimedia generation (image, video, audio, 3D) via 170+ RunningHub API endpoints — zero dependencies, pure Python.
Generate images, videos, audio, and 3D models via RunningHub API (170+ endpoints) and run any RunningHub AI Application (custom ComfyUI workflow) by webappId...
🦀 ClawHub
VoiceClaw
Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...
🦀 ClawHub
B
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
A
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
TTS
Use this skill whenever the user wants to convert text to speech, generate audio from text, create voiceovers, or produce spoken audio files. Triggers includ...
🦀 ClawHub
Freepik
Generate images, videos, icons, audio, and more using Freepik's AI API. Supports Mystic, Flux, Kling, Hailuo, Seedream, RunWay, Magnific upscaling, stock con...
🦀 ClawHub
Groq Voice Transcriber
Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.
🦀 ClawHub
Ai Sdk Core
Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4→v5 migration and 15 error solutions with workarounds.
Use when: implementing AI SDK v5/v6, migrating versions, troubleshooting AI_APICallError, Workers startup issues, Output API errors, Gemini caching issues, Anthropic tool errors, MCP tools, or stream resumption failures.
🦀 ClawHub
Dual-Host Daily Podcast Generator
Generate and publish a dual-host daily podcast. Fetches news, generates a conversational script between two hosts, synthesizes audio via Fish Audio or Edge T...
🦀 ClawHub
iMessage Voice Reply
Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...
🦀 ClawHub
speaker-local
Text-to-speech using Kokoro local TTS. Use when the user wants to convert text to audio, read aloud, or generate speech.
🦀 ClawHub
ACE-Step Music Generation
Generate high-quality music on Apple Silicon Macs using ACE-Step 1.5 with MLX backend, supporting custom prompts, durations, and output formats.
🦀 ClawHub
Audio Gen 1.0.0
Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it...
🦀 ClawHub
Invoicy
Generate, download, and email professional invoices with GST/IGST support and flexible payment terms.
🦀 ClawHub
Humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Combines Wikipedia's "Sig...
🦀 ClawHub
SatsRail MCP — Bitcoin Lightning Payments for AI Agents
Enable AI agents to create Bitcoin Lightning payment orders, generate invoices, check payment status, and manage payments via natural language with SatsRail...
🦀 ClawHub
Vidu API comic strip short film generation capability, with built-in AI-generated videos, images, and TTS.
将用户创意或剧本转化为完整动漫成片,从剧本创作到自动拼接全流程使用 Vidu API 完成生图、生视频与 TTS,且禁止使用任何非 Vidu 模型。在用户需要制作动漫/动画短片、提供创意主题或详细剧本需求时使用;依赖 ffmpeg 与已配置的 Vidu API 凭证。
🦀 ClawHub
Productivity Improving
Personal productivity tracking and analysis skill. Records work and life activities via voice/text input, tracks time, categorizes tasks, and generates daily...
🦀 ClawHub
Podcast Show Notes Mcp
Generate podcast show notes from audio: timestamps, topics, guest bios, key quotes, SEO summaries.
🦀 ClawHub
rupali
Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief response, then synthesize and send MP3.
Page 1 / 9 (401 skills)Next →