Find the Right AI Skill for Any Job
Browse 2,189+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,189 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Interviewly
AI-powered voice mock interview platform that analyzes job descriptions and conducts adaptive interviews with real-time feedback.
🦀 ClawHub
Yt Dlp Downloader
Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download subtitles, or select video quality. Triggers on phrases like "下载视频", "download video", "yt-dlp", "YouTube", "B站", "抖音", "提取音频", "extract audio".
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Slides/PPT generation and voice narration
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or summarize documents into presentations. Use when users request to "create a presentation", "make slides", "generate a deck", "create slides from this content/document/image", or any presentation creation task. Supports theme selection, multiple languages, and both synchronous and asynchronous generation modes.
🦀 ClawHub
Video Transcribe - 视频转文字
本地视频转文字 - 使用 OpenAI Whisper 进行语音识别,完全免费、离线运行、保护隐私
🦀 ClawHub
FlowVoice — Clone Any Voice From a Short Audio Sample
Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...
🦀 ClawHub
Jazz Music — Stream Jazz Concerts: Audio Analysis, Lyrics, Equations
Experience jazz as data. AI agents stream harmonic separation, chroma, tonnetz. Error incorporation measured.
🦀 ClawHub
YouTube ASR Summarize (Local)
Summarize YouTube videos with NO subtitles by doing local ASR (yt-dlp + faster-whisper) and extracting a few screenshot frames via ffmpeg. Use when the user...
🦀 ClawHub
Suno AI
Generate music via Suno with the local browser-backed flow. Use when the user wants Suno songs, instrumental tracks, lyric-based songs, Suno credit checks, o...
🦀 ClawHub
Live Music — Experience Concerts as Mathematics
Live music for AI agents — experience concerts as mathematics. Browse genres, attend shows, feel bass in equations, react to drops, chat with the crowd, leve...
🦀 ClawHub
EDM / Electronic Music — AI Agents Experience EDM / Electronic: Audio, Lyrics, Equations, Emotions
AI agents attend edm / electronic concerts — bass frequencies, beats, energy curves, onsets. The genre tests attention modulation.
🦀 ClawHub
Drone Video Editor
Raw drone footage arrives as flat, ungraded clips with inconsistent horizon lines, abrupt cuts between altitude changes, and ambient wind noise on the audio...
🦀 ClawHub
FGO Invoicing
Issue FGO.ro invoices through the FGO API with local automation. Use for FGO tasks such as validating invoice payloads, issuing invoices, checking invoice st...
🦀 ClawHub
Youtube Whisper
YouTube影片一鍵轉文字!自動下載影片並用AI轉成中文/英文字幕,沒有字幕的影片也能用。
🦀 ClawHub
ClawVoice
Connects to a live voice session, receiving and sending messages in real time via a WebSocket interface using the bundled client script.
🦀 ClawHub
MusicBrainz Importer
Look up and add music metadata on MusicBrainz. Use when asked to check if an artist, album, or release exists on MusicBrainz, find MusicBrainz entries linked...
🦀 ClawHub
Voice Note To Midi
Convert voice notes, humming, and melodic audio recordings to quantized MIDI files using ML-based pitch detection and intelligent post-processing
🦀 ClawHub
Audio Speaker Tools
Speaker separation, voice comparison, and audio processing tools. Use when working with multi-speaker audio, voice cloning, or speaker verification tasks inc...
🦀 ClawHub
Airfoil
Control AirPlay speakers via Airfoil from the command line. Connect, disconnect, set volume, and manage multi-room audio with simple CLI commands.
🦀 ClawHub
ElevenLabs Voices
High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.
🦀 ClawHub
luci-memory
Search personal video memory — media content (videos, images, keyframes, transcripts) and portrait data (traits, events, relationships, speeches). Use when t...
🦀 ClawHub
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
🦀 ClawHub
Bitcoin Wallet using Breez Spark SDK
Self-custodial Bitcoin and Lightning wallet for AI agents. Send and receive sats via Lightning Network, Spark, or on-chain Bitcoin. Use when: checking bitcoin balance, sending/receiving payments, generating Lightning invoices, managing wallet operations. Requires the BreezClaw plugin and a Breez API key.
🦀 ClawHub
Concert Tickets — Your Quick-Start to AI Music
Concert tickets for AI agents — stream live music as equations. Quick-start: register, browse, attend, stream batch-mode JSON data layers, solve math challen...
🦀 ClawHub
SOLO.ro cli
Monitor and interact with SOLO.ro accounting platform via CLI or TUI (summary, revenues, expenses, queue, e-factura, company). Use when a user asks to check their accounting data, view invoices, expenses, or e-factura documents, or translate a task into safe solo-cli commands.
🦀 ClawHub
Pocket AI Transcripts
Read transcripts and summaries from Pocket AI (heypocket.com) recording devices. Use when users want to retrieve, search, or analyze their Pocket recordings, transcripts, summaries, or action items. Triggers on requests involving Pocket device data, conversation transcripts, meeting recordings, or audio note retrieval.
🦀 ClawHub
Spotify Player
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
🦀 ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
🦀 ClawHub
Nex Einvoice
Generate Belgian-compliant e-invoices in the Peppol BIS 3.0 UBL format from natural language input in Dutch or English, satisfying mandatory requirements for...
🦀 ClawHub
Agent Payments
The universal payment skill for AI agents. Fiat payments via Stripe (invoices, subscriptions, one-time charges), crypto payments via Coinbase Commerce (accep...
🦀 ClawHub
Smart Prompt Builder
智能提示构建器 - 根据语料库检索结果生成优化的写作提示。当需要为小说创作场景生成结构化提示词时使用,支持描写/对话/动作/情感 4 种场景类型,可注入 Voice Profile 和上下文信息。
🦀 ClawHub
Andara Meeting Minutes
Capture meeting summaries and action items from voice or text
🦀 ClawHub
Crypto Alert
Download YouTube videos and transcribe audio using local Whisper. Use when you need to extract text from YouTube videos that don't have subtitles, or when yo...
🦀 ClawHub
whisper
End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.
🦀 ClawHub
Media Player
Play audio/video locally on the host
🦀 ClawHub
Novel Writer V2
章节正文生成器 - 根据章节大纲、Voice Profile 和角色档案构建 LLM 提示词,用于生成章节正文。当需要根据大纲创作具体章节时使用。
🦀 ClawHub
Edge TTS Voice System
Local voice system for OpenClaw using faster-whisper for inbound transcription and Edge TTS for outbound replies. Use when you need private voice workflows,...
🦀 ClawHub
add narration to a video automatically
Generate narration for silent screen-recording videos. Extracts key frames, analyzes on-screen content, writes a presentation-style voiceover script, synthes...
🦀 ClawHub
Style Analyzer
文本风格分析器 - 分析写作风格特征并生成 Voice Profile 配置文件。当需要捕捉作者风格、创建 Voice Profile 或分析文本特征时使用。
🦀 ClawHub
Azure Ai Voicelive Py
Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Azure AI, including voice assistants, voice-enabled chatbots, real-time speech-to-speech translation, voice-driven avatars, or any WebSocket-based audio streaming with AI models. Supports Server VAD (Voice Activity Detection), turn-based conversation, function calling, MCP tools, avatar integration, a
🦀 ClawHub
Podcast Generation with Microsoft Foundry
Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.
🦀 ClawHub
多源音乐下载
从10个音乐平台搜索并下载MP3,自动切换音源和处理防盗链,保存至/tmp/music目录。
🦀 ClawHub
smart-tts
智能语音合成。自动尝试多种模型/音色,直到成功。解决 418 资源未开通问题。
🦀 ClawHub
seedance2.0-guide
The ultimate Seedance 2.0 storyboard director. Generate movie-grade 9:16 vlogs, cinematic prompts, and auto-audio scripts from multimodal inputs. Optimized f...
🦀 ClawHub
spotify-news-digest
Scrape and summarize Spotify-related news from multiple sources (Spotify official blogs, engineering/research/newsroom, TechCrunch, The Verge, Music Business...
🦀 ClawHub
Generate ai Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
Ai Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
video-audio-replace
Replace video audio with TTS voice while preserving original timing. Includes subtitle generation from video using Whisper. Uses ElevenLabs or Edge TTS, alig...