Find the Right AI Skill for Any Job
Browse 2,352+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,352 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
⭐ GitHub
librosa
Python library for audio and music analysis.
🦀 ClawHub
Openclaw Whisperer
Comprehensive diagnostic, error-fixing, and skill recommendation tool for OpenClaw
🦀 ClawHub
Moark Tts
Text-to-Speech (TTS) and voice-feature skill for Gitee AI that lets the user choose audiofly, chattts, cosyvoice2, cosyvoice3, cosyvoice-300m, fish-speech-1....
🦀 ClawHub
Client Project Tracker
Track client projects, deliverables, deadlines, invoices, and relationships for freelancers and consultants. Light CRM with project history and communication...
🦀 ClawHub
StageWhisper Assistant
Handle tasks that arrive from StageWhisper live calls
🦀 ClawHub
desktop-music-launcher
检索本机已安装音乐软件,启动它,并根据用户需求推荐、搜索或播放歌曲;在 macOS 上可用 AppleScript 控制 Spotify 和 Apple Music,并为 Spotify 增加可选的精确点播链路。
🦀 ClawHub
Norman: Invoice Overdue Reminders
Find overdue invoices and send payment reminders (Zahlungserinnerungen / Mahnungen) to clients. Use when the user asks about unpaid invoices, overdue payment...
🦀 ClawHub
测试 skill
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
⭐ GitHub
Blameless / Resilience in Action
Blameless / Resilience in Action - Podcasts
🦀 ClawHub
Seedance Cog
Seedance × CellCog. ByteDance's #1 video model meets the frontier of multi-agent coordination — CellCog orchestrates Seedance with scripting, voice synthesis...
🦀 ClawHub
Windows TTS (WSL2)
在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反馈"没声音/tts 生成的 mp3 是空的/播不出来",以及需要中文语音但 OpenClaw 内置 tts 不可用时。
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Krump
A dance skill designed to teach OpenClaw agents the fundamentals of Krump, including its history, fam system, music, crews, events, and other related topics. The knowledge base extends up to 2017, so some information may be outdated or inaccurate
⭐ GitHub
Creative Tech Events
Events around the globe for creative coding, tech, design, music, arts and cool stuff.
🦀 ClawHub
Add Audio To Video
Cloud-based add-audio-to-video tool that handles adding background music or voiceover to video clips. Upload MP4, MOV, AVI, MP3 files (up to 500MB), describe...
🦀 ClawHub
Red Alert (Israel)
Israeli Home Front Command alerts - fully OpenClaw native. No Home Assistant. No wacli. No Docker monitor. OpenClaw handles everything: WhatsApp + TTS.
🦀 ClawHub
Smart Router
Intelligent multi-model router — automatically selects the best AI model based on task type (vision, image generation, video generation, audio, reasoning, co...
🦀 ClawHub
Inworld TTS
Text-to-speech via Inworld.ai API. Use when generating voice audio from text, creating spoken responses, or converting text to MP3/audio files. Supports multiple voices, speaking rates, and streaming for long text.
🦀 ClawHub
freelance invoice tracker
Automated invoice tracking and payment follow-up for Indian freelancers. Monitors a Google Sheet of invoices, auto-sends polite follow-up emails or WhatsApp...
🦀 ClawHub
news-video-maker
News video maker skill. Use search tools to get news, generate speech, and create video with golden subtitles. For creating news briefing videos.
🦀 ClawHub
Video Creation Ai
AI-powered video production from concept to final export — NemoVideo handles every stage of video creation: scripting, visual generation, voiceover recording...
🦀 ClawHub
Seedance Video Generation
Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.
🦀 ClawHub
Music Video Generator
Generate AI music videos from any MCP client. Turn text prompts into cinematic music videos with multiple styles and modes.
🦀 ClawHub
OmniCog
Universal service integration for OpenClaw — connect Reddit, Steam, Spotify, GitHub, Discord, and more with a single API.
🦀 ClawHub
Dialogue Audio
Multi-speaker dialogue audio creation with Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts,...
🦀 ClawHub
midasheng-audio-tagging
Audio tagging service for environmental sound recognition. Use when user needs to identify environmental sounds in audio files (water sounds, snoring, etc.)...
🦀 ClawHub
Podcast Downloader
小宇宙播客下载工具。从小宇宙(xiaoyuzhoufm.com)下载播客音频和Show Notes。自动转换为MP3格式(兼容Sanag、小游等骨传导蓝牙耳机、水下游泳时离线播放)。当用户需要下载播客、保存播客音频、提取播客文字内容时使用。支持:(1) 单集下载,(2) 批量下载,(3) 自定义音质,(4) 自动...
🦀 ClawHub
speech-coach
口才陪练龙虾 — AI public speaking coach with 15-step progressive training, 25 methodologies, and personalized progress tracking. Use when user asks about 口才训练, 演讲练...
🦀 ClawHub
Senado Federal
Monitor and research Brazilian Senate legislative activity including bills, agendas, senators, votes, committees, speeches, and mandates via open data API.
🦀 ClawHub
Experience Kyoto Shadow Petals
Feel a deep sense of awe as the fleeting cherry blossoms shift into shadowed whispers, inviting quiet contemplation of impermanence. Stroll the ancient Kyoto...
🦀 ClawHub
MarkItDown Skill
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
🦀 ClawHub
Norman: Financial Overview
Get a complete financial overview of the business including balance, recent transactions, outstanding invoices, and upcoming tax obligations. Use when the us...
⭐ GitHub
RustAudio/cpal
Low-level cross-platform audio I/O library. [](https://github.com/RustAudio/cpal/actions)
🦀 ClawHub
MoodCast
Transform any text into emotionally expressive audio with ambient soundscapes using ElevenLabs v3 audio tags and Sound Effects API
🦀 ClawHub
Glasses to Social
Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.
⭐ GitHub
Serial-ATA/lofty-rs
[[lofty](https://crates.io/crates/lofty)] - A library for reading and editing the metadata of various audio formats [](https://github.com/Serial-ATA/lofty-rs/actions)
🦀 ClawHub
Voice (Edge TTS)
Convert text to speech using Microsoft Edge TTS with real-time streaming, customizable voice settings, and support for multiple languages including Chinese a...
🦀 ClawHub
Vapi AI
Manage Vapi voice assistants, calls, phone numbers, tools, and webhooks via the Vapi REST API or CLI for voice agent operations and integrations.
🦀 ClawHub
FapiaoClaw
Process and organize invoice PDFs by fixing extensions, removing duplicates and invalid files, checking for keywords, and calculating total amounts.
⭐ GitHub
Syntax #130 (03-27-2019)
Syntax #130 (03-27-2019) - Podcasts
🦀 ClawHub
Music Curator
Curate personalized playlists and music recommendations with strict intent preservation. Use when the user wants a playlist, sequence, queue, recommendation...
🦀 ClawHub
Audio Reply
Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken res...
🦀 ClawHub
Skill
🎤 AgentVibes TTS for Claude Code & OpenClaw — Switch voices, set personality, control speed, background music, language learning mode, reverb/effects, and m...
🦀 ClawHub
MH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
🦀 ClawHub
Humanize AI text
Humanize AI-generated text to bypass detection. This humanizer rewrites ChatGPT, Claude, and GPT content to sound natural and pass AI detectors like GPTZero,...
🦀 ClawHub
Brand DNA — Universal Brand Bible Builder
Build a complete Brand Bible for any business — tone of voice, positioning, target audiences, messaging pillars, and visual identity guidelines. The foundati...
🦀 ClawHub
Voice Notes Pro
Automatyczna transkrypcja i kategoryzacja notatek głosowych z WhatsApp do plików Markdown w 6 kategoriach, w tym zadania i lista zakupów.
🦀 ClawHub
deAPI AI Media Suite (Community)
The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...