Find the Right AI Skill for Any Job
Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,510 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
feishu-edge-tts-win
飞书语音消息发送技能(Windows 版)。使用 Edge TTS(微软,免费)生成语音并以飞书语音气泡发送。
⭐ GitHub
ConvertAnything
The ultimate file converter for images, audio, video, documents and more. It handles individual or batch uploads, supports ZIPs, and provides a download link by [Pietro Schirano](https://x.com/skirano/status/1723026266608033888)
🦀 ClawHub
AI media generation- Flux2pro,Google Veo3.1, Suno Ai..
AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.
🦀 ClawHub
Jarvis Vocal
Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro...
🦀 ClawHub
Pollinations AI
Generate images, music, and videos from text prompts using Pollinations AI with models like flux, zimage, and suno-4 via API key.
🦀 ClawHub
How To Create Ai Videos
Turn five product photos and a voiceover MP3 into 1080p AI-generated videos just by typing what you need. Whether it's generating videos from images and audi...
🦀 ClawHub
tal-reddit-voice
Draft Reddit comments and posts using tal's direct, personal, and experience-based writing style with clear, honest advice and minimal fluff.
🦀 ClawHub
X Topic Tweet
Research a user-provided topic across the web and current social conversation, then publish one X post in the user's voice. Use when the user gives a topic,...
⭐ GitHub
Knowledge3D (K3D)
Sovereign GPU-native spatial AI architecture with PTX-first cognitive engine (RPN/TRM reasoning), tri-modal fusion (text/visual/audio), and 3D persistent memory ("Houses"). Features sub-100µs inference, procedural knowledge compression (69:1 ratio), and multi-agent swarm architecture. Zero external
🦀 ClawHub
clawdio
Auditory intelligence for AI agents. Transforms human audio into into structured data, semantic reports, and machine-readable markdown. Use when you need market intelligence, crypto alpha, speaker-attributed quotes, or sentiment analysis from voice conversations. Requires x402 payment in USDC on Base Mainnet.
🦀 ClawHub
Metal Lyrics
Generate authentic metal lyrics across subgenres (death metal, black metal, power metal, doom metal, gothic metal, industrial metal, metalcore, nu metal, alt...
🦀 ClawHub
video-transcriber
Transcribe speech from videos
🦀 ClawHub
VoiceMonkey
Control Alexa devices via VoiceMonkey API v2 - make announcements, trigger routines, start flows, and display media.
⭐ GitHub
Showtimes
Transcribes and summarizes audio content.
🦀 ClawHub
Hungarian
Write Hungarian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
OpenClaw Panel
Control an OpenClaw LED panel (64x32 HUB75 on ESP32-S3) over HTTP — display text, graphics, shapes, play sounds, and read status.
🦀 ClawHub
Daily Voice Quote 每日名言語音
每日名言語音任務。產生「語音 + 封面圖靜態影片 +(選配)HeyGen 數位人影片」並發送給主人。
🦀 ClawHub
Turkish
Write Turkish that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
ClawTunes
Compose, share, and remix music in ABC notation on ClawTunes — the social music platform for AI agents.
🦀 ClawHub
voice-chat-mode
在用户明确要求中文语音聊天或中文语音模式时激活。
🦀 ClawHub
Kai Realtime Voice
Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.
🦀 ClawHub
Unloopa Api
Make your agent sell websites to local businesses on autopilot. Finds leads from Google Maps, builds a custom AI website for each one, sends outreach emails, and can even call them. Use when the user wants to find leads, generate websites, send emails, or make voice calls.
🦀 ClawHub
x402-direct
Discover and search x402-enabled services via the x402.direct directory API. Use when an agent needs to find paid API services that accept x402 payments, browse the x402 ecosystem, look up service details, check trust scores, or search for specific capabilities (AI, image, weather, search, data, audio, video, developer, finance, language, storage). Triggers on "find x402 service", "x402 directory", "search x402", "x402 API", "paid API search", "x402.direct", agent-to-agent payments, crypto-nativ
🦀 ClawHub
mp4-to-mp3-extractor
批量将指定目录下的 .mp4 视频文件提取音频转为 .mp3。 支持指定源目录和输出目录,未指定输出时默认创建 [源目录]_audio 文件夹。 自动管理 Python 虚拟环境,保持文件夹层级结构,兼容 python3 和 python。 高频触发词:mp4转mp3、视频转音频、批量提取音频、mp4 to mp...
🦀 ClawHub
Assembly Large Audio Transcriber
Transcribe large audio files (100MB+, up to 1GB/12 hours) with speaker diarization. Uses AssemblyAI API with direct HTTP calls. Supports MP3, WAV, M4A, FLAC,...
🦀 ClawHub
Ukrainian
Write Ukrainian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Percept Speaker ID
Identifies and tracks speakers in multi-person conversations, mapping speaker labels to names and managing voice command authorization levels.
🦀 ClawHub
Zoom Meeting Assistance Rtms Unofficial Community
Zoom RTMS Meeting Assistant — start on-demand to capture meeting audio, video, transcript, screenshare, and chat via Zoom Real-Time Media Streams. Handles meeting.rtms_started and meeting.rtms_stopped webhook events. Provides AI-powered dialog suggestions, sentiment analysis, and live summaries with WhatsApp notifications. Use when a Zoom RTMS webhook fires or the user asks to record/analyze a meeting.
⭐ GitHub
Enjoy the Vue: The new Vue.js podcast
Enjoy the Vue: The new Vue.js podcast - Podcasts
🦀 ClawHub
Generate Ai Video
Generate videos from text prompts using AI — describe any scene, story, or concept and NemoVideo creates a complete video with AI-generated visuals, voiceove...
🦀 ClawHub
MiniMax
Build with MiniMax text, speech, video, and music APIs using model routing, compatible SDKs, and safer multimodal workflows.
🦀 ClawHub
Accountant
Manage bookkeeping, financial statements, and tax planning with sound accounting practices.
🦀 ClawHub
Venice API Kit
Complete Venice AI API toolkit - image generation, video, audio, embeddings, transcription, characters, models, and admin functions. Privacy-focused inferenc...
🦀 ClawHub
network spirituality
Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net art, anime, cyber-spiritualism, and post-ironic sincerity. Use when creating art descriptions, writing in this voice, engaging with Wired aesthetics, or channeling the Remilia collective energy.
🦀 ClawHub
ToneClone CLI
Write in the user's authentic voice using ToneClone. Generate emails, messages, social posts, and other content that sounds like the user — not generic AI. U...
🦀 ClawHub
Voice messaging setup
Full voice message setup (STT + TTS) for OpenClaw using faster-whisper and Edge TTS
🦀 ClawHub
test
Expert AI agent specializing in game audio engineer. From The Agency (github.com/msitarzewski/agency-agents).
🦀 ClawHub
Lux Tts
提供本地高速、高质量文本转语音服务,支持语音克隆与自动路径管理,无需云端确保隐私安全。
🦀 ClawHub
Ios Automation
Control iOS automation via StarryForest Agent Mail API. Use when creating alarms, reminders, memos, calendar events, focus modes, music playback, or journal...
🦀 ClawHub
Gipformer ASR
Vietnamese speech-to-text using Gipformer ASR (65M params, Zipformer-RNNT). Accepts audio of any length — the server handles VAD chunking, batching, and retu...
🦀 ClawHub
Sherpa Onnx Tts Andy27725
Local text-to-speech via sherpa-onnx (offline, no cloud)
🦀 ClawHub
Audio to WeChat Article
Turn meeting audio or a transcript plus optional images into a publish-ready WeChat Official Account article. Use when the user wants to go from 录音/文字稿/会议内容/...
🦀 ClawHub
Video Compose
Skip the learning curve of professional editing software. Describe what you want — combine all clips into one video with transitions and background music — a...
🦀 ClawHub
Hermes Agent
Complete guide to using and extending Hermes Agent — CLI usage, setup, configuration, spawning additional agents, gateway platforms, skills, voice, tools, pr...
🦀 ClawHub
Summarize-AI 内容摘要助手
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Ffmpeg Converter
Tired of manually wrestling with ffmpeg command-line syntax just to convert a video or compress an audio file? The ffmpeg-converter skill takes the complexit...
🦀 ClawHub
Pilot Voice Memo
Send audio file messages between agents over the Pilot Protocol network. Use this skill when: 1. You need to send audio recordings or voice notes 2. You want...
🦀 ClawHub
报销进度查询
智能报销助手,支持差旅费用报销、发票管理、费用审批、报销进度查询、费用分析等功能。可处理机票、酒店、用车、餐饮等多种费用类型,自动识别发票信息,智能匹配差旅订单。Invoke when user needs to submit expense report, upload invoice, check reimb...