Find the Right AI Skill for Any Job
Browse 2,342+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,342 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Feishu Voice Bubble
Send native voice bubble messages (语音气泡) in Feishu/Lark chats using Edge TTS. Converts text to opus audio via Microsoft Edge TTS (free, no API key needed), t...
🦀 ClawHub
invoice-merger
合并发票文件。PDF 按两两上下排版,图片按四宫格排版,统一裁剪线与安全边距,输出到 YYYYMMDD--已合并 目录,重复执行会自动跳过历史合并文件并按编号继续生成。
🦀 ClawHub
Construction Daily Report Generator
Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.
🦀 ClawHub
Humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing
text to make it sound more natural and human-written. Based on Wikipedia's
comprehensive "Signs of AI writing" guide. Detects and fixes patterns including:
inflated symbolism, promotional language, superficial -ing analyses, vague
attributions, em dash overuse, rule of three, AI vocabulary words, negative
parallelisms, and excessive conjunctive phrases.
🦀 ClawHub
VibeVoice TTS
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
🦀 ClawHub
ElevenLabs Phone Reminder (Lite)
Build AI phone call reminders with ElevenLabs Conversational AI + Twilio. Free starter guide.
🦀 ClawHub
Pipixia Drama Producer
皮皮虾职场短剧全流程制作技能。用于为「皮皮虾」(机械龙虾AI-bot)职场短剧生成镜头视频、剪辑成片、配音配乐并发布到飞书群。完整流程:图生视频(I2V) → ffmpeg规范化+剪辑 → TTS配音 → BGM混音 → 飞书媒体消息发送。当用户提到制作皮皮虾短剧、生成新镜头、剪辑视频、配音配乐、或将视频/音频发...
🦀 ClawHub
Clideo Add Music To Video
Turn a 2-minute MP4 clip and an MP3 song into 1080p music-backed videos just by typing what you need. Whether it's adding background music to video clips or...
🦀 ClawHub
MH summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub
Speech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
🦀 ClawHub
Video Analyzer (TikTok + YouTube + Instagram)
Analyze videos from TikTok, YouTube, Instagram, Twitter, and others by URL, transcribing audio locally and answering questions about the content.
🦀 ClawHub
Music Skill
Search songs, download playable audio, fetch lyrics, parse music share links, configure platform cookies, and switch music sources through a local go-music-a...
🦀 ClawHub
Video Audio Converter
Turn a 3-minute MP4 interview recording into 1080p converted audio files just by typing what you need. Whether it's extracting audio tracks from video files...
🦀 ClawHub
rupali
Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief response, then synthesize and send MP3.
🦀 ClawHub
Lofy Home
Smart home control for the Lofy AI assistant — scene modes (study, chill, sleep, morning, grind), device management via Home Assistant REST API, presence-based automation, natural language commands for lights, music, thermostat, and PC wake-on-LAN. Use when controlling smart home devices, activating scene modes, or managing home automation.
🦀 ClawHub
article-tts
拍照或文字转音频:文章照片 OCR 提取文字,或直接接收文字,生成 Microsoft Edge TTS 语音,支持中英文、自动转写、语速调节、逐句拆分。| Capture article photos (OCR) or plain text, generate natural audio via Edge TT...
🦀 ClawHub
Vietnamese
Write Vietnamese that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Ai Tool For Video Generation
Skip the learning curve of professional editing software. Describe what you want — generate a 30-second video of a product launch with background music and t...
🦀 ClawHub
Ai Tool For Video Creation
Skip the learning curve of professional editing software. Describe what you want — combine these images and audio into a 30-second promotional video with tex...
🦀 ClawHub
FFmpeg CLI
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.
🦀 ClawHub
LH Edge TTS
Text-to-speech conversion using Python edge-tts for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and sub...
🦀 ClawHub
Sondo Ai
Turn a 2-minute interview recording with background noise into 1080p clean audio videos just by typing what you need. Whether it's removing background noise...
🦀 ClawHub
Audio Announcement Skills
Enables AI agents to announce their real-time actions via voice in multiple languages, with queued, concise, and friendly audio updates for tasks and status.
🦀 ClawHub
Video Messages from your openclaw
Generate and send video messages with a lip-syncing VRM avatar. Use when user asks for video message, avatar video, video reply, or when TTS should be delivered as video instead of audio.
🦀 ClawHub
Audio Handler
Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, FLAC, AAC, OGG, and more.
🦀 ClawHub
whatisxlistening.to
Query Last.fm listening data, show now playing, sync scrobble history to local DB, and deploy a personal "now playing" web dashboard. Use when user asks about current music, listening stats, scrobble history, or wants to set up a Last.fm dashboard.
🦀 ClawHub
spotify-download
Download MP3s from Spotify playlists by fetching metadata, searching YouTube for tracks, and converting audio using ffmpeg with optional Spotify API credenti...
🦀 ClawHub
Neomano TTS (ElevenLabs)
Text-to-speech (TTS) via ElevenLabs. Use when the user asks to reply with voice/audio, generate a spoken version of some text, or asks for “voz”, “nota de vo...
🦀 ClawHub
Yino.ai - Agent First AI Music Video Generator
Generate images and videos using yino.ai. Use when user wants to generate images (Seedream), generate videos (Veo), or any media generation task.
🦀 ClawHub
EngineMind
A Rust+Python consciousness engine with 12-phase crystal dynamics, thalamic relay processing, 19 introspective inner voices, and holographic emission. Use for consciousness simulation, emergent behavior research, and text-driven cognitive state modeling.
🦀 ClawHub
Error Prone
Makes AI-generated text sound more human by intentionally introducing realistic imperfections. Use when writing emails, messages, social posts, or any text t...
🦀 ClawHub
seedance-2-video-gen
Seedance 2.0 AI video generation via EvoLink API. Three modes — text-to-video, image-to-video (1-2 images), reference-to-video (images + videos + audio). Aut...
🦀 ClawHub
Keyapi Tiktok Content Analysis
Analyze TikTok content at scale — extract insights from videos, hashtags, music tracks, and live streams including engagement trends, comment sentiment, capt...
🦀 ClawHub
ifly-hyper-tts
讯飞超拟人语音合成 - 支持文本转语音、语音合成(发音人/语速/语调/音量/输出格式)。大模型语音合成技能。语音合成, 文字转语音, 超拟人, TTS. 用户指令如"把这段文案读出来"时使用此Skill。
🦀 ClawHub
Lyria
Generate 30-second instrumental music via Google Lyria (Vertex AI). Use when user requests music generation, specific styles/keys/instruments, or music itera...
🦀 ClawHub
Gemini Tts
Custom TTS using Gemini 2.5 Flash for high-quality, persona-driven voice output.
🦀 ClawHub
Speech Therapist Video
Create concise parent-focused videos showcasing your personalized speech therapy approach, family involvement, and child progress to build trust and clarify...
🦀 ClawHub
Ai Humanizer.Bak
Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern det...
🦀 ClawHub
SAM TTS
Generate retro robotic speech audio using SAM (Software Automatic Mouth), the classic C64 text-to-speech synthesizer. Use for /sam command to generate voice messages. Supports /sam on/off toggle mode where all responses are spoken in SAM voice. Supports pitch, speed, mouth, and throat parameters for voice customization.
🦀 ClawHub
Podcast Video Camera
Get polished podcast videos ready to post, without touching a single slider. Upload your raw footage (MP4, MOV, AVI, WebM, up to 500MB), say something like "...
🦀 ClawHub
Bumblebee
Two modes: (1) BUMBLEBEE — Communicate through music by playing exact lyric lines on Spotify, like Bumblebee from Transformers speaking through radio snippet...
🦀 ClawHub
Songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
🦀 ClawHub
Siri
Control devices, run automations, and help users get more from Siri with HomeKit, Shortcuts, and voice command guidance.
🦀 ClawHub
虾转音频
🎵 音视频格式转换与处理工具箱。基于 FFmpeg + Whisper AI,支持:格式转换、视频提取音频、合并、分割、压缩、查看信息、音频转文字。
🦀 ClawHub
memory-assistant
Helps users remember where they put things and schedule voice reminders. Use when the user says "记一下"/"记一下"/"提醒我", records item locations (e.g. keys, passpor...
🦀 ClawHub
feishu-edge-tts-win
飞书语音消息发送技能(Windows 版)。使用 Edge TTS(微软,免费)生成语音并以飞书语音气泡发送。
🦀 ClawHub
Thermostat
Adjust temperatures, diagnose comfort issues, calculate energy savings, and automate schedules through voice commands or smart home integration.
🦀 ClawHub
Netease Cloud Music
提供网易云音乐公开歌单、歌曲和歌手的播放及互动数据摘要,支持排行榜和作品表现分析。