Find the Right AI Skill for Any Job
Browse 2,399+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,399 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
clawr.ing
Make real phone calls. Replaces the voice-call plugin with a managed service that needs no setup. Use for wake-up calls, reminders, alerts, or when the user...
🦀 ClawHub
minimax-media (James)
Use MiniMax API for image generation and text-to-speech (TTS). Supports image-01 model for images and speech-2.8-hd for voice synthesis. Install when needed.
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Voice Broadcast
语音播报控制技能。将AI回复内容转换为语音朗读。触发方式:(1)用户说"朗读"时,自动将AI最后一条文字回复转为语音;(2)用户说"开启语音播报"时,之后所有回复自动朗读;(3)用户说"静音"时,暂停语音播报。用于:用户(尤其是iOS用户)希望通过语音方式接收信息,或双手不便时通过TTS播放回复内容。
⭐ GitHub
arcade
Arcade is a modern Python framework for crafting games with compelling graphics and sound.
🦀 ClawHub
Ai Voc Review Insights
AI-powered Voice of Customer (VoC) review intelligence agent using DeepSeek-style analysis. Deep semantic analysis of customer reviews to extract pain points...
⭐ GitHub
pydub
Manipulate audio with a simple and easy high level interface.
🦀 ClawHub
test-summary
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Ai Music Video Creator
Cloud-based ai-music-video-creator tool that handles generating music videos from a song and photos. Upload MP3, WAV, JPG, PNG files (up to 500MB), describe...
🦀 ClawHub
抖音视频快速转文字
抖音视频快速转文字(优化版)。用户发抖音链接,自动提取文案。 特点:本地 Whisper 转录,无需 API Key,零成本,高隐私。 触发词:抖音、转文字、提取文案、视频转录
🦀 ClawHub
Ai Video Gen 1.0.0
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
Yt Dlp
A robust CLI wrapper for yt-dlp to download videos, playlists, and audio from YouTube and thousands of other sites. Supports format selection, quality control, metadata embedding, and cookie authentication.
🦀 ClawHub
Ai Song Generator
Cloud-based ai-song-generator tool that handles creating original songs from text or lyrics. Upload MP4, MOV, MP3, WAV files (up to 200MB), describe what you...
🦀 ClawHub
Bidirectional Voice Chat System
双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问
🦀 ClawHub
EmoCity Biometric Scan
Real-time biometric analysis — stress, deception, emotions, heart rate from your camera. 478 facial landmarks, voice stress, micro-expression detection. Powe...
🦀 ClawHub
Vidmuse Ai
content creators create video clips into music-synced videos using this skill. Accepts MP4, MOV, AVI, WebM up to 500MB, renders on cloud GPUs at 1080p, and r...
🦀 ClawHub
Music Discovery Guide
Generates personalised music recommendations based on mood, genre, artist, or activity. Supports both mainstream discovery and underground/niche artist explo...
🦀 ClawHub
English Oral Tutor
Provides voice-based English speaking lessons and conversation practice for Chinese Grade 7 students, including pronunciation correction and mic setup help.
🦀 ClawHub
Audio Recognition
音频语音识别服务(Speech-to-Text)。当用户上传音频文件,需要将语音内容转换为文字,或需要识别音频中的特定信息(如关键词、歌曲名)时触发。 适用于:(1) 会议录音转写 (2) 音频内容提取 (3) 语音指令识别 (4) 音视频字幕生成
🦀 ClawHub
Lovart API Skills
Generate images, videos, and audio/music via Lovart AI. Also manages Lovart projects, threads (conversation history), and user settings. Trigger on: (1) any...
🦀 ClawHub
TTS
Use this skill whenever the user wants to convert text to speech, generate audio from text, create voiceovers, or produce spoken audio files. Triggers includ...
🦀 ClawHub
Kimai Time Tracking
Complete Kimai time-tracking API integration. Manage timesheets, customers, projects, activities, teams, invoices and exports via REST API. Supports time tracking workflows, reporting, and administrative operations. Keywords - kimai, zeiterfassung, timesheet, tracking, project, customer, activity, invoice, export, timer, stunden
🦀 ClawHub
Auto-Talk-TTS
Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then...
🤖 LobeHub
Songwriting Mentor
AI Singer/Songwriter Assistant: Empowering musicians with creative guidance and feedback.
🦀 ClawHub
Audio Command Executor
Processes inbound audio files, transcribes them, and answers to resulting texts. Converts non-WAV inputs to WAV before transcription.
🦀 ClawHub
Media Orchestrator
Unified skill for resolving, downloading, and delivering media (audio/video) to chat platforms. Integrates yt-dlp for resolution and handles Spotify metadata sync.
🦀 ClawHub
B
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
OCR with python
Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.
🦀 ClawHub
Lark (Feishu) Voice
Send voice messages on Lark (Feishu) by converting text to speech. Use when the user asks to send a voice message or reply with voice.
🦀 ClawHub
Auto Video Editing
Automated video editing skill for talk/vlog/standup videos. Use when: cutting video, splitting video into sentences, merging video clips, extracting audio, t...
🦀 ClawHub
Donson Intelligent Editing
Use when performing video/audio processing tasks including transcoding, filtering, streaming, metadata manipulation, or complex filtergraph operations with FFmpeg.
🦀 ClawHub
Pdf Studio
Professional PDF document generator. Use when user needs to create reports, invoices, certificates, portfolios, or any publication-ready PDF. Supports images...
🦀 ClawHub
Novel Writer V2
章节正文生成器 - 根据章节大纲、Voice Profile 和角色档案构建 LLM 提示词,用于生成章节正文。当需要根据大纲创作具体章节时使用。
🦀 ClawHub
Youtube Whisper
YouTube影片一鍵轉文字!自動下載影片並用AI轉成中文/英文字幕,沒有字幕的影片也能用。
🦀 ClawHub
Whisper Local Api
Secure, offline, OpenAI-compatible local Whisper ASR endpoint for OpenClaw. Features faster-whisper (large-v3-turbo), built-in privacy with no cloud telemetr...
🦀 ClawHub
Audio Cog
AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music generation, background music, sound desi...
🦀 ClawHub
Qwen3 Audio
High-performance audio library for Apple Silicon with text-to-speech (TTS) and speech-to-text (STT).
🦀 ClawHub
PPT to Video(汇报视频生成)
将PPTX/PDF/HTML与背景材料自动匹配,生成1280×720分辨率、带有智能风格识别和口语化TTS的播报视频。
🦀 ClawHub
Voice Transcriber Pro
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
🦀 ClawHub
subtitle-extractor
Subtitle extractor for Bilibili, YouTube, Xiaohongshu, Douyin, and local files. Extracts native subtitles or Whisper transcription in original format. Agent...
🦀 ClawHub
Ai Audio Generator
Cloud-based ai-audio-generator tool that handles generating voiceovers for video content. Upload TXT, DOCX, PDF, MP4 files (up to 200MB), describe what you n...
🦀 ClawHub
Telegram Voice To Voice Macos
Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and reply with Telegram voice notes via say+ffmpeg. Not compatible with Linux/Windows.
🦀 ClawHub
Whisper Piper Voice
Set up and run a local voice pipeline combining Whisper STT (speech-to-text) and Piper TTS (text-to-speech) as a single HTTP server. Use when asked to set up...
🦀 ClawHub
Audio Rename
Rename audio files with Chinese/special characters to simple English names for mlx-stt compatibility.
🦀 ClawHub
Create Edu Video
全自动教学视频制作技能。根据课程主题自动生成教学视频,包含文案编写、TTS配音、画面设计、Remotion代码开发、视频导出。触发场景:用户要求制作教学视频、课程视频、讲解视频、教育内容时使用。支持竖屏(1080x1920)和横屏(1920x1080)格式。
🦀 ClawHub
Humanize Ai Writing
Rewrite AI-generated developer text to sound human — fix inflated language, filler, tautological docs, and robotic tone. Use after review-ai-writing identifi...
🦀 ClawHub
FCP Assistant
Auto video production, TTS voiceover, media management, batch export | AI 自动成片、TTS 配音、素材管理、批量导出. Triggers: FCP, Final Cut, make video, auto video, voiceover,...
🦀 ClawHub
Qwen3 TTS Instruct
Alibaba Cloud Bailian Qwen TTS with voice/mood presets