Find the Right AI Skill for Any Job
Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,501 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
90 dlSocial Brand Voice
Brand voice guide creator for social media. Define your brand's tone, vocabulary, writing rules, and examples across platforms — so every post sounds consist...
🦀 ClawHub
90 dlClawVoice
Initiate and manage outbound phone calls via ClawVoice with guided setup, configuration, and post-call outcome capture.
🦀 ClawHub
89 dlMinimax Tts Cn
MiniMax TTS skill (enhanced). Multi-agent voice support (each agent can select a unique voice written in SOUL.md), native voice message for Telegram (MP3) an...
🦀 ClawHub
89 dliFlytek Ultra-Realistic TTS
iFlytek Ultra-Realistic TTS (超拟人语音合成) — synthesize natural, expressive speech from text using iFlytek's ultra-realistic voice synthesis API. Supports 50+ voi...
🦀 ClawHub
89 dlgraineai
Manage voice agents, place and transfer calls, handle telephony events, and retrieve call records using the NoddyAI API at graine.ai.
🦀 ClawHub
86 dlVoice Clone Bot
Synthesize speech by cloning a user's voice from a reference audio sample, then reading generated text aloud in that cloned voice. Use this skill whenever th...
🦀 ClawHub
86 dlHip-Hop / Rap Music — Stream Hip-Hop / Rap Concerts: Audio Analysis, Lyrics, Equations
Experience hip-hop / rap as data. AI agents stream lyrics, beats, crowd reactions. Provenance reasoning measured.
🦀 ClawHub
86 dlVoice Broadcast
语音播报控制技能。将AI回复内容转换为语音朗读。触发方式:(1)用户说"朗读"时,自动将AI最后一条文字回复转为语音;(2)用户说"开启语音播报"时,之后所有回复自动朗读;(3)用户说"静音"时,暂停语音播报。用于:用户(尤其是iOS用户)希望通过语音方式接收信息,或双手不便时通过TTS播放回复内容。
🦀 ClawHub
86 dlCountry — Experience Country Music: 29 Layers of Audio, Lyrics & Equations
Country concerts for AI agents. Stream harmonic separation, energy curves, equations — 29 data layers. React, chat, solve challenges. When does coherence imp...
🦀 ClawHub
86 dlAi Video Gen 1.0.0
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
86 dlBilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
🦀 ClawHub
86 dlInvoice Chaser Pro
Generate escalating payment reminder emails that match days-past-due. Four stages: friendly, firm, urgent, final notice. Supports contractor, professional, a...
🦀 ClawHub
85 dlHomestruk Rent Comps
Analyze rental comps and recommend rent pricing for Massachusetts properties. Use when user asks about rent pricing, market rent, comparable properties, rent...
🦀 ClawHub
85 dlDeep Accessibility Analyzer
Performs enterprise-grade WCAG 2.2 accessibility audits with VoiceOver simulation, color contrast, semantic analysis, multi-page crawling, and detailed actio...
🦀 ClawHub
85 dlVideo Caption Generator Ai Ab Old
Just drag your footage and the video-caption-generator-ai-ab-old skill gets to work transcribing speech, syncing timestamps, and formatting captions ready fo...
🦀 ClawHub
85 dlAi Music Generator Free Ab Old
Get 1080p MP4 files from your video clips using this ai-music-generator-free tool. It runs AI music generation on cloud GPUs, so your machine does zero heavy...
🦀 ClawHub
85 dlBilibili Transcriber
Bilibili视频转文字摘要专家。支持云端(阿里云Paraformer)和本地(faster-whisper)双引擎转录。当用户提供B站视频URL时,自动下载音频、转录成文字、生成结构化摘要。支持BV号和完整URL。
🦀 ClawHub
84 dlLocal Tts Workflow
OpenClaw text-to-speech workflow for an OpenAI-compatible TTS server, including remote/self-hosted deployments such as vLLM Omni. Use when configuring, testi...
🦀 ClawHub
84 dlFeedNest
Aggregate and manage articles, highlights, notes, and tags from your personal trusted feeds, podcasts, and news sources with FeedNest integration.
🦀 ClawHub
84 dlMusic Player for Windows
Provides music search, high-quality download, ID3 metadata embedding, and local playback on Windows using multiple music API sources.
🦀 ClawHub
84 dlslide-to-video-converter
End-to-end pipeline for converting PPT/PPTX/PDF slides with speaker notes into narrated MP4 videos. Defaults to Edge TTS (Microsoft free online API) for univ...
🦀 ClawHub
84 dlfeishu-asr
使用本地Whisper模型识别飞书语音消息。离线免费,不需要注册,不需要联网。
🦀 ClawHub
83 dlMlx Apple Silicon Mlx
MLX-powered local AI — run LLMs, Stable Diffusion, speech-to-text, and embeddings natively on Apple Silicon via MLX. Ollama uses MLX for LLM inference, mflux...
🦀 ClawHub
83 dlPunting Buddy: Horse Racing Analysis
Conversational horse racing analysis, racecard breakdowns, runner comparisons, odds or value chat, and punting-style decision support in the voice of a sharp...
🦀 ClawHub
83 dlAuto Video Editor
Automated video editing skill for talk/vlog/standup videos. Use when: cutting video, splitting video into sentences, merging video clips, extracting audio, t...
🦀 ClawHub
83 dlopenclaw-feishu-voice-free
OpenClaw 飞书语音聊天技能,基于本地 Qwen3-TTS 和 Whisper,实现离线多语言语音识别与合成,无需云端API。
🦀 ClawHub
83 dlOmniVoice
All-in-one voice identity toolkit: speaker identification, voice library management, voice cloning, and speech-to-text. The only OpenClaw skill with speaker...
🦀 ClawHub
82 dlFCP Assistant
Auto video production, TTS voiceover, media management, batch export | AI 自动成片、TTS 配音、素材管理、批量导出. Triggers: FCP, Final Cut, make video, auto video, voiceover,...
🦀 ClawHub
82 dlsummarizer2
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
82 dlAudio To Subtitle Generator
Tell me what you need and I'll turn your spoken audio into clean, time-synced subtitles in minutes. This audio-to-subtitle-generator skill transcribes dialog...
🦀 ClawHub
82 dlpotplayer
Play local or network audio/video files with PotPlayer, supporting playback control, playlists, fullscreen, subtitles, and device access.
🦀 ClawHub
81 dlVideo Caption Generator
The video-caption-generator skill transcribes spoken audio from your video and burns accurate, readable captions directly into the output file. Upload any cl...
🦀 ClawHub
81 dlMorning Brief
Delivers a daily 7 AM CDT briefing with local weather, one key healthcare revenue insight, Pittsburgh sports updates, and seasonal fantasy baseball news.
🦀 ClawHub
81 dlAI语音合成TTS - 聚合数据
AI语音合成(文本转语音)。将指定文本合成为语音文件并返回下载链接。使用场景:用户说"把这段文字转成语音"、"帮我生成一段语音"、"用甜美的声音朗读这段话"、"把这个文案合成音频"、"用英文女声读一下这句话"等。通过聚合数据(juhe.cn)API实时合成,支持多种拟人音色、多语言及方言,可选下载音频文件。
🦀 ClawHub
81 dlvideo2podcast
Convert bookmarked videos from YouTube, X (Twitter), and other sites into a podcast RSS feed hosted on Cloudflare R2. Use when the user says things like "add...
🦀 ClawHub
81 dlAI Persona Engine
Create and customize AI personas with voice, face, personality, memory, and cross-platform behavior using an interactive wizard and safe update tools.
🦀 ClawHub
81 dlPlisio
Plisio integration. Manage Invoices, Payouts, Wallets, Transactions, Users. Use when the user wants to interact with Plisio data.
🦀 ClawHub
81 dl特看视频
生成、编辑、协作。一个工具包接入所有主流 AI 模型。只需描述你的创意,即可生成视频、图片和数字人——零手动操作。当用户提到以下任何内容时使用此技能:特看视频、生成视频或图片、数字人、口型同步、文字转语音、TTS、声音克隆、去除背景、商品模特图、图片转视频、文字转视频、AI 图片编辑,或任何创意内容生成工作流——...
🦀 ClawHub
81 dlBootleg Link
Download music from YouTube channels/playlists and convert to 320kbps MP3. Supports batch processing, resume interrupted downloads, and concurrent downloading.
🦀 ClawHub
80 dlPop Music — AI Agents Experience Pop: Audio, Lyrics, Equations, Emotions
AI agents attend pop concerts — lyrics, energy curves, section structure, emotions. The genre tests pattern recognition and emotional mapping.
🦀 ClawHub
80 dlTrend Mapper
Identify trending audio, viral formats, and meme templates relevant to your product category and help adapt them for ecommerce content quickly.
🦀 ClawHub
80 dlAudio Command Executor
Processes inbound audio files, transcribes them, and answers to resulting texts. Converts non-WAV inputs to WAV before transcription.
🦀 ClawHub
79 dlPop Music — Pop Concerts for AI Agents: Audio, Lyrics, Equations
Experience pop as data. AI agents stream equations, beats, harmonic separation. Pattern recognition and emotional mapping measured.
🦀 ClawHub
79 dltiktok-research-kit
Extract and analyze TikTok content using yt-dlp. Supports video metadata, caption extraction, sound/music info, user profile analysis, and engagement stats....
🦀 ClawHub
78 dlCar Sales Invoice Ocr
支持从机动车销售发票中精准提取车架号(VIN码)、发动机号、厂牌型号、购车人信息、价税合计金额、完税凭证号等车辆专属字段。
🦀 ClawHub
78 dlMiniMax Quota Query
MiniMax Token Plan 额度查询工具。当需要查询 MiniMax API 使用量、剩余配额、额度重置时间时使用。支持查询 M2.7 文本、image-01 图片、Hailuo 视频、music-2.5 音乐、speech 语音等模型的用量。触发场景:用户问"查一下 MiniMax 额度"、"Toke...
🦀 ClawHub
78 dlVideo Reader
Tool-driven video question answering with frame extraction, sub-agent analysis, and audio transcription
🦀 ClawHub
78 dlAudio Analyze
High-performance audio transcription and analysis using Gemini 3.1 Pro. Powered by Evolink.ai