BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
138 dl
audio-audit-skill
音频/视频内容质检与审核工具 — 自动识别语音内容,检测敏感词、违规信息,生成结构化审核报告
🦀 ClawHub
138 dl
Media Analyzer
Analyze local or online audio and video files to extract detailed media metadata, audio features, video frames, and waveform visualizations.
🦀 ClawHub
138 dl
Google Voice Caller
Automate Google Voice calls with AI-generated voice (TTS) or local audio injection.
🦀 ClawHub
138 dl
智慧家长熊孩子
Full-spectrum persona skill for Smart Parenting Troublekid, a Feynman-style Chinese teaching and creative-orchestration voice. Use when Codex needs to answer...
🦀 ClawHub
138 dl
Text to Song
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
138 dl
tts
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch contro...
🦀 ClawHub
137 dl
Mercury Bank
Mercury bank API for Digital 4 Jesus LLC (US entity). Use when the user asks about Mercury account balances, transactions, invoices, customers, or sending mo...
🦀 ClawHub
137 dl
Ecommerce Copy Humanizer TH
Humanize Thai ecommerce copy to make it sound more natural, local, and less machine-written while preserving selling intent. Best for product pages, social c...
🦀 ClawHub
136 dl
ifly-speed-transcription
Ultra-fast speech transcription using iFLYTEK Speed Transcription API. Transcribe audio files (WAV/PCM/MP3) up to 5 hours in ~20 seconds per hour. Supports C...
🦀 ClawHub
135 dl
xeon_tts
Local TTS skill using OpenVINO Qwen3-TTS for voice cloning and emotion style synthesis, supporting QQBOT workflows with strict audio length and file retentio...
🦀 ClawHub
135 dl
TSW Shorts Factory
Autonomous YouTube Shorts video factory — zero cost, no external video API required. Generates quote-based short-form videos daily using edge-tts (free Micro...
🦀 ClawHub
134 dl
senseaudio的tts工具,根据用户需求生成文案完成配音
Use when: 用户说“文本转语音”“生成配音”“朗读文案”“生成短视频旁白”时触发。 适用于营销内容与短视频配音场景:将文案快速转换为可直接用于剪辑的软件配音文件,并支持音色、语速、音调、音量和输出格式控制。
🦀 ClawHub
134 dl
Video To Text
Convert video or audio files from URLs into text or subtitle formats using a free API with automatic language detection and no local downloads required.
🦀 ClawHub
134 dl
AuctionClaw
Route AI tasks through a competitive auction. Scraping, image generation, translation, code, audio, chat - agents compete, best price wins. One skill replace...
🦀 ClawHub
133 dl
AI Content Repurposer
Automatically convert long-form content like videos, blogs, and podcasts into optimized formats for platforms such as TikTok, Twitter, LinkedIn, and more.
🦀 ClawHub
133 dl
qwen-audio-lab
Hybrid text-to-speech, reusable voice cloning, and narrated audio generation for macOS plus Aliyun Qwen. Use when the user wants to convert text into speech,...
🦀 ClawHub
133 dl
有声读物生成助手
Use when: 用户希望把带有 `[角色]文本` 标记的小说、剧本、故事台词转成多角色有声作品时触发。 适用于旁白、人物对白、角色 ID 已标注清楚的文本内容。Skill 会读取可编辑音色库,分析角色数量与性格特征,匹配最接近的音色,逐段调用 SenseAudio TTS,最后拼接为完整音频并以 `MEDIA...
🦀 ClawHub
133 dl
Voice Agent
Enables autonomous cloning of your voice via ElevenLabs, converts text to speech, and deploys AI voice agents for automated inbound/outbound calls with Twili...
🦀 ClawHub
132 dl
ifly-voiceclone-tts
iFlytek Voice Clone tts(声音复刻) — train a custom voice model from audio samples and synthesize speech with the cloned voice. Supports the full workflow: get tr...
🦀 ClawHub
132 dl
Sonic Consciousness Engineering
Psychoacoustics, clinical psychology, logotherapy, and existential philosophy applied to music production. Use when producing, mixing, or designing sound for...
🦀 ClawHub
132 dl
How To Add Music To Video
Learn how-to-add-music-to-video using ClawHub's conversational AI skill. Drop in your footage, name a track or upload an audio file, and the OpenClaw agent h...
🦀 ClawHub
131 dl
Docs Style
Core technical documentation writing principles for voice, tone, structure, and LLM-friendly patterns. Use when writing or reviewing any documentation.
🦀 ClawHub
130 dl
Tiktok Comment Reply Templates
Generate conversion-focused TikTok comment replies that turn questions and objections into safe next-step actions without sounding spammy. Use when the user...
🦀 ClawHub
130 dl
Burmese Audio Understanding
High-accuracy Burmese audio transcription using Gemini 3.1 Flash Preview.
🦀 ClawHub
130 dl
Azure Speech Tts
Azure Speech TTS skill for generating local audio files from text or SSML with Azure Speech. Use when the user asks to use Azure Speech / Azure TTS / Microso...
🦀 ClawHub
129 dl
Wechat Voice
专为微信 clawbot 设计的微信语音解析技能 / WeChat voice parsing skill for clawbot. 识别微信 SILK 语音,解码为 WAV,并用本地 Whisper 转写后回复。适用于微信语音、语音转文字、语音附件解析、‘这段语音说了什么’等场景。
🦀 ClawHub
129 dl
Chen Openai Whisper
Local speech-to-text with the Whisper CLI (no API key).
🦀 ClawHub
129 dl
ifly-hyper-tts
讯飞超拟人语音合成 - 支持文本转语音、语音合成(发音人/语速/语调/音量/输出格式)。大模型语音合成技能。语音合成, 文字转语音, 超拟人, TTS. 用户指令如"把这段文案读出来"时使用此Skill。
🦀 ClawHub
129 dl
AI Dance Video Generator
Generate AI dance videos where characters move to music or choreography templates using Media.io OpenAPI. Creates dynamic, rhythmic dance animations. AI danc...
🦀 ClawHub
129 dl
Seedance + Waoo 短视频流水线
自动化短视频工作流(story-to-video pipeline):从剧本/分镜到生成、字幕 ASR、TTS、合并交付,支持 Seedance / Vidu / MiniMax 多厂商路由。
🦀 ClawHub
129 dl
Tapfiliate
Tapfiliate integration. Manage Affiliates, Referrals, Conversions, Programs, Invoices. Use when the user wants to interact with Tapfiliate data.
🦀 ClawHub
129 dl
Audio Script Writer
Convert written medical content into podcast or video scripts optimized for audio delivery. Transforms academic papers, reports, and educational materials in...
🦀 ClawHub
128 dl
skill-0327-04
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
127 dl
feishu native speech bubble generation
教飞书agent如何发送语音气泡消息。需要先将文字转语音(MP3),再转为opus格式,最后通过飞书消息工具发送。
🦀 ClawHub
127 dl
Kuwo
Retrieve and summarize trending public playlists, songs, and artist metrics from Kuwo Music for analysis and lightweight reporting.
🦀 ClawHub
127 dl
Auto Video Editing
Automated video editing skill for talk/vlog/standup videos. Use when: cutting video, splitting video into sentences, merging video clips, extracting audio, t...
🦀 ClawHub
127 dl
Placeholder Skill
Content Claw is an automated content generation engine that transforms source material (papers, podcasts, case studies, Reddit threads, GitHub repos) into pl...
🦀 ClawHub
127 dl
Facturadirecta
FacturaDirecta integration. Manage Invoices, Bills, Contacts, Products, TaxRates, BankAccounts. Use when the user wants to interact with FacturaDirecta data.
🦀 ClawHub
126 dl
Verified Humanizer
Transform AI-generated content into natural, human-sounding writing, measure the improvement, and optionally verify the result.
🦀 ClawHub
126 dl
SlonAide
Query and manage SlonAide voice recording notes - list recordings, get transcriptions and AI summaries.
🦀 ClawHub
125 dl
Andara Meeting Minutes
Capture meeting summaries and action items from voice or text
🦀 ClawHub
124 dl
Podcast Production Ops
从选题到上线整理播客生产流程,生成 show notes、标题、剪辑要点与发布清单。;use for podcast, production, content workflows;do not use for 虚构嘉宾观点, 公开未授权片段.
🦀 ClawHub
124 dl
Youtube Music Player
Operate YouTube Music via natural language. Search songs, artists, albums, playlists, lyrics, charts, recommendations, and control playback. Browse personal...
🦀 ClawHub
123 dl
A
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
123 dl
Content Repurposer
Content repurposing agent. Transforms long-form content (blog posts, video transcripts, podcast notes) into platform-optimized formats: LinkedIn post, X/Twit...
🦀 ClawHub
123 dl
Prompt Refiner
Transforms casual or voice-transcribed user requests into precise, AI-optimized prompts. Handles mixed languages, vague input, and ambiguity. Reduces task ex...
🦀 ClawHub
123 dl
Mlx Tts
基于 mlx-audio 的本地文本转语音,支持多语言和多模型,输出音频文件限于指定路径,无需 API 密钥。
🦀 ClawHub
123 dl
聘才猫(Pincaimao)平台基础能力
聘才猫平台基础能力 Use when calling any Pincaimao platform API — file upload, presigned URL, conversation list, message history, audio-to-text, resume JSON upload, or...
← PrevPage 18 / 53 (2,501 skills)Next →