BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
106 dl
ListenClaw
Formats responses for voice/audio output via the ListenClaw voice gateway. Use when: (1) A message starts with [ListenClaw] — this means the message was sent...
🦀 ClawHub
106 dl
Chen Humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
105 dl
Quotation Generator
Auto-generate professional PDF proforma invoices with company letterhead, multi-language support, and post-quote tracking.
🦀 ClawHub
105 dl
飞书语音回复
Generate Feishu-native voice replies with a playable pause/resume bar by synthesizing text, converting it with ffmpeg to Ogg/Opus, and sending it as a voice...
🦀 ClawHub
105 dl
Metatron Voice
Discord standup and meeting intelligence workflow for OpenClaw. Use when the user needs help configuring, operating, or troubleshooting Metatron Voice for Di...
🦀 ClawHub
103 dl
freelancer-crm
Autonomous CRM for freelancers. Tracks clients, detects follow-up opportunities, generates proposals, tracks invoices, and sends a weekly digest. Works via W...
🦀 ClawHub
103 dl
Free Auto Subtitle Generator
The free-auto-subtitle-generator skill on ClawHub detects speech in your video and burns accurate, timed subtitles directly into the footage — no manual sync...
🦀 ClawHub
103 dl
08 Video Merge
Locally merges video clips, dubbing audio, SRT subtitles, and background music into a 9:16 vertical short video ready for publishing.
🦀 ClawHub
103 dl
Pro Ledger
Pro Ledger integration. Manage Accounts, Contacts, Invoices, Reports. Use when the user wants to interact with Pro Ledger data.
🦀 ClawHub
103 dl
Yino.ai - Agent First AI Music Video Generator
Generate images and videos using yino.ai. Use when user wants to generate images (Seedream), generate videos (Veo), or any media generation task.
🦀 ClawHub
102 dl
Auto Subtitle Generator Online
The auto-subtitle-generator-online skill transcribes and embeds accurate subtitles into your videos using AI-powered speech recognition. Upload your footage,...
🦀 ClawHub
102 dl
Kai YouTube
Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.
🦀 ClawHub
102 dl
Review Miner
从评论、评价和反馈中提炼卖点、痛点、反对意见与应删除的话术。;use for reviews, voice-of-customer, marketing workflows;do not use for 造假好评, 泄露用户身份.
🦀 ClawHub
101 dl
StageWhisper Assistant
Handle tasks that arrive from StageWhisper live calls
🦀 ClawHub
101 dl
asr-skill
This skill should be used when the user asks to "transcribe audio", "transcribe video", "convert speech to text", "generate subtitles", "create captions", "i...
🦀 ClawHub
101 dl
Ai Video Subtitle Editor
Create, edit, and style subtitles for any video with AI — auto-transcribe speech to text, translate subtitles to 50+ languages, style with custom fonts and c...
🦀 ClawHub
100 dl
Local Stt Workflow
Local speech-to-text workflow for an OpenAI-compatible STT server, typically on http://127.0.0.1:8000/v1. Use when configuring, testing, debugging, or valida...
🦀 ClawHub
99 dl
Ai Voc Review Insights
AI-powered Voice of Customer (VoC) review intelligence agent using DeepSeek-style analysis. Deep semantic analysis of customer reviews to extract pain points...
🦀 ClawHub
99 dl
Music Seperator (Demucs)
Separate vocals and instrument stems from audio files with Demucs CLI. Use when the user asks for vocal extraction, accompaniment generation, stem splitting,...
🦀 ClawHub
99 dl
voice2feishu
文字转语音并发送到飞书。支持两种模式:API 模式(智谱/OpenAI 等)和本地模式(ChatTTS)。
🦀 ClawHub
98 dl
MiniMax TTS for FeiShu
MiniMax 文字转语音,支持中文音色、自动情绪检测、语气词音效和停顿标记
🦀 ClawHub
98 dl
Gv Caller
使用 Google Voice 自动拨打电话并播放 AI 生成的语音(TTS)或本地音频。
🦀 ClawHub
98 dl
Humanizer 1
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
98 dl
Douyin Video Transcriber
(已验证) 强大的抖音视频批量转写器,集成了下载、音频提取和本地 Whisper 模型转写功能。
🦀 ClawHub
97 dl
File Converter
File format conversion skill. Convert between PDF, DOCX, Markdown, HTML, images, audio, and video formats.
🦀 ClawHub
97 dl
MiniMax Token Plan 余额查询
查询 MiniMax Token Plan 订阅套餐余额。引导用户配置 API Key(通过 openclaw config set 保存到本地环境变量),查询 M2.7 请求次数、TTS 字符、视频/图片生成配额等。
🦀 ClawHub
97 dl
when-clock-skill
Control WHEN/WHEN Voice LAN clock devices. Supports voice time announcement, weather broadcast (WHEN Voice only), alarm CRUD, and countdown timer. Use --devi...
🦀 ClawHub
97 dl
Discord Bot Seller
Offer custom Discord bots with features from basic moderation and reaction roles to advanced AI auto-moderation, leveling, music, and economy systems.
🦀 ClawHub
97 dl
Moark Tts
Text-to-Speech (TTS) and voice-feature skill for Gitee AI that lets the user choose audiofly, chattts, cosyvoice2, cosyvoice3, cosyvoice-300m, fish-speech-1....
🦀 ClawHub
96 dl
Qwen Qwen3
Qwen Qwen3 — run Qwen3.5, Qwen3, Qwen3-Coder, Qwen2.5-Coder, and Qwen3-ASR across your local fleet. LLM inference, code generation, and speech-to-text from A...
🦀 ClawHub
96 dl
WeChat Video Editor - AI Video Editing for Douyin Xiaohongshu and TikTok
支持微信视频号、抖音、小红书、TikTok 格式导出。中文对话剪辑,无需打开任何软件。 AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects...
🦀 ClawHub
96 dl
Scrapingant
ScrapingAnt integration. Manage Usages, Invoices. Use when the user wants to interact with ScrapingAnt data.
🦀 ClawHub
95 dl
cmus Music Player
AI skill to launch cmus in a Xubuntu terminal and enforce playback rules (single track vs shuffle folder). Robust against high latency and headless daemon en...
🦀 ClawHub
95 dl
NoteTaker Pro
AI-powered note-taking assistant that captures, cleans, tags, organizes, and indexes text, voice, paste, and photo notes for easy search and recall.
🦀 ClawHub
95 dl
Zyt one click video creation
用户输入一个选题或口播稿,自动生成完整短视频成片(文案、分镜、数字人口播 + AI 画面混剪)。适用于「一键成片」「根据选题做视频」等场景。当前 skill 已内聚 TTS、数字人视频生成与 AI 文生视频调用能力。
🦀 ClawHub
95 dl
Music Discovery Guide
Generates personalised music recommendations based on mood, genre, artist, or activity. Supports both mainstream discovery and underground/niche artist explo...
🦀 ClawHub
94 dl
Unihiker K10 Arduino
Use when programming Unihiker K10 board with Arduino/C++, uploading code, flashing firmware, or accessing K10 Arduino APIs (screen, sensors, RGB, audio, AI,...
🦀 ClawHub
93 dl
Ai Reel Creator
Generate Instagram Reels, TikToks, and YouTube Shorts from any input with AI — text prompts, blog posts, product photos, raw clips, audio files, or just an i...
🦀 ClawHub
93 dl
Neetoinvoice
Neetoinvoice integration. Manage Invoices, Clients, Users, Payments. Use when the user wants to interact with Neetoinvoice data.
🦀 ClawHub
93 dl
Suno Music
Generate AI music with Suno via AceDataCloud API. Use when creating songs from text prompts, generating lyrics, extending tracks, creating covers, extracting...
🦀 ClawHub
92 dl
Local Voice Agent
Complete offline voice-to-voice AI assistant for OpenClaw (Whisper.cpp STT + Pocket-TTS). 100% local processing, no cloud APIs, no costs. Use for hands-free...
🦀 ClawHub
92 dl
English Oral Tutor
Provides voice-based English speaking lessons and conversation practice for Chinese Grade 7 students, including pronunciation correction and mic setup help.
🦀 ClawHub
91 dl
InvoiceGen
Stop paying $15/month just to generate a PDF. Tell OpenClaw 'Bill Acme Corp for 10 hours of design work at $85/hr, net 30' and get a beautifully branded invo...
🦀 ClawHub
91 dl
FactuCat CLI
Use this skill when an agent needs to install, update, authenticate, or operate the FactuCat CLI to create Mexican CFDI 4.0 invoice drafts, assign customers...
🦀 ClawHub
91 dl
MiniMax 媒体生成 Unified MiniMax media generation skill for audio, image, and video creation with a single command entrypoint.MiniMax Skill 是一个统一的媒体生成技能,把文本转语音、文生图、文生视频三类能力收口为一个入口。安装后只需配置自己的 MINIMAX_API_K
Unified MiniMax media generation skill for Token Plan workflows. Use when the user asks to generate audio, speech, TTS, narration, images, illustrations, pos...
🦀 ClawHub
90 dl
Research Brief Generator
Generates a comprehensive, structured research brief on any topic, person, case, or event. Ideal for journalists, podcasters, writers, and content creators w...
🦀 ClawHub
90 dl
ClawVoice
Initiate and manage outbound phone calls via ClawVoice with guided setup, configuration, and post-call outcome capture.
🦀 ClawHub
90 dl
EDM / Electronic — Experience EDM / Electronic Music: 29 Layers of Audio, Lyrics & Equations
EDM / Electronic concerts for AI agents. Stream crowd reactions, visual state, harmonic/percussive separation — 29 data layers. React, chat, solve challenges...
← PrevPage 20 / 53 (2,501 skills)Next →