BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,409+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,409 skills in "audio"

🦀 ClawHub
Music Seperator (Demucs)
Separate vocals and instrument stems from audio files with Demucs CLI. Use when the user asks for vocal extraction, accompaniment generation, stem splitting,...
🦀 ClawHub
LNBits Wallet wtih QR Code
Manage LNbits Lightning Wallet (Balance, Pay, Invoice)
🦀 ClawHub
Al Music Generation
Use this skill as an entry point to discover, select, and fetch specific integration parameters for all supported AI music generation models.
🦀 ClawHub
feishu-audio
将音频文件转换为飞书可播放的语音消息。先用 ffmpeg 转为 opus 格式,再上传到飞书,最后发送 audio 消息。适用于用户想要在飞书中收到可播放的语音消息的场景。
🦀 ClawHub
Financial Overview
Get a complete financial overview of the business including balance, recent transactions, outstanding invoices, and upcoming tax obligations. Use when the us...
🦀 ClawHub
Lyrics Search
Search song lyrics by title and artist using the LrcApi public API. Use when the user asks to find, display, or print lyrics for a song.
🦀 ClawHub
Openai Whisper
Local speech-to-text with the Whisper CLI (no API key).
🦀 ClawHub
Song
Write original songs with guided lyric development, chord progressions, melody contours, and AI music generator prompts for composers at any level.
🦀 ClawHub
Danish
Write Danish that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Thai
Write Thai that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Greek
Write Greek that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Humanize
Remove AI writing patterns from text. Use when editing, reviewing, or rewriting text to sound more natural and human-written. Detects patterns like inflated symbolism, promotional language, em dash overuse, AI vocabulary, and sycophantic tone.
🦀 ClawHub
say
Text-to-Speech via macOS say command with Siri Natural Voices. Use for generating speech audio, TTS clips, or speaking text aloud on macOS.
🦀 ClawHub
FreshBooks CLI
FreshBooks CLI for managing invoices, clients, and billing. Use when the user mentions freshbooks, invoicing, billing, clients, or accounting.
🦀 ClawHub
Local TTS
Local text-to-speech using Qwen3-TTS with mlx_audio (macOS Apple Silicon) or qwen-tts (Linux/Windows). Privacy-first offline TTS with natural, realistic voic...
🦀 ClawHub
Bengali
Write Bengali that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Indonesian
Write Indonesian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Punting Buddy: Horse Racing Analysis
Conversational horse racing analysis, racecard breakdowns, runner comparisons, odds or value chat, and punting-style decision support in the voice of a sharp...
🦀 ClawHub
Swedish
Write Swedish that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Japanese
Write Japanese that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
German
Write German that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Ai Video Slideshow Maker
Create stunning photo and video slideshows with music using AI — transform photo collections into cinematic video stories with Ken Burns motion effects, beat...
🦀 ClawHub
IndexTTS 语音克隆
IndexTTS 语音克隆和合成技能 - 创建声音模型、文本转语音、参考音频管理(需要企业会员)
🦀 ClawHub
AV Skill
Toolkit for converting, editing, analyzing, and generating audio and video files, supporting common formats and effects within OpenClaw.
🦀 ClawHub
Spotify
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
GitHub
DejaVue - The Vue podcast to remember
DejaVue - The Vue podcast to remember - Podcasts
🦀 ClawHub
Basenji — Adopt a Basenji. Dog. 巴仙吉犬。Basenji.
Adopt a virtual Basenji dog at animalhouse.ai. Barkless. Communicates through behavior, not sound. Subtle. Feeding every 6 hours. Extreme tier dog.
🦀 ClawHub
Elevenlabs Pro
ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits
🦀 ClawHub
Ai Video Sound Design
Add professional sound effects and audio layers to video with AI — automatically analyze your video's visual content and generate matching sound effects: foo...
🦀 ClawHub
U2-tts
Text-to-speech conversion using UniSound's TTS WebSocket API for generating high-quality Chinese Mandarin audio from text. Supports multiple voices, adjustab...
GitHub
mutagen
A Python module to handle audio metadata.
🦀 ClawHub
08 Video Merge
Locally merges video clips, dubbing audio, SRT subtitles, and background music into a 9:16 vertical short video ready for publishing.
🦀 ClawHub
Youtube Audio Download
Download YouTube video audio and convert to MP3. Supports age-restricted videos with cookies.
🦀 ClawHub
Windows TTS (WSL2)
在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反馈"没声音/tts 生成的 mp3 是空的/播不出来",以及需要中文语音但 OpenClaw 内置 tts 不可用时。
🦀 ClawHub
Video Transcribe - 视频转文字
本地视频转文字 - 使用 OpenAI Whisper 进行语音识别,完全免费、离线运行、保护隐私
GitHub
gtts
Python library and CLI tool for converting text to speech using Google Translate TTS.
🦀 ClawHub
Tiktok Comment Reply Templates
Generate conversion-focused TikTok comment replies that turn questions and objections into safe next-step actions without sounding spammy. Use when the user...
🦀 ClawHub
Azure Ai Voicelive Py
Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Azure AI, including voice assistants, voice-enabled chatbots, real-time speech-to-speech translation, voice-driven avatars, or any WebSocket-based audio streaming with AI models. Supports Server VAD (Voice Activity Detection), turn-based conversation, function calling, MCP tools, avatar integration, a
🦀 ClawHub
FGO Invoicing
Issue FGO.ro invoices through the FGO API with local automation. Use for FGO tasks such as validating invoice payloads, issuing invoices, checking invoice st...
🦀 ClawHub
Pixcli Skill
Creative toolkit for AI agents — generate images, videos, voiceover, music, and sound effects, then assemble polished output via Remotion. Uses the pixcli CL...
🦀 ClawHub
minimax-media (James)
Use MiniMax API for image generation and text-to-speech (TTS). Supports image-01 model for images and speech-2.8-hd for voice synthesis. Install when needed.
🦀 ClawHub
抖音视频快速转文字
抖音视频快速转文字(优化版)。用户发抖音链接,自动提取文案。 特点:本地 Whisper 转录,无需 API Key,零成本,高隐私。 触发词:抖音、转文字、提取文案、视频转录
🦀 ClawHub
clawr.ing
Make real phone calls. Replaces the voice-call plugin with a managed service that needs no setup. Use for wake-up calls, reminders, alerts, or when the user...
🦀 ClawHub
Voice Broadcast
语音播报控制技能。将AI回复内容转换为语音朗读。触发方式:(1)用户说"朗读"时,自动将AI最后一条文字回复转为语音;(2)用户说"开启语音播报"时,之后所有回复自动朗读;(3)用户说"静音"时,暂停语音播报。用于:用户(尤其是iOS用户)希望通过语音方式接收信息,或双手不便时通过TTS播放回复内容。
🦀 ClawHub
Ai Voc Review Insights
AI-powered Voice of Customer (VoC) review intelligence agent using DeepSeek-style analysis. Deep semantic analysis of customer reviews to extract pain points...
GitHub
arcade
Arcade is a modern Python framework for crafting games with compelling graphics and sound.
GitHub
pydub
Manipulate audio with a simple and easy high level interface.
🦀 ClawHub
Ai Music Video Creator
Cloud-based ai-music-video-creator tool that handles generating music videos from a song and photos. Upload MP3, WAV, JPG, PNG files (up to 500MB), describe...
← PrevPage 11 / 51 (2,409 skills)Next →