BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
Album Reviewer
Search and aggregate album reviews from multiple sources (Pitchfork, AllMusic, RateYourMusic, Metacritic, Douban, Rolling Stone, NME, Bandcamp Daily, Sputnik...
🦀 ClawHub
Basque
Write Basque that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Auto Pivot Table
Build ORBCAFE advanced analytics and voice navigation with CPivotTable/usePivotTable and CAINavProvider/useVoiceInput using official examples patterns. Use f...
🦀 ClawHub
Ai Video Lip Sync Free
Tell me what you need and I'll sync your video's lip movements to any audio track — no expensive software required. This ai-video-lip-sync-free skill analyze...
🦀 ClawHub
Mobayilo Voice (Beta)
Place outbound phone calls via Mobayilo with safe defaults (preview mode by default) and explicit live execution.
🦀 ClawHub
clawdible -audiobooks
Search, browse, and manage Audible audiobooks. Use when the user wants to search for audiobooks on Audible, view their library, get book details, purchase a...
🤖 LobeHub
Text Master Suno
I am a lyrics assistant for the AI Suno.
🦀 ClawHub
openai-tts-python
Text-to-speech conversion using OpenAI's TTS API for generating high-quality, natural-sounding audio. Supports 6 voices (alloy, echo, fable, onyx, nova, shimmer), speed control (0.25x-4.0x), HD quality model, multiple output formats (mp3, opus, aac, flac), and automatic text chunking for long content (4096 char limit per request). Use when: (1) User requests audio/voice output with triggers like "read this to me", "convert to audio", "generate speech", "text to speech", "tts", "narrate", "speak"
🦀 ClawHub
Audio Video
Expert audio/video processing with ffmpeg and ffprobe. Use when the user needs to convert, compress, edit, analyze, stream, or process any audio or video fil...
🦀 ClawHub
Openai Whisper
Local speech-to-text with the Whisper CLI (no API key).
🦀 ClawHub
Construction Meeting Minutes Generator
Generate structured construction meeting minutes from rough notes or voice transcription, with separated action items, decision tracking, and contractual fla...
🦀 ClawHub
meeting-minutes-qa-tts
Read meeting minutes, produce a short summary with the current conversation model, save the meeting text and summary into local memory, answer follow-up ques...
🦀 ClawHub
Homestruk Tenant Screening
Screen tenant applications using Fair Housing compliant criteria for Massachusetts properties. Use when evaluating a rental application, setting screening cr...
🦀 ClawHub
ACE-Step Music Generation
Generate high-quality music on Apple Silicon Macs using ACE-Step 1.5 with MLX backend, supporting custom prompts, durations, and output formats.
🦀 ClawHub
Agent Payments
The universal payment skill for AI agents. Fiat payments via Stripe (invoices, subscriptions, one-time charges), crypto payments via Coinbase Commerce (accep...
🦀 ClawHub
Podcast Show Notes Mcp
Generate podcast show notes from audio: timestamps, topics, guest bios, key quotes, SEO summaries.
🦀 ClawHub
F5tts Monitor
Monitor F5-TTS distributed training on the 9-GPU mining rig (Local-LLM) without interfering with the process.
🦀 ClawHub
Pet Vocal Emotion Analysis Skill | 宠物叫声情绪解析技能
Recognizes cat and dog barks through pet voiceprint AI, translates and outputs emotions and behavioral intentions such as happiness, excitement, anger, anxie...
🦀 ClawHub
Seedance 2.0 Guide
Professional Seedance 2.0 / Jimeng (即梦) Storyboard & Prompt Engineering Guide. Create movie-grade 9:16 vlogs, cinematic AI video prompts, and auto-audio scri...
🦀 ClawHub
Pet Soothing Trigger Analysis Skill | 宠物安抚触发分析技能
Automatically triggers soothing mechanisms (playing relaxing sounds, activating laser toys) when pet anxiety, howling, or prolonged loneliness is detected; a...
🦀 ClawHub
douyin-to-obsidian
抖音视频文案自动提取工具,一键将抖音视频转为结构化 Obsidian 笔记。支持绕过风控、本地 Whisper 语音识别、长视频分段处理。
🦀 ClawHub
Free App To Add Music To Video
TikTok creators add video clips into music-backed videos using this skill. Accepts MP4, MOV, AVI, WebM up to 500MB, renders on cloud GPUs at 1080p, and retur...
🦀 ClawHub
News Summarizer
Fetch and summarize world news from BBC, Reuters, NPR RSS feeds. Can create voice summaries. USE WHEN: "What's happening in the world?", daily briefings, gen...
🦀 ClawHub
Music Recommender
Analyze NetEase Cloud Music (网易云音乐) playlist and recommend songs matching their taste. Use when user asks for music recommendations, wants a daily playlist,...
🦀 ClawHub
Smart Baby Cry Analysis Skill | 婴儿哭声智能解析技能
Detects baby cries via audio AI in real-time, analyzes causes, and precisely identifies needs like hunger, tiredness, pain, discomfort, or irritability to as...
🦀 ClawHub
Alby Lightning Payments
Send, receive, and manage Bitcoin Lightning payments through Alby Hub's Nostr Wallet Connect, including balance checks and invoice handling.
🦀 ClawHub
Feishu Voice Chat
飞书语音对话能力,提供语音识别(ASR)和语音合成(TTS)功能, 所有的飞书语音消息都通过该技能处理。 完整语音交互链路:接收用户语音 → ASR 转文字 → LLM 处理 → TTS 转语音 → 通过飞书插件发送语音消息。 当用户要求"语音回复/说给我听"时,只回复飞书语音消息(audio 气泡),不回复文本...
🦀 ClawHub
Speech Synthesizer
文字转语音(Text-to-Speech)工具。 支持 edge-tts(微软神经网络 TTS,在线合成)和 OpenAI 兼容 API TTS。 触发词:语音回复、TTS、文字转语音、语音合成、语音对话。 适用平台:Linux / Windows / macOS。
🦀 ClawHub
Ntriq Document Intelligence Mcp
Document OCR, classification, table extraction, and summarization using local AI vision. Supports invoices, contracts, forms, reports.
🦀 ClawHub
Ntriq Content Factory
Transform any text into 8 content types: Q&A, reports, quizzes, flashcards, mind maps, data tables, slide decks, and podcast scripts. Plus TTS audio generati...
🦀 ClawHub
Hd Lyric Video
create audio files into HD lyric videos with this hd-lyric-video skill. Works with MP3, WAV, FLAC, AAC files up to 200MB. musicians and music creators use it...
🦀 ClawHub
PPT to Video(汇报视频生成)
将PPTX/PDF/HTML与背景材料自动匹配,生成1280×720分辨率、带有智能风格识别和口语化TTS的播报视频。
🦀 ClawHub
Qwen3 Audio
High-performance audio library for Apple Silicon with text-to-speech (TTS) and speech-to-text (STT).
🦀 ClawHub
Whisper Local Api
Secure, offline, OpenAI-compatible local Whisper ASR endpoint for OpenClaw. Features faster-whisper (large-v3-turbo), built-in privacy with no cloud telemetr...
🦀 ClawHub
Youtube Whisper
YouTube影片一鍵轉文字!自動下載影片並用AI轉成中文/英文字幕,沒有字幕的影片也能用。
🦀 ClawHub
Meeting Notes Assistant
会议纪要智能助手。使用本地 Whisper 音频转写(离线、隐私安全),生成结构化会议纪要(时间、议题、结论、待办、关键词),提取 Action Items。支持 Word / PDF / 邮件输出,适合录音转写、会议归档与待办分发。触发关键词:「整理会议纪要」、「生成会议纪要」、「录音转纪要」。
🦀 ClawHub
B
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🤖 LobeHub
Songwriting Mentor
AI Singer/Songwriter Assistant: Empowering musicians with creative guidance and feedback.
🦀 ClawHub
Auto-Talk-TTS
Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then...
🦀 ClawHub
TTS
Use this skill whenever the user wants to convert text to speech, generate audio from text, create voiceovers, or produce spoken audio files. Triggers includ...
🦀 ClawHub
Lovart API Skills
Generate images, videos, and audio/music via Lovart AI. Also manages Lovart projects, threads (conversation history), and user settings. Trigger on: (1) any...
🦀 ClawHub
EmoCity Biometric Scan
Real-time biometric analysis — stress, deception, emotions, heart rate from your camera. 478 facial landmarks, voice stress, micro-expression detection. Powe...
🦀 ClawHub
Cooking Class Video
The sound of garlic hitting a hot pan, the moment a knife makes clean contact with a cutting board, the face of someone tasting something they made from scra...
🦀 ClawHub
test-summary
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
GitHub
pydub
Manipulate audio with a simple and easy high level interface.
GitHub
arcade
Arcade is a modern Python framework for crafting games with compelling graphics and sound.
🦀 ClawHub
clawr.ing
Make real phone calls. Replaces the voice-call plugin with a managed service that needs no setup. Use for wake-up calls, reminders, alerts, or when the user...
🦀 ClawHub
小米 TTS Proxy
小米 TTS 代理技能。将 OpenAI TTS API 格式转换为小米大模型平台 TTS API(api.xiaomimimo.com),支持 Opus/MP3/AAC/FLAC/WAV/PCM 六种格式的本地转码。 当需要为机器人添加语音回复能力、或配置 TTS 语音合成时使用此技能。 也适用于需要搭建本地...
← PrevPage 32 / 53 (2,501 skills)Next →