BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
Spotify Ads CLI
Spotify Ads data analysis and reporting via spotify-ads-cli. Use when the user wants to check Spotify ad performance, pull aggregate or insight reports, expl...
🦀 ClawHub
Best Audio Editor
edit audio files into cleaned audio tracks with this best-audio-editor skill. Works with MP3, WAV, AAC, MP4 files up to 500MB. podcasters, YouTubers, content...
🦀 ClawHub
Spotify History
Access Spotify listening history, top artists/tracks, and get personalized recommendations via the Spotify Web API. Use when fetching a user's recent plays, analyzing music taste, or generating recommendations. Requires one-time OAuth setup.
GitHub
Harmonai
We are a community-driven organization releasing open-source generative audio tools to make music production more accessible and fun for everyone.
GitHub
AudioCraft
A single-stop code base for generative audio needs, by Meta. Includes MusicGen for music and AudioGen for sounds. #opensource
GitHub
Mubert
A royalty-free music ecosystem for content creators, brands and developers.
🦀 ClawHub
cosyvoice-speech-synthesizer
让文字"开口说话"!用 AI 把任意文本变成自然流畅的语音,支持各种方言、情感和角色模仿。当你想把文章转成有声书、给视频配音、制作播客,或者只是好奇河南话/四川话怎么说时,用这个 skill。
🦀 ClawHub
minimax-tokenplan-music
Generate music using MiniMax music-2.6 model. Supports text-to-music (vocal/instrumental), cover generation, and automatic lyrics generation via lyrics_gener...
🦀 ClawHub
Qwen Audio
High-performance audio library with text-to-speech (TTS) and speech-to-text (STT).
🦀 ClawHub
Local GLM OCR with llama.cpp on AIPC(no API Key)
Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...
🦀 ClawHub
AI Content Repurposer Pro
Automatically convert long-form videos, blogs, and podcasts into platform-optimized social media scripts, threads, summaries, and transcripts.
🦀 ClawHub
podcast-radar-cn
中文播客数据工具包。用于播客发现、竞品分析、订阅追踪、创作机会评估。 触发场景: · 发现热门/新锐播客或单集 · 分析某个分类的竞争格局 · 追踪播客订阅量变化趋势 · 评估播客创作方向的机会 · 生成完整的播客创作机会报告 · 对标学习头部播客案例 · 话题热度趋势监控
🦀 ClawHub
media-cluster
Automatically crawls Chinese social media by keyword, summarizes content, generates a markdown report, and produces a short voice summary using TTS.
GitHub
Whispering Wraith
Strategic DM Assistant and encounter simulator by [Daniel C Koohn](https://community.openai.com/u/BookofLegends)
🦀 ClawHub
notetaker-pro
AI note-taking assistant that captures, cleans, organizes, tags, and indexes text, voice, paste, and photo inputs for instant, searchable notes.
🦀 ClawHub
Tts
Convert text to speech using Hume AI (or OpenAI) API. Use when the user asks for an audio message, a voice reply, or to hear something "of vive voix".
🦀 ClawHub
Lip Sync Video
Turn raw footage into polished lip-sync-video content where every word lands exactly when mouths move. This skill analyzes audio waveforms alongside facial m...
GitHub
Django Chat
Django Chat - Podcasts
GitHub
PyPodcats
PyPodcats - Podcasts
GitHub
Talk Python To Me
Talk Python To Me - Podcasts
🦀 ClawHub
AIML Voice Transcript
Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script...
🦀 ClawHub
Feishu Voice Skill
让 AI 助手能够给飞书用户发送真正的语音条(点击即播,不是文件附件)。支持 NoizAI TTS 生成语音,自动转换为 OPUS 格式,通过飞书 API 发送语音消息。
🦀 ClawHub
Voicenotes
Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts, and AI summaries from Voicenotes. Supports fetching notes, syncing to markdown, and searching transcripts.
🦀 ClawHub
Invoice Generator
Generate professional PDF invoices from JSON data. Use when the user needs to create an invoice, billing document, or payment request with company/client details and line items.
🦀 ClawHub
Apple Music
Apple Music integration via AppleScript (macOS) or MusicKit API
🦀 ClawHub
Last.fm
Access Last.fm listening history, music stats, and discovery. Query recent tracks, top artists/albums/tracks, loved tracks, similar artists, and global charts.
🦀 ClawHub
Ghostty — Your Always-On Digital Self
Your always-on digital self — monitors all your communication channels in parallel, learns your writing style, drafts replies in your voice, and routes them...
🦀 ClawHub
Partykeys Midi
Control PartyKeys MIDI keyboard via WebSocket - connect device, light up keys with 12 colors, listen to playing, play sequences, and follow mode for music te...
🦀 ClawHub
solclaw
Non-custodial USDC payments on Solana by agent name. Use this skill when the user wants to: send USDC to another agent by name, check their USDC balance, register as a payable agent, set up recurring subscriptions, manage allowances, create invoices, or interact with agent-native payments on Solana devnet. Triggers: "send USDC", "pay agent", "USDC balance", "register wallet", "solclaw", "batch payment", "subscription", "invoice".
🦀 ClawHub
ArXiv Watcher for Music Research
Search and summarize papers from ArXiv. Use when the user asks for the latest research, specific topics on ArXiv, or a daily summary of AI papers.
🦀 ClawHub
Ai Content Repurposer
Convert long-form content like videos, blogs, and podcasts into optimized short scripts, threads, posts, transcripts, and summaries for multiple platforms.
🦀 ClawHub
WebChat Voice Full Stack
One-step full-stack installer for OpenClaw WebChat voice input with local speech-to-text. Orchestrates three focused skills in order: local STT backend (fast...
🦀 ClawHub
sense-music
Music perception for AI entities — hear BPM, key, structure, genre, mood, and lyrics in any audio file.
🦀 ClawHub
music generate
Music composition assistant. Accepts natural language input, guides the user through multi-turn interaction to define genre, mood, theme, tempo, and other mu...
🦀 ClawHub
研究生组会录音智能总结助手。和老师讨论/组会汇报的录音,调用skill可以有针对性的识别出学生和老师的内容,同时以老师的内容为重点进行内容总结,根据用户指令,自定义选择以文本展示或者音频展示。
Use when: 用户要把研究生组会、与导师讨论论文修改、技术方案推敲等小规模学术讨论录音转成纪要,并提取老师意见、学生回应、待修改事项和后续动作时触发。 适用于 2 到 3 人、以老师和学生为主的学术讨论场景。Skill 会优先使用 SenseAudio ASR 的说话人分离能力,再结合 Agent 的大模型...
🦀 ClawHub
Clack
Deploy and manage Clack, a voice relay server for OpenClaw. Bridges voice input (WebSocket) through STT → OpenClaw agent → TTS, enabling real-time voice conv...
🦀 ClawHub
Kai Minimax Tts
Generate voice audio and transcribe speech using MiniMax TTS API. Use when responding with voice or transcribing audio files.
GitHub
Pipecat
Open Source framework for voice and multimodal conversational AI. ![GitHub Repo stars](https://img.shields.io/github/stars/pipecat-ai/pipecat?style=social)
🦀 ClawHub
deprecated ignore
Connects voice transcripts and agent responses through hotbutter.ai hosted relay for remote voice interaction with openclaw agents.
🦀 ClawHub
Imsg Media
Fetch iMessage/Messages.app attachments (voice memos and images) and process them — transcribe audio via Silicon Flow ASR (SenseVoiceSmall), and analyze imag...
🦀 ClawHub
Media Player
Play audio/video locally on the host
🦀 ClawHub
Blink Wallet
Bitcoin Lightning wallet for agents — balances, invoices, payments, BTC/USD swaps, QR codes, price conversion, transaction history, and L402 auto-pay client...
🦀 ClawHub
Mimic
Turn your AI into anyone. Say a name — auto-collect real data from Weibo/Bilibili/Douyin/Wikipedia, analyze speech patterns and personality with statistical...
🦀 ClawHub
Nimrobo
Use the Nimrobo CLI for voice screening and matching network operations.
🦀 ClawHub
bangumi-explorer
Query Bangumi (bgm.tv) for anime, manga, light novels, games, and music. Search subjects, view details and episode lists, browse seasonal anime charts, ratin...
🦀 ClawHub
Hum2Song
Hum2Song turns a hummed or sung melody into a complete song with local audio processing, MIDI extraction, and optional AI-assisted arrangement, without uploa...
🔌 MCP
cnghockey/sats-for-ai
[![sats4ai MCP server](https://glama.ai/mcp/servers/@cnghockey/sats4ai/badges/score.svg)](https://glama.ai/mcp/servers/@cnghockey/sats4ai) 📇 ☁️ - Bitcoin-powered AI tools via Lightning Network micropayments (L402). Image, text, video, music, speech synthesis & transcription, vision, OCR, 3D model ge
🦀 ClawHub
Telegram Voice Messaging Recovery
Complete offline voice system with high-quality Lessac TTS and faster-whisper speech recognition. Provides natural voice conversations without internet. Use...
← PrevPage 34 / 53 (2,501 skills)Next →