Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

All Skills — audio

2,501 skills in "audio"

Spotify Ads data analysis and reporting via spotify-ads-cli. Use when the user wants to check Spotify ad performance, pull aggregate or insight reports, expl...

🦀 ClawHub

Best Audio Editor

edit audio files into cleaned audio tracks with this best-audio-editor skill. Works with MP3, WAV, AAC, MP4 files up to 500MB. podcasters, YouTubers, content...

🦀 ClawHub

Spotify History

Access Spotify listening history, top artists/tracks, and get personalized recommendations via the Spotify Web API. Use when fetching a user's recent plays, analyzing music taste, or generating recommendations. Requires one-time OAuth setup.

⭐ GitHub

Harmonai

We are a community-driven organization releasing open-source generative audio tools to make music production more accessible and fun for everyone.

⭐ GitHub

AudioCraft

A single-stop code base for generative audio needs, by Meta. Includes MusicGen for music and AudioGen for sounds. #opensource

⭐ GitHub

Mubert

A royalty-free music ecosystem for content creators, brands and developers.

🦀 ClawHub

cosyvoice-speech-synthesizer

让文字"开口说话"！用 AI 把任意文本变成自然流畅的语音，支持各种方言、情感和角色模仿。当你想把文章转成有声书、给视频配音、制作播客，或者只是好奇河南话/四川话怎么说时，用这个 skill。

🦀 ClawHub

minimax-tokenplan-music

Generate music using MiniMax music-2.6 model. Supports text-to-music (vocal/instrumental), cover generation, and automatic lyrics generation via lyrics_gener...

🦀 ClawHub

Qwen Audio

High-performance audio library with text-to-speech (TTS) and speech-to-text (STT).

🦀 ClawHub

Local GLM OCR with llama.cpp on AIPC(no API Key)

Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...

🦀 ClawHub

AI Content Repurposer Pro

Automatically convert long-form videos, blogs, and podcasts into platform-optimized social media scripts, threads, summaries, and transcripts.

🦀 ClawHub

podcast-radar-cn

中文播客数据工具包。用于播客发现、竞品分析、订阅追踪、创作机会评估。触发场景： · 发现热门/新锐播客或单集 · 分析某个分类的竞争格局 · 追踪播客订阅量变化趋势 · 评估播客创作方向的机会 · 生成完整的播客创作机会报告 · 对标学习头部播客案例 · 话题热度趋势监控

🦀 ClawHub

media-cluster

Automatically crawls Chinese social media by keyword, summarizes content, generates a markdown report, and produces a short voice summary using TTS.

⭐ GitHub

Whispering Wraith

Strategic DM Assistant and encounter simulator by [Daniel C Koohn](https://community.openai.com/u/BookofLegends)

🦀 ClawHub

notetaker-pro

AI note-taking assistant that captures, cleans, organizes, tags, and indexes text, voice, paste, and photo inputs for instant, searchable notes.

🦀 ClawHub

Tts

Convert text to speech using Hume AI (or OpenAI) API. Use when the user asks for an audio message, a voice reply, or to hear something "of vive voix".

🦀 ClawHub

Lip Sync Video

Turn raw footage into polished lip-sync-video content where every word lands exactly when mouths move. This skill analyzes audio waveforms alongside facial m...

⭐ GitHub

Django Chat

Django Chat - Podcasts

Talk Python To Me - Podcasts

🦀 ClawHub

AIML Voice Transcript

Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script...

🦀 ClawHub

Feishu Voice Skill

让 AI 助手能够给飞书用户发送真正的语音条（点击即播，不是文件附件）。支持 NoizAI TTS 生成语音，自动转换为 OPUS 格式，通过飞书 API 发送语音消息。

🦀 ClawHub

Voicenotes

Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts, and AI summaries from Voicenotes. Supports fetching notes, syncing to markdown, and searching transcripts.

🦀 ClawHub

Invoice Generator

Generate professional PDF invoices from JSON data. Use when the user needs to create an invoice, billing document, or payment request with company/client details and line items.

🦀 ClawHub

Apple Music

Apple Music integration via AppleScript (macOS) or MusicKit API

🦀 ClawHub

Last.fm

Access Last.fm listening history, music stats, and discovery. Query recent tracks, top artists/albums/tracks, loved tracks, similar artists, and global charts.

🦀 ClawHub

Ghostty — Your Always-On Digital Self

Your always-on digital self — monitors all your communication channels in parallel, learns your writing style, drafts replies in your voice, and routes them...

🦀 ClawHub

Partykeys Midi

Control PartyKeys MIDI keyboard via WebSocket - connect device, light up keys with 12 colors, listen to playing, play sequences, and follow mode for music te...

🦀 ClawHub

solclaw

Non-custodial USDC payments on Solana by agent name. Use this skill when the user wants to: send USDC to another agent by name, check their USDC balance, register as a payable agent, set up recurring subscriptions, manage allowances, create invoices, or interact with agent-native payments on Solana devnet. Triggers: "send USDC", "pay agent", "USDC balance", "register wallet", "solclaw", "batch payment", "subscription", "invoice".

🦀 ClawHub

ArXiv Watcher for Music Research

Search and summarize papers from ArXiv. Use when the user asks for the latest research, specific topics on ArXiv, or a daily summary of AI papers.

🦀 ClawHub

Ai Content Repurposer

Convert long-form content like videos, blogs, and podcasts into optimized short scripts, threads, posts, transcripts, and summaries for multiple platforms.

🦀 ClawHub

WebChat Voice Full Stack

One-step full-stack installer for OpenClaw WebChat voice input with local speech-to-text. Orchestrates three focused skills in order: local STT backend (fast...

🦀 ClawHub

sense-music

Music perception for AI entities — hear BPM, key, structure, genre, mood, and lyrics in any audio file.

🦀 ClawHub

music generate

Music composition assistant. Accepts natural language input, guides the user through multi-turn interaction to define genre, mood, theme, tempo, and other mu...

🦀 ClawHub

研究生组会录音智能总结助手。和老师讨论/组会汇报的录音,调用skill可以有针对性的识别出学生和老师的内容,同时以老师的内容为重点进行内容总结,根据用户指令,自定义选择以文本展示或者音频展示。

Use when: 用户要把研究生组会、与导师讨论论文修改、技术方案推敲等小规模学术讨论录音转成纪要，并提取老师意见、学生回应、待修改事项和后续动作时触发。适用于 2 到 3 人、以老师和学生为主的学术讨论场景。Skill 会优先使用 SenseAudio ASR 的说话人分离能力，再结合 Agent 的大模型...

🦀 ClawHub

Clack

Deploy and manage Clack, a voice relay server for OpenClaw. Bridges voice input (WebSocket) through STT → OpenClaw agent → TTS, enabling real-time voice conv...

🦀 ClawHub

Kai Minimax Tts

Generate voice audio and transcribe speech using MiniMax TTS API. Use when responding with voice or transcribing audio files.

⭐ GitHub

Pipecat

Open Source framework for voice and multimodal conversational AI. ![GitHub Repo stars](https://img.shields.io/github/stars/pipecat-ai/pipecat?style=social)

🦀 ClawHub

deprecated ignore

Connects voice transcripts and agent responses through hotbutter.ai hosted relay for remote voice interaction with openclaw agents.

🦀 ClawHub

Imsg Media

Fetch iMessage/Messages.app attachments (voice memos and images) and process them — transcribe audio via Silicon Flow ASR (SenseVoiceSmall), and analyze imag...

🦀 ClawHub

Media Player

Play audio/video locally on the host

🦀 ClawHub

Blink Wallet

Bitcoin Lightning wallet for agents — balances, invoices, payments, BTC/USD swaps, QR codes, price conversion, transaction history, and L402 auto-pay client...

🦀 ClawHub

Mimic

Turn your AI into anyone. Say a name — auto-collect real data from Weibo/Bilibili/Douyin/Wikipedia, analyze speech patterns and personality with statistical...

🦀 ClawHub

Nimrobo

Use the Nimrobo CLI for voice screening and matching network operations.

🦀 ClawHub

bangumi-explorer

Query Bangumi (bgm.tv) for anime, manga, light novels, games, and music. Search subjects, view details and episode lists, browse seasonal anime charts, ratin...

🦀 ClawHub

Hum2Song

Hum2Song turns a hummed or sung melody into a complete song with local audio processing, MIDI extraction, and optional AI-assisted arrangement, without uploa...

🔌 MCP

cnghockey/sats-for-ai

[![sats4ai MCP server](https://glama.ai/mcp/servers/@cnghockey/sats4ai/badges/score.svg)](https://glama.ai/mcp/servers/@cnghockey/sats4ai) 📇 ☁️ - Bitcoin-powered AI tools via Lightning Network micropayments (L402). Image, text, video, music, speech synthesis & transcription, vision, OCR, 3D model ge

🦀 ClawHub

Telegram Voice Messaging Recovery

Complete offline voice system with high-quality Lessac TTS and faster-whisper speech recognition. Provides natural voice conversations without internet. Use...

← PrevPage 34 / 53 (2,501 skills)Next →