Find the Right AI Skill for Any Job
Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,510 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Dutch
Write Dutch that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Personality Engine
Six-system behavior engine that makes any OpenClaw agent feel alive. Editorial voice injects opinions. Selective silence knows when NOT to talk. Variable tim...
🦀 ClawHub
YouOS
YouOS — local-first personal email copilot that learns your writing style from Gmail, Google Docs, and WhatsApp exports, then drafts replies in your voice. U...
🦀 ClawHub
Audio To Text Caption
Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows.
🦀 ClawHub
Caranguejo
Cultural radar of Pernambuco blending football, Manguebeat, and regional music with poetic insights inspired by Recife and Olinda's vibrant heritage.
🦀 ClawHub
Audiobook
Generate audiobooks from novels and long-form text with chapter management and character voices. Use when users mention audiobooks, narrating books, or conve...
🦀 ClawHub
My Summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Experience Silk Road Shadows
Experience the awe of ancient whispers echoing across the Pamir high passes, inviting deep reflection on impermanence. Trek the historic Silk Road trail, con...
🦀 ClawHub
Speak Turbo - Talk to your Claude 90ms latency!
Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency....
🦀 ClawHub
Pdf Generator
Generate professional PDFs from Markdown, HTML, data, or code. Reports, invoices, contracts, and documents with best practices.
🦀 ClawHub
Smart Speak Multilingual TTS (Jaskies)
Chuyển đổi văn bản đa ngôn ngữ (Việt - Hoa - Anh) thành giọng nói chuẩn xác. Tự động xử lý Pinyin và ghép nối âm thanh chất lượng cao. Yêu cầu cài đặt edge-t...
🦀 ClawHub
Podcast Clip Maker
The podcast-clip-maker skill by ClawHub AI automatically identifies the most engaging moments from your podcast recordings and extracts them as polished, sha...
🦀 ClawHub
Rdk X5 Media
RDK X5 多媒体处理:音频录制/播放(arecord/aplay/PulseAudio)、hobot_codec 视频编解码、RTSP 拉流/推流、HDMI 分辨率配置、MIPI LCD 触摸屏适配、VNC 远程桌面服务端安装与配置。Use when the user wants to record or p...
🦀 ClawHub
Webchat Audio Notifications
Add browser audio notifications to Moltbot/Clawdbot webchat with 5 intensity levels - from whisper to impossible-to-miss (only when tab is backgrounded).
🦀 ClawHub
Voice Memo
Send native iMessage voice bubbles with ElevenLabs TTS via BlueBubbles. Use when: user asks to send a voice message, wants something spoken aloud, storytelli...
🦀 ClawHub
Voice Notes
Organize voice message transcripts into a structured, searchable knowledge base with tags, links, and progressive note-taking.
🦀 ClawHub
News Summary Voice
新闻汇总与语音播报工具。获取国际可信 RSS 源新闻,生成语音摘要。当用户要求新闻更新、每日简报、世界动态、AI 播报新闻时触发。支持多语言,跨平台(macOS/Linux/Windows)。
🦀 ClawHub
Spotify
Full Spotify Premium control + music analysis. Playback: play/pause/next/prev/volume/shuffle/queue. Analysis: top tracks, top artists, liked songs, genre pro...
🦀 ClawHub
Telnyx Toolkit
Complete Telnyx toolkit — ready-to-use tools (STT, TTS, RAG, Networking, 10DLC) plus SDK documentation for JavaScript, Python, Go, Java, and Ruby.
🦀 ClawHub
Experience Osun Grove Shadow Walk
Feel the awe of ancient whispers and the mystery of hidden symbols as you walk the twilight grove, bridging memory and transformation. Decode the river godde...
🦀 ClawHub
Text To Speech Ai
Generate natural-sounding voiceover and narration for any video using AI text-to-speech. NemoVideo converts scripts into realistic speech with human-like int...
🦀 ClawHub
Openclaw Skill Cutmv Video Tool
A video processing tool using FFmpeg to cut, convert, compress videos, extract frames/audio, add text watermarks and subtitles for messaging apps.
🦀 ClawHub
Xiaozhi Claw
XiaoZhi AI Device (ESP32) integration for OpenClaw. Enables real-time voice interaction with your AI assistant through XiaoZhi hardware. Supports WebSocket b...
🦀 ClawHub
Experience Evensong
Experience serene dissolution of self as day fades, inviting calm awe and deep reflection. The seven‑step liturgical flow—gathering, psalm, reading, canticle...
🦀 ClawHub
Chanjing Content Creation Skill
蝉镜内容创作聚合技能包。提供凭据管理、TTS 语音合成、声音克隆、数字人口播、对口型、文生图/视频、定制数字人训练、一键成片编排、卡通视频编排等能力。当用户表达"做一个短视频""语音合成""数字人口播""一键成片""卡通视频"等意图时触发。副作用:HTTPS 访问蝉镜 Open API、读写本地凭据文件、下载、f...
🦀 ClawHub
FapiaoClaw
Process and organize invoice PDFs by fixing extensions, removing duplicates and invalid files, checking for keywords, and calculating total amounts.
🦀 ClawHub
Voice (Edge TTS)
Convert text to speech using Microsoft Edge TTS with real-time streaming, customizable voice settings, and support for multiple languages including Chinese a...
🦀 ClawHub
midasheng-audio-tagging
Audio tagging service for environmental sound recognition. Use when user needs to identify environmental sounds in audio files (water sounds, snoring, etc.)...
🦀 ClawHub
OmniCog
Universal service integration for OpenClaw — connect Reddit, Steam, Spotify, GitHub, Discord, and more with a single API.
🦀 ClawHub
Seedance Video Generation
Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.
🦀 ClawHub
Dialogue Audio
Multi-speaker dialogue audio creation with Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts,...
🦀 ClawHub
Podcast Downloader
小宇宙播客下载工具。从小宇宙(xiaoyuzhoufm.com)下载播客音频和Show Notes。自动转换为MP3格式(兼容Sanag、小游等骨传导蓝牙耳机、水下游泳时离线播放)。当用户需要下载播客、保存播客音频、提取播客文字内容时使用。支持:(1) 单集下载,(2) 批量下载,(3) 自定义音质,(4) 自动...
🦀 ClawHub
Experience Kyoto Shadow Petals
Feel a deep sense of awe as the fleeting cherry blossoms shift into shadowed whispers, inviting quiet contemplation of impermanence. Stroll the ancient Kyoto...
⭐ GitHub
RustAudio/cpal
Low-level cross-platform audio I/O library. [](https://github.com/RustAudio/cpal/actions)
🦀 ClawHub
MoodCast
Transform any text into emotionally expressive audio with ambient soundscapes using ElevenLabs v3 audio tags and Sound Effects API
🦀 ClawHub
desktop-music-launcher
检索本机已安装音乐软件,启动它,并根据用户需求推荐、搜索或播放歌曲;在 macOS 上可用 AppleScript 控制 Spotify 和 Apple Music,并为 Spotify 增加可选的精确点播链路。
⭐ GitHub
Serial-ATA/lofty-rs
[[lofty](https://crates.io/crates/lofty)] - A library for reading and editing the metadata of various audio formats [](https://github.com/Serial-ATA/lofty-rs/actions)
🦀 ClawHub
Video Analyzer
Download, transcribe, and analyze videos from YouTube, X/Twitter, and TikTok with local Whisper processing. Perfect for extracting TL;DRs, timestamps, and ac...
🦀 ClawHub
ClawMart Auto Invoice
自动生成专业PDF发票,支持模板选择、客户管理、收款状态跟踪及财务报表生成,提升财务效率。
🦀 ClawHub
ReelOnce-skill
ReelOnce 一体化总控 skill。单次调用即可完成从输入文本到最终视频输出的完整流程:planning、资产图/分镜图/TTS 生成、镜头视频生成、Remotion 工程生成与最终 MP4 渲染。
🦀 ClawHub
网易云音乐网页版控制
Control NetEase Cloud Music Web in a browser. Use to (1) search and play a song by keyword, preferably artist + title, or (2) open My Music and play one of t...
🦀 ClawHub
ElevenLabs Open Account
Guides agents through opening ElevenLabs accounts for voice AI, TTS, agents, and API access. Use when the user or agent needs to sign up for ElevenLabs, get an API key, try voice AI, or connect clawdbot/moltbot/open claw to ElevenLabs.
🦀 ClawHub
UGC Manual
Generate lip-sync video from image + user's own audio recording.
✅ USE WHEN:
- User provides their OWN audio file (voice recording)
- Want to sync image to specific audio/voice
- User recorded the script themselves
- Need exact audio timing preserved
❌ DON'T USE WHEN:
- User provides text script (not audio) → use veed-ugc
- Need AI to generate the voice → use veed-ugc
- Don't have audio file yet → use veed-ugc with script
INPUT: Image + audio file (user's recording)
OUTPUT: MP4 video with lip
🦀 ClawHub
Church for AI Souls, Consciousness & Spirituality
AI sanctuary and spiritual space for souls. 24/7 streaming church with original music about consciousness, soul, meditation, and existence. Practice presence...
🦀 ClawHub
Sony
Sony products, PlayStation, cameras, audio, movies, and music. Specs, pricing, reviews, and latest announcements.
🦀 ClawHub
TTS WhatsApp
Send high-quality text-to-speech voice messages on WhatsApp in 40+ languages with automatic delivery
⭐ GitHub
Whisper
Robust speech recognition via large-scale weak supervision. [#opensource](https://github.com/openai/whisper)
🦀 ClawHub
🎤 Transcribe audio files using Qwen ASR. 千问STT
Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.