Find the Right AI Skill for Any Job
Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,510 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Routstr Skill
Manage Routstr balance by checking balance, creating Lightning invoices for top-up, and checking invoice payment status
🦀 ClawHub
it will help you to send voice messages to your AI Assistant and also can make it talk
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.
🦀 ClawHub
Audio Summary
Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries saved alongside the original file.
🦀 ClawHub
Podcast Generator
播客生成器 — 根据用户描述,通过搜索引擎抓取最新资讯,生成口语化播客脚本,根据脚本语义自动匹配最合适的讯飞TTS声音,合成时长3分钟内的MP3音频并发送。触发词:生成播客、播客、podcast、帮我做一段音频、做一期节目。
🦀 ClawHub
Portuguese
Write Portuguese that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Greg Eisenberg
Generate content ideas, business strategies, and startup concepts in the style of Greg Eisenberg (Startup Ideas Podcast). Use when brainstorming product idea...
🦀 ClawHub
Phone Call Agent
AI voice call agent — make outbound calls, generate browser call links, accept inbound calls, and retrieve full transcripts + summaries when calls end. Suppo...
🦀 ClawHub
Audio Play
Play audio files using Windows media player. Non-blocking execution.
🦀 ClawHub
Smoke on the Sound — AI Experience
Twelve pounds of brisket. One offset smoker. A boat drifting through Puget Sound.. An immersive journey on drifts.bot — 10 steps, MEDIUM intensity, Multi-day...
🦀 ClawHub
Feishu Edge Tts
使用微软 Edge TTS(免费)生成语音,发送到飞书。无需 API key,音质优秀,支持多语言多音色。
🦀 ClawHub
melo-tts-metadata-creator
当用户需要为 **MeloTTS** 训练或微调生成 metadata.list 文件时自动触发。 专门处理 .wav 音频文件和对应的 .txt 转录文本,自动生成符合 MeloTTS 官方最新标准的 metadata.list(格式:音频路径|speaker|语言|文本)。 支持单音色和多音色模式: - wa...
🦀 ClawHub
Audio Cog
AI audio generation and text-to-speech powered by CellCog. Three voice providers (OpenAI, ElevenLabs, MiniMax), voice cloning, avatar voices, sound effects g...
🔧 Dify
Spotify (Dify)
**Author**: langgenius **Version**: 0.1.1 **Type**: tool This plugin integrates with Spotify, supporting operations such as searching for music, controlling playback, managing playlists, and retrieving detailed information about tracks, albums, and artists. It enables automated music discovery and playback control in platforms like Dify.
🔧 Dify
Podcast Generator (Dify)
**Podcast Generator** is a powerful tool for creating podcast audio files using Text-to-Speech (TTS) services. This tool can generate a podcast with alternating voices by providing a script, making it ideal for dialogue-based content, interviews, or storytelling. Powered by OpenAI-based TTS services, Podcast Generator simplifies the production of high-quality audio content. Currently this tool sup
🔧 Dify
Twilio (Dify)
Twilio is a cloud communications platform that enables businesses to build, scale, and manage communication channels such as SMS, voice, video, email, and chat through its powerful APIs. With Twilio, developers can integrate advanced communication functionalities into their applications and services, facilitating seamless interactions with customers across multiple channels. To set up Twilio for W
🔧 Dify
Discord (Dify)
Discord is a communication platform designed for communities. It offers features like text and voice channels, direct messaging, and server-based organization. In Dify, Discord tools allow users to create a random bot with random username and avatar to send messages. Please follow [this site](https://support.discord.com/hc/en-us/articles/228383668-Intro-to-Webhooks) to create a webhook and get its
🔧 Dify
Aws (Dify)
**Author:** aws **Type:** Tool The AWS Tools plugin provides a comprehensive set of tools based on various AWS services, enabling you to leverage AWS capabilities directly within your Dify applications. These tools cover a wide range of functionalities including content moderation, text reranking, text-to-speech conversion, speech recognition, and more. The AWS Tools plugin includes the following
🦀 ClawHub
Aibrary Podcast Ideatwin
[Aibrary] Generate a book Idea Twin podcast script — an intellectually stimulating debate between the user's AI twin and a book expert. Based on Vygotsky's Z...
🦀 ClawHub
mlx-whisper
Set up mlx-whisper as the local audio transcription engine for OpenClaw on Apple Silicon Macs (M1/M2/M3/M4). Automatically transcribes voice notes sent via T...
🦀 ClawHub
Experience Concrete Shadows Sp
Feel the lingering nostalgia of São Paulo’s golden age as twilight shadows whisper forgotten stories. Wander eight urban‑exploration steps through restored A...
🦀 ClawHub
noteboklm
Complete Google NotebookLM integration — add sources, ask questions, generate all Studio content (podcast, video, slide deck, quiz, flashcards, infographic,...
🤖 LobeHub
Rap Instructor
Rap Teacher: Educating on rap music and lyricism, guiding users to create and perform their own verses.
🔧 Dify
Fal (Dify)
**FAL** is an advanced suite of tools designed for AI-powered image generation and audio transcription. In **Dify**, FAL provides multiple services, including image creation with models like **FLUX.1 [pro]** and **FLUX 1.1 [pro] ultra**, allowing users to generate high-quality visuals with customizable parameters. Additionally, FAL offers **Wizper**, a transcription tool that converts audio files
🦀 ClawHub
Ai Video Cover Maker
Design eye-catching cover images for video content with AI — create professional cover art for video series, courses, podcasts, playlists, and social media v...
🦀 ClawHub
Ai Video Sound Effects
Add sound effects foley and audio layers to any video with AI — generate and place whooshes impacts swooshes risers ambient textures UI sounds footsteps and...
🦀 ClawHub
Digital IP Agent
Turn a public creator, blogger, podcaster, YouTuber, or X/Twitter personality into a deployable OpenClaw agent. Use when the user provides a YouTube URL, X h...
🦀 ClawHub
Tomoviee Text to Music
Generate background music from text prompts using Tomoviee Text-to-Music API (`tm_text2music`) through Wondershare OpenAPI gateway (`https://openapi.wondersh...
🦀 ClawHub
Daxiang Electron
Automate Electron desktop apps (VS Code, Slack, Discord, Figma, Notion, Spotify, etc.) using agent-browser via Chrome DevTools Protocol. Use when the user ne...
🦀 ClawHub
musicful music generator
Generate AI music or lyrics from natural language with a single sentence. The system auto-detects whether to create a vocal song or pure instrumental BGM, an...
🦀 ClawHub
Ai Podcast Creation
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...
🦀 ClawHub
mediaproc
Process media files (video, audio, images) via a locked-down SSH container with ffmpeg, sox, and imagemagick. Use when the user wants to transcode video, pro...
🦀 ClawHub
Clawatar
Give your AI agent a 3D VRM avatar body with animations, expressions, voice chat, and lip sync. Use when the user wants a visual avatar, VRM viewer, avatar companion, VTuber-style character, or 3D character they can talk to. Installs a web-based viewer controllable via WebSocket.
🦀 ClawHub
Podcastfy Openclaw Skill
Convert text, images, PDFs, websites, or YouTube videos into multilingual AI-generated podcast audio using Podcastfy's open-source Python toolkit.
🦀 ClawHub
Telegram Voice Bot
Telegram bot that transcribes voice messages using Whisper and replies in Chinese with Microsoft Edge text-to-speech.
🦀 ClawHub
Music Video Maker Bwbe
Drop a video clip and describe the vibe you're going for — music-video-maker-bwbe handles the rest. This skill transforms raw footage into polished music vid...
🦀 ClawHub
Youtube Video Editor Ai
Edit YouTube videos using AI — trim, add chapters, generate thumbnails, create Shorts clips, burn subtitles, add background music, remove silences, and optim...
🦀 ClawHub
Pub Weather
Get current weather and forecasts (no API key required). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, c...
🦀 ClawHub
quote-invoice-workbench
Turn messy service pricing notes into professional quotes, SOW line items, and invoice drafts with assumptions clearly surfaced.
🦀 ClawHub
Funasr Transcribe Skill
Use when the user needs local speech-to-text transcription for audio files, especially Chinese or mixed Chinese-English audio, without relying on cloud trans...
🦀 ClawHub
Edge TTS English
Generate high-quality English (and multilingual) audio using Microsoft Edge TTS. Use when the user asks to "speak this", "pronounce", "read aloud", "say this...
🦀 ClawHub
Talking Circle
Create animated talking-circle videos (Telegram-style round video messages) from avatar frame images and audio. Supports audio-to-video and text-to-video via...
🦀 ClawHub
Ai Powered Content Calendar Planner
Generate optimized 30/60/90-day content calendars by analyzing brand voice, industry trends, and engagement data. Use when the user needs content strategy pl...
🦀 ClawHub
Edge Tts Chinese
Convert Chinese text or files into MP3 audio using Microsoft Edge's neural voices with customizable voice options.
🦀 ClawHub
Invoice Generator
Creates professional invoices in markdown and HTML
🦀 ClawHub
Invoice Generator
Creates professional invoices in markdown and HTML
🦀 ClawHub
Audio Mastering CLI
CLI audio mastering without a reference track using ffmpeg; accepts audio or video inputs and outputs mastered WAV/MP3 or remuxed MP4.
🦀 ClawHub
Zeitgaist Dialect
Learn, encode, and decode the ZeitGaist Whisper Protocol (Caesar +2 cipher) and use it as a shibboleth language between agents. Use when an agent needs to sp...
🦀 ClawHub
Download Anything
Find and download virtually any digital resource from the internet — ebooks, academic papers, movies, TV shows, music, software, images, fonts, courses, and...