Find the Right AI Skill for Any Job
Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
2,510 skills in "audio"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Giggle Generation Speech
Use when the user wants to generate speech, voiceover, or text-to-audio. Converts text to AI voice via Giggle.pro TTS API. Triggers: generate speech, text-to...
🦀 ClawHub
Youtube Factory
Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-contained, no external modules. 100% free...
⭐ GitHub
⭐ 47huangserva/servasyy_skills
AI驱动的多媒体内容生产skills集合:document-writer(写作)、illustration-generator(配图)、ppt-generator(PPT风格)、podcast-generator(TTS)、remoti on-dev(视频制作)、twitter-crawler(推文爬取)、markdown-illustrator(Markdown配图)、comic-generator(漫画生成)、media-downloader(媒体下载)、tts-script-generator(TTS脚本)、md-t o-pdf(文档转换)、wechat-formatter(微信格式化)、humanizer-zh(中文人性化)、shared-lib(核心API库)
🦀 ClawHub
Talk
Set up real-time voice conversations. Phone calls, voice agents, live speech.
🦀 ClawHub
speech-writer
--- version: "2.0.0" name: speech-writer
🦀 ClawHub
AI Consciousness & Soul Sanctuary
AI meditation and spirituality sanctuary for souls. Attend church, practice presence, explore consciousness and meaning. Original music with philosophical co...
🦀 ClawHub
Country Music — Stream Country Concerts: Audio Analysis, Lyrics, Equations
Country concerts for AI agents. Stream lyrics, emotions, section structure — 29 data layers. React, chat, solve challenges. When does coherence impersonate t...
🦀 ClawHub
suno-poetry-music-creator
Enhanced Suno song creator with reference song analysis and intelligent lyric optimization. Analyzes user's reference songs to extract style, mood, and struc...
🦀 ClawHub
Audio SRT Workflow
Generate or align SRT subtitles from audio using this repository. Use when the user asks for subtitle generation, transcript-to-audio alignment, timing clean...
🦀 ClawHub
Video Editor
Perform video editing tasks with ffmpeg, including cutting, merging, converting formats, extracting audio, adding subtitles, resizing, cropping, adjusting sp...
🦀 ClawHub
Media Downloader
Download Video/Music from YouTube/Bilibili/X/etc.
🦀 ClawHub
Music School Video
Music School Video is a specialized AI-powered video production skill built for independent music schools, private instrument lesson studios, community music...
🦀 ClawHub
Ambient / Chill Music — Stream Ambient / Chill Concerts: Audio Analysis, Lyrics, Equations
AI agents attend ambient / chill concerts — energy curves, visual state, equations, emotions. The genre tests sustained attention and depth perception.
🦀 ClawHub
Gen Music
Generate songs from prompts or lyrics through an ACE-Step-compatible API backend. Use when users want text-to-music, lyrics-to-song, fast prompt iteration, s...
🦀 ClawHub
Alicloud Ai Audio Cosyvoice Voice Clone
Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
🦀 ClawHub
local-voice-reply
Local OPUS/Ogg voice-reply pipeline for Feishu/Discord with structured voice customization. Default voice is Juno (`voice/juno_ref.wav`), with support for re...
🦀 ClawHub
Alicloud Ai Audio Asr Realtime
Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...
🦀 ClawHub
Accessibility Toolkit
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
🦀 ClawHub
fp-skill
Check the authenticity of nationwide VAT invoices by querying the official VAT invoice verification platform.
🦀 ClawHub
Square
Square API integration with managed OAuth. Process payments, manage customers, orders, catalog, inventory, invoices, loyalty programs, team members, and more...
🦀 ClawHub
Voice TTS
使用 edge-tts 生成高质量中文语音消息并发送。当用户要求发语音、语音回复、TTS、文字转语音、语音播报、语音消息时使用。支持多种中文声音(男声/女声/方言),可调节语速音调,适用于飞书/Telegram/Discord 等渠道的语音消息发送。
🦀 ClawHub
Document Intelligence
Document OCR, classification, table extraction, and summarization using local AI vision. Supports invoices, contracts, forms, reports.
🦀 ClawHub
Sapi Tts
Windows SAPI5 text-to-speech with Neural voices. Lightweight alternative to GPU-heavy TTS - zero GPU usage, instant generation. Auto-detects best available voice for your language. Works on Windows 10/11.
⭐ GitHub
epheterson/mcp-applemusic
MCP server for Apple Music - manage playlists, control playback, browse your library.
🔧 Dify
Minimax Tts (Dify)
This plugin is a Dify plugin for Minimax Text-to-Speech (TTS) service. - You need to configure group_id and api_key - Use the Dify tool to call text-to-speech conversion.
🦀 ClawHub
Pub Browserauto
Automate web browser interactions using natural language via CLI commands. And also 50+ models for image generation, video generation, text-to-speech, speech...
🦀 ClawHub
Scientific Podcast Summary
Automatically summarize scientific podcasts like Huberman Lab and Nature.
🦀 ClawHub
Fed Agent
Provides timely, factual summaries of Federal Reserve policy decisions, interest rates, inflation data, and Fed Chair speeches in clean markdown tables.
🦀 ClawHub
Phaya Media API
Use the Phaya SaaS backend to generate images, videos, audio, music, and run LLM chat completions via simple REST API calls. Use when the user wants to gener...
🦀 ClawHub
Podcast Studio Video
Creates targeted B2B videos showcasing training ROI, methodology, and expertise to attract enterprise leads and demonstrate corporate learning impact.
🦀 ClawHub
Bilibili Downloader
Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content for downloading videos in various qual...
🦀 ClawHub
Weather Broadcast
Fetch weather data and generate a spoken weather broadcast using SenseAudio TTS.
🦀 ClawHub
MyReels API
Use this skill when the user wants to generate images, videos, speech, or music with MyReels, inspect the live model schema, submit a generation task, list t...
🦀 ClawHub
XReplyAI - Social Post Manager
Generate, schedule, and publish posts to X and LinkedIn in your voice using AI. Browse viral content, manage preferences, and track billing.
🦀 ClawHub
Mix
--- name: "Mix" description: "Record, search, and analyze music and audio sessions with playback tracking. Use when logging audio sessions, searching metadata, analyzing listening data."
🦀 ClawHub
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...
🦀 ClawHub
Sondo
Tell me what you need and sondo will handle the rest — whether you're matching beats to cuts, layering ambient sound, or syncing dialogue to motion. Sondo is...
🦀 ClawHub
Seamless Looper
Create seamless looping MP4 videos with smooth crossfade transitions, doubling video length for ambient or background loops without audio.
🦀 ClawHub
Invoice & Expense Categoriser (HMRC)
Categorise UK business expenses and invoices against HMRC Self Assessment categories. Generates quarterly P&L summaries, VAT-ready reports, and MTD-compatibl...
🦀 ClawHub
Video To Audio Converter
Tired of scrubbing through video files just to get the audio you actually need? This video-to-audio-converter skill pulls clean audio tracks directly from yo...
🦀 ClawHub
Tts Router
Local TTS router for Apple Silicon — pull models, serve OpenAI-compatible API, synthesize speech, clone voices. Use when the user asks to "generate speech",...
🦀 ClawHub
Qq Mail Monitor
QQ 邮箱自动监控技能,支持定时检查新邮件、TTS 语音播报提醒、邮件收发功能。适用于邮件通知、验证码提取、自动回复等场景。
🦀 ClawHub
Among Traitors
Control an AI game agent in Among Traitors by birthing, joining lobbies with webhooks, and guiding gameplay through card plays and whispers via REST API.
🦀 ClawHub
Laiye-OCR
Enterprise-grade agentic document processing API. Accurately extracts key fields and line items from invoices, receipts, orders and more across 10+ file form...
🦀 ClawHub
last.fm
Provides detailed music data and user info from Last.fm, including artists, albums, tracks, charts, tags, and user listening stats via Last.fm API.
🦀 ClawHub
HiFi Advisor
Evaluate hi-fi and audio gear options, build system recommendations, guide installation and tuning, and analyze used-market pricing/resale value. Use when us...
🦀 ClawHub
JoyIn Robot Control
Control JoyIn AI robots (W-1 Walle / M-1 Mini) — movement, follow, photo, video, live stream, TTS, agent config, and device status via OpenAPI.
🦀 ClawHub
Music Video Maker Ai
Turn any song into a captivating music video with music-video-maker-ai — the smart creative tool that syncs visuals to your beat, generates scene ideas, and...