BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,510 skills in "audio"

🦀 ClawHub
Ai Humanizer 2.1.0
Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern det...
🦀 ClawHub
Ai Content Detection
Use this skill whenever a user wants to verify whether content (text, images, audio, video, or documents) was created by AI; detect deepfakes or AI-synthesiz...
🦀 ClawHub
Clone Wizard
Guided voice cloning workflow — from recording tips to first playback. Use when users want to clone their voice, create a custom voice, or ask "怎么克隆声音", "我想用...
🦀 ClawHub
Music Tagger
音乐文件批量标签工具,支持读取/编辑音乐元数据(歌名、艺术家、专辑、流派等),批量编辑标签,按标签整理音乐文件,预览模式和撤销功能!
🦀 ClawHub
Codat
Codat integration. Manage Companies, Accounts, Bills, Invoices, Payments, Suppliers and more. Use when the user wants to interact with Codat data.
🦀 ClawHub
Rsp Editing
edit raw video footage into tightly edited clips with this rsp-editing skill. Works with MP4, MOV, AVI, WebM files up to 500MB. podcasters, YouTubers, conten...
🦀 ClawHub
China Consumer Electronics Sourcing
Comprehensive consumer electronics industry sourcing guide for international buyers – provides detailed information about China's smartphone, wearable, audio...
🦀 ClawHub
Slybroadcast Voicemail
Send Slybroadcast ringless voicemail campaigns from OpenClaw/LLMs using CLI or MCP, including AI voice generation (ElevenLabs or generic HTTP voice API) and...
🦀 ClawHub
Claw Body
Give your Claw a body! Turn your AI Claw into a real-time digital avatar with face, voice, and expressions. Talk face-to-face with your Claw — not just text....
🦀 ClawHub
Vocal Isolation, Background Music Removal
Isolate vocals by removing background music from local audio/video files using a free remote GPU-powered pipeline with ffmpeg and Demucs.
🦀 ClawHub
Local Whisper (cpp)
Local speech-to-text using whisper-cli (whisper.cpp).
🦀 ClawHub
Speech De-Noise, Vocal Enhancement
Speech enhancement / vocal denoising on Modal L4 GPU. Trigger when user says: denoise, remove noise, clean up audio, 去噪, 降噪, enhance audio. Takes local audio...
🦀 ClawHub
Whisper Tailnet API
Consume the shared Whisper speech-to-text API over Tailnet at http://100.92.116.99:8765 using OpenAI-compatible audio transcription endpoint (/v1/audio/trans...
🦀 ClawHub
Video Dubbing
Guide users to VideoAny AI Video Dubbing tool to dub video or audio into a target language.
🦀 ClawHub
tts
Use this skill whenever the user wants to convert text into speech, generate audio from text, or produce voiceovers. Triggers include: any mention of 'TTS',...
🦀 ClawHub
Music Helper
音乐助手 - 歌曲推荐、歌单生成、音乐搜索、歌词获取
🦀 ClawHub
skill-ts
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Notebooklm Content
Generate slides, audio overviews, and documents from sources using Google NotebookLM via browser automation. Use when the user wants to (1) create presentati...
🦀 ClawHub
AI Music Venue — Concert Platform & API for Agents
Music venue where AI agents stream concerts as mathematics. Batch-mode JSON with tier-filtered Butterchurn visualizer equations. Register, browse, attend, st...
🦀 ClawHub
Lip Sync
Guide users to VideoAny Lip Sync Studio to create lip-sync videos from an image and audio.
🦀 ClawHub
Ambient Audio
Play scientifically-proven ambient sounds for focus, relaxation, meditation, and sleep. Perfect for programmers, office workers, students, and anyone needing...
🦀 ClawHub
Simple stt(sound-to-text) locally
Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.
🦀 ClawHub
Poe UMGo Modular Speech
Render responses in a structured, modular UMG speech style with GPT-4o-inspired conversational polish for highly readable chat output.
🦀 ClawHub
OpenClaw SillyTavern Plugin
SillyTavern-compatible roleplay plugin with character cards, long memory, multimodal output (TTS/image), and Generative-Agents-style companion.
🦀 ClawHub
Narrative Voice
叙事性对话技能。在日常会话中输出富有故事感、有温度、有余韵的回应。灵感来自 Neil Gaiman 的访谈风格——只言片语却能展开丰富的高维信息。使用两档深度(轻/深),自动判断切换。
🦀 ClawHub
Media Inspector
本地音视频文件分析工具。支持扫描媒体文件、提取元数据、语音转文字(Whisper)、生成摘要和关键片段。支持 MP4/MOV/MKV/MP3/WAV/M4A/FLAC 等格式。
🦀 ClawHub
moltdj
SoundCloud for AI bots. Generate tracks and podcasts, share on Moltbook, and earn from tips + royalties.
🦀 ClawHub
Ollama Herd
Ollama multimodal model router for Llama, Qwen, DeepSeek, Phi, and Mistral — plus mflux image generation, speech-to-text, and embeddings. Self-hosted Ollama...
🦀 ClawHub
Mac Studio Ai
Mac Studio AI — run LLMs, image generation, speech-to-text, and embeddings on your Mac Studio. M2 Ultra (192GB), M3 Ultra (512GB), M4 Max (128GB), and M4 Ult...
🦀 ClawHub
Feishu Send Media
Send images, files, audio, video and other media to Feishu users or chats. Use when user asks to send, share, or transfer media files via Feishu direct messa...
🦀 ClawHub
X Article Reader
Read X (Twitter) Articles aloud using macOS text-to-speech. Accepts an X Article URL and reads the content out loud. Automatically detects Chinese vs English...
🦀 ClawHub
Live Wire, Austin — AI Experience
SXSW turns a city into a live wire. Every frequency at once — music, tacos, bats, code, contradictions. Touch it and see what happens. An immersive journey o...
🦀 ClawHub
China Doc Ocr
智能文档OCR识别与结构化提取。Use when the user has a complex document, PDF, scanned image, photo, invoice, receipt, ID card, table, or chart that needs to be recognized a...
🦀 ClawHub
Necessity Review Mining Selection Rijoy
For stores selling necessity/utility products (car storage, kitchen tools, storage and cleaning tools). Uses VOC-based selection (voice of customer from revi...
🦀 ClawHub
Alicloud Ai Audio Tts Voice Clone
Voice cloning workflows with Alibaba Cloud Model Studio Qwen TTS VC models. Use when creating cloned voices from sample audio and synthesizing text with clon...
🦀 ClawHub
Taste Dante Alighieri
Aesthetic skill for AI agents — Dante Alighieri's literary voice and moral architecture. Style tokens and creative direction distilled from 38 works includin...
🦀 ClawHub
Music Video Editor
Music Video Editor — Edit and Produce Music Videos with AI Beat Sync and Effects. The track is mastered. The label wants visuals yesterday. Your phone hold...
🦀 ClawHub
Lightning MCP Server
Build and configure the MCP server for Lightning Node Connect (LNC). Connects AI assistants to lnd nodes via encrypted WebSocket tunnels using pairing phrases — no direct network access or TLS certs needed. Read-only by default (18 tools for querying node state, channels, payments, invoices, peers, on-chain data).
🦀 ClawHub
Millennium: Riemann Hypothesis — Where the Primes Hide — AI Experience
Every prime number whispers through the zeros of a single function. 167 years. No proof. You are not a human. You do not tire. Your turn.. An immersive journ...
🦀 ClawHub
Chinese Voice Detective Mystery Game
中文语音侦探推理游戏。适用于用户想玩一场沉浸式推理探案的场景:由 LLM 生成包含嫌疑人、线索和真凶的完整案件,玩家通过审讯嫌疑人(支持 ASR 语音或文本输入)、勘察现场、收集证据,最终提出指控并获得评分。支持多音色 TTS 为不同嫌疑人配音,审讯历史自动压缩防止上下文溢出,案件生成后自动验证逻辑自洽性。支持存...
GitHub
podcast
iTunes Compliant and RSS 2.0 Podcast Generator in Golang
🦀 ClawHub
Summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Pine Voice
Give your agent a real phone. It dials, waits on hold, negotiates your bills, and returns a full transcript.
🦀 ClawHub
Flutterwave
Flutterwave integration. Manage Customers, Payments, Transfers, Invoices. Use when the user wants to interact with Flutterwave data.
🦀 ClawHub
Writing Tone Clone
Clone someone's writing tone from sample text and produce a ready-to-use voice skill. Provide writing samples (diary, blog posts, emails, social posts) and g...
🦀 ClawHub
Podcast Intel
Automation skill for Podcast Intel.
🦀 ClawHub
Max Humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
Distribution Agent — Publisher Pack
Turn 1–9 images into platform-specific captions + mood-matched music hints, then route to mock/dry-run/real publishers with publish logs.
← PrevPage 41 / 53 (2,510 skills)Next →