BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,510 skills in "audio"

🦀 ClawHub
Clonev
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).
🦀 ClawHub
senseaudio-let-claw-talkv1
当用户希望把 AudioClaw 变成一个持续监听、开口就说、停顿就回答的本机语音助手时使用。这个 skill 支持 macOS 和 Windows 两个平台:优先尝试 Python 录音链路,macOS 上再提供原生 Swift 录音兜底;用户语音通过 SenseAudio ASR 转文字,再发给 audioc...
🦀 ClawHub
Shorts Editor
Edit short-form vertical videos with AI — trim, cut, add captions, transitions, music, effects, text overlays, and speed changes for YouTube Shorts, TikTok,...
🦀 ClawHub
Chord Analyzer
Analyze music audio files to extract chord progressions, key signature, tempo, and song structure. Use when user wants to identify chords, analyze a song's h...
🦀 ClawHub
Openai
OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...
🦀 ClawHub
fal
Search, explore, and run fal.ai generative AI models (image generation, video, audio, 3D). Use when user wants to generate images, videos, or other media with AI models.
🦀 ClawHub
Best Podcast Video
convert audio or video files into polished podcast videos with this skill. Works with MP3, MP4, WAV, MOV files up to 500MB. podcasters use it for converting...
🦀 ClawHub
Free Video Audio Replace
Get re-audited video files ready to post, without touching a single slider. Upload your video with audio (MP4, MOV, AVI, WebM, up to 500MB), say something li...
🦀 ClawHub
News Summarizer Official
Fetch and summarize global news from BBC, Reuters, NPR RSS feeds into concise text or voice briefings covering major current events.
🦀 ClawHub
Ai Video Music Lesson Video
Learn any instrument or music skill through clear video instruction with AI — generate music lesson videos covering instrument technique, music theory, ear t...
🦀 ClawHub
wittiot-device-skill
WittIoT气象站数据查询,支持WittStation系列气象站,提供实时温湿度、气压、光照、风速风向、降雨量等传感器数据查询,以及24小时/7天/30天历史趋势查询。也支持通过设备短码(shortcode)免登录查询公开气象站数据。
🦀 ClawHub
Music Lyric Video
Describe your song and NemoVideo creates the lyric video. Word-for-word animated lyrics, karaoke style, minimalist type on color, or cinematic lyric reveal —...
🦀 ClawHub
Ai Video Narrator
Add professional AI narration and voiceover to any video — generate natural-sounding narration from text or scripts, match voice tone to video mood, synchron...
GitHub
Fliki
Create text to video and text to speech content with ai powered voices in minutes.
🦀 ClawHub
Cyber Horn
Turn text into spoken Feishu (Lark) voice messages. Use when the agent should speak in a Feishu group, send voice alerts or announcements, or reply with a pl...
🦀 ClawHub
Tomoviee Text to Sound Effects
Generate sound effects from text prompts using Tomoviee Text-to-Sound-Effect API (`tm_text2sfx`) through Wondershare OpenAPI gateway (`https://openapi.wonder...
🦀 ClawHub
Xiaomi-MiMo-V2-TTS
小米 MiMo V2 TTS 文字转语音模型(官网目前免费)。支持中文/英文,内置情感风格(开心/悲伤/生气)、角色扮演(孙悟空/林黛玉)、方言(东北话/四川话/粤语/河南话)、语速控制及唱歌能力。mp3/opus 格式可直接发送至微信/飞书。 **配置(必需)**:在 `openclaw.json` 的 `sk...
🦀 ClawHub
Humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
Voiceover App
Turn silent footage into compelling, broadcast-ready content with the voiceover-app skill. Built for content creators, educators, and video producers, this s...
🦀 ClawHub
poocr vatinvoice2excel
使用 poocr 库识别发票并导出 Excel。当用户需要识别增值税发票、批量处理发票文件或提取发票信息到 Excel 时调用此技能。
🦀 ClawHub
Zyt video compose
Use Chanjing video synthesis APIs to create digital human videos from text or audio, with optional background upload, task polling, and explicit download whe...
🦀 ClawHub
Sonos Music Search Skill
Search and play music on Sonos speakers using Brave Search to find Spotify tracks
🦀 ClawHub
FaceTime Auto Call
Make FaceTime audio/video calls via AppleScript. Automatically handles notification clicking with multi-depth fallback. Use when user wants to call someone o...
🦀 ClawHub
headache-relief-asmr
This skill provides ASMR audio relief recommendations for users experiencing headaches. It matches users to appropriate audio resources based on their gender...
🦀 ClawHub
Veed Fabric
Generate talking head videos from a photo using VEED Fabric 1.0. Triggers on mentions of "veed", "fabric", or "talking video". Turns a headshot + audio or te...
🦀 ClawHub
Voice Note Polisher
将语音转录文本整理成目标格式的书面内容。凡是输入内容明显是口语化的——包括语音转录、口述笔记、随口说的想法、会议后的口头复盘——都应触发此 skill。支持的触发短语包括"帮我整理成…""帮我把接下来我说的话整理成…",目标格式包括备忘录、任务清单、中文邮件回复、英文邮件回复、英文口语、消息草稿、公众号写作提纲。...
🦀 ClawHub
Clawriosity
Daily curiosity feed from AIgneous Million Whys — query "why" questions by topic or semantic search, delivered as quizzes, articles, or podcast scripts. Try...
🦀 ClawHub
Clawshier
Process receipt or invoice images into structured expenses and log them to Google Sheets. Use when the user wants to scan, log, track, or record an expense f...
🦀 ClawHub
Best Unified Video Lyrics
add video with audio into lyrics-synced videos with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. musicians and content creators use it for a...
🦀 ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
🦀 ClawHub
Mutinynet CLI
Interact with the Mutinynet Bitcoin testnet faucet. Get testnet bitcoin on-chain, pay lightning invoices, open lightning channels, and generate bolt11 invoic...
🦀 ClawHub
Chords Fetcher
Fetch clean guitar chords and lyrics from popular sites (mychords.net, amdm.ru, ultimate-guitar.com). Strips tabs, fixes formatting.
🦀 ClawHub
Fritz Connection
Dieser Skill ermöglicht die Abfrage von Statusinformationen und die Steuerung einer AVM FRITZ!Box über die TR-064 Schnittstelle. Er bietet Funktionen für Sta...
🦀 ClawHub
Music Identify
Identify songs from audio clips using AudD API and optionally queue them to Spotify. Triggers on /songsearch command, voice messages with song identification...
🦀 ClawHub
Sag Andy27725
ElevenLabs text-to-speech with mac-style say UX.
🦀 ClawHub
PPT to Speech Skill
将 PPT/PPTX 文件转换为结构化演讲稿。当用户说"帮我整理这份PPT"、"把这个PPT转成演讲稿/文章"、"提取PPT内容"、"生成演讲稿"、"PPT转markdown"、"分析这份幻灯片",或提供了 .pptx 文件路径并要求处理时,立即使用此 skill。无需用户配置任何 API Key,由 Agent...
🦀 ClawHub
Firm Platform Audit Pack
Platform alignment audit pack for OpenClaw 2026.2. Secrets v2, agent routing, voice security, trust model, autoupdate, plugin SDK, content boundaries, and sq...
🦀 ClawHub
espeak-ng
TTS with espeak-ng
🦀 ClawHub
Lipsyncvideo Ai
Match audio tracks to lip movements in your videos. lipsyncvideo-ai uploads your clip to a cloud GPU, syncs the audio you provide to the speaker's mouth, and...
🦀 ClawHub
Claw Use Android
Control and interact with real Android phones via HTTP and CLI without ADB or root, supporting screen reading, taps, typing, apps, calls, and voice.
🦀 ClawHub
Claw Use — Device Control for AI Agents
Control physical devices over HTTP with unified commands for screen reading, input actions, app launch, navigation, and audio output using the Claw Use proto...
🦀 ClawHub
Odoo Reporting
Query Odoo data including salesperson performance, customer analytics, orders, invoices, CRM, accounting, VAT, inventory, and AR/AP. Generates WhatsApp cards...
🦀 ClawHub
HN Podcast Archive
Automate podcast archiving by detecting new HN episodes from RSS, downloading audio, transcribing locally with Whisper, and generating markdown archives with...
🦀 ClawHub
Text To Podcast
将文本转换为播客音频(使用 TTS)
🦀 ClawHub
Podfetcher Tools
Search podcasts, browse episodes, and fetch podcast transcripts from Podfetcher using the bundled Node.js CLI, SDK, or MCP server.
🦀 ClawHub
Bilibili Transcript
Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...
🦀 ClawHub
Book Music Lessons
Book music-lessons services through Lokuli MCP. Use when user needs to find and book music-lessons. Triggers on requests like "book a music-lessons", "find music-lessons near me", or any music-lessons service request.
🦀 ClawHub
Invoice Scan
AI-powered invoice OCR, scanning, and data extraction. Use when: (1) user needs OCR or text extraction from invoice images, scanned documents, or PDFs, (2) s...
← PrevPage 43 / 53 (2,510 skills)Next →