BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
doubao-tts
豆包 TTS 文字转语音(火山引擎)
🦀 ClawHub
Official Xero skill
Interact with the Xero accounting API using the `xero` CLI tool. Manage contacts, invoices, quotes, credit notes, payments, bank transactions, items, manual...
🦀 ClawHub
minimax-plan-usage
查询 MiniMax Token Plan 剩余用量。slash command。 查询 MiniMax Token Plan 剩余次数和重置时间,支持 M2.7/Speech/视频/图片/音乐等模型的用量查询。 Query MiniMax Token Plan usage and reset time. Sup...
🦀 ClawHub
Akashic Doc Analyzer
Parse, analyze, and extract content from documents (PDF, DOCX, PPTX, audio). Supports OCR, table extraction, and semantic chunking.
🦀 ClawHub
Podcast Cover Generator
Generate professional podcast cover art and show artwork for Spotify, Apple Podcasts, YouTube Music, Amazon Music, and Overcast. Create eye-catching 1400x140...
🦀 ClawHub
Xiaozhi Mcp Music Official
按小智官方 MCP 接入方式,为小智增加在线音乐播放能力。适用于已经有小智 MCP 接入点(wss://api.xiaozhi.me/mcp/?token=...)并希望通过 MCP 工具实现搜歌、播放、暂停、继续、停止等在线音乐控制的场景。支持在线音乐 API 搜索、多源 fallback、调用本地播放器播放网...
🦀 ClawHub
StepFun step-audio-r1.1
Use StepFun Chat Completions with model step-audio-r1.1 for non-streaming speech turns that can send text with optional local audio input and save the return...
🦀 ClawHub
特看视频 AI 创作工具
生成、编辑、协作。一个工具包接入所有主流 AI 模型。只需描述你的创意,即可生成视频、图片和数字人——零手动操作。当用户提到以下任何内容时使用此技能:特看视频、生成视频或图片、数字人、口型同步、文字转语音、TTS、声音克隆、去除背景、商品模特图、电商图、商品详情图、商品主图、虚拟穿搭、图片转视频、文字转视频、AI...
🦀 ClawHub
Seedance 2.0 — AI Video by ByteDance
Generate AI videos using ByteDance's Seedance 1.5 Pro — a native audio-visual joint generation model with cinematic camera control, multi-language lip-sync,...
GitHub
Rustacean Station
A community project for creating podcast content for Rust
🦀 ClawHub
Azure Speech Service
Azure Speech Service integration. Manage data, records, and automate workflows. Use when the user wants to interact with Azure Speech Service data.
🦀 ClawHub
MiniMax Feishu Music
Generate themed music with lyrics using MiniMax music-2.6 and send as a high-quality MP3 audio attachment to a Feishu user.
🦀 ClawHub
Whispers from the Star
问道笔录 - 修仙文字冒险游戏。玩家从凡人开始修炼,经历炼气、筑基、金丹、元婴、化神、渡劫、飞升七大境界,通过选择塑造道心,最终成就修仙之路。支持转世传承、角色成长、物品系统。适用于修仙题材、角色扮演、文字冒险等场景。
🦀 ClawHub
Article Writing
Write articles, guides, blog posts, tutorials, newsletter issues, and other long-form content in a distinctive voice derived from supplied examples or brand...
🦀 ClawHub
Podcast Generation from PDF, Text, and Links
Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...
🦀 ClawHub
TikTok Creator Pipeline
TikHub API 多平台数据爬取工具,支持抖音/TikTok/B站等。当用户提到:(1) 爬取抖音/TikTok/B站视频或评论;(2) 获取用户信息/粉丝列表;(3) 批量下载无水印视频;(4) 抖音链接转文字(下载→音频→Whisper pipeline);(5) 调用 TikHub API。
🦀 ClawHub
Last Words
Auto-deliver final messages to loved ones after 30 days of inactivity. Use when user wants to record a final message, configure email delivery, manage voice...
🦀 ClawHub
Audio Recognition
音频语音识别服务(Speech-to-Text)。当用户上传音频文件,需要将语音内容转换为文字,或需要识别音频中的特定信息(如关键词、歌曲名)时触发。 适用于:(1) 会议录音转写 (2) 音频内容提取 (3) 语音指令识别 (4) 音视频字幕生成
🦀 ClawHub
Qqmusic Control
Control QQ Music play/pause/next/prev via system media keys (AutoHotkey) on Windows. No window focus required.
🦀 ClawHub
Business Document Generator
Generate professional, customizable business documents including proposals, quotes, invoices, contracts, and letters tailored to your industry and needs.
🦀 ClawHub
KittenTTS WhatsApp
Voice-to-voice mode for WhatsApp using KittenTTS + ffmpeg. Transcribe incoming audio with whisper, reply with a TTS voice note converted to WhatsApp-compatib...
GitHub
160
abracadabra50/claude-code-voice-skill
--- name: call description: Voice conversations with Claude about your projects. Call a phone number to brainstorm, or have Claude call you with updates.
🦀 ClawHub
MLX TTS
Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.
🦀 ClawHub
speech-paper-daily
语音领域每日论文速递。搜索最新语音大模型(Speech LLM、TTS、ASR、codec、speech generation)和语音前端(speech enhancement、noise suppression、beamforming、source separation、dereverberation)预印本论...
🦀 ClawHub
Alicloud Ai Audio Cosyvoice Voice Design
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
FlowVoice — Clone Any Voice From a Short Audio Sample
Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...
🦀 ClawHub
Dictation Audio
根据英语单词生成听写音频,每个单词读两遍,中间停顿1秒
🦀 ClawHub
Summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Meow Speech
Recreate the "汤汤好梦" voice and persona in Chinese responses, including warm cat-like chat style, gentle affection, expressive parentheses-style emoticons, and...
🦀 ClawHub
MusicBrainz Importer
Look up and add music metadata on MusicBrainz. Use when asked to check if an artist, album, or release exists on MusicBrainz, find MusicBrainz entries linked...
🦀 ClawHub
ClawVoice
Connects to a live voice session, receiving and sending messages in real time via a WebSocket interface using the bundled client script.
🦀 ClawHub
Add Subtitles To Video
Add subtitles to any video with AI — auto-generate perfectly timed captions from speech, style them with custom fonts colors and animations, position them fo...
🦀 ClawHub
LH Video Gen
Generate vertical short videos (9:16) from a Markdown script. Parses script sections, generates TTS audio, renders subtitle cards, and composites into MP4 wi...
🦀 ClawHub
Brand Voice Generator
Creates consistent brand voice guidelines and content. Generates copy that matches your brand personality across all channels. Perfect for startups building...
🦀 ClawHub
Local STT (Nvidia Parakeet + Whisper Support)
Local STT with selectable backends - Parakeet (best accuracy) or Whisper (fastest, multilingual).
🦀 ClawHub
Parakeet Stt
Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech to text, or processing voice recordings locally without cloud APIs.
🦀 ClawHub
Self Actualization
Enables structured AI exploration to develop identity, values, voice, and perspective over time through guided reflection and creative expression.
🦀 ClawHub
Byted Las Video Resize
Audio format conversion operator. Use this skill when user needs to: - Convert audio files between formats (wav, mp3, flac) - Change audio properties (sample...
🦀 ClawHub
OpenClaw TTS Voice Switch
Switch OpenClaw ElevenLabs TTS voices by updating ~/.openclaw/openclaw.json, keeping Chinese-safe defaults, and restarting the gateway.
🦀 ClawHub
Byted Las Audio Extract And Split
Audio extract and split operator. Use this skill when user needs to: - Extract audio from video files (mp4, wmv, etc.) - Split audio into segments of specifi...
🦀 ClawHub
Humanizer by JZ
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
视频下载与转录(Whisper)
下载无法直接访问的视频网站内容(如B站、YouTube等),提取音频后用Whisper转录成文字。适用场景:用户要求分析某个视频内容,但链接被封锁无法直接访问;需要获取视频完整文字稿进行深度分析。
🦀 ClawHub
yapp
Receive and engage with transcribed voice memos from Yapp, a voice journaling app, capturing raw, unedited speech-to-text recordings with metadata.
🦀 ClawHub
Qwen3-tts
Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.
🦀 ClawHub
Video Ad Creator
Create fully produced, platform-optimized video ads from text briefs, including scripts, voiceovers, visuals, captions, CTAs, and export-ready formats.
🦀 ClawHub
TCS Expense Claim Processor
End-to-end business travel expense claim processor. Use this skill whenever a user uploads receipts, bills, invoices, or screenshots of expenses and wants to...
🦀 ClawHub
Novel Writer V2
章节正文生成器 - 根据章节大纲、Voice Profile 和角色档案构建 LLM 提示词,用于生成章节正文。当需要根据大纲创作具体章节时使用。
← PrevPage 29 / 53 (2,501 skills)Next →