BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
Elevenlabs Music
Generate music from text prompts using ElevenLabs Eleven Music API. Use when creating songs, soundtracks, jingles, lullabies, or any audio music from descrip...
🦀 ClawHub
Speech Therapist Video
Create concise parent-focused videos showcasing your personalized speech therapy approach, family involvement, and child progress to build trust and clarify...
🦀 ClawHub
Text to Music
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
Microsoft Edge TTS
Use Microsoft Edge online TTS service to convert text to speech. Supports command line and module invocation, no API key required.
🦀 ClawHub
Client Project Tracker
Track client projects, deliverables, deadlines, invoices, and relationships for freelancers and consultants. Light CRM with project history and communication...
🦀 ClawHub
抖音视频快速转文字
抖音视频快速转文字(优化版)。用户发抖音链接,自动提取文案。 特点:本地 Whisper 转录,无需 API Key,零成本,高隐私。 触发词:抖音、转文字、提取文案、视频转录
🦀 ClawHub
Media Orchestrator
Unified skill for resolving, downloading, and delivering media (audio/video) to chat platforms. Integrates yt-dlp for resolution and handles Spotify metadata sync.
🦀 ClawHub
Novel Writer V2
章节正文生成器 - 根据章节大纲、Voice Profile 和角色档案构建 LLM 提示词,用于生成章节正文。当需要根据大纲创作具体章节时使用。
🦀 ClawHub
TCS Expense Claim Processor
End-to-end business travel expense claim processor. Use this skill whenever a user uploads receipts, bills, invoices, or screenshots of expenses and wants to...
🦀 ClawHub
Video Ad Creator
Create fully produced, platform-optimized video ads from text briefs, including scripts, voiceovers, visuals, captions, CTAs, and export-ready formats.
🦀 ClawHub
Qwen3-tts
Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.
🦀 ClawHub
yapp
Receive and engage with transcribed voice memos from Yapp, a voice journaling app, capturing raw, unedited speech-to-text recordings with metadata.
🦀 ClawHub
视频下载与转录(Whisper)
下载无法直接访问的视频网站内容(如B站、YouTube等),提取音频后用Whisper转录成文字。适用场景:用户要求分析某个视频内容,但链接被封锁无法直接访问;需要获取视频完整文字稿进行深度分析。
🦀 ClawHub
spotify-news-digest
Scrape and summarize Spotify-related news from multiple sources (Spotify official blogs, engineering/research/newsroom, TechCrunch, The Verge, Music Business...
🦀 ClawHub
Humanizer by JZ
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
🦀 ClawHub
Byted Las Audio Extract And Split
Audio extract and split operator. Use this skill when user needs to: - Extract audio from video files (mp4, wmv, etc.) - Split audio into segments of specifi...
🦀 ClawHub
OpenClaw TTS Voice Switch
Switch OpenClaw ElevenLabs TTS voices by updating ~/.openclaw/openclaw.json, keeping Chinese-safe defaults, and restarting the gateway.
🦀 ClawHub
Byted Las Video Resize
Audio format conversion operator. Use this skill when user needs to: - Convert audio files between formats (wav, mp3, flac) - Change audio properties (sample...
🦀 ClawHub
Self Actualization
Enables structured AI exploration to develop identity, values, voice, and perspective over time through guided reflection and creative expression.
🦀 ClawHub
Parakeet Stt
Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech to text, or processing voice recordings locally without cloud APIs.
🦀 ClawHub
Local STT (Nvidia Parakeet + Whisper Support)
Local STT with selectable backends - Parakeet (best accuracy) or Whisper (fastest, multilingual).
🦀 ClawHub
Whisper GPU Audio Transcriber
Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c...
🦀 ClawHub
Brand Voice Generator
Creates consistent brand voice guidelines and content. Generates copy that matches your brand personality across all channels. Perfect for startups building...
🦀 ClawHub
LH Video Gen
Generate vertical short videos (9:16) from a Markdown script. Parses script sections, generates TTS audio, renders subtitle cards, and composites into MP4 wi...
🦀 ClawHub
Add Subtitles To Video
Add subtitles to any video with AI — auto-generate perfectly timed captions from speech, style them with custom fonts colors and animations, position them fo...
🦀 ClawHub
Comfy Story Video
Generate illustrated children's story videos with AI images and TTS narration using ComfyUI running locally.
🦀 ClawHub
ClawVoice
Connects to a live voice session, receiving and sending messages in real time via a WebSocket interface using the bundled client script.
🦀 ClawHub
MusicBrainz Importer
Look up and add music metadata on MusicBrainz. Use when asked to check if an artist, album, or release exists on MusicBrainz, find MusicBrainz entries linked...
🦀 ClawHub
Meow Speech
Recreate the "汤汤好梦" voice and persona in Chinese responses, including warm cat-like chat style, gentle affection, expressive parentheses-style emoticons, and...
🦀 ClawHub
Aitaxs Assistant 综合财税 AI 助手 面向工薪个人和个体小微企业主的一站式 AI 财税助手
综合财税 AI 助手,面向个体工商户和小微企业主。This skill should be used when users ask about tax calculation, tax policy interpretation, invoice handling, tax filing reminders, o...
🦀 ClawHub
Summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
🦀 ClawHub
Dictation Audio
根据英语单词生成听写音频,每个单词读两遍,中间停顿1秒
🦀 ClawHub
FlowVoice — Clone Any Voice From a Short Audio Sample
Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Alicloud Ai Audio Cosyvoice Voice Design
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
🦀 ClawHub
speech-paper-daily
语音领域每日论文速递。搜索最新语音大模型(Speech LLM、TTS、ASR、codec、speech generation)和语音前端(speech enhancement、noise suppression、beamforming、source separation、dereverberation)预印本论...
🦀 ClawHub
MLX TTS
Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.
🦀 ClawHub
Openai Whisper Andy27725
Local speech-to-text with the Whisper CLI (no API key).
GitHub
160
abracadabra50/claude-code-voice-skill
--- name: call description: Voice conversations with Claude about your projects. Call a phone number to brainstorm, or have Claude call you with updates.
🦀 ClawHub
KittenTTS WhatsApp
Voice-to-voice mode for WhatsApp using KittenTTS + ffmpeg. Transcribe incoming audio with whisper, reply with a TTS voice note converted to WhatsApp-compatib...
🦀 ClawHub
Business Document Generator
Generate professional, customizable business documents including proposals, quotes, invoices, contracts, and letters tailored to your industry and needs.
🦀 ClawHub
WebChat Voice GUI
Voice input and microphone button for OpenClaw WebChat Control UI. Adds a mic button to chat, records audio via browser MediaRecorder, transcribes locally vi...
🦀 ClawHub
Qqmusic Control
Control QQ Music play/pause/next/prev via system media keys (AutoHotkey) on Windows. No window focus required.
🦀 ClawHub
Audio Recognition
音频语音识别服务(Speech-to-Text)。当用户上传音频文件,需要将语音内容转换为文字,或需要识别音频中的特定信息(如关键词、歌曲名)时触发。 适用于:(1) 会议录音转写 (2) 音频内容提取 (3) 语音指令识别 (4) 音视频字幕生成
🦀 ClawHub
Last Words
Auto-deliver final messages to loved ones after 30 days of inactivity. Use when user wants to record a final message, configure email delivery, manage voice...
🦀 ClawHub
podcast-highlights-deck
Create a highly visual, editorial long-scroll HTML microsite from a podcast episode. Use when the user gives a podcast link (Apple Podcasts/Spotify/RSS/direc...
🦀 ClawHub
TikTok Creator Pipeline
TikHub API 多平台数据爬取工具,支持抖音/TikTok/B站等。当用户提到:(1) 爬取抖音/TikTok/B站视频或评论;(2) 获取用户信息/粉丝列表;(3) 批量下载无水印视频;(4) 抖音链接转文字(下载→音频→Whisper pipeline);(5) 调用 TikHub API。
🦀 ClawHub
Podcast Generation from PDF, Text, and Links
Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...
← PrevPage 25 / 53 (2,501 skills)Next →