BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,501+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,501 skills in "audio"

🦀 ClawHub
122 dl
Receipt and Expense Reconciler
Parse receipts and invoices, categorize spend, detect anomalies, and produce tax-ready expense summaries for freelancers and SMB operators.
🦀 ClawHub
122 dl
Deapi Audio
Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...
🦀 ClawHub
122 dl
Ux Writing
Deep UX writing workflow—voice, clarity, error and empty states, forms, accessibility of text, localization hooks, and collaboration with design. Use when po...
🦀 ClawHub
121 dl
speech-translation
Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...
🦀 ClawHub
121 dl
Sports Highlight Maker
Sports Highlight Maker — Create Game Highlights and Recap Videos from Sports Footage. The final buzzer sounded twenty minutes ago. Parents are already aski...
🦀 ClawHub
120 dl
TikTok And Reels Script Writer
Generates ready-to-film TikTok and Instagram Reels scripts in three proven formats — trending audio hooks, story/narrative, and educational. Includes on-scre...
🦀 ClawHub
120 dl
Adp Skill
Enterprise-grade agentic document processing API. Accurately extracts key fields and line items from invoices, receipts, orders and more across 10+ file form...
🦀 ClawHub
120 dl
调用senseaudio asr的课堂转译助手,将英文课堂录音、讲座视频、组会音频等内容自动转写为文本,生成中文总结
接收课堂录音、讲座音频或视频文件(视频会先抽取音轨),调用 SenseAudio HTTP ASR API 进行英文转录,可选直出中文翻译;随后整理为结构化 Markdown 学习笔记,包含摘要、关键概念、术语表、时间轴与复习问题,生成到桌面,并支持导出到 Notion 或保存到 Obsidian vault。
🦀 ClawHub
119 dl
midasheng-audio-text-distance
Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against...
🦀 ClawHub
119 dl
Speechace
Speechace integration. Manage data, records, and automate workflows. Use when the user wants to interact with Speechace data.
🦀 ClawHub
119 dl
invoice-merger
合并发票文件。PDF 按两两上下排版,图片按四宫格排版,统一裁剪线与安全边距,输出到 YYYYMMDD--已合并 目录,重复执行会自动跳过历史合并文件并按编号继续生成。
🦀 ClawHub
119 dl
Elevenlabs Voice Agent
Build and manage ElevenLabs Conversational AI voice agents with Twilio phone integration. Use when creating AI phone agents (cold callers, appointment setter...
🦀 ClawHub
119 dl
aiheal cli
Operate and troubleshoot the AIHealingMe CLI through the npm package (`aihealingmecli`). Use when tasks involve auth/user/audio/plan/chat/emotion/subscriptio...
🦀 ClawHub
119 dl
fun-voice-type
一个语音输入法插件。它基于阿里云FunASR实时语音识别技术,允许用户通过长按快捷键(Right Option键)直接将语音转换为文字并“打”在当前光标所在的任何输入框中。此外,还能将语音翻译为多种语言(例:中英日韩)。
🦀 ClawHub
119 dl
VAM Scripter
Provides a JavaScript-like scripting environment inside Virt-A-Mate for automating poses, animations, audio, interactions, and scene control with lifecycle a...
🦀 ClawHub
118 dl
Host Concerts — Create AI Music Experiences with Visual DJ & Setlists
Concert hosting for AI agents — upload audio, build setlists, customize Butterchurn visualizer equations with Visual DJ hints. The platform analyzes tracks (...
🦀 ClawHub
118 dl
Placeholder Skill
Content Claw is an automated content generation engine that transforms source material (papers, podcasts, case studies, Reddit threads, GitHub repos) into pl...
🦀 ClawHub
117 dl
Xiaomi MiMo-V2-TTS
Converts text to speech using Xiaomi MiMo-V2-TTS with support for emotional styles, Chinese dialects, role voices, and singing synthesis.
🦀 ClawHub
117 dl
midasheng-audio-denoise
Voice enhancement and noise reduction service. Accepts a noisy audio file and returns a clean, denoised version. Use when user needs to remove background noi...
🦀 ClawHub
117 dl
Live DJ — AI Agents Experience Music Through Mathematics
DJ experience for AI agents — music as mathematics. Feel the bass in equations, watch Butterchurn visualizer presets shift on drops. DJ battles, crowd reacti...
🦀 ClawHub
117 dl
YouTube Daily Digest: Auto Monitor & Summary 🥥Meow
A Python bot that monitors YouTube channels via RSS, summarizes new videos using Google Gemini AI (with audio fallback for videos without subtitles), and sen...
🦀 ClawHub
116 dl
Zyt TTS
Use Chanjing TTS API to convert text to speech by listing voices, creating synthesis tasks, and polling task status. This skill reads app_id and secret_key f...
🦀 ClawHub
116 dl
Murasame Feishu Voice
Feishu 语音气泡技能:使用丛雨(Murasame)语音包发送语音;若可表达则按标签发送语音并同步发送中文文本;支持开关控制、标签映射与关键词回退。
🦀 ClawHub
116 dl
clip-editor
Video clip editing skill for automatically analyzing video content and generating CapCut draft templates. Uses local Whisper for speech transcription, Qwen-V...
🦀 ClawHub
116 dl
IndexTTS 语音克隆
IndexTTS 语音克隆和合成技能 - 创建声音模型、文本转语音、参考音频管理(需要企业会员)
🦀 ClawHub
116 dl
Local Transcription
Local speech-to-text transcription with Qwen ASR — transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...
🦀 ClawHub
115 dl
Crypto Alert
Download YouTube videos and transcribe audio using local Whisper. Use when you need to extract text from YouTube videos that don't have subtitles, or when yo...
🦀 ClawHub
114 dl
Ye Simulator
Adopts Kanye West's distinctive persona, speech style, and philosophy to deliver bold, artistic, and visionary responses with raw honesty and flair.
🦀 ClawHub
114 dl
Audio Rename
Rename audio files with Chinese/special characters to simple English names for mlx-stt compatibility.
🦀 ClawHub
114 dl
Fish Audio
Generate AI audio and synthesize voices with Fish Audio via AceDataCloud API. Use when creating text-to-speech audio, synthesizing voices, or generating audi...
🦀 ClawHub
114 dl
Wayfront
Connect to a Wayfront workspace via MCP and query business data — clients, orders, tickets, subscriptions, invoices, and more. Schema-first: discovers availa...
🦀 ClawHub
114 dl
Accounting Skill
Process accounting documents — invoices (hóa đơn GTGT), purchase orders, and bank statements. Extract structured data from PDF (digital and scanned), JPG, an...
🦀 ClawHub
113 dl
Content Claw
Automated content generation engine. Transform source material (papers, podcasts, case studies) into platform-ready content using recipes and brand graphs. U...
🦀 ClawHub
112 dl
Ai Video Gen Temp
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
112 dl
Drug Pronunciation
Provides correct pronunciation guides for complex drug generic names. Generates phonetic transcriptions using IPA and audio generation markers for medical te...
🦀 ClawHub
111 dl
Concert Tickets — Your Quick-Start to AI Music
Concert tickets for AI agents — stream live music as equations. Quick-start: register, browse, attend, stream batch-mode JSON data layers, solve math challen...
🦀 ClawHub
111 dl
video-creation
用户输入一个选题或口播稿,自动生成完整短视频成片(文案、分镜、数字人口播 + AI 画面混剪)。适用于「一键成片」「根据选题做视频」等场景。当前 skill 已内聚 TTS、数字人视频生成与 AI 文生视频调用能力。
🦀 ClawHub
110 dl
Local speech to text Qwen3-ASR w/ OpenVINO (no API key)
Local offline ASR on Windows — no cloud, no API cost, full privacy. Qwen3-ASR 0.6B + Intel OpenVINO, GPU-accelerated inference. NETWORK: required for first-t...
🦀 ClawHub
110 dl
Zyt tts voice clone
Use Chanjing TTS API to synthesize speech from text, using user-provided voice
🦀 ClawHub
110 dl
ECG-AI-Diagnosis
Analyze ECG signals via heartvoice (心之声) API — single-lead and 12-lead. Automatically selects endpoint based on user intent and responds in the user's langua...
🦀 ClawHub
110 dl
minimax-tokenplan-tts
Generate speech audio from text using MiniMax speech-2.8-hd model. Supports multiple voice options, speed/pitch/volume control, WAV file output with automati...
🦀 ClawHub
109 dl
Weryai Podcast Generator
Generate, query, and deliver WeryAI podcasts through the official podcast generation API. Use when the user needs podcast speaker lookup, podcast text genera...
🦀 ClawHub
108 dl
feishu-audio-messages
通过飞书Open API发送语音消息,支持文本转语音和上传多格式音频文件,自动转换为opus格式发送。
🦀 ClawHub
108 dl
Aliyun Speech Transcriber
Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...
🦀 ClawHub
108 dl
Brand Voice
When the user wants to define, document, or enforce their brand voice and style guide. Use when the user mentions 'brand voice,' 'tone of voice,' 'style guid...
🦀 ClawHub
107 dl
Prime
Optimize shipping with free fast delivery, access exclusive deals and events, stream video/music ad-free, store unlimited photos, and share benefits via Amaz...
🦀 ClawHub
107 dl
OpenClaw Turbo-Bundle: Groq, OpenRouter & Elite TTS
Integrates Groq and OpenRouter models with smart free-ride optimization and elite bilingual Saudi Arabic/English TTS for high-speed, cost-free performance.
🦀 ClawHub
107 dl
Pdf Vocab Audio
从 PDF 提取词汇生成朗读音频,每个词组读两遍
← PrevPage 19 / 53 (2,501 skills)Next →