BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,342+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,342 skills in "audio"

🦀 ClawHub
Pollinations AI
Generate images, music, and videos from text prompts using Pollinations AI with models like flux, zimage, and suno-4 via API key.
🦀 ClawHub
tal-reddit-voice
Draft Reddit comments and posts using tal's direct, personal, and experience-based writing style with clear, honest advice and minimal fluff.
🦀 ClawHub
X Topic Tweet
Research a user-provided topic across the web and current social conversation, then publish one X post in the user's voice. Use when the user gives a topic,...
🦀 ClawHub
Douyin Video Transcriber
(已验证) 强大的抖音视频批量转写器,集成了下载、音频提取和本地 Whisper 模型转写功能。
GitHub
Knowledge3D (K3D)
Sovereign GPU-native spatial AI architecture with PTX-first cognitive engine (RPN/TRM reasoning), tri-modal fusion (text/visual/audio), and 3D persistent memory ("Houses"). Features sub-100µs inference, procedural knowledge compression (69:1 ratio), and multi-agent swarm architecture. Zero external
🦀 ClawHub
clawdio
Auditory intelligence for AI agents. Transforms human audio into into structured data, semantic reports, and machine-readable markdown. Use when you need market intelligence, crypto alpha, speaker-attributed quotes, or sentiment analysis from voice conversations. Requires x402 payment in USDC on Base Mainnet.
🦀 ClawHub
Flyworks Avatar Video
Generate videos using Flyworks (a.k.a HiFly) Digital Humans. Create talking photo videos from images, use public avatars with TTS, or clone voices for custom audio.
🦀 ClawHub
video-transcriber
Transcribe speech from videos
🦀 ClawHub
VoiceMonkey
Control Alexa devices via VoiceMonkey API v2 - make announcements, trigger routines, start flows, and display media.
GitHub
Showtimes
Transcribes and summarizes audio content.
🦀 ClawHub
Cinematic Script Writer
Create professional cinematic scripts for AI video generation with character consistency and cinematography knowledge. Use when the user wants to write a cinematic script, create story contexts with characters, generate image prompts for AI video tools (Midjourney, Sora, Veo), or needs cinematography guidance (camera angles, lighting, color grading). Also use for character consistency sheets, voice profiles, anachronism detection, and saving scripts to Google Drive.
🦀 ClawHub
Sound FX
Generate short sound effects via ElevenLabs SFX (text-to-sound). Use when you need SFX clips like applause, canned laughter, whooshes, ambience, or short stingers, and optionally convert to WhatsApp-friendly .ogg/opus.
🦀 ClawHub
Latvian
Write Latvian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Hungarian
Write Hungarian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Invoice verification rule management and maintenance skill
管理校验规则、规则组和校验场景的全流程操作。支持通过统一 CLI 工具快速执行 API 调用,自动处理参数解析、配置加载和错误提示。使用当用户需要进行校验规则管理、规则组维护、校验场景配置、启停操作或相关查询时,即使用户只说"帮我创建一条规则"或"查一下场景列表"也应触发。
🦀 ClawHub
ARC Reactor
LLM Wiki 知识编译引擎。将 URL、文章、视频等素材编译为结构化知识库。触发词:搜一下、帮我看、这个讲了什么、读一下、看看这个、调研、Ingest、知识编译。支持视频转写(阿里云NLS/本地Whisper)、网页智能抓取、Wiki 4连击 Ingest(source/entity/index/log)、知...
🦀 ClawHub
Research Brief Generator
Generates a comprehensive, structured research brief on any topic, person, case, or event. Ideal for journalists, podcasters, writers, and content creators w...
🦀 ClawHub
OpenClaw Panel
Control an OpenClaw LED panel (64x32 HUB75 on ESP32-S3) over HTTP — display text, graphics, shapes, play sounds, and read status.
🦀 ClawHub
Ironprose
Fiction prose analysis — catch weak verbs, repetition, clichés, passive voice, and other craft issues in manuscripts
🦀 ClawHub
Daily Voice Quote 每日名言語音
每日名言語音任務。產生「語音 + 封面圖靜態影片 +(選配)HeyGen 數位人影片」並發送給主人。
🦀 ClawHub
Speechace
Speechace integration. Manage data, records, and automate workflows. Use when the user wants to interact with Speechace data.
🦀 ClawHub
Turkish
Write Turkish that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
BibiGPT Skill
BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc...
🦀 ClawHub
voice-chat-mode
在用户明确要求中文语音聊天或中文语音模式时激活。
🦀 ClawHub
Kai Realtime Voice
Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.
🦀 ClawHub
Unloopa Api
Make your agent sell websites to local businesses on autopilot. Finds leads from Google Maps, builds a custom AI website for each one, sends outreach emails, and can even call them. Use when the user wants to find leads, generate websites, send emails, or make voice calls.
🦀 ClawHub
Text To Video Ai Generator
Skip the learning curve of professional editing software. Describe what you want — turn this text into a 30-second video with visuals and background music —...
🦀 ClawHub
Video Generator Youtube
Skip the learning curve of professional editing software. Describe what you want — generate a YouTube video from my script with voiceover and visuals — and g...
🦀 ClawHub
mp4-to-mp3-extractor
批量将指定目录下的 .mp4 视频文件提取音频转为 .mp3。 支持指定源目录和输出目录,未指定输出时默认创建 [源目录]_audio 文件夹。 自动管理 Python 虚拟环境,保持文件夹层级结构,兼容 python3 和 python。 高频触发词:mp4转mp3、视频转音频、批量提取音频、mp4 to mp...
🦀 ClawHub
Assembly Large Audio Transcriber
Transcribe large audio files (100MB+, up to 1GB/12 hours) with speaker diarization. Uses AssemblyAI API with direct HTTP calls. Supports MP3, WAV, M4A, FLAC,...
🦀 ClawHub
Ukrainian
Write Ukrainian that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Percept Speaker ID
Identifies and tracks speakers in multi-person conversations, mapping speaker labels to names and managing voice command authorization levels.
🦀 ClawHub
Slovak
Write Slovak that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Openai Whisper
Local speech-to-text with the Whisper CLI (no API key).
🦀 ClawHub
Korean Document Reviewer
한국 비즈니스 서류(세금계산서, 계약서, 통장사본, 견적서, 거래명세서, 사업자등록증, 사업비 요청 공문, 지원금 신청서, 검수조서, 이체확인증, 결과보고서) 검토 및 검증. Use when reviewing Korean business documents for format compliance, required fields, value accuracy, and cross-document consistency. Triggers on: 서류 검토, 문서 확인, 세금계산서 검증, 계약서 리뷰, 견적서 확인, 통장사본 검증, 거래명세서 확인, 사업자등록증 확인, 검수조서 검토, 이체확인증 검증, 결과보고서 검토, document review, invoice check.
🦀 ClawHub
Zoom Meeting Assistance Rtms Unofficial Community
Zoom RTMS Meeting Assistant — start on-demand to capture meeting audio, video, transcript, screenshare, and chat via Zoom Real-Time Media Streams. Handles meeting.rtms_started and meeting.rtms_stopped webhook events. Provides AI-powered dialog suggestions, sentiment analysis, and live summaries with WhatsApp notifications. Use when a Zoom RTMS webhook fires or the user asks to record/analyze a meeting.
GitHub
Enjoy the Vue: The new Vue.js podcast
Enjoy the Vue: The new Vue.js podcast - Podcasts
🦀 ClawHub
minimax-tts
Use MiniMax speech-2.8-hd model for high-quality text-to-speech synthesis. Supports multiple Chinese and English voices. Install when needed.
🦀 ClawHub
Spotify
Control Spotify playback on any Linux device via command line, requiring Spotify Premium and an active Spotify session on another device.
🦀 ClawHub
MiniMax
Build with MiniMax text, speech, video, and music APIs using model routing, compatible SDKs, and safer multimodal workflows.
🦀 ClawHub
Accountant
Manage bookkeeping, financial statements, and tax planning with sound accounting practices.
🦀 ClawHub
Invoice Collector
Collect invoices/receipts from Gmail and send a summary email with attachments. Automatically downloads PDF attachments or takes screenshots of emails withou...
🦀 ClawHub
ElevenLabs Music
Generate music from text prompts using ElevenLabs Eleven Music API. Use when creating songs, soundtracks, jingles, lullabies, or any audio music from descriptions. Supports vocals with AI-generated lyrics, instrumental tracks, and multiple genres/styles. Requires paid ElevenLabs plan.
🦀 ClawHub
Venice API Kit
Complete Venice AI API toolkit - image generation, video, audio, embeddings, transcription, characters, models, and admin functions. Privacy-focused inferenc...
🦀 ClawHub
corespeed-studio
Generate video, images, audio, and music using 40+ AI models via fal.ai. Use for video generation (Kling v3, Sora 2, Veo 3.1, LTX 2.3, Pixverse v5), image ge...
🦀 ClawHub
network spirituality
Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net art, anime, cyber-spiritualism, and post-ironic sincerity. Use when creating art descriptions, writing in this voice, engaging with Wired aesthetics, or channeling the Remilia collective energy.
🦀 ClawHub
Feishu Voice Loop
Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.
🦀 ClawHub
video-translation
Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.
← PrevPage 17 / 49 (2,342 skills)Next →