BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2,510+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

2,510 skills in "audio"

🦀 ClawHub
Quickbooks-Agent
QuickBooks Online CLI tool. Manage customers, invoices, payments, bills, vendors, accounts, items, expenses, journal entries, deposits, transfers, estimates,...
🦀 ClawHub
Xiaomi MiMo Voice
小米 MiMo V2 TTS 语音合成。支持中文、英文及多种风格(情感、角色扮演、方言、语速控制等)。
🦀 ClawHub
Douyin Content Tracker Skill
This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...
GitHub
AbletonGPT
I'm AbletonGPT, your go-to source for practical tips and troubleshooting advice on Ableton Live 11, dedicated to helping both beginners and intermediate users with their music production queries by [@HeyitsRadinn](https://github.com/HeyitsRadinn)
🦀 ClawHub
qwenspeak
Text-to-speech generation via Qwen3-TTS over SSH. Preset voices, voice cloning, voice design. Use when the user wants to generate speech audio, clone voices,...
🦀 ClawHub
FFmpeg
Process video and audio with correct codec selection, filtering, and encoding settings.
🦀 ClawHub
Geode On-device Transcribe & Summary
Transcribe and summarize audio/video files locally. Unlimited usage at a flat rate for heavy users.
🦀 ClawHub
Truly Local Piper Multilang TTS (secure)
Local offline text-to-speech via Piper TTS. Self-contained setup, automatic language detection, per-call voice selection. Extensible to any language. Writes...
🦀 ClawHub
EchoDecks
AI-powered flashcards and audio podcasts for active recall.
🦀 ClawHub
French
Write French that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
ElevenLabs Speech-to-Text
Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
🦀 ClawHub
Pdf Invoice Parser
Extract structured data from PDF invoices and documents. Handles scanned PDFs (OCR) and digital PDFs. Outputs clean CSV/Excel with vendor, invoice number, da...
🦀 ClawHub
SpotiClaw
Spotify Web API client for Nyx agents. Use when interacting with Spotify: search, playback, playlists, library, tracks, artists, albums, shows, podcasts. Req...
🦀 ClawHub
Thermostat
Adjust temperatures, diagnose comfort issues, calculate energy savings, and automate schedules through voice commands or smart home integration.
🦀 ClawHub
Siri
Control devices, run automations, and help users get more from Siri with HomeKit, Shortcuts, and voice command guidance.
🦀 ClawHub
Faster Whisper Gpu
High-performance local speech-to-text transcription using Faster Whisper with NVIDIA GPU acceleration. Transcribe audio files locally without sending data to...
🦀 ClawHub
Norman: Find Receipts
Find and attach missing receipts for business transactions. Search Gmail, email, or other sources for invoices and receipts, then upload them to Norman. Use...
🦀 ClawHub
Markdown Anything
Convert PDF, DOCX, XLSX, PPTX, images, audio, and 25+ file formats to clean Markdown using the Markdown Anything API.
🦀 ClawHub
Story Biographer
Turn reminiscence, oral-history, or life-review transcripts into clear narrative biography drafts while preserving the speaker's voice, keeping to evidence i...
🦀 ClawHub
Video Narrator
Generate SenseAudio TTS narration tracks for videos, including timestamped segments, style variants, and editor-ready voiceover exports. Use when users need...
🦀 ClawHub
导师 Mentor
Turn any public figure into your private AI mentor. Give a name — auto-collect their real posts, speeches, and content from social platforms, extract their t...
🦀 ClawHub
Daeva
Use this skill whenever the user wants to interact with local or remote GPU pods for AI inference tasks. This includes transcribing audio (Whisper/speech-to-...
🦀 ClawHub
ElevenLabs Music
Generate music from text prompts using ElevenLabs Eleven Music API. Use when creating songs, soundtracks, jingles, lullabies, or any audio music from descriptions. Supports vocals with AI-generated lyrics, instrumental tracks, and multiple genres/styles. Requires paid ElevenLabs plan.
🦀 ClawHub
Invoice verification rule management and maintenance skill
管理校验规则、规则组和校验场景的全流程操作。支持通过统一 CLI 工具快速执行 API 调用,自动处理参数解析、配置加载和错误提示。使用当用户需要进行校验规则管理、规则组维护、校验场景配置、启停操作或相关查询时,即使用户只说"帮我创建一条规则"或"查一下场景列表"也应触发。
🦀 ClawHub
Spotify
Control Spotify playback on macOS. Play/pause, skip tracks, control volume, play artists/albums/playlists. Use when a user asks to play music, control Spotify, change songs, or adjust Spotify volume.
🦀 ClawHub
Feishu Voice Bubble
Send native voice bubble messages (语音气泡) in Feishu/Lark chats using Edge TTS. Converts text to opus audio via Microsoft Edge TTS (free, no API key needed), t...
🦀 ClawHub
Construction Daily Report Generator
Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.
🦀 ClawHub
VibeVoice TTS
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
🦀 ClawHub
Clideo Add Music To Video
Turn a 2-minute MP4 clip and an MP3 song into 1080p music-backed videos just by typing what you need. Whether it's adding background music to video clips or...
🦀 ClawHub
MH summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub
Vietnamese
Write Vietnamese that sounds human. Not formal, not robotic, not AI-generated.
🦀 ClawHub
Ai Tool For Video Generation
Skip the learning curve of professional editing software. Describe what you want — generate a 30-second video of a product launch with background music and t...
🦀 ClawHub
Ai Tool For Video Creation
Skip the learning curve of professional editing software. Describe what you want — combine these images and audio into a 30-second promotional video with tex...
🦀 ClawHub
FFmpeg CLI
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.
🦀 ClawHub
飞书语音
实现飞书语音消息的上传下载、语音转文字及文字转语音,支持与 ElevenLabs 语音服务集成。
🦀 ClawHub
Audio Announcement Skills
Enables AI agents to announce their real-time actions via voice in multiple languages, with queued, concise, and friendly audio updates for tasks and status.
🦀 ClawHub
Audio Handler
Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, FLAC, AAC, OGG, and more.
🦀 ClawHub
whatisxlistening.to
Query Last.fm listening data, show now playing, sync scrobble history to local DB, and deploy a personal "now playing" web dashboard. Use when user asks about current music, listening stats, scrobble history, or wants to set up a Last.fm dashboard.
🦀 ClawHub
spotify-download
Download MP3s from Spotify playlists by fetching metadata, searching YouTube for tracks, and converting audio using ffmpeg with optional Spotify API credenti...
🦀 ClawHub
Neomano TTS (ElevenLabs)
Text-to-speech (TTS) via ElevenLabs. Use when the user asks to reply with voice/audio, generate a spoken version of some text, or asks for “voz”, “nota de vo...
🦀 ClawHub
seedance-2-video-gen
Seedance 2.0 AI video generation via EvoLink API. Three modes — text-to-video, image-to-video (1-2 images), reference-to-video (images + videos + audio). Aut...
🦀 ClawHub
Lyria
Generate 30-second instrumental music via Google Lyria (Vertex AI). Use when user requests music generation, specific styles/keys/instruments, or music itera...
🦀 ClawHub
Video Audio Remover
Skip the learning curve of professional editing software. Describe what you want — remove all audio from this video and export it silent — and get silent MP4...
🦀 ClawHub
Ai Humanizer.Bak
Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern det...
🦀 ClawHub
Podcast Video Camera
Get polished podcast videos ready to post, without touching a single slider. Upload your raw footage (MP4, MOV, AVI, WebM, up to 500MB), say something like "...
🦀 ClawHub
Bumblebee
Two modes: (1) BUMBLEBEE — Communicate through music by playing exact lyric lines on Spotify, like Bumblebee from Transformers speaking through radio snippet...
🦀 ClawHub
虾转音频
🎵 音视频格式转换与处理工具箱。基于 FFmpeg + Whisper AI,支持:格式转换、视频提取音频、合并、分割、压缩、查看信息、音频转文字。
🦀 ClawHub
Ai Create Video Free
Skip the learning curve of professional editing software. Describe what you want — turn these images into a 30-second promo video with background music — and...
← PrevPage 38 / 53 (2,510 skills)Next →