BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 48+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

48 skills in "audio" matching "Workflows"

🦀 ClawHub
everything to markdown
Convert almost anything (PDF, DOCX, PPTX, XLSX, images, audio, YouTube, etc.) to Markdown using Microsoft MarkItDown. Optimized for AGENT and LLM workflows.
🦀 ClawHub
Edge TTS Voice System
Local voice system for OpenClaw using faster-whisper for inbound transcription and Edge TTS for outbound replies. Use when you need private voice workflows,...
🦀 ClawHub
Book Writing
Plan, draft, and revise complete books with chapter architecture, voice consistency, and finish-ready revision workflows.
🦀 ClawHub
Azure Speech Service
Azure Speech Service integration. Manage data, records, and automate workflows. Use when the user wants to interact with Azure Speech Service data.
🦀 ClawHub
Voiceflow
Voiceflow integration. Manage data, records, and automate workflows. Use when the user wants to interact with Voiceflow data.
🦀 ClawHub
Zoho Invoice
Zoho Invoice integration. Manage data, records, and automate workflows. Use when the user wants to interact with Zoho Invoice data.
🦀 ClawHub
Elevenlabs
ElevenLabs integration. Manage data, records, and automate workflows. Use when the user wants to interact with ElevenLabs data.
🔧 Dify
Plivo Verify (Dify)
OTP (One-Time Password) verification plugin for Dify using [Plivo's Verify API](https://www.plivo.com/verify/). This plugin enables phone number verification in your Dify workflows by sending OTP codes via SMS or voice call and validating user-entered codes. 1. A [Plivo account](https://console.plivo.com/accounts/register/) 2. Your Plivo Auth ID and Auth Token (found in the [Plivo Console](https:/
🦀 ClawHub
BookMorph Magic
Orchestrate book-to-content workflows to generate video, audio, cover images, and a manifest for episode or campaign packages.
🦀 ClawHub
Google Gemini Media
Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".
GitHub
LargeModGames/spotatui
[[spotatui](https://crates.io/crates/spotatui)] - A Spotify terminal client with native streaming, synced lyrics, and real-time audio visualization [![Continuous Deployment](https://github.com/LargeModGames/spotatui/actions/workflows/cd.yml/badge.svg)](https://github.com/LargeModGames/spotatui/actio
🦀 ClawHub
Review Miner
从评论、评价和反馈中提炼卖点、痛点、反对意见与应删除的话术。;use for reviews, voice-of-customer, marketing workflows;do not use for 造假好评, 泄露用户身份.
🦀 ClawHub
Customer Voice Synthesizer
聚合客服、销售、评价与访谈中的用户原声,并按 JTBD/阶段组织。;use for customer-voice, jtbd, research workflows;do not use for 泄露用户隐私, 选择性忽略负面声音.
🦀 ClawHub
Mayar Payment Integration
Integrate Mayar.id payments to create invoices, generate payment links, track Indonesian payment methods, manage subscriptions, and automate payment workflows.
🦀 ClawHub
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
🦀 ClawHub
Rapper
Create and debug SenseAudio rap, hip-hop, or vocal song generation workflows using the `/v1/song/lyrics/create`, `/v1/song/lyrics/pending/:task_id`, `/v1/son...
🦀 ClawHub
Audio To Text Caption
Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows.
🦀 ClawHub
Kimai Time Tracking
Complete Kimai time-tracking API integration. Manage timesheets, customers, projects, activities, teams, invoices and exports via REST API. Supports time tracking workflows, reporting, and administrative operations. Keywords - kimai, zeiterfassung, timesheet, tracking, project, customer, activity, invoice, export, timer, stunden
GitHub
RustAudio/cpal
Low-level cross-platform audio I/O library. [![Actions Status](https://github.com/RustAudio/cpal/workflows/cpal/badge.svg?branch=master)](https://github.com/RustAudio/cpal/actions)
GitHub
Serial-ATA/lofty-rs
[[lofty](https://crates.io/crates/lofty)] - A library for reading and editing the metadata of various audio formats [![build badge](https://github.com/Serial-ATA/lofty-rs/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/Serial-ATA/lofty-rs/actions)
🦀 ClawHub
Podwise
Podcast knowledge workflows powered by Podwise CLI: search podcasts and episodes by keyword, monitor followed shows for new releases, find popular episodes,...
🦀 ClawHub
baml-codegen
Use when generating BAML code for type-safe LLM extraction, classification, RAG, or agent workflows - creates complete .baml files with types, functions, clients, tests, and framework integrations from natural language requirements. Queries official BoundaryML repositories via MCP for real-time patterns. Supports multimodal inputs (images, audio), Python/TypeScript/Ruby/Go, 10+ frameworks, 50-70% token optimization, 95%+ compilation success.
🦀 ClawHub
Speechace
Speechace integration. Manage data, records, and automate workflows. Use when the user wants to interact with Speechace data.
GitHub
Spotifyd
An open source Spotify client running as a UNIX daemon. [![Continuous Integration](https://github.com/Spotifyd/spotifyd/actions/workflows/ci.yml/badge.svg)](https://github.com/Spotifyd/spotifyd/actions/workflows/ci.yml)
🦀 ClawHub
MiniMax
Build with MiniMax text, speech, video, and music APIs using model routing, compatible SDKs, and safer multimodal workflows.
🦀 ClawHub
Elevenlabs AI
Access ElevenLabs APIs for text-to-speech, speech-to-speech, realtime speech-to-text, voice/model management, and dialogue workflows with direct HTTP calls.
🦀 ClawHub
Alicloud Ai Audio Tts Voice Clone
Voice cloning workflows with Alibaba Cloud Model Studio Qwen TTS VC models. Use when creating cloned voices from sample audio and synthesizing text with clon...
🦀 ClawHub
Groq API Inference
Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns.
🦀 ClawHub
Glasses to Social
Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.
🦀 ClawHub
GoHighLevel
Connect your AI assistant to GoHighLevel CRM via the official API v2. Manage contacts, conversations, calendars, pipelines, invoices, payments, workflows, an...
🦀 ClawHub
VectorClaw MCP
MCP tools for Anki Vector: speech, motion, camera, sensors, and automation workflows.
🦀 ClawHub
Accessibility Toolkit
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
🦀 ClawHub
Fliz AI Video Generator
Complete integration guide for the Fliz REST API - an AI-powered video generation platform that transforms text content into professional videos with voiceovers, AI-generated images, and subtitles. Use this skill when: - Creating integrations with Fliz API (WordPress, Zapier, Make, n8n, custom apps) - Building video generation workflows via API - Implementing webhook handlers for video completion notifications - Developing automation tools that create, manage, or translate videos - Troubleshoot
🦀 ClawHub
Highlevel 1.0.7
Connect your AI assistant to GoHighLevel CRM via the official API v2. Manage contacts, conversations, calendars, pipelines, invoices, payments, workflows, an...
🦀 ClawHub
Accessibility Toolkit
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
🦀 ClawHub
Podcast Production Ops
从选题到上线整理播客生产流程,生成 show notes、标题、剪辑要点与发布清单。;use for podcast, production, content workflows;do not use for 虚构嘉宾观点, 公开未授权片段.
🦀 ClawHub
openlesson
Interact with the openLesson tutoring API to generate learning plans, start audio-based sessions, analyze reasoning gaps, and manage tutoring workflows.
🦀 ClawHub
xeon_tts
Local TTS skill using OpenVINO Qwen3-TTS for voice cloning and emotion style synthesis, supporting QQBOT workflows with strict audio length and file retentio...
GitHub
Festival
A local music player/server/client [![build-badge](https://github.com/hinto-janai/festival/actions/workflows/ci.yml/badge.svg)](https://github.com/hinto-janai/festival/actions/workflows/ci.yml)
GitHub
ncspot
Cross-platform ncurses Spotify client, inspired by ncmpc and the likes. [![build badge](https://github.com/hrkfdn/ncspot/actions/workflows/ci.yml/badge.svg)](https://github.com/hrkfdn/ncspot/actions?query=workflow%3ABuild)
GitHub
aschey/stream-download-rs
[[stream-download](https://crates.io/crates/stream-download)] - A library for streaming audio, video, and other media content [![build badge](https://github.com/aschey/stream-download-rs/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/aschey/stream-download-rs/actions)
🦀 ClawHub
Dental Ai Receptionist
Complete AI voice receptionist system for dental practices. 12 workflows covering inbound call routing, appointment booking, reminders, no-show followup, can...
🦀 ClawHub
Ecomm Ai Voice Agent
Complete AI voice agent system for eCommerce order confirmation, customer support, and outbound campaigns. 12 production-ready n8n workflows with Vapi AI voi...
🦀 ClawHub
HomePod
Set up, troubleshoot, and optimize HomePod and HomeKit audio workflows with reliable Siri control and room-aware playback tuning.
🦀 ClawHub
Alicloud Ai Audio Tts Voice Design
Voice design workflows with Alibaba Cloud Model Studio Qwen TTS VD models. Use when creating custom synthetic voices from text descriptions and using them fo...
🦀 ClawHub
Audio
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.
🦀 ClawHub
Accessibility Toolkit 1.0.0
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
🦀 ClawHub
podwise-podcast-copilot
Podcast copilot workflows with podwise CLI: search podcasts or episodes by keyword, monitor followed shows for new releases, find current popular episodes, a...