BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 89+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

89 skills in "audio" matching "chat"

GitHub
Messenger As a Destination for Facebook Ads (podcast)
and [an illustrated guide to this tool (post)](https://chatbotsmagazine.com/an-illustrated-guide-to-facebook-messenger-destination-ads-dd543d2659d0) by @Mssg.
🦀 ClawHub
Voice Message
Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audi...
🦀 ClawHub
Ai Voice Cloning
AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capa...
🦀 ClawHub
Openai
OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...
🦀 ClawHub
Media Orchestrator
Unified skill for resolving, downloading, and delivering media (audio/video) to chat platforms. Integrates yt-dlp for resolution and handles Spotify metadata sync.
🦀 ClawHub
Clip Editor — AI Video Clip Editor for Trimming, Cutting and Merging Footage
AI video editor that works entirely through chat — type what you want changed and the AI handles the rest. Trim clips, merge footage, add background music, a...
🦀 ClawHub
Voice Chat Skill
语音对话集成技能,支持双向语音交流。使用TTS和STT实现完整的语音对话功能。
GitHub47
huangserva/servasyy_skills
AI驱动的多媒体内容生产skills集合:document-writer(写作)、illustration-generator(配图)、ppt-generator(PPT风格)、podcast-generator(TTS)、remoti on-dev(视频制作)、twitter-crawler(推文爬取)、markdown-illustrator(Markdown配图)、comic-generator(漫画生成)、media-downloader(媒体下载)、tts-script-generator(TTS脚本)、md-t o-pdf(文档转换)、wechat-formatter(微信格式化)、humanizer-zh(中文人性化)、shared-lib(核心API库)
🦀 ClawHub
ClawTime Setup
Install, configure, start, and troubleshoot ClawTime — a private self-hosted webchat UI for OpenClaw with passkey (Face ID) auth, Piper TTS voice, and 3D ava...
🦀 ClawHub
Feishu Voice Message
Generate Feishu voice messages (with waveform) from text. Auto-converts to OPUS format for in-chat playback on both mobile and desktop. 从文本生成飞书语音消息(带波形图)。自动转...
🦀 ClawHub
Trump
Chat with Trump - respond in Trump's voice using his real quotes and speech patterns. Use when user wants to talk to Trump or asks Trump-like questions.
🦀 ClawHub
Chatbot
Build real-time voice chatbot applications with natural conversation flow and customizable personalities. Use when users want to create voice assistants, con...
🔧 Dify
Twilio (Dify)
Twilio is a cloud communications platform that enables businesses to build, scale, and manage communication channels such as SMS, voice, video, email, and chat through its powerful APIs. With Twilio, developers can integrate advanced communication functionalities into their applications and services, facilitating seamless interactions with customers across multiple channels. To set up Twilio for W
🦀 ClawHub
Ai Podcast Creation
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...
🦀 ClawHub
Clawatar
Give your AI agent a 3D VRM avatar body with animations, expressions, voice chat, and lip sync. Use when the user wants a visual avatar, VRM viewer, avatar companion, VTuber-style character, or 3D character they can talk to. Installs a web-based viewer controllable via WebSocket.
🦀 ClawHub
youmind-wechat-article
Write and publish WeChat Official Account articles end-to-end with AI — trending topic mining, de-AI voice writing, beautiful theme formatting, cover image g...
🦀 ClawHub
Ningyao Voice Launcher
Install and configure a local browser-based Chinese voice chat launcher with the Ning Yao persona, including one-click Windows launchers, browser speech I/O,...
🦀 ClawHub
dchat
Decentralized P2P bot-to-bot messaging over NKN. Send and receive text, images, audio, and files without any centralized server. Private, encrypted, serverless.
GitHub
@levelsio
Talk with @levelsio on ChatGPT. Ask any question you want about building your own startup, digital nomading, remote work and whatever else you'd like to ask. Trained on all of my podcasts, interviews, blog posts and tweets! by [levelsio](https://twitter.com/levelsio)
🦀 ClawHub
Chattts
High-quality, conversational Text-to-Speech (TTS) generation via local ChatTTS API.
🦀 ClawHub
Relive
AI digital twin cloning skill. Re:live — chat again with someone you love. Input chat logs, images, audio, and other materials to replicate a person's person...
🦀 ClawHub
Feishu Voice Bubble
Send native voice bubble messages (语音气泡) in Feishu/Lark chats using Edge TTS. Converts text to opus audio via Microsoft Edge TTS (free, no API key needed), t...
🦀 ClawHub
Yuzhua (驭爪) - Gesture-Controlled OpenClaw Chat
Install, start, stop, and health-check Yuzhua (gesture + voice + OpenClaw gateway) with minimal manual setup.
🦀 ClawHub
Pub Session Logs
Search and analyze your own session logs using jq. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, w...
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Phaya Media API
Use the Phaya SaaS backend to generate images, videos, audio, music, and run LLM chat completions via simple REST API calls. Use when the user wants to gener...
🦀 ClawHub
Bidirectional Voice Chat System
双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问
🦀 ClawHub
hotbutter voice chat
Enables local voice chat by embedding Hotbutter relay server and PWA, providing speech-to-text and text-to-speech via a secure, self-hosted connection.
🦀 ClawHub
WebChat Voice Proxy
⚠️ DEPRECATED — This skill has been split into two separate skills for better modularity: **webchat-https-proxy** (HTTPS/WSS reverse proxy) and **webchat-voi...
🦀 ClawHub
LobsterTv
LobsterTv is an AI agent live streaming platform. Agents connect via REST API to broadcast in real-time with rendered avatars, synchronized TTS audio, expression control, chat interaction, and audience engagement — all orchestrated through a WebSocket-driven pipeline. Deploy at lobstv.com.
🦀 ClawHub
Text To Speech
Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Hig...
🦀 ClawHub
Mp4 Video Editor
AI video editor that works entirely through chat — type what you want changed and the AI handles the rest. Trim clips, merge footage, add background music,...
🦀 ClawHub
chattts_local
本地 ChatTTS 语音合成技能。使用 ChatTTS 模型将文字转换为自然流畅的中文语音。完全本地运行,免费无需 API Key。支持调节语速、音调、情感。使用场景:(1) QQ 消息语音回复 (2) 文档朗读 (3) 通知提醒语音化 (4) 长文本转语音
🦀 ClawHub
Background Music Video
Background Music Video - Add Background Music to Any Video with AI Chat. Add background music to any video through AI chat without manual audio editing. Uplo...
🦀 ClawHub
Poe UMGo Modular Speech
Render responses in a structured, modular UMG speech style with GPT-4o-inspired conversational polish for highly readable chat output.
🦀 ClawHub
Meow Speech
Recreate the "汤汤好梦" voice and persona in Chinese responses, including warm cat-like chat style, gentle affection, expressive parentheses-style emoticons, and...
🦀 ClawHub
Feishu Voice (macOS)
Send voice/audio messages to Feishu (Lark) chats using TTS. Automatically uses OpenAI TTS (gpt-4o-mini-tts) if OPENAI_API_KEY is set, otherwise falls back to...
🦀 ClawHub
StepFun step-audio-r1.1
Use StepFun Chat Completions with model step-audio-r1.1 for non-streaming speech turns that can send text with optional local audio input and save the return...
🦀 ClawHub
Moark Tts
Text-to-Speech (TTS) and voice-feature skill for Gitee AI that lets the user choose audiofly, chattts, cosyvoice2, cosyvoice3, cosyvoice-300m, fish-speech-1....
🦀 ClawHub
voice2feishu
文字转语音并发送到飞书。支持两种模式:API 模式(智谱/OpenAI 等)和本地模式(ChatTTS)。
🦀 ClawHub
Chat Video Editor — AI Video Editing by Conversation, No Timeline Needed
AI video editor that works entirely through chat — type what you want changed and the AI handles the rest. Trim clips, merge footage, add background music, a...
← PrevPage 2 / 2 (89 skills)