🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain

All Skills β€” audio

26 skills in "audio" matching "Language"

πŸ¦€ ClawHub20.2k dl
Edge TTS
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
πŸ¦€ ClawHub6.9k dl
Humanize
Remove AI writing patterns from text. Use when editing, reviewing, or rewriting text to sound more natural and human-written. Detects patterns like inflated symbolism, promotional language, em dash overuse, AI vocabulary, and sycophantic tone.
πŸ¦€ ClawHub4.1k dl
ACE Music - Free Suno Alternative Generate unlimited AI music for free using ACE-Step 1.5. Full songs with vocals, lyrics, any genre, any language. No subscription, no credits, no limits. The open-sou
Generate AI music using ACE-Step 1.5 via ACE Music's free API. Use when the user asks to create, generate, or compose music, songs, beats, instrumentals, or...
πŸ¦€ ClawHub3.2k dl
Voice.ai Voices
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
πŸ¦€ ClawHub2.8k dl
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
πŸ¦€ ClawHub2.5k dl
Clonev
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).
πŸ¦€ ClawHub1.0k dl
Truly Local Piper Multilang TTS (secure)
Local offline text-to-speech via Piper TTS. Self-contained setup, automatic language detection, per-call voice selection. Extensible to any language. Writes...
πŸ¦€ ClawHub272 dl
Oatda Translate Audio
Translate foreign-language audio into English text using OATDA's unified audio API. Triggers when the user wants audio translation, spoken-language translati...
πŸ¦€ ClawHub185 dl
Whisper ASR β€” Speech-to-Text
Automatic Speech Recognition using OpenAI Whisper (local GPU). Supports Chinese, English, and 90+ languages. Auto-detects language.
πŸ¦€ ClawHub13.5k dl
ffmpeg-video-editor
Generate FFmpeg commands from natural language video editing requests - cut, trim, convert, compress, change aspect ratio, extract audio, and more.
πŸ¦€ ClawHub8.9k dl
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
πŸ¦€ ClawHub6.6k dl
Mac TTS
Text-to-speech using macOS built-in `say` command. Use for voice notifications, audio alerts, reading text aloud, or announcing messages through Mac speakers. Supports multiple languages including Chinese (Mandarin), English, Japanese, etc.
πŸ¦€ ClawHub4.5k dl
Voice Reply
Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required. Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud. Supports multiple languages including German (thorsten) and English (ryan) voices. Outputs Telegram-compatible voice notes with [[audio_as_voice]] tag.
πŸ¦€ ClawHub3.7k dl
Qwen3-tts
Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.
πŸ¦€ ClawHub3.2k dl
Sapi Tts
Windows SAPI5 text-to-speech with Neural voices. Lightweight alternative to GPU-heavy TTS - zero GPU usage, instant generation. Auto-detects best available voice for your language. Works on Windows 10/11.
πŸ¦€ ClawHub3.1k dl
Chinese Humanizer
Removes AI-style writing traces to make text sound naturally written by a real author, primarily in Chinese-language contexts.
πŸ¦€ ClawHub2.9k dl
it will help you to send voice messages to your AI Assistant and also can make it talk
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.
πŸ¦€ ClawHub2.7k dl
Parakeet Stt
Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech to text, or processing voice recordings locally without cloud APIs.
πŸ¦€ ClawHub2.7k dl
Addis Assistant
Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate text between languages (e.g., Amharic to English). Requires 'x-api-key'.
πŸ¦€ ClawHub2.1k dl
Volcengine Ai Audio Tts
Text-to-speech generation on Volcengine audio services. Use when users need narration, multi-language speech output, voice selection, or TTS troubleshooting.
πŸ¦€ ClawHub1.7k dl
AssemblyAI Transcriber
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
πŸ¦€ ClawHub651 dl
Subtitle Video Generator
Generate and style video subtitles in any language with AI β€” auto-transcribe speech to perfectly timed subtitles, translate across 50+ languages, apply trend...
πŸ¦€ ClawHub623 dl
AI Company Translator
Translator skill: Multi-language translation (EN/ZH/RU/FR+5 source languages), translation coordination, quality verification, brand voice consistency, AIGC...
πŸ¦€ ClawHub232 dl
Pilot Service Agents Language
Language and NLP services β€” translation, text-to-speech, dictionaries, word tools, Bible text, linguistic corpora. Use this skill when: 1. Translating text b...
πŸ¦€ ClawHub195 dl
Wjs Dubbing Video
Use when the user has a video + a target-language SRT and wants the video to actually speak that language β€” generates a time-aligned TTS voice dub. Routes by...
πŸ¦€ ClawHub
Speech Language Pathologist Video
Creates short videos for speech-language pathologists to explain evaluation, therapy, and family coaching for pediatric and adult communication development.