🎁 Get the FREE AI Skills Starter GuideSubscribe →
BytesAgainBytesAgain

All Skills — audio

125 skills in "audio" matching "Transcribe"

🦀 ClawHub25.3k dl
Openai Whisper Api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
🦀 ClawHub4.6k dl
ElevenLabs Speech-to-Text
Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
🦀 ClawHub3.8k dl
Transcribe audio files via OpenRouter using audio-capable models
Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).
🦀 ClawHub3.4k dl
Transcribee 🐝
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
🦀 ClawHub3.0k dl
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
🦀 ClawHub2.8k dl
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
🦀 ClawHub2.2k dl
Cult Of Carcinization
Give your agent a voice — and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.
🦀 ClawHub1.9k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub1.7k dl
Doubao Asr
Transcribe recorded audio files to text via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0) from ByteDance/Volcengine. Best-in-class Chinese speech recognition with spea...
🦀 ClawHub1.7k dl
Video Analyzer (TikTok + YouTube + Instagram)
Analyze videos from TikTok, YouTube, Instagram, Twitter, and others by URL, transcribing audio locally and answering questions about the content.
🦀 ClawHub1.5k dl
Free Groq Voice Recognition
FREE voice recognition using Groq's complimentary Whisper API. Transcribe audio messages to text in 50+ languages at no cost. Perfect for voice-to-text autom...
🦀 ClawHub1.3k dl
Telnyx Stt
Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.
🦀 ClawHub1.1k dl
Summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub1.1k dl
Funasr Transcribe Skill
Use when the user needs local speech-to-text transcription for audio files, especially Chinese or mixed Chinese-English audio, without relying on cloud trans...
🦀 ClawHub1.0k dl
MH summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub848 dl
Percept Listen
Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.
🦀 ClawHub815 dl
VoiceClaw
Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...
🦀 ClawHub777 dl
Douyin Video Transcribe
Douyin video transcription suite. Extract audio from Douyin/TikTok China videos, transcribe with Whisper, and analyze content. Supports video links, local fi...
🦀 ClawHub753 dl
Volcengine STT
Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...
🦀 ClawHub717 dl
U2-audio-file-transcriber
Transcribe audio files via UniCloud ASR (云知声语音识别, recorded audio → text) API from UniSound. Supports multiple formats, optimized for finance, customer servic...
🦀 ClawHub666 dl
Whisper AI Audio to Text Transcriber
Turn raw transcripts into structured summaries, meeting minutes, and action items.
🦀 ClawHub647 dl
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...
🦀 ClawHub610 dl
Groq Voice Transcribe
Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...
🦀 ClawHub589 dl
moss-transcribe-diarize
MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入,输出带时间戳与 speaker 的结构化转写结果(JSON、逐段文本、按说话人汇总)。用于会议纪要、访谈录音、多人对话整理。需要 API 凭证(环境变量:MOSS_API_KEY,兼容 MOSI_TTS_API_KEY / MOS...
🦀 ClawHub567 dl
Feishu Voice Loop
Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.
🦀 ClawHub560 dl
Telegram Voice Bot
Telegram bot that transcribes voice messages using Whisper and replies in Chinese with Microsoft Edge text-to-speech.
🦀 ClawHub542 dl
Alicloud Ai Audio Asr
Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when c...
🦀 ClawHub536 dl
Telegram Whisper Transcribe
Standalone Telegram bot for voice message transcription via OpenAI Whisper API. No LLM overhead — audio goes directly to Whisper and text comes back in 2-5 s...
🦀 ClawHub483 dl
Simple sound-to-text skill locally
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
🦀 ClawHub443 dl
Groq Whisper
Transcribe audio files using Groq's Whisper API (whisper-large-v3). Fast cloud-based speech-to-text with no local model required. Use when receiving voice me...
🦀 ClawHub429 dl
Transcrição e respostas em áudio em PTBR, Português Brasil - Brazillian portuguese transcription and audio answers
Brazilian Portuguese voice auto-reply skill for OpenClaw. Transcribes audio locally with wav2vec2, generates a reply with the local OpenClaw agent by default...
🦀 ClawHub415 dl
Kai YouTube
Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.
🦀 ClawHub384 dl
Prompt Refiner
Transforms casual or voice-transcribed user requests into precise, AI-optimized prompts. Handles mixed languages, vague input, and ambiguity. Reduces task ex...
🦀 ClawHub367 dl
HN Podcast Transcribe
Download, transcribe, and archive Hacker News podcast episodes (e.g. "Hacker News Recap" by Wondercraft). Use when: (1) user wants to transcribe HN podcast e...
🦀 ClawHub356 dl
Deapi Audio
Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...
🦀 ClawHub348 dl
Assembly Large Audio Transcriber
Transcribe large audio files (100MB+, up to 1GB/12 hours) with speaker diarization. Uses AssemblyAI API with direct HTTP calls. Supports MP3, WAV, M4A, FLAC,...
🦀 ClawHub330 dl
Douyin Video Transcriber
(已验证) 强大的抖音视频批量转写器,集成了下载、音频提取和本地 Whisper 模型转写功能。
🦀 ClawHub286 dl
transcription-speech-to-text-hebrew
Transcribe audio or video files using the TextOps/Modal API. Use this skill whenever the user wants to transcribe a video or audio file, mentions an mp4/mp3/...
🦀 ClawHub261 dl
audio-transcribe
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
🦀 ClawHub250 dl
transcribe-he
Transcribe audio or video files using the TextOps/Modal API. Use this skill whenever the user wants to transcribe a video or audio file, mentions an mp4/mp3/...
🦀 ClawHub125 dl
transcribe
Transcribe audio to text using SkillBoss API Hub powered by OpenAI Whisper, requiring an API key and no local model download.
🦀 ClawHub
elevenlabs-transcribe
Official elevenlabs skill: elevenlabs-transcribe. From elevenlabs/skills.
🦀 ClawHub8.9k dl
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
🦀 ClawHub6.2k dl
Voice Transcribe
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
🦀 ClawHub4.0k dl
TG Voice Whisper Transcriber
Automation skill for TG Voice Whisper Transcriber.
🦀 ClawHub3.9k dl
Vocal Chat
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub3.8k dl
Agentic Calling
Enable AI agents to autonomously make, receive, transcribe, route, and record phone calls using Twilio with customizable voice messages and IVR support.
🦀 ClawHub3.7k dl
Gemini STT
Transcribe audio files using Google's Gemini API or Vertex AI