Browse AI Agent Skills | BytesAgain

🎁 Get the FREE AI Skills Starter Guide — Subscribe →

All Skills — audio

109 skills in "audio" matching "Transcribe"

🦀 ClawHub25.6k dl

Openai Whisper Api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

🦀 ClawHub4.0k dl

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

🦀 ClawHub3.8k dl

Transcribe audio files using Google's Gemini API or Vertex AI

🦀 ClawHub3.2k dl

Elevenlabs Integration with Openclaw

ClawVox - ElevenLabs voice studio for OpenClaw. Generate speech, transcribe audio, clone voices, create sound effects, and more.

🦀 ClawHub3.0k dl

it will help you to send voice messages to your AI Assistant and also can make it talk

Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.

🦀 ClawHub2.3k dl

Cult Of Carcinization

Give your agent a voice — and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.

🦀 ClawHub1.8k dl

Whisper Transcribe

Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.

🦀 ClawHub1.8k dl

Video Analyzer (TikTok + YouTube + Instagram)

Analyze videos from TikTok, YouTube, Instagram, Twitter, and others by URL, transcribing audio locally and answering questions about the content.

🦀 ClawHub1.5k dl

Instagram Reels

Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript with the original description.

🦀 ClawHub1.3k dl

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

🦀 ClawHub1.1k dl

Video Transcribe - 视频转文字

本地视频转文字 - 使用 OpenAI Whisper 进行语音识别，完全免费、离线运行、保护隐私

🦀 ClawHub917 dl

Faster Whisper Gpu

High-performance local speech-to-text transcription using Faster Whisper with NVIDIA GPU acceleration. Transcribe audio files locally without sending data to...

🦀 ClawHub880 dl

macOS CLI for transcribing audio and video files using local Whisper models or Whisnap Cloud.

🦀 ClawHub849 dl

Super-Transcribe — Unified Speech-to-Text

Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...

🦀 ClawHub828 dl

Douyin Video Transcribe

Douyin video transcription suite. Extract audio from Douyin/TikTok China videos, transcribe with Whisper, and analyze content. Supports video links, local fi...

🦀 ClawHub804 dl

Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...

🦀 ClawHub774 dl

ElevenLabs STT OpenClaw

Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...

🦀 ClawHub737 dl

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

🦀 ClawHub713 dl

Cloudflare Whisper Worker

Transcribe audio using a deployed Cloudflare Worker Whisper endpoint. Use when converting voice/audio files (wav, mp3, m4a, ogg, webm) to text through the cu...

🦀 ClawHub686 dl

Youtube Transcriber

One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...

🦀 ClawHub649 dl

Podcast Transcribe

For transcript or subtitle requests involving podcast URLs, public audio URLs/files, or raw transcript cleanup. Generates audio + SRT + TXT artifacts and can...

🦀 ClawHub615 dl

Deepgram Voice Workflow

End-to-end voice workflow with Deepgram STT and TTS. Use when transcribing voice messages, generating spoken replies, or building a shell-based audio pipelin...

🦀 ClawHub583 dl

Alicloud Ai Audio Asr

Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when c...

🦀 ClawHub500 dl

Speech Transcriber

语音转文字（Speech-to-Text）工具。支持从麦克风录音，使用 Whisper（faster-whisper）在本地进行语音转文字，或通过 OpenAI 兼容 API 进行云端转写。触发词：录音、语音转文字、STT、语音识别、转写、录音转文字。适用平台：Linux / Windows / macOS。

🦀 ClawHub485 dl

Transcrição e respostas em áudio em PTBR, Português Brasil - Brazillian portuguese transcription and audio answers

Brazilian Portuguese voice auto-reply skill for OpenClaw. Transcribes audio locally with wav2vec2, generates a reply with the local OpenClaw agent by default...

🦀 ClawHub469 dl

speech-translation

Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...

🦀 ClawHub429 dl

Transforms casual or voice-transcribed user requests into precise, AI-optimized prompts. Handles mixed languages, vague input, and ambiguity. Reduces task ex...

🦀 ClawHub414 dl

HN Podcast Transcribe

Download, transcribe, and archive Hacker News podcast episodes (e.g. "Hacker News Recap" by Wondercraft). Use when: (1) user wants to transcribe HN podcast e...

🦀 ClawHub411 dl

Douyin Content Tracker Skill

This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...

🦀 ClawHub397 dl

Assembly Large Audio Transcriber

Transcribe large audio files (100MB+, up to 1GB/12 hours) with speaker diarization. Uses AssemblyAI API with direct HTTP calls. Supports MP3, WAV, M4A, FLAC,...

🦀 ClawHub390 dl

Download YouTube videos and transcribe audio using local Whisper. Use when you need to extract text from YouTube videos that don't have subtitles, or when yo...

🦀 ClawHub354 dl

Transcribe audio files (voice notes, recordings, podcasts) to text via the Speechmatics batch transcription API. Use when the user asks to transcribe audio,...

🦀 ClawHub321 dl

transcription-speech-to-text-hebrew

Transcribe audio or video files using the TextOps/Modal API. Use this skill whenever the user wants to transcribe a video or audio file, mentions an mp4/mp3/...

🦀 ClawHub294 dl

Transcribe audio with Microsoft's MAI-Transcribe-1 model via Azure AI Speech.

🦀 ClawHub249 dl

Voice Note Transcriber

Fetch voice note emails via IMAP, transcribe attachments with OpenAI Whisper, and save transcripts to an Obsidian vault's fleeting notes folder.

Voice Transcriber

Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...

Voice Transcriber Pro

Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...

🦀 ClawHub9.0k dl

Video Subtitles

Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.

🦀 ClawHub4.7k dl

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

🦀 ClawHub3.9k dl

Transcribe audio files via OpenRouter using audio-capable models

Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).

🦀 ClawHub3.9k dl

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

🦀 ClawHub3.5k dl

Transcribee 🐝

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

🦀 ClawHub3.5k dl

AssemblyAI advanced speech transcription

Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...

🦀 ClawHub3.2k dl

Speech is Cheap Transcribe

Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.

🦀 ClawHub3.1k dl

Automate YouTube video editing: download videos, transcribe with Whisper, analyze content using GPT-4, and create Korean SEO-optimized metadata plus consiste...

🦀 ClawHub2.7k dl

Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech to text, or processing voice recordings locally without cloud APIs.

🦀 ClawHub2.6k dl

Audio Transcribe

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

🦀 ClawHub2.2k dl

Transcribe Audio with Parakeet MLX

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).