BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 2+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — automation

2 skills in "automation" matching "Transcribe"

🦀 ClawHub3.5k dl
Vocal Chat
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub2.6k dl
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
🦀 ClawHub2.5k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub2.1k dl
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
🦀 ClawHub1.7k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub1.5k dl
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub1.3k dl
Whisper Transcribe
Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.
🦀 ClawHub1.3k dl
AssemblyAI Transcriber
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
🦀 ClawHub1.3k dl
Speechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
🦀 ClawHub798 dl
Zhipu Asr
Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files to text. Supports Chinese audio trans...
🦀 ClawHub580 dl
YouTube Transcript Pipeline Lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...
🦀 ClawHub544 dl
Voice
Voice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...
🦀 ClawHub400 dl
Youtube Transcribe Skill
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "视频字幕", "字幕提取", "YouTube转文字", "提取字幕".
🦀 ClawHub294 dl
Whisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
🦀 ClawHub275 dl
Groq Voice Transcriber
Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.
🦀 ClawHub243 dl
Voice Memos
Transcribe and organize voice memos with automatic categorization and information extraction. Use when users have voice notes, audio memos, or spoken notes t...
🦀 ClawHub214 dl
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
🦀 ClawHub205 dl
ListenHub Asr
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".
🦀 ClawHub147 dl
Coze Asr
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...
🦀 ClawHub142 dl
Instagram Video Caption
Automatically generate and burn captions into Instagram videos — Reels, Stories, and IGTV. NemoVideo transcribes speech with word-level timing, styles captio...
🦀 ClawHub123 dl
Auto Subtitle Video
Add subtitles to any video automatically — just upload and NemoVideo transcribes, times, styles, and burns captions directly into your footage. No manual typ...
🦀 ClawHub122 dl
Deapi Audio
Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...
🦀 ClawHub121 dl
speech-translation
Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...
🦀 ClawHub86 dl
Bilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
🦀 ClawHub76 dl
Auto Quotation System OpenClaw
Build a reusable quotation workflow for software projects from markdown requirements, feature outlines, or mind-map screenshots that have been transcribed in...
🦀 ClawHub73 dl
Auto Subtitle Generator Free Ab2n 0330
Drop a video and watch captions appear automatically — no subscriptions, no watermarks, no hassle. The auto-subtitle-generator-free skill transcribes spoken...
🦀 ClawHub72 dl
Douyin Transcriber
Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.
🦀 ClawHub71 dl
Video Caption Generator Free Ab Old
Turn raw footage into fully captioned videos without spending a dime. This video-caption-generator-free skill automatically transcribes speech and burns accu...
🦀 ClawHub
Ai Video Caption Generator
The ai-video-caption-generator skill brings accurate, AI-powered captioning to your video workflow through a simple conversational interface. Transcribe spee...
🦀 ClawHub
Whisper GPU Audio Transcriber
Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c...
🦀 ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
🦀 ClawHub
SenseVoice Transcribe
Transcribe audio files (WAV/MP3/M4A/FLAC) to timestamped text using SenseVoice-Small + FSMN-VAD. Supports single-file and batch mode with VAD-anchored per-se...
🦀 ClawHub
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...
🦀 ClawHub
TG Voice Whisper Transcriber
Automation skill for TG Voice Whisper Transcriber.
🦀 ClawHub
Audio Summary
Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries saved alongside the original file.
🦀 ClawHub
mlx-whisper
Set up mlx-whisper as the local audio transcription engine for OpenClaw on Apple Silicon Macs (M1/M2/M3/M4). Automatically transcribes voice notes sent via T...
🦀 ClawHub
YoinkIt
Search, analyze, and transcribe content across 13 social platforms — trending topics, video transcripts, post metadata, and multi-platform research workflows.
🦀 ClawHub
Telegram Multilingual Voice Reply
Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta...
🦀 ClawHub
MCBAI Douyin Dubber
Auto-dub Douyin/TikTok videos into any language using a fully local pipeline: download with Playwright Chromium + Douyin cookie → transcribe with Whisper → t...
🦀 ClawHub
Youtube Transcript Api
Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...
🦀 ClawHub
Youtube Editor
Automate YouTube video editing: download videos, transcribe with Whisper, analyze content using GPT-4, and create Korean SEO-optimized metadata plus consiste...
🦀 ClawHub
Transcribe audio via Groq API (~10x cheaper than OpenAI API)
Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).
🦀 ClawHub
video-to-srt
Generate timecoded SRT subtitles from local video or audio files. Use when a user wants a local low-cost subtitle workflow, asks to transcribe local media in...
🦀 ClawHub
Asr Claw
Speech recognition CLI for AI agent automation. Transcribe audio from stdin, files, or URLs.