BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 41+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — automation

41 skills in "automation" matching "transcribe"

🦀 ClawHub
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
🦀 ClawHub
Bilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
🦀 ClawHub
Auto Subtitle Video
Add subtitles to any video automatically — just upload and NemoVideo transcribes, times, styles, and burns captions directly into your footage. No manual typ...
🦀 ClawHub
briefing
Automatically track creator channels and transcribe new videos (YouTube, Bilibili, TikTok) with zero token cost during the pipeline. Use memory-based updates...
🦀 ClawHub
Groq Voice Transcriber
Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.
🦀 ClawHub
YoinkIt
Search, analyze, and transcribe content across 13 social platforms — trending topics, video transcripts, post metadata, and multi-platform research workflows.
🦀 ClawHub
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub
Auto Subtitle Generator Free Ab2n 0330
Drop a video and watch captions appear automatically — no subscriptions, no watermarks, no hassle. The auto-subtitle-generator-free skill transcribes spoken...
🦀 ClawHub
Ai Video Caption Generator
The ai-video-caption-generator skill brings accurate, AI-powered captioning to your video workflow through a simple conversational interface. Transcribe spee...
🦀 ClawHub
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
🦀 ClawHub
Vocal Chat
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
🦀 ClawHub
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
🦀 ClawHub
Whisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
🦀 ClawHub
Voice
Voice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...
🦀 ClawHub
Speechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
🦀 ClawHub
Whisper Transcribe
Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.
🦀 ClawHub
Transcribe audio via Groq API (~10x cheaper than OpenAI API)
Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).
🦀 ClawHub
Douyin Transcriber
Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.
🦀 ClawHub
YouTube Transcript Pipeline Lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...
🦀 ClawHub
Telegram Multilingual Voice Reply
Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta...
🦀 ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
🦀 ClawHub
Deapi Audio
Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...
🦀 ClawHub
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...
🦀 ClawHub
TG Voice Whisper Transcriber
Automation skill for TG Voice Whisper Transcriber.
🦀 ClawHub
ListenHub Asr
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".
🦀 ClawHub
Zhipu Asr
Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files to text. Supports Chinese audio trans...
🦀 ClawHub
video-to-srt
Generate timecoded SRT subtitles from local video or audio files. Use when a user wants a local low-cost subtitle workflow, asks to transcribe local media in...
🦀 ClawHub
mlx-whisper
Set up mlx-whisper as the local audio transcription engine for OpenClaw on Apple Silicon Macs (M1/M2/M3/M4). Automatically transcribes voice notes sent via T...
🦀 ClawHub
Asr Claw
Speech recognition CLI for AI agent automation. Transcribe audio from stdin, files, or URLs.
🦀 ClawHub
SenseVoice Transcribe
Transcribe audio files (WAV/MP3/M4A/FLAC) to timestamped text using SenseVoice-Small + FSMN-VAD. Supports single-file and batch mode with VAD-anchored per-se...
🦀 ClawHub
Audio Summary
Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries saved alongside the original file.
🦀 ClawHub
Youtube Transcribe Skill
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "视频字幕", "字幕提取", "YouTube转文字", "提取字幕".
🦀 ClawHub
Youtube Transcript Api
Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...
🦀 ClawHub
AssemblyAI Transcriber
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
🦀 ClawHub
Youtube Editor
Automate YouTube video editing: download videos, transcribe with Whisper, analyze content using GPT-4, and create Korean SEO-optimized metadata plus consiste...
🦀 ClawHub
Video Caption Generator Free Ab Old
Turn raw footage into fully captioned videos without spending a dime. This video-caption-generator-free skill automatically transcribes speech and burns accu...
🦀 ClawHub
speech-translation
Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...
🦀 ClawHub
Instagram Video Caption
Automatically generate and burn captions into Instagram videos — Reels, Stories, and IGTV. NemoVideo transcribes speech with word-level timing, styles captio...
🦀 ClawHub
Whisper GPU Audio Transcriber
Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c...
🦀 ClawHub
Coze Asr
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...