Find the Right AI Skill for Any Job
Browse 1+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills β writing
1 skills in "writing" matching "Transcribe"
π Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
π¦ ClawHub19.3k dl
YouTube Transcript
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
π¦ ClawHub7.7k dl
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
π¦ ClawHub4.4k dl
TubeScribe
YouTube video summarizer with speaker detection, formatted documents, and audio output. Works out of the box with macOS built-in TTS. Optional recommended tools (pandoc, ffmpeg, mlx-audio) enhance quality. Requires internet for YouTube access. No paid APIs or subscriptions. Use when user sends a YouTube URL or asks to summarize/transcribe a YouTube video.
π¦ ClawHub4.2k dl
Transcript
Get transcripts from any YouTube video β for summarization, research, translation, quoting, or content analysis. Use when the user shares a video link or asks "what did they say", "get the transcript", "transcribe this video", "summarize this video", or wants to analyze spoken content.
π¦ ClawHub3.1k dl
Transcribee π
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
π¦ ClawHub3.1k dl
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
π¦ ClawHub2.9k dl
AssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
π¦ ClawHub2.6k dl
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
π¦ ClawHub2.5k dl
Speech To Text
Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...
π¦ ClawHub1.9k dl
Cult Of Carcinization
Give your agent a voice β and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.
π¦ ClawHub1.3k dl
Speechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
π¦ ClawHub1.1k dl
Whisper STT
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
π¦ ClawHub939 dl
Faster Whisper Transcription
Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.
π¦ ClawHub857 dl
Instagram Reels
Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript with the original description.
π¦ ClawHub717 dl
Youtube Transcription Generator
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.
π¦ ClawHub702 dl
Speech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
π¦ ClawHub620 dl
openclaw-voice
Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.
π¦ ClawHub580 dl
YouTube Transcript Pipeline Lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...
π¦ ClawHub428 dl
Facticity.AI Complete Integration
Complete Facticity.AI integration - fact-check claims, extract claims from content, transcribe links, check link reliability, check credits, and monitor task...
π¦ ClawHub421 dl
MH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
π¦ ClawHub416 dl
Video Transcribe
Use when the user wants to transcribe, caption, or get the text content of a video or audio file β e.g. "transcribe this video", "get the transcript", "what...
π¦ ClawHub400 dl
Youtube Transcribe Skill
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "θ§ι’εεΉ", "εεΉζε", "YouTube转ζε", "ζεεεΉ".
π¦ ClawHub338 dl
Whisper AI Audio to Text Transcriber
Turn raw transcripts into structured summaries, meeting minutes, and action items.
π¦ ClawHub298 dl
Gladia YouTube Transcription (Free)
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
π¦ ClawHub294 dl
Whisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
π¦ ClawHub292 dl
Voice Transcriber
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
π¦ ClawHub292 dl
OpenRouter Audio
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...
π¦ ClawHub280 dl
Summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for βtranscribe this YouTube/videoβ).
π¦ ClawHub274 dl
YouTube Transcribe
Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
π¦ ClawHub214 dl
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
π¦ ClawHub209 dl
Podcast Transcribe
For transcript or subtitle requests involving podcast URLs, public audio URLs/files, or raw transcript cleanup. Generates audio + SRT + TXT artifacts and can...
π¦ ClawHub199 dl
Voice To Protocol Transcriber
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structured experiment documentation.
π¦ ClawHub199 dl
Simple sound-to-text skill locally
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
π¦ ClawHub187 dl
musa-torch-coding
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
π¦ ClawHub147 dl
Coze Asr
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...
π¦ ClawHub140 dl
Telegram Whisper Transcribe
Standalone Telegram bot for voice message transcription via OpenAI Whisper API. No LLM overhead β audio goes directly to Whisper and text comes back in 2-5 s...
π¦ ClawHub136 dl
ifly-speed-transcription
Ultra-fast speech transcription using iFLYTEK Speed Transcription API. Transcribe audio files (WAV/PCM/MP3) up to 5 hours in ~20 seconds per hour. Supports C...
π¦ ClawHub131 dl
Finance OCR Pro
Use this skill when the user asks to OCR, transcribe, extract, or convert the contents of a scanned PDF, image, or office document into Markdown, HTML, DOCX,...
π¦ ClawHub121 dl
speech-translation
Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...
π¦ ClawHub116 dl
Local Transcription
Local speech-to-text transcription with Qwen ASR β transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...
π¦ ClawHub108 dl
Aliyun Speech Transcriber
Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...
π¦ ClawHub102 dl
Kai YouTube
Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.
π¦ ClawHub86 dl
Bilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
π¦ ClawHub80 dl
Best Video To Text Converter
content creators, journalists, students convert video files into transcribed text files using this skill. Accepts MP4, MOV, AVI, WebM up to 500MB, renders on...
π¦ ClawHub80 dl
Audio Command Executor
Processes inbound audio files, transcribes them, and answers to resulting texts. Converts non-WAV inputs to WAV before transcription.
π¦ ClawHub73 dl
Auto Subtitle Generator Free Ab2n 0330
Drop a video and watch captions appear automatically β no subscriptions, no watermarks, no hassle. The auto-subtitle-generator-free skill transcribes spoken...
π¦ ClawHub73 dl
Douyin Content Tracker Skill
Scrapes Douyin creator videos, downloads audio (Playwright+ffmpeg with yt-dlp fallback), and transcribes with Whisper. Covers setup, daily tracking, cookie m...
π¦ ClawHub72 dl
Auto Subtitle Generator Free Ab Old
Drop a video and watch captions appear in seconds β no subscriptions, no hidden fees. This auto-subtitle-generator-free skill transcribes spoken audio and bu...