Find the Right AI Skill for Any Job
Browse 1+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — communication
1 skills in "communication" matching "Transcribe"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
19.3k dlYouTube Transcript
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
🦀 ClawHub
7.7k dlVideo Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
🦀 ClawHub
4.2k dlTranscript
Get transcripts from any YouTube video — for summarization, research, translation, quoting, or content analysis. Use when the user shares a video link or asks "what did they say", "get the transcript", "transcribe this video", "summarize this video", or wants to analyze spoken content.
🦀 ClawHub
3.1k dlTranscribee 🐝
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
🦀 ClawHub
3.1k dlAudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
🦀 ClawHub
2.9k dlAssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
🦀 ClawHub
2.6k dlSpeech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
🦀 ClawHub
2.5k dlSpeech To Text
Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...
🦀 ClawHub
2.1k dlwhatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
🦀 ClawHub
1.6k dlTelegram Voice To Voice Macos
Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and reply with Telegram voice notes via say+ffmpeg. Not compatible with Linux/Windows.
🦀 ClawHub
1.3k dlWhisper Transcribe
Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.
🦀 ClawHub
1.3k dlAssemblyAI Transcriber
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
🦀 ClawHub
1.3k dlSpeechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
🦀 ClawHub
1.1k dlWhisper STT
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
🦀 ClawHub
939 dlFaster Whisper Transcription
Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.
🦀 ClawHub
857 dlInstagram Reels
Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript with the original description.
🦀 ClawHub
717 dlYoutube Transcription Generator
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.
🦀 ClawHub
705 dlPocket AI Integration
Transcribe, index, and semantically search all voice recordings, extracting action items and meeting insights for comprehensive conversation intelligence.
🦀 ClawHub
702 dlSpeech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
🦀 ClawHub
647 dlBili Summary
Download Bilibili videos, extract or transcribe subtitles, and generate AI-powered detailed summaries using Gemini 2.5 Flash.
🦀 ClawHub
620 dlopenclaw-voice
Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.
🦀 ClawHub
580 dlYouTube Transcript Pipeline Lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...
🦀 ClawHub
549 dlB站视频转文字&总结神器-Bilibili video transcribe&summary
当用户提供 B 站视频链接、BV 号或 b23.tv 短链,并希望转录、提取字幕、总结或分析视频内容时使用。先检查 Node.js 环境和 SILICONFLOW_API_KEY,优先尝试官方字幕;如果没有字幕,则获取匿名音频地址,下载为 .m4s 后直接改名为 .mp3,无需转码;有 API key 时调用硅基...
🦀 ClawHub
544 dlVoice
Voice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...
🦀 ClawHub
464 dlTelegram Voice Transcribe
Transcribe Telegram voice messages and audio notes into text using the OpenAI Whisper API. Use when (1) a user sends a voice message or audio note via Telegr...
🦀 ClawHub
421 dlMH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
🦀 ClawHub
416 dlVideo Transcribe
Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe this video", "get the transcript", "what...
🦀 ClawHub
400 dlYoutube Transcribe Skill
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "视频字幕", "字幕提取", "YouTube转文字", "提取字幕".
🦀 ClawHub
393 dlVoice Note Transcriber Cn V1.1
语音笔记转文字工具 v1.1 | 新增:实时字幕、多语言翻译、语音标记、音频剪辑、SRT导出。支持实时转写、会议纪要生成。
🦀 ClawHub
338 dlWhisper AI Audio to Text Transcriber
Turn raw transcripts into structured summaries, meeting minutes, and action items.
🦀 ClawHub
298 dlGladia YouTube Transcription (Free)
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
🦀 ClawHub
294 dlWhisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
🦀 ClawHub
292 dlOpenRouter Audio
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...
🦀 ClawHub
292 dlVoice Transcriber
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
🦀 ClawHub
280 dlSummarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub
275 dlGroq Voice Transcriber
Automatically transcribes Telegram voice messages using Groq Whisper API and replies with text generated by an LLM.
🦀 ClawHub
274 dlYouTube Transcribe
Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
🦀 ClawHub
243 dlVoice Memos
Transcribe and organize voice memos with automatic categorization and information extraction. Use when users have voice notes, audio memos, or spoken notes t...
🦀 ClawHub
233 dlSpeech to Text
Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...
🦀 ClawHub
214 dlMeeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
🦀 ClawHub
209 dlPodcast Transcribe
For transcript or subtitle requests involving podcast URLs, public audio URLs/files, or raw transcript cleanup. Generates audio + SRT + TXT artifacts and can...
🦀 ClawHub
199 dlVoice To Protocol Transcriber
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structured experiment documentation.
🦀 ClawHub
187 dlmusa-torch-coding
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
🦀 ClawHub
147 dlCoze Asr
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...
🦀 ClawHub
140 dlTelegram Whisper Transcribe
Standalone Telegram bot for voice message transcription via OpenAI Whisper API. No LLM overhead — audio goes directly to Whisper and text comes back in 2-5 s...
🦀 ClawHub
136 dlifly-speed-transcription
Ultra-fast speech transcription using iFLYTEK Speed Transcription API. Transcribe audio files (WAV/PCM/MP3) up to 5 hours in ~20 seconds per hour. Supports C...
🦀 ClawHub
121 dlspeech-translation
Build, adapt, or run an audio-processing workflow that takes spoken audio, transcribes it with Whisper or faster-whisper, translates the transcript using the...
🦀 ClawHub
116 dlLocal Transcription
Local speech-to-text transcription with Qwen ASR — transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...