BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 87+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case β†’Pick My Role

All Skills β€” writing

87 skills in "writing" matching "transcribe"

πŸ¦€ ClawHub613 dl
Gemini Video Analyzer
Native video analysis using Google Gemini API. Upload and analyze video files β€” describe scenes, extract text/UI, answer questions about content, transcribe...
πŸ¦€ ClawHub
Bilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
πŸ¦€ ClawHub
YouTube Transcript
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
πŸ¦€ ClawHub
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
πŸ¦€ ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
πŸ¦€ ClawHub
Local Transcription
Local speech-to-text transcription with Qwen ASR β€” transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...
πŸ¦€ ClawHub
Telnyx Stt
Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.
πŸ¦€ ClawHub
Audio Command Executor
Processes inbound audio files, transcribes them, and answers to resulting texts. Converts non-WAV inputs to WAV before transcription.
πŸ¦€ ClawHub
Whisper STT
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
πŸ¦€ ClawHub
Finance OCR Pro
Use this skill when the user asks to OCR, transcribe, extract, or convert the contents of a scanned PDF, image, or office document into Markdown, HTML, DOCX,...
πŸ¦€ ClawHub
Voice-to-Protocol Transcriber
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structured experiment documentation.
πŸ¦€ ClawHub
Audio Intelligence Mcp
Transcribe, summarize, and analyze audio files using local Whisper + Qwen. Returns transcript, segments, and action items.
πŸ¦€ ClawHub
YoinkIt
Search, analyze, and transcribe content across 13 social platforms β€” trending topics, video transcripts, post metadata, and multi-platform research workflows.
πŸ¦€ ClawHub
Openai Whisper Api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
πŸ¦€ ClawHub
Auto Subtitle Generator Free Ab2n 0330
Drop a video and watch captions appear automatically β€” no subscriptions, no watermarks, no hassle. The auto-subtitle-generator-free skill transcribes spoken...
πŸ¦€ ClawHub
OpenRouter Audio
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...
πŸ¦€ ClawHub
musa-torch-coding
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
πŸ¦€ ClawHub
Youtube Video To Text
Transcribe any YouTube video to text using AI β€” get a full transcript, timestamped SRT captions, chapter summaries, and key-point extraction from any YouTube...
πŸ¦€ ClawHub
TubeScribe
YouTube video summarizer with speaker detection, formatted documents, and audio output. Works out of the box with macOS built-in TTS. Optional recommended tools (pandoc, ffmpeg, mlx-audio) enhance quality. Requires internet for YouTube access. No paid APIs or subscriptions. Use when user sends a YouTube URL or asks to summarize/transcribe a YouTube video.
πŸ¦€ ClawHub
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
πŸ¦€ ClawHub
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
πŸ¦€ ClawHub
Funasr Transcribe Skill
Use when the user needs local speech-to-text transcription for audio files, especially Chinese or mixed Chinese-English audio, without relying on cloud trans...
πŸ¦€ ClawHub
Whisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
πŸ¦€ ClawHub
Speechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
πŸ¦€ ClawHub
Douyin Content Tracker Skill
This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...
πŸ¦€ ClawHub
Kai YouTube
Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.
πŸ¦€ ClawHub
MH summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for β€œtranscribe this YouTube/video”).
πŸ¦€ ClawHub
Speech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
πŸ¦€ ClawHub
Aliyun Speech Transcriber
Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...
πŸ¦€ ClawHub
Facticity.AI Complete Integration
Complete Facticity.AI integration - fact-check claims, extract claims from content, transcribe links, check link reliability, check credits, and monitor task...
πŸ¦€ ClawHub
Voice Transcriber
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
⭐ GitHub
Showtimes
Transcribes and summarizes audio content.
πŸ¦€ ClawHub
Transcript
Get transcripts from any YouTube video β€” for summarization, research, translation, quoting, or content analysis. Use when the user shares a video link or asks "what did they say", "get the transcript", "transcribe this video", "summarize this video", or wants to analyze spoken content.
πŸ¦€ ClawHub
MH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
πŸ¦€ ClawHub
it will help you to send voice messages to your AI Assistant and also can make it talk
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.
πŸ¦€ ClawHub
TL;DX
Extract, transcribe, clean, segment, and analyze long-form content from URLs, local media files, existing transcripts, and pasted text. Use when a user provi...
πŸ¦€ ClawHub
Transcribe
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.
πŸ¦€ ClawHub
Bilibili Transcript
Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...
πŸ¦€ ClawHub
Simple sound-to-text skill locally
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
πŸ¦€ ClawHub
Youtube Transcription Generator
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.
πŸ¦€ ClawHub
Gladia YouTube Transcription (Free)
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
πŸ¦€ ClawHub
YouTube Transcribe
Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
πŸ¦€ ClawHub
YouTube Transcript Pipeline Lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...
πŸ¦€ ClawHub
Super-Transcribe β€” Unified Speech-to-Text
Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...
πŸ¦€ ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
πŸ¦€ ClawHub
Transcribee 🐝
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
πŸ¦€ ClawHub
AIML Voice Transcript
Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script...
πŸ¦€ ClawHub
AssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
Page 1 / 2 (87 skills)Next β†’