BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 90+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case β†’Pick My Role

All Skills β€” clawhub

90 skills in "clawhub" matching "transcription"

πŸ¦€ ClawHub
it will help you to send voice messages to your AI Assistant and also can make it talk
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.
πŸ¦€ ClawHub
Argmax Transcription and TTS
On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...
πŸ¦€ ClawHub
Youtube Transcription Generator
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.
πŸ¦€ ClawHub
Gladia YouTube Transcription (Free)
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
πŸ¦€ ClawHub
YouTube Transcribe
Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
πŸ¦€ ClawHub
Qcut Video Edit
Run QCut's native TypeScript pipeline CLI for AI content generation, video analysis, transcription, YAML pipelines, ViMax agentic video production, and proje...
πŸ¦€ ClawHub
Video Captions
Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.
πŸ¦€ ClawHub
Openai
OpenAI API integration β€” chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...
πŸ¦€ ClawHub
Yt Assemblyai Monitor
YouTube channel monitor and video transcription using AssemblyAI cloud API. Pure Python + requests only β€” no ffmpeg, no Whisper, no extra tools needed. Monit...
πŸ¦€ ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
πŸ¦€ ClawHub
subtitle-extractor
Subtitle extractor for Bilibili, YouTube, Xiaohongshu, Douyin, and local files. Extracts native subtitles or Whisper transcription in original format. Agent...
πŸ¦€ ClawHub
Local Whisper
Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.
πŸ¦€ ClawHub
AssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
πŸ¦€ ClawHub
Link Transcriber Skill Public
Use this skill when a user wants to submit a Douyin or Xiaohongshu link to the linkTranscriber transcription API, optionally provide cookie when available, w...
πŸ¦€ ClawHub
YouTube Long Video Transcript
YouTube long video (>1 hour) full verbatim transcription and translation workflow. Use when user needs to (1) Extract subtitles from YouTube videos, (2) Translate English transcripts to Chinese, (3) Handle long videos that exceed session limits, (4) Process DownSub API responses and generate formatted documents.
πŸ¦€ ClawHub
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API β€” works even when YouTube subtitles are disab...
πŸ¦€ ClawHub
Parakeet Local Asr
Install and operate local NVIDIA Parakeet ASR for OpenClaw with an OpenAI-compatible transcription API on Ubuntu/Linux and macOS (Intel/Apple Silicon). Use w...
πŸ¦€ ClawHub
clip-editor
Video clip editing skill for automatically analyzing video content and generating CapCut draft templates. Uses local Whisper for speech transcription, Qwen-V...
πŸ¦€ ClawHub
Video Summary
Video summarization for Bilibili, Xiaohongshu, Douyin, and YouTube. Extract insights from video content through transcription and summarization.
πŸ¦€ ClawHub
mlx-whisper
Set up mlx-whisper as the local audio transcription engine for OpenClaw on Apple Silicon Macs (M1/M2/M3/M4). Automatically transcribes voice notes sent via T...
πŸ¦€ ClawHub
Qwen ASR
Local speech-to-text using Qwen3-ASR (CPU-only, no API key, no cloud). Use when: (1) a voice message or audio file needs transcription, (2) user asks to tran...
πŸ¦€ ClawHub
Meta Video Ad Analyzer
Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative content, extracting text overlays, or generating scene-by-scene descriptions.
πŸ¦€ ClawHub
Play Music from YouTube
Play music on YouTube via browser automation with playwright-cli. Use when the user wants to: (1) play a specific song (e.g. 'play Money Money Money by ABBA') (2) play songs by an artist as a playlist or mix (e.g. 'play Jay Chou's songs') (3) play genre or mood-based music (e.g. 'play relaxing spa music', 'play 60s Chinese oldies') (4) control playback β€” next, pause, resume, stop, skip ad, change song, close the player. Also handles song/artist name corrections from voice transcription erro
πŸ¦€ ClawHub
case.dev
case.dev β€” a legal AI platform with encrypted document vaults, OCR, audio transcription, and legal search. This skill installs the casedev CLI and provides s...
πŸ¦€ ClawHub
ifly-speed-transcription
Ultra-fast speech transcription using iFLYTEK Speed Transcription API. Transcribe audio files (WAV/PCM/MP3) up to 5 hours in ~20 seconds per hour. Supports C...
πŸ¦€ ClawHub
Aliyun Asr
Pure Aliyun ASR skill for voice message transcription, supports multiple channels including Feishu
πŸ¦€ ClawHub
Telegram Whisper Transcribe
Standalone Telegram bot for voice message transcription via OpenAI Whisper API. No LLM overhead β€” audio goes directly to Whisper and text comes back in 2-5 s...
πŸ¦€ ClawHub
Voice To Protocol Transcriber
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structured experiment documentation.
πŸ¦€ ClawHub
transcription
Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...
πŸ¦€ ClawHub
Douyin Video Transcribe
Douyin video transcription suite. Extract audio from Douyin/TikTok China videos, transcribe with Whisper, and analyze content. Supports video links, local fi...
πŸ¦€ ClawHub
Nex Voice
Voice note transcription and intelligent action item extraction for capture and organization of verbal communication. Record and transcribe voice notes, voic...
πŸ¦€ ClawHub
Faster Whisper Transcription
Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.
πŸ¦€ ClawHub
Faster Whisper Gpu
High-performance local speech-to-text transcription using Faster Whisper with NVIDIA GPU acceleration. Transcribe audio files locally without sending data to...
πŸ¦€ ClawHub
Venice API Kit
Complete Venice AI API toolkit - image generation, video, audio, embeddings, transcription, characters, models, and admin functions. Privacy-focused inferenc...
πŸ¦€ ClawHub
Youtube Transcript Api
Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...
πŸ¦€ ClawHub
Voice Transcriber Pro
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
πŸ¦€ ClawHub
acestep-lyrics-transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API. Outputs LRC, SRT, or JSON with word-level timestamps. Use when users want to transcribe songs, generate LRC files, or extract lyrics with timestamps from audio.
πŸ¦€ ClawHub
Audio
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.
πŸ¦€ ClawHub
Speech To Text
Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...
πŸ¦€ ClawHub
Ai Video Transcription
Transcribe video speech to text with 98%+ accuracy using AI β€” convert spoken audio from any video into perfectly timed text transcripts, searchable documents...
πŸ¦€ ClawHub
protocal-agent
Generate structured execution plans for medical and molecular biology protocols such as RNA extraction, reverse transcription, qPCR, cell culture, CRISPR, or...
πŸ¦€ ClawHub
Coze Asr
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...
← PrevPage 2 / 2 (90 skills)