Find the Right AI Skill for Any Job
Browse 166+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
166 skills in "audio" matching "transcribe"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
ElevenLabs STT OpenClaw
Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...
🦀 ClawHub
Douyin Content Tracker Skill
This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...
🦀 ClawHub
Kai YouTube
Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.
🦀 ClawHub
ElevenLabs Speech-to-Text
Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
🦀 ClawHub
Transcribe audio via Groq API (~10x cheaper than OpenAI API)
Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).
🦀 ClawHub
MH summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub
Speech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
🦀 ClawHub
Aliyun Speech Transcriber
Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...
🦀 ClawHub
Voice Transcriber
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
🦀 ClawHub
Audio Handler
Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, FLAC, AAC, OGG, and more.
🦀 ClawHub
Audio Transcribe
Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.
⭐ GitHub
Showtimes
Transcribes and summarizes audio content.
🦀 ClawHub
Kai Minimax Tts
Generate voice audio and transcribe speech using MiniMax TTS API. Use when responding with voice or transcribing audio files.
🦀 ClawHub
Feishu Voice Loop
Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.
🦀 ClawHub
Voice Note Transcriber Cn
语音笔记转文字工具 v2.1 | Voice Note Transcriber. 支持多语言识别、实时转写、说话人识别、智能摘要、音频降噪、离线识别。触发词:转写、识别、语音。
🦀 ClawHub
Transcribe Audio with Parakeet MLX
Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).
🦀 ClawHub
MH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
🦀 ClawHub
Telegram Voice To Voice Macos
Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and reply with Telegram voice notes via say+ffmpeg. Not compatible with Linux/Windows.
🦀 ClawHub
Percept Listen
Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.
🦀 ClawHub
moss-transcribe-diarize
MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入,输出带时间戳与 speaker 的结构化转写结果(JSON、逐段文本、按说话人汇总)。用于会议纪要、访谈录音、多人对话整理。需要 API 凭证(环境变量:MOSS_API_KEY,兼容 MOSI_TTS_API_KEY / MOS...
🦀 ClawHub
it will help you to send voice messages to your AI Assistant and also can make it talk
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.
🦀 ClawHub
Douyin Transcriber
Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.
🦀 ClawHub
Transcribe
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.
🦀 ClawHub
Video Transcriber
视频转写工作流,支持B站和YouTube视频。自动判断有字幕/无字幕,有字幕则获取字幕,无字幕则下载音频+whisper转写。触发场景:(1) 用户要求总结视频内容 (2) 用户要求获取视频字幕 (3) 用户要求转写视频 (4) 处理B站/YouTube视频
🦀 ClawHub
Bilibili Transcript
Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...
🦀 ClawHub
Simple sound-to-text skill locally
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
🦀 ClawHub
Gladia YouTube Transcription (Free)
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
🦀 ClawHub
YouTube Transcribe
Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
🦀 ClawHub
Super-Transcribe — Unified Speech-to-Text
Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...
🦀 ClawHub
Volcengine STT
Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...
🦀 ClawHub
Voice Memo Sync
Sync, transcribe, and intelligently organize voice memos, audio/video files, and URLs. 同步、转录、智能整理语音备忘录、音视频文件和视频链接。
🦀 ClawHub
K8s Self Hosted Whisper Api
Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert sp...
🦀 ClawHub
Gettr Transcribe
Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT). Use when given a GE...
🦀 ClawHub
Voice Note Transcriber Cn V1.1
语音笔记转文字工具 v1.1 | 新增:实时字幕、多语言翻译、语音标记、音频剪辑、SRT导出。支持实时转写、会议纪要生成。
🦀 ClawHub
Telegram Multilingual Voice Reply
Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta...
🦀 ClawHub
salute speech
Transcribe audio files using Sber Salute Speech async API. Russian-first STT with support for ru-RU, en-US, kk-KZ, ky-KG, uz-UZ.
🦀 ClawHub
Speech to Text
Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...
🦀 ClawHub
Auto Subtitle Generator Online
The auto-subtitle-generator-online skill transcribes and embeds accurate subtitles into your videos using AI-powered speech recognition. Upload your footage,...
🦀 ClawHub
Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
🦀 ClawHub
Deapi Audio
Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...
🦀 ClawHub
Transcribee 🐝
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
🦀 ClawHub
AIML Voice Transcript
Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script...
🦀 ClawHub
Gemini STT
Transcribe audio files using Google's Gemini API or Vertex AI
🦀 ClawHub
Step Asr
Transcribe audio files to text via Step ASR streaming API (HTTP SSE). Supports Chinese and English, multiple audio formats (PCM, WAV, MP3, OGG/OPUS), real-ti...
🦀 ClawHub
AssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
🦀 ClawHub
Youtube Transcriber
One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...
🦀 ClawHub
openclaw-voice
Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.
🦀 ClawHub
TG Voice Whisper Transcriber
Automation skill for TG Voice Whisper Transcriber.