Find the Right AI Skill for Any Job

Browse 210+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

All Skills

210 skills total matching "transcribe"

Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...

🦀 ClawHub

Vocal Chat

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

🦀 ClawHub

Speech is Cheap Transcribe

Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.

🦀 ClawHub

Elevenlabs Integration with Openclaw

ClawVox - ElevenLabs voice studio for OpenClaw. Generate speech, transcribe audio, clone voices, create sound effects, and more.

🦀 ClawHub

Telegram Voice Bot

Telegram bot that transcribes voice messages using Whisper and replies in Chinese with Microsoft Edge text-to-speech.

🦀 ClawHub

Funasr Transcribe Skill

Use when the user needs local speech-to-text transcription for audio files, especially Chinese or mixed Chinese-English audio, without relying on cloud trans...

🦀 ClawHub

Whisper Transcriber

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...

🦀 ClawHub

Voice

Voice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...

🦀 ClawHub

deAPI AI Media Suite (Community)

The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...

🦀 ClawHub

Speechall command-line tool for fast speech-to-text transcription using multiple providers

Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.

🦀 ClawHub

video-transcriber

Transcribe speech from videos

🦀 ClawHub

Whisper Transcribe

Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.

🦀 ClawHub

🎤 Transcribe audio files using Qwen ASR. 千问STT

Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.

🦀 ClawHub

ElevenLabs STT OpenClaw

Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...

🦀 ClawHub

Douyin Content Tracker Skill

This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...

🦀 ClawHub

Kai YouTube

Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.

🦀 ClawHub

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

🦀 ClawHub

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

🦀 ClawHub

MH summarize

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

🦀 ClawHub

Speech to Text Transcription

Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.

🦀 ClawHub

Aliyun Speech Transcriber

Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...

🦀 ClawHub

Facticity.AI Complete Integration

Complete Facticity.AI integration - fact-check claims, extract claims from content, transcribe links, check link reliability, check credits, and monitor task...

🦀 ClawHub

Voice Transcriber

Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...

🦀 ClawHub

Audio Handler

Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, FLAC, AAC, OGG, and more.

🦀 ClawHub

Audio Transcribe

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

⭐ GitHub

YT transcriber

this transcribes a YT video from a single id by [swyx](https://x.com/swyx/)

⭐ GitHub

Showtimes

Transcribes and summarizes audio content.

🦀 ClawHub

Kai Minimax Tts

Generate voice audio and transcribe speech using MiniMax TTS API. Use when responding with voice or transcribing audio files.

🦀 ClawHub

Transcript

Get transcripts from any YouTube video — for summarization, research, translation, quoting, or content analysis. Use when the user shares a video link or asks "what did they say", "get the transcript", "transcribe this video", "summarize this video", or wants to analyze spoken content.

🦀 ClawHub

Feishu Voice Loop

Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.

🦀 ClawHub

Voice Note Transcriber Cn

语音笔记转文字工具 v2.1 | Voice Note Transcriber. 支持多语言识别、实时转写、说话人识别、智能摘要、音频降噪、离线识别。触发词：转写、识别、语音。

🦀 ClawHub

Transcribe Audio with Parakeet MLX

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

🦀 ClawHub

MOSI Transcribe Diarize 多说话人转写

MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入，输出带时间戳与 speaker 的结构化转写结果（JSON、逐段文本、按说话人汇总）。用于会议纪要、访谈录音、多人对话整理。

🦀 ClawHub

MH openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

🦀 ClawHub

Telegram Voice To Voice Macos

Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and reply with Telegram voice notes via say+ffmpeg. Not compatible with Linux/Windows.

🦀 ClawHub

Percept Listen

Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.

🦀 ClawHub

moss-transcribe-diarize

MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入，输出带时间戳与 speaker 的结构化转写结果（JSON、逐段文本、按说话人汇总）。用于会议纪要、访谈录音、多人对话整理。需要 API 凭证（环境变量：MOSS_API_KEY，兼容 MOSI_TTS_API_KEY / MOS...

🦀 ClawHub

Cult Of Carcinization

Give your agent a voice — and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.

🦀 ClawHub

it will help you to send voice messages to your AI Assistant and also can make it talk

Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.

🦀 ClawHub

Douyin Transcriber

Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.

🦀 ClawHub

TL;DX

Extract, transcribe, clean, segment, and analyze long-form content from URLs, local media files, existing transcripts, and pasted text. Use when a user provi...

🦀 ClawHub

Transcribe

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

🦀 ClawHub

Voice Memos

Transcribe and organize voice memos with automatic categorization and information extraction. Use when users have voice notes, audio memos, or spoken notes t...

🦀 ClawHub

Video Transcriber

视频转写工作流，支持B站和YouTube视频。自动判断有字幕/无字幕，有字幕则获取字幕，无字幕则下载音频+whisper转写。触发场景：(1) 用户要求总结视频内容 (2) 用户要求获取视频字幕 (3) 用户要求转写视频 (4) 处理B站/YouTube视频

🦀 ClawHub

Bilibili Transcript

Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...

🦀 ClawHub

video-download-transcribe

多平台视频下载 + 本地转录 + 视频内容分析。 **触发词**：这个视频说了什么、视频内容是什么、帮我看这个视频、下载这个视频、视频转录、字幕提取、B站视频、抖音视频、bilibili、youtube视频、帮我转录 **支持平台**：B站/抖音/TikTok/YouTube/小红书/微博/快手 **下载**：y...

🦀 ClawHub

Simple sound-to-text skill locally

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

🦀 ClawHub

whatsappVoiceOpenSkill

Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.

← PrevPage 2 / 5 (210 skills)Next →