Find the Right AI Skill for Any Job

Browse 166+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

All Skills — audio

166 skills in "audio" matching "transcribe"

Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...

🦀 ClawHub

Douyin Content Tracker Skill

This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...

🦀 ClawHub

Kai YouTube

Download and transcribe YouTube videos using yt-dlp and Whisper CLI, saving audio and transcripts for playback and summary from any YouTube URL.

🦀 ClawHub

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

🦀 ClawHub

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

🦀 ClawHub

MH summarize

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

🦀 ClawHub

Speech to Text Transcription

Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.

🦀 ClawHub

Aliyun Speech Transcriber

Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript...

🦀 ClawHub

Voice Transcriber

Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...

🦀 ClawHub

Audio Handler

Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, FLAC, AAC, OGG, and more.

🦀 ClawHub

Audio Transcribe

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

⭐ GitHub

Showtimes

Transcribes and summarizes audio content.

🦀 ClawHub

Kai Minimax Tts

Generate voice audio and transcribe speech using MiniMax TTS API. Use when responding with voice or transcribing audio files.

🦀 ClawHub

Feishu Voice Loop

Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.

🦀 ClawHub

Voice Note Transcriber Cn

语音笔记转文字工具 v2.1 | Voice Note Transcriber. 支持多语言识别、实时转写、说话人识别、智能摘要、音频降噪、离线识别。触发词：转写、识别、语音。

🦀 ClawHub

Transcribe Audio with Parakeet MLX

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

🦀 ClawHub

MH openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

🦀 ClawHub

Telegram Voice To Voice Macos

Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and reply with Telegram voice notes via say+ffmpeg. Not compatible with Linux/Windows.

🦀 ClawHub

Percept Listen

Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.

🦀 ClawHub

moss-transcribe-diarize

MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入，输出带时间戳与 speaker 的结构化转写结果（JSON、逐段文本、按说话人汇总）。用于会议纪要、访谈录音、多人对话整理。需要 API 凭证（环境变量：MOSS_API_KEY，兼容 MOSI_TTS_API_KEY / MOS...

🦀 ClawHub

it will help you to send voice messages to your AI Assistant and also can make it talk

Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.

🦀 ClawHub

Douyin Transcriber

Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.

🦀 ClawHub

Transcribe

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

🦀 ClawHub

Video Transcriber

视频转写工作流，支持B站和YouTube视频。自动判断有字幕/无字幕，有字幕则获取字幕，无字幕则下载音频+whisper转写。触发场景：(1) 用户要求总结视频内容 (2) 用户要求获取视频字幕 (3) 用户要求转写视频 (4) 处理B站/YouTube视频

🦀 ClawHub

Bilibili Transcript

Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...

🦀 ClawHub

Simple sound-to-text skill locally

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

🦀 ClawHub

Gladia YouTube Transcription (Free)

Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...

🦀 ClawHub

YouTube Transcribe

Transcribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...

🦀 ClawHub

Super-Transcribe — Unified Speech-to-Text

Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...

🦀 ClawHub

Volcengine STT

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...

🦀 ClawHub

Voice Memo Sync

Sync, transcribe, and intelligently organize voice memos, audio/video files, and URLs. 同步、转录、智能整理语音备忘录、音视频文件和视频链接。

🦀 ClawHub

K8s Self Hosted Whisper Api

Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert sp...

🦀 ClawHub

Gettr Transcribe

Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT). Use when given a GE...

🦀 ClawHub

Voice Note Transcriber Cn V1.1

语音笔记转文字工具 v1.1 | 新增：实时字幕、多语言翻译、语音标记、音频剪辑、SRT导出。支持实时转写、会议纪要生成。

🦀 ClawHub

Telegram Multilingual Voice Reply

Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta...

🦀 ClawHub

salute speech

Transcribe audio files using Sber Salute Speech async API. Russian-first STT with support for ru-RU, en-US, kk-KZ, ky-KG, uz-UZ.

🦀 ClawHub

Speech to Text

Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...

🦀 ClawHub

Auto Subtitle Generator Online

The auto-subtitle-generator-online skill transcribes and embeds accurate subtitles into your videos using AI-powered speech recognition. Upload your footage,...

🦀 ClawHub

Elevenlabs Transcribe

Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.

🦀 ClawHub

Deapi Audio

Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read al...

🦀 ClawHub

Transcribee 🐝

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

🦀 ClawHub

AIML Voice Transcript

Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script...

🦀 ClawHub

Gemini STT

Transcribe audio files using Google's Gemini API or Vertex AI

🦀 ClawHub

Step Asr

Transcribe audio files to text via Step ASR streaming API (HTTP SSE). Supports Chinese and English, multiple audio formats (PCM, WAV, MP3, OGG/OPUS), real-ti...

🦀 ClawHub

AssemblyAI advanced speech transcription

Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...

🦀 ClawHub

Youtube Transcriber

One-command YouTube video transcription. Automatically downloads audio and transcribes using OpenAI Whisper API — works even when YouTube subtitles are disab...

🦀 ClawHub

openclaw-voice

Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.

🦀 ClawHub

TG Voice Whisper Transcriber

Automation skill for TG Voice Whisper Transcriber.

← PrevPage 2 / 4 (166 skills)Next →