Find the Right AI Skill for Any Job
Browse 90+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills โ clawhub
90 skills in "clawhub" matching "transcription"
๐ Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
๐ฆ ClawHub
Meeting Notes Generator
AI-powered meeting notes generator - automatic transcription, summary, action items extraction, and task assignment. Turns meeting recordings or text into pr...
๐ฆ ClawHub
Construction Daily Report Generator
Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.
๐ฆ ClawHub
Construction Meeting Minutes Generator
Generate structured construction meeting minutes from rough notes or voice transcription, with separated action items, decision tracking, and contractual fla...
๐ฆ ClawHub
ๆ้ณๆๆก่งฃๆ
Call the coze-js-api Douyin transcription endpoint and return transcript-ready results from Douyin URLs or share-text. Use this skill whenever the user asks...
๐ฆ ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
๐ฆ ClawHub
Edge TTS Voice System
Local voice system for OpenClaw using faster-whisper for inbound transcription and Edge TTS for outbound replies. Use when you need private voice workflows,...
๐ฆ ClawHub
Local Transcription
Local speech-to-text transcription with Qwen ASR โ transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...
๐ฆ ClawHub
In Silico Perturbation Oracle
Virtual gene knockout simulation using foundation models to predict transcriptional changes
๐ฆ ClawHub
WayinVideo - Video Understanding & AI Clipping
WayinVideo AI video editing and analysis suite. Includes highlight extraction, natural language video search, content summarization, and transcription. Examp...
๐ฆ ClawHub
Audio Command Executor
Processes inbound audio files, transcribes them, and answers to resulting texts. Converts non-WAV inputs to WAV before transcription.
๐ฆ ClawHub
Whisper STT
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
๐ฆ ClawHub
Ai Sdk Core
Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4โv5 migration and 15 error solutions with workarounds.
Use when: implementing AI SDK v5/v6, migrating versions, troubleshooting AI_APICallError, Workers startup issues, Output API errors, Gemini caching issues, Anthropic tool errors, MCP tools, or stream resumption failures.
๐ฆ ClawHub
In Silico Perturbation Oracle
Virtual gene knockout simulation using foundation models to predict transcriptional changes
๐ฆ ClawHub
MiMo Voice Assistant
End-to-end voice solution for OpenClaw agents. Xiaomi MiMo-V2-TTS with emotion-aware speech generation, MiMo-V2-Omni for voice transcription. Multi-platform...
๐ฆ ClawHub
Voice-to-Protocol Transcriber
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structured experiment documentation.
๐ฆ ClawHub
Azure Ai Transcription Py
Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization.
Triggers: "transcription", "speech to text", "Azure AI Transcription", "TranscriptionClient".
๐ฆ ClawHub
Markdown Converter
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
๐ฆ ClawHub
Faster Whisper
Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...
๐ฆ ClawHub
Openai Whisper Api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
๐ฆ ClawHub
Video Reader
Tool-driven video question answering with frame extraction, sub-agent analysis, and audio transcription
๐ฆ ClawHub
Audio Analyze
High-performance audio transcription and analysis using Gemini 3.1 Pro. Powered by Evolink.ai
๐ฆ ClawHub
OpenRouter Audio
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...
๐ฆ ClawHub
musa-torch-coding
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
๐ฆ ClawHub
SlonAide
Query and manage SlonAide voice recording notes - list recordings, get transcriptions and AI summaries.
๐ฆ ClawHub
Subtitle Generator
Generate synchronized subtitles (SRT/VTT/ASS) from video audio with precise timestamps. Use when users need subtitles, captions, or video transcription with...
๐ฆ ClawHub
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
๐ฆ ClawHub
SenseAudio-ASR
Build and troubleshoot SenseAudio speech recognition integrations, including HTTP transcription (`/v1/audio/transcriptions`), realtime WebSocket ASR (`/ws/v1...
๐ฆ ClawHub
Meeting Assistant
็จไบๆๅปบๅๆๆฅ SenseAudio ไผ่ฎฎๅฉๆ๏ผ่ฆ็ๅฎๆถไผ่ฎฎ่ฝฌๅใ่ฏด่ฏไบบๅบๅใๅฎๆถ็ฟป่ฏใไผ่ฎฎ็บช่ฆ็ๆใ่กๅจ้กนๆๅไธ่ฝฌๅฝๅฏผๅบใBuild and troubleshoot SenseAudio meeting assistants for live meeting transcription, speaker-aw...
๐ฆ ClawHub
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
๐ฆ ClawHub
Funasr Transcribe Skill
Use when the user needs local speech-to-text transcription for audio files, especially Chinese or mixed Chinese-English audio, without relying on cloud trans...
๐ฆ ClawHub
Whisper Transcriber
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
๐ฆ ClawHub
Local Voice (FluidAudio TTS/STT)
Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and transcription running entirely on-device via the Apple Neural Engine. Use when setting up local voice capabilities, voice assistant integration, or replacing cloud TTS/STT services.
๐ฆ ClawHub
Speechall command-line tool for fast speech-to-text transcription using multiple providers
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.
๐ฆ ClawHub
multimodal-parser
Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing
๐ฆ ClawHub
DeepGram Speech platform
Command-line tool for fast, accurate speech-to-text transcription from local files, URLs, or live audio using Deepgramโs API with customizable options.
๐ฆ ClawHub
Speech to Text Transcription
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
๐ฆ ClawHub
Voice Transcriber
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcript...
๐ฆ ClawHub
Listen
Improve transcription accuracy over time. Learn corrections, configure STT.
๐ฆ ClawHub
yap
Fast on-device speech-to-text transcription on macOS 26+ using Apple Speech.framework, supporting multiple languages and output formats without model downloads.
๐ฆ ClawHub
Faster Whisper Local Service
OpenClaw local speech-to-text backend using faster-whisper over HTTP on 127.0.0.1:18790. Use when you want voice transcription without external APIs, without...
๐ฆ ClawHub
Timeless.day Meeting Notes
Query and manage Timeless meetings, rooms, transcripts, and AI documents. Capture podcast episodes and YouTube videos into Timeless for transcription. Use wh...
๐ฆ ClawHub
Whisper Tailnet API
Consume the shared Whisper speech-to-text API over Tailnet at http://100.92.116.99:8765 using OpenAI-compatible audio transcription endpoint (/v1/audio/trans...
๐ฆ ClawHub
openclaw-whisper-voice
Local Whisper speech-to-text for audio files and inbound voice notes on the OpenClaw Gateway host. Use when setting up local transcription for WhatsApp, Tele...
๐ฆ ClawHub
Drug Pronunciation
Provides correct pronunciation guides for complex drug generic names. Generates phonetic transcriptions using IPA and audio generation markers for medical te...
๐ฆ ClawHub
MarkItDown Skill
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
๐ฆ ClawHub
MH openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
๐ฆ ClawHub
ton
Ton namespace for Netsnek e.U. audio and media processing tools. Handles audio transcription, format conversion, waveform analysis, and podcast production wo...
๐ฆ ClawHub
Faster Whisper Local
Local speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models....
Page 1 / 2 (90 skills)Next โ