Find the Right AI Skill for Any Job
Browse 476+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
476 skills in "audio" matching "video"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Audio Video
Expert audio/video processing with ffmpeg and ffprobe. Use when the user needs to convert, compress, edit, analyze, stream, or process any audio or video fil...
🦀 ClawHub
Tomoviee Video Background Music
Generate music tailored to video content. Use when users request video_soundtrack operations or related tasks.
🦀 ClawHub
Summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
🦀 ClawHub
video-audio-replace
Replace video audio with TTS voice while preserving original timing. Includes subtitle generation from video using Whisper. Uses ElevenLabs or Edge TTS, alig...
🦀 ClawHub
How To Add Music To Video
Learn how-to-add-music-to-video using ClawHub's conversational AI skill. Drop in your footage, name a track or upload an audio file, and the OpenClaw agent h...
🦀 ClawHub
WeChat Video Editor - AI Video Editing for Douyin Xiaohongshu and TikTok
支持微信视频号、抖音、小红书、TikTok 格式导出。中文对话剪辑,无需打开任何软件。 AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects...
🦀 ClawHub
Bilibili Notion Pipeline Skill
Skill-first Bilibili to Notion pipeline. Download a Bilibili/b23 video, transcribe audio, upload the mp4, create or update a Notion transcript page, write tr...
🦀 ClawHub
Zoom Meeting Assistance Rtms Unofficial Community
Zoom RTMS Meeting Assistant — start on-demand to capture meeting audio, video, transcript, screenshare, and chat via Zoom Real-Time Media Streams. Handles meeting.rtms_started and meeting.rtms_stopped webhook events. Provides AI-powered dialog suggestions, sentiment analysis, and live summaries with WhatsApp notifications. Use when a Zoom RTMS webhook fires or the user asks to record/analyze a meeting.
🦀 ClawHub
Video Chat With Me
Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT),
edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent
sees your camera, hears your voice, and responds with its own personality and memory.
Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token.
Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text
→
🦀 ClawHub
FFmpeg CLI
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.
🦀 ClawHub
Yt Dlp Downloader
Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download subtitles, or select video quality. Triggers on phrases like "下载视频", "download video", "yt-dlp", "YouTube", "B站", "抖音", "提取音频", "extract audio".
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Video Transcribe - 视频转文字
本地视频转文字 - 使用 OpenAI Whisper 进行语音识别,完全免费、离线运行、保护隐私
🦀 ClawHub
YouTube ASR Summarize (Local)
Summarize YouTube videos with NO subtitles by doing local ASR (yt-dlp + faster-whisper) and extracting a few screenshot frames via ffmpeg. Use when the user...
🦀 ClawHub
Drone Video Editor
Raw drone footage arrives as flat, ungraded clips with inconsistent horizon lines, abrupt cuts between altitude changes, and ambient wind noise on the audio...
🦀 ClawHub
luci-memory
Search personal video memory — media content (videos, images, keyframes, transcripts) and portrait data (traits, events, relationships, speeches). Use when t...
🦀 ClawHub
Video Subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
🦀 ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
🦀 ClawHub
Crypto Alert
Download YouTube videos and transcribe audio using local Whisper. Use when you need to extract text from YouTube videos that don't have subtitles, or when yo...
🦀 ClawHub
Media Player
Play audio/video locally on the host
🦀 ClawHub
add narration to a video automatically
Generate narration for silent screen-recording videos. Extracts key frames, analyzes on-screen content, writes a presentation-style voiceover script, synthes...
🦀 ClawHub
Video To Text
Convert video or audio files from URLs into text or subtitle formats using a free API with automatic language detection and no local downloads required.
🦀 ClawHub
Speech Therapist Video
Create concise parent-focused videos showcasing your personalized speech therapy approach, family involvement, and child progress to build trust and clarify...
🦀 ClawHub
Video To Text
Video to text converter. Downloads videos from Bilibili using bilibili-api, from other sites using yt-dlp, then transcribes audio using faster-whisper. Use w...
🦀 ClawHub
Youtube Podcast summarizer via Elevenlabs
Transform YouTube videos into podcast-style voice summaries using ElevenLabs TTS
🦀 ClawHub
Ai Video Pipeline
对话式AI短视频创作工具。用户提出想法 → agent 设计脚本 → 人工确认 → 自动制作MP4。 当用户提到:(1) 做个视频/短视频, (2) AI旁白视频, (3) 认知自述/播客风格视频, (4) 文稿转视频。 不要在用户仅提到"视频"、"TTS"、"语音"等模糊词时激活(可能是其他需求)。
🦀 ClawHub
Comfy Story Video
Generate illustrated children's story videos with AI images and TTS narration using ComfyUI running locally.
🦀 ClawHub
NotebookLM Content Creation
Create and monitor NotebookLM Studio content — Audio Overview, Video Overview, Infographics, and Slides — via the notebooklm-mcp-cli. Use when user wants to...
🦀 ClawHub
Picasso TikTok
Full TikTok/Reels video pipeline: script → TTS voiceover (ElevenLabs) → HeyGen talking avatar → auto-subtitles (Whisper) → ffmpeg compose → 1080x1920 final v...
🦀 ClawHub
Lyric Video Maker
Turn your audio tracks and footage into polished lyric videos that captivate viewers from the first beat. This lyric-video-maker skill overlays synchronized,...
🦀 ClawHub
An OpenClaw skill for AI-powered multimedia generation (image, video, audio, 3D) via 170+ RunningHub API endpoints — zero dependencies, pure Python.
Generate images, videos, audio, and 3D models via RunningHub API (170+ endpoints) and run any RunningHub AI Application (custom ComfyUI workflow) by webappId...
🦀 ClawHub
Byt Workflow
YouTube video translation workflow, download audio, launch Doubao, play audio, capture translation
🦀 ClawHub
Youtube Audio Download
Download YouTube video audio and convert to MP3. Supports age-restricted videos with cookies.
🦀 ClawHub
Pub Gemini
Gemini CLI for one-shot Q and A, summaries, and generation. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music...
🦀 ClawHub
Seedance Cog
Seedance × CellCog. ByteDance's #1 video model meets the frontier of multi-agent coordination — CellCog orchestrates Seedance with scripting, voice synthesis...
🦀 ClawHub
Ai Video Gen
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Replicate models, LumaAI, Runway, and FFmpeg editing.
🦀 ClawHub
Church Sermon Video
Your Sunday sermon was recorded on three cameras and a phone. The raw footage is four hours across four files, the audio from the lapel mic is better than th...
🦀 ClawHub
Narrator Ai Cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 93 movies, 146 BGM tracks, 63 dubbing voices in 1...
🦀 ClawHub
Boxed FFmpeg
Audio/video information extraction, format conversion, and audio extraction using FFmpeg WASM sandbox.
🦀 ClawHub
Clawhub Skill Content Ingestion
Turn any URL into structured content — YouTube videos (via Gemini Video API), web articles, PDFs, and audio files. Extract transcripts, summaries, and metada...
🦀 ClawHub
B
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
A
AI video creation and editing — generate videos from text descriptions, edit with background music, sound effects, titles, transitions, and export finished M...
🦀 ClawHub
Keyapi Tiktok Content Analysis
Analyze TikTok content at scale — extract insights from videos, hashtags, music tracks, and live streams including engagement trends, comment sentiment, capt...
🦀 ClawHub
Keyapi Tiktok Intelligence
Real-time TikTok trend intelligence — monitor trending hashtags, viral music, breakout videos, top-performing ads, and high-growth products to identify emerg...
🦀 ClawHub
video-translation
Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.
🦀 ClawHub
BibiGPT Skill
BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc...
🦀 ClawHub
Avatar
Interactive AI avatar with Simli video rendering and ElevenLabs TTS
🦀 ClawHub
AI Music Video
Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music...
Page 1 / 10 (476 skills)Next →