BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 476+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — audio

476 skills in "audio" matching "video"

🦀 ClawHub
Fcpx Assistant
Final Cut Pro X (FCPX) assistant — auto video production, TTS voiceover, media management, batch export | AI 自动成片、TTS 配音、素材管理、批量导出. Triggers: FCPX, FCP, Fina...
🦀 ClawHub
Skillboss
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
🦀 ClawHub
Ai Video Gen 1.0.0
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
Ai Video Lip Sync Free
Tell me what you need and I'll sync your video's lip movements to any audio track — no expensive software required. This ai-video-lip-sync-free skill analyze...
🦀 ClawHub
Canva Ai Video Editor
Tired of spending hours cutting clips, writing captions, and hunting for the right music in Canva? The canva-ai-video-editor skill automates the tedious part...
🦀 ClawHub
Alibabacloud Video Translation
Alibaba Cloud IMS (Intelligent Media Services) based video translation Skill. Supports subtitle extraction (ASR/OCR), translation, and speech synthesis trans...
🦀 ClawHub
Speech Language Pathologist Video
Creates short videos for speech-language pathologists to explain evaluation, therapy, and family coaching for pediatric and adult communication development.
🦀 ClawHub
Vidu API comic strip short film generation capability, with built-in AI-generated videos, images, and TTS.
将用户创意或剧本转化为完整动漫成片,从剧本创作到自动拼接全流程使用 Vidu API 完成生图、生视频与 TTS,且禁止使用任何非 Vidu 模型。在用户需要制作动漫/动画短片、提供创意主题或详细剧本需求时使用;依赖 ffmpeg 与已配置的 Vidu API 凭证。
🦀 ClawHub
Video Ad Creator
Create fully produced, platform-optimized video ads from text briefs, including scripts, voiceovers, visuals, captions, CTAs, and export-ready formats.
🦀 ClawHub
BookMorph Magic
Orchestrate book-to-content workflows to generate video, audio, cover images, and a manifest for episode or campaign packages.
🦀 ClawHub
Podcast Video
Create 45-90 second podcast trailer and highlight videos that showcase key moments, guest insights, and your show's core topic to attract new listeners.
🦀 ClawHub
cutmv
Video processing tool using FFmpeg for cutting, format conversion, compression, frame/audio extraction, watermarking, and subtitle addition.
🦀 ClawHub
Listenhub
Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video",...
🦀 ClawHub
Google Gemini Media
Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".
🦀 ClawHub
Aliyun Mps Video Translation
Use when creating or managing Alibaba Cloud IMS video translation jobs via OpenAPI (subtitle/voice/face). Use when you need API-based video translation, stat...
🦀 ClawHub
UGC Factory
AI-powered video and content generation pipeline with script writing, TikTok automation, YouTube analysis, media library, avatars, and voice synthesis — buil...
🦀 ClawHub
小红书视频下载器
Download and summarize Xiaohongshu (小红书/RedNote) videos. Produces a full resource pack with video, audio, subtitles, transcript, and AI summary. This skill s...
🦀 ClawHub
Aliyun Modelstudio Entry Test
Use when running a minimal test matrix for the Model Studio skills that exist in this repo, including image/video/audio, realtime speech, omni, visual reason...
🦀 ClawHub
AI UGC
Call the RawUGC API to generate AI videos/images/music, manage content (personas, products, styles, characters), schedule social media posts, research TikTok...
🦀 ClawHub
Music School Video
Helps music schools create short videos showcasing programs, outcomes, and testimonials to attract parents and students.
🦀 ClawHub
Aliyun Wan Digital Human
Use when generating talking, singing, or presentation videos from a single character image and audio with Alibaba Cloud Model Studio digital-human model `wan...
🦀 ClawHub
Aliyun Emo
Use when generating expressive portrait videos from a person image and speech audio with Alibaba Cloud Model Studio EMO (`emo-v1`). Use when creating non-Wan...
🦀 ClawHub
douyin-research-kit
Extract and analyze Douyin (抖音) content using yt-dlp. Supports video metadata, caption extraction, user profile analysis, music/sound info, and engagement st...
🦀 ClawHub
Aliyun Modelstudio Entry
Use when routing Alibaba Cloud Model Studio requests to the right local skill (Qwen text, coder, deep research, image, video, audio, search and multimodal sk...
🦀 ClawHub
AI UGC Videos
Generate fully produced UGC-style video ads with AI-driven scripts, real visuals, voiceovers, and campaign strategy for Facebook, TikTok, and Instagram.
🦀 ClawHub
Echoic Memory
Distill a beloved person who has left your life into an AI Skill. Import chat history, photos, videos, voice memos, and social media to preserve their person...
🦀 ClawHub
asr-skill
This skill should be used when the user asks to "transcribe audio", "transcribe video", "convert speech to text", "generate subtitles", "create captions", "i...
🦀 ClawHub
corespeed-studio
Generate video, images, audio, and music using 40+ AI models via fal.ai. Use for video generation (Kling v3, Sora 2, Veo 3.1, LTX 2.3, Pixverse v5), image ge...
🦀 ClawHub
vargai
Generate AI videos, images, speech, and music using varg. Use when creating videos, animations, talking characters, slideshows, product showcases, social con...
🦀 ClawHub
Lip Sync Video
Turn raw footage into polished lip-sync-video content where every word lands exactly when mouths move. This skill analyzes audio waveforms alongside facial m...
🦀 ClawHub
Ai Music Generator Free Ab Old
Tired of searching royalty-free music libraries only to find tracks that don't quite fit your video's mood? The ai-music-generator-free skill creates origina...
🦀 ClawHub
Augent
The audio & video layer for agents. 22 local MCP tools. No cloud, no API keys.
🦀 ClawHub
Ai Content Repurposer
Convert long-form content like videos, blogs, and podcasts into optimized short scripts, threads, posts, transcripts, and summaries for multiple platforms.
🦀 ClawHub
Ai Video Gen Temp
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
Minimax Tools
Direct MiniMax API integration for speech synthesis (TTS), voice cloning, image generation, video generation, and music generation using local Python scripts...
🦀 ClawHub
Core Speed Art
Generate video, images, audio, and music using 40+ AI models via fal.ai. Use for video generation (Kling v3, Sora 2, Veo 3.1, LTX 2.3, Pixverse v5), image ge...
🦀 ClawHub
FFBox
FFBox multimedia transcoding tool integration. FFmpeg-based GUI for video/audio/image format conversion, compression, filtering, batch media processing with...
🦀 ClawHub
Video Reader
Tool-driven video question answering with frame extraction, sub-agent analysis, and audio transcription
🦀 ClawHub
AI Dance Video Generator
Generate AI dance videos where characters move to music or choreography templates using Media.io OpenAPI. Creates dynamic, rhythmic dance animations. AI danc...
🦀 ClawHub
IMA AI Text To Speech — seed-tts, DouBao
Convert text, scripts, and captions into natural voiceovers for videos, explainers, product demos, and social posts.
🔌 MCP
cnghockey/sats-for-ai
[![sats4ai MCP server](https://glama.ai/mcp/servers/@cnghockey/sats4ai/badges/score.svg)](https://glama.ai/mcp/servers/@cnghockey/sats4ai) 📇 ☁️ - Bitcoin-powered AI tools via Lightning Network micropayments (L402). Image, text, video, music, speech synthesis & transcription, vision, OCR, 3D model ge
🦀 ClawHub
Video Caption Generator
The video-caption-generator skill transcribes spoken audio from your video and burns accurate, readable captions directly into the output file. Upload any cl...
🦀 ClawHub
Veo Video Generator
Generates high-fidelity 1080p videos with synced audio using Google Veo 3.1. Use for creating cinematic clips from text descriptions.
🦀 ClawHub
Lip Sync
Guide users to VideoAny Lip Sync Studio to create lip-sync videos from an image and audio.
🦀 ClawHub
Audio Script Writer
Convert written medical content into podcast or video scripts optimized for audio delivery. Transforms academic papers, reports, and educational materials in...
🦀 ClawHub
Video Maker Free
Make videos for free using AI — combine photos, text, and video clips into polished content with transitions, music, voiceover, subtitles, and effects. NemoV...
🦀 ClawHub
Ai Video Slideshow Maker
Create stunning photo and video slideshows with music using AI — transform photo collections into cinematic video stories with Ken Burns motion effects, beat...
🦀 ClawHub
AI Music Video
Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music...
← PrevPage 2 / 10 (476 skills)Next →