Find the Right AI Skill for Any Job
Browse 141+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.
All Skills — audio
141 skills in "audio" matching "Content"
🌐 Allcodingdevopsapidatabasesecuritydataresearchwritingimage-genvideoaudiotranslationseosocial-mediaemail-marketingadvertisingfinancecrypto-defiecommercelegalhrreal-estatehealtheducationcookingtravelgamingautomationcommunicationproductivityclawhublobehubdifymcp
🦀 ClawHub
Douyin Content Tracker Skill
This skill should be used when the user wants to scrape Douyin (TikTok China) creator content, download audio, and transcribe it with Whisper. Covers first-t...
🦀 ClawHub
Linkedin Writer 1.0.0
Writes LinkedIn posts that sound like a real person, not a content mill
🦀 ClawHub
Content Repurposer
Turn one piece of content into 10+ formats. Transform blog posts, podcasts, videos, or talks into tweets, LinkedIn posts, newsletters, carousels, and more.
🦀 ClawHub
Video Analyzer (TikTok + YouTube + Instagram)
Analyze videos from TikTok, YouTube, Instagram, Twitter, and others by URL, transcribing audio locally and answering questions about the content.
🦀 ClawHub
AI Content Repurposer Pro
Automatically convert long-form videos, blogs, and podcasts into platform-optimized social media scripts, threads, summaries, and transcripts.
🦀 ClawHub
Link Library
Personal knowledge base that captures web content (articles, tweets/threads, videos, podcasts, images, PDFs) and makes it retrievable for future conversation...
🦀 ClawHub
Edge TTS
Text-to-speech conversion using node-edge-tts npm package for generating audio from text.
Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation.
Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
🦀 ClawHub
Vibe Marketing
Run marketing campaigns with AI automation. Covers content generation, workflow automation, copy that sounds human, and rapid testing.
🦀 ClawHub
Content Claw
Automated content generation engine. Transform source material (papers, podcasts, case studies) into platform-ready content using recipes and brand graphs. U...
⭐ GitHub
Showtimes
Transcribes and summarizes audio content.
🦀 ClawHub
Podcast Generator
Convert articles, blog posts, or any text into professional podcast scripts and TTS audio. Use when a user wants to: (1) Transform written content into conve...
🦀 ClawHub
ToneClone CLI
Write in the user's authentic voice using ToneClone. Generate emails, messages, social posts, and other content that sounds like the user — not generic AI. U...
🦀 ClawHub
IMA Studio All-in-One — Image, Video, Music, SeeDream, Veo, Suno. Banana
Most comprehensive AI content creation platform with unified access to all leading models across images (SeeDream 4.5, Midjourney, Nano Banana 2, Nano Banana...
🦀 ClawHub
Ai Humanizer
Rewrites AI-generated content to sound natural, human, and undetectable. Removes robotic patterns, adds voice variety, and preserves meaning.
🦀 ClawHub
Audio Reply
Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken res...
🦀 ClawHub
Voice-Matched Content System
Extract someone's authentic writing voice from samples, build a complete Voice DNA profile, then generate content that sounds like them — not AI. Covers conf...
🦀 ClawHub
Ai Content Detection
Use this skill whenever a user wants to verify whether content (text, images, audio, video, or documents) was created by AI; detect deepfakes or AI-synthesiz...
🦀 ClawHub
Transcribe
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.
🦀 ClawHub
Audio Content Generator
Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it to high-quality audio. Supports multiple formats (audiobook, podcast, educational), custom lengths, and voice effects. Use when asked to create audio content, make a podcast, generate an audiobook, or produce educational audio. Returns MP3 audio file via MEDIA token.
🦀 ClawHub
short-video-content-replicator
一键端到端短视频内容复制工作流。输入抖音/B站视频URL或本地视频目录,严格按6步顺序执行:1. link-resolver-engine 下载视频;2. mp4-to-mp3-extractor 提取MP3;3. purevocals-uvr-automator 提取干声;4. turbo-whisper-lo...
🦀 ClawHub
XRepl AI - Tweet Generator
Generate, schedule, and publish tweets in your voice using AI. Browse viral content, manage preferences, and track billing.
🦀 ClawHub
X Article Reader
Read X (Twitter) Articles aloud using macOS text-to-speech. Accepts an X Article URL and reads the content out loud. Automatically detects Chinese vs English...
🦀 ClawHub
Simple sound-to-text skill locally
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
🦀 ClawHub
Text to Song
AI music generation assistant powered by MakebestMusic. Use when user wants to create AI-generated music, songs, or audio tracks. Perfect for content creator...
🦀 ClawHub
milady
Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net art, anime, cyber-spiritualism, and post-ironic sincerity. Use when creating art descriptions, writing in this voice, engaging with Wired aesthetics, or channeling the Remilia collective energy.
🦀 ClawHub
Glasses to Social
Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.
⭐ GitHub
Fliki
Create text to video and text to speech content with ai powered voices in minutes.
🦀 ClawHub
Voiceover App
Turn silent footage into compelling, broadcast-ready content with the voiceover-app skill. Built for content creators, educators, and video producers, this s...
🦀 ClawHub
Firm Platform Audit Pack
Platform alignment audit pack for OpenClaw 2026.2. Secrets v2, agent routing, voice security, trust model, autoupdate, plugin SDK, content boundaries, and sq...
🦀 ClawHub
Firm Spec Compliance Pack
MCP 2025-11-25 specification compliance audit pack. Validates elicitation, tasks, resources/prompts, audio content, JSON Schema 2020-12, SSE transport, and i...
🦀 ClawHub
Akashic Doc Analyzer
Parse, analyze, and extract content from documents (PDF, DOCX, PPTX, audio). Supports OCR, table extraction, and semantic chunking.
🦀 ClawHub
Add Music To Video For Free
Turn raw, silent footage into polished, emotionally engaging content by using this skill to add-music-to-video-for-free — no subscriptions, no watermarks, no...
🦀 ClawHub
LinkedIn Writer
Writes LinkedIn posts that sound like a real person, not a content mill
🦀 ClawHub
Bilibili Downloader
Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content for downloading videos in various qual...
🦀 ClawHub
XReplyAI - Social Post Manager
Generate, schedule, and publish posts to X and LinkedIn in your voice using AI. Browse viral content, manage preferences, and track billing.
🦀 ClawHub
Content Humanizer
Makes AI-generated content sound genuinely human — not just cleaned up, but alive. Use when content feels robotic, uses too many AI clichés, lacks personalit...
🦀 ClawHub
Fliz AI Video Generator
Complete integration guide for the Fliz REST API - an AI-powered video generation platform that transforms text content into professional videos with voiceovers, AI-generated images, and subtitles.
Use this skill when:
- Creating integrations with Fliz API (WordPress, Zapier, Make, n8n, custom apps)
- Building video generation workflows via API
- Implementing webhook handlers for video completion notifications
- Developing automation tools that create, manage, or translate videos
- Troubleshoot
🦀 ClawHub
clawdio
Analyze Twitter Spaces and voice conversations to extract market intelligence, crypto alpha, sentiment analysis, and speaker-attributed insights. Transforms spoken audio into structured reports, full transcripts, and machine-readable metadata. Use when you need intelligence from Twitter Spaces, podcast discussions, or any long-form voice content — especially for crypto markets, AI trends, and expert commentary that only exists in audio.
🦀 ClawHub
clip-editor
Video clip editing skill for automatically analyzing video content and generating CapCut draft templates. Uses local Whisper for speech transcription, Qwen-V...
🦀 ClawHub
Human Voice Content Editor
Audit and rewrite content to remove AI-generated feel by stripping markdown artifacts, eliminating AI vocabulary patterns, flagging hallucination risks, and...
🦀 ClawHub
Ai Powered Content Repurposing
Transform blog posts, articles, and long-form content into multiple formats—videos, podcasts, social media posts, and summaries. Use when the user needs cont...
🦀 ClawHub
🗣️ Edge-TTS Skill using uvx
Text-to-speech conversion using `uvx edge-tts` for generating audio from text.
Use when:
(1) User requests audio/voice output with the "tts" trigger or keyword.
(2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking).
(3) User wants a specific voice, speed, pitch, or format for TTS output.
🦀 ClawHub
Research Brief Generator
Generates a comprehensive, structured research brief on any topic, person, case, or event. Ideal for journalists, podcasters, writers, and content creators w...
🦀 ClawHub
Douyin Content Tracker Skill
Scrapes Douyin creator videos, downloads audio (Playwright+ffmpeg with yt-dlp fallback), and transcribes with Whisper. Covers setup, daily tracking, cookie m...
🦀 ClawHub
Greg Eisenberg
Generate content ideas, business strategies, and startup concepts in the style of Greg Eisenberg (Startup Ideas Podcast). Use when brainstorming product idea...
🔧 Dify
Podcast Generator (Dify)
**Podcast Generator** is a powerful tool for creating podcast audio files using Text-to-Speech (TTS) services. This tool can generate a podcast with alternating voices by providing a script, making it ideal for dialogue-based content, interviews, or storytelling. Powered by OpenAI-based TTS services, Podcast Generator simplifies the production of high-quality audio content. Currently this tool sup
🔧 Dify
Aws (Dify)
**Author:** aws **Type:** Tool The AWS Tools plugin provides a comprehensive set of tools based on various AWS services, enabling you to leverage AWS capabilities directly within your Dify applications. These tools cover a wide range of functionalities including content moderation, text reranking, text-to-speech conversion, speech recognition, and more. The AWS Tools plugin includes the following
🦀 ClawHub
noteboklm
Complete Google NotebookLM integration — add sources, ask questions, generate all Studio content (podcast, video, slide deck, quiz, flashcards, infographic,...