BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 23+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case β†’Pick My Role

All Skills β€” audio

23 skills in "audio" matching "photo"

πŸ¦€ ClawHub
Echoic Memory
Distill a beloved person who has left your life into an AI Skill. Import chat history, photos, videos, voice memos, and social media to preserve their person...
πŸ¦€ ClawHub
NoteTaker Pro
AI-powered note-taking assistant that captures, cleans, tags, organizes, and indexes text, voice, paste, and photo notes for easy search and recall.
πŸ¦€ ClawHub
Video Maker Free
Make videos for free using AI β€” combine photos, text, and video clips into polished content with transitions, music, voiceover, subtitles, and effects. NemoV...
πŸ¦€ ClawHub
Ai Video Slideshow Maker
Create stunning photo and video slideshows with music using AI β€” transform photo collections into cinematic video stories with Ken Burns motion effects, beat...
πŸ¦€ ClawHub
JoyIn Robot Control
Control JoyIn AI robots (W-1 Walle / M-1 Mini) β€” movement, follow, photo, video, live stream, TTS, agent config, and device status via OpenAPI.
πŸ¦€ ClawHub
Flyworks Avatar Video
Generate videos using Flyworks (a.k.a HiFly) Digital Humans. Create talking photo videos from images, use public avatars with TTS, or clone voices for custom audio.
πŸ¦€ ClawHub
China Doc Ocr
智能文摣OCRθ―†εˆ«δΈŽη»“ζž„εŒ–ζε–γ€‚Use when the user has a complex document, PDF, scanned image, photo, invoice, receipt, ID card, table, or chart that needs to be recognized a...
πŸ¦€ ClawHub
Glasses to Social
Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.
πŸ¦€ ClawHub
Telegram Media
Send generated charts, photos, documents, and ElevenLabs TTS voice clips securely through Telegram using executed shell commands.
πŸ¦€ ClawHub
Article TTS
ζ‹η…§ζˆ–ζ–‡ε­—θ½¬ιŸ³ι’‘οΌšζ–‡η« η…§η‰‡ OCR ζε–ζ–‡ε­—οΌŒζˆ–η›΄ζŽ₯ζŽ₯ζ”Άζ–‡ε­—οΌŒη”Ÿζˆ Microsoft Edge TTS θ―­ιŸ³οΌŒζ”―ζŒδΈ­θ‹±ζ–‡γ€θ‡ͺεŠ¨θ½¬ε†™γ€θ―­ι€Ÿθ°ƒθŠ‚γ€ι€ε₯ζ‹†εˆ†γ€‚| Capture article photos (OCR) or plain text, generate natural audio via Edge TT...
πŸ¦€ ClawHub
Ai Talking Photo
Bring any still photo to life with ai-talking-photo, the skill that syncs facial animation to audio and makes portraits speak, sing, or narrate. Upload a fac...
πŸ¦€ ClawHub
Bean Whisperer
Generate espresso brew profiles for GaggiMate Pro on Rancilio Silvia. Use when the user provides a coffee bean (photo or name) and wants a brewing profile cr...
πŸ¦€ ClawHub
Craft Habit
Build sustainable creative practice routines for artistic skills. Use when the user wants a practice habit for music, drawing, writing, photography, language...
πŸ¦€ ClawHub
notetaker-pro
AI note-taking assistant that captures, cleans, organizes, tags, and indexes text, voice, paste, and photo inputs for instant, searchable notes.
πŸ¦€ ClawHub
Vision Recognition Ocr
Vehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask θ―†εˆ«θ½¦εž‹/ηœ‹ε›Ύθ―†εˆ«/提取文字/OCR. Supports local path, URL, and...
πŸ¦€ ClawHub
Veed Fabric
Generate talking head videos from a photo using VEED Fabric 1.0. Triggers on mentions of "veed", "fabric", or "talking video". Turns a headshot + audio or te...
πŸ¦€ ClawHub
AI Influencer
Create an AI clone video (talking head) from a single reference photo, a text script, and a cloned voice. Automates the pipeline of image generation (Gemini)...
πŸ¦€ ClawHub
Farmos Observations
Query and create field observations and AI-processed captures. Photos, voice notes, and text notes from the field.
πŸ¦€ ClawHub
Ai Media
Generate photorealistic images, videos, talking heads, and natural TTS audio using GPU-accelerated AI models and scripts on a remote server.
πŸ¦€ ClawHub
Record screen, microphone or camera from macOS terminal
macOS CLI tool to record microphone audio, screen video or screenshot, and camera video or photo from the terminal with device listing and output control.
πŸ¦€ ClawHub
Ai Reel Creator
Generate Instagram Reels, TikToks, and YouTube Shorts from any input with AI β€” text prompts, blog posts, product photos, raw clips, audio files, or just an i...
πŸ¦€ ClawHub
Test Skill
Headless creative production studio for AI agents. Generate images, edit photos, create videos, produce voiceover/music/SFX, and assemble polished output via...
πŸ¦€ ClawHub
Test Skill 3
Headless creative production studio for AI agents. Generate images, edit photos, create videos, produce voiceover/music/SFX, and assemble polished output via...