BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 576+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case →Pick My Role

All Skills — image-gen

576 skills in "image-gen"

🦀 ClawHub
SAA Agent
Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend via command-line interface.
🦀 ClawHub
Image Generator
AI image generation skill using DALL-E, Stable Diffusion, or Midjourney API. Generate, edit, and vary images from text prompts.
GitHub
Software Development Resources for Data Scientists
_Data scientists concentrate on making sense of data through exploratory analysis, statistics, and models. Software developers apply a separate set of knowledge with different tools. Although their focus may seem unrelated, data science teams can benefit from adopting software development best practices. Version control, automated testing, and other dev skills help create reproducible, production-ready code and tools._
🦀 ClawHub
vision-skill
Use this skill for computer vision tasks including image recognition (OCR, object detection) and image generation (text-to-image, image-to-image). Supports a...
🦀 ClawHub
corespeed-nanobanana
Generate and edit images using Google Gemini models via Corespeed AI Gateway. Supports text-to-image generation, image editing, multi-image input, and text r...
🔌 MCP
evalstate/mcp-hfspace
📇 ☁️ - Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
🦀 ClawHub
Agent Selfie
AI agent self-portrait generator. Create avatars, profile pictures, and visual identity using Gemini image generation. Supports mood-based generation, season...
GitHub
Hugging Face Diffusion Models Course
Python materials for the online course on diffusion models by [@huggingface](https://github.com/huggingface).
🦀 ClawHub
Comfyui-Api
Connects to a ComfyUI server to generate images from prompts, auto-detects URLs, translates Chinese prompts, and supports REST and WebSocket APIs.
🦀 ClawHub
Nano Banana Pro (Morfeo)
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file
🦀 ClawHub
Phosor AI
Generate AI videos (text-to-video, image-to-video) with optional custom LoRA styles via the Phosor AI platform. Supports importing images and LoRA models fro...
🦀 ClawHub
comfyui-runner
Start/stop/status for a ComfyUI instance.
🦀 ClawHub
Comfyui anfrage
Send a workflow request to ComfyUI and return image results.
🦀 ClawHub
Al Image Generation
Use this skill as an entry point to discover, select, and fetch specific integration parameters for all supported AI image generation models.
GitHub
Stable Horde
A crowdsourced distributed cluster of Stable Diffusion workers.
GitHub
DiffusionDB
A list of all public apps, developer tools, guides and plugins for Stable Diffusion. [Airtable version](https://airtable.com/shr0HlBwbw3nZ8Ht3/tblxOCylXV8ynh7ti).
🦀 ClawHub
firstskill
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, gener...
🦀 ClawHub
Baoyu Danger Gemini Web
Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input,...
🦀 ClawHub
Comfy UI Complete Toolkit
Portable ComfyUI workflow and API guidance for any install. Use when building, validating, or troubleshooting ComfyUI image/video workflows, discovering avai...
🦀 ClawHub
Ai Video Gen Temp
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Supports OpenAI DALL-E, Re...
🦀 ClawHub
Minimax Tools
Direct MiniMax API integration for speech synthesis (TTS), voice cloning, image generation, video generation, and music generation using local Python scripts...
🦀 ClawHub
Universe explorer
Provide clear, structured explanations of cosmic structures, celestial bodies, fundamental physics, space exploration, and the universe's future based on cur...
🦀 ClawHub
Multimodal Generate Image
AI驱动的图片生成与编辑工具,用于制作高质量产品图。当用户要求生成图片、制作图片、编辑照片、文生图、图生图、换背景、变换风格、替换图片中的物体、将产品合成到场景中、换模特、制作任何类型的AI生成视觉内容、AI drawing, image generation, text-to-image, image-to-i...
🦀 ClawHub
LoRa CAD air scanner
LoRa Channel Activity Detection (CAD) scanner for LilyGo T3 v1.6 (ESP32-PICO-D4 + SX1276) with HackRF One support. Scans a configurable frequency range using...
🦀 ClawHub
Text to Image API
Generate AI images from text descriptions using Media.io OpenAPI. Provide a text prompt and receive a high-quality AI-generated image. Supports multiple mode...
🦀 ClawHub
krea
Generate images, videos, upscale images, and train LoRA styles with the Krea.ai API using customizable models and parameters.
🦀 ClawHub
Ecm Perchance Image
Generates images from text prompts using Perchance.org API with options for orientation and multiple image support.
🦀 ClawHub
bozo-writer
短视频口播文案扩展仿写工具。专为AI领域自媒体博主设计,根据提供的AI主题内容或文章,按照8步核心结构(开篇破局→对比铺垫→核心转折→深层解析→价值升华→关键痛点→方法给出→结尾点睛)扩展创作短视频口播文案。适用于AI基础知识科普、Lora训练、Comfyui使用、API调用、AI编程、智能体搭建等内容创作。 T...
🦀 ClawHub
Algorithmic Art.Blocked
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, gener...
🦀 ClawHub
NSFW Video Generation — Adult Creative AI Video Models
Generate AI videos for mature creative projects using Wan 2.2 Spicy (LoRA-tuned for NSFW, top recommended), Wan 2.6, Seedance 1.5, Vidu Q3-Pro, and other mod...
🦀 ClawHub
Lora Finetune
LoRA fine-tuning pipeline for Stable Diffusion on Apple Silicon — dataset prep, training, evaluation with LLM-as-judge scoring. Use when fine-tuning image ge...
🦀 ClawHub
Vultr Inference
Generate images and text using Vultr Inference API. Supports Flux image generation and various LLMs for text. Use when user wants to generate images, artwork...
🦀 ClawHub
Agent Selfie Backup
AI agent self-portrait generator. Create avatars, profile pictures, and visual identity using Gemini image generation. Supports mood-based generation, season...
🦀 ClawHub
Dogfood
Systematically explore and test a web application to find bugs, UX issues, and other problems. Use when asked to "dogfood", "QA", "exploratory test", "find i...
🦀 ClawHub
GLM-V-Prompt-Gen
Analyze images/videos and generate professional prompts for text-to-image and text-to-video AI tools (Midjourney, Stable Diffusion, DALL-E, Sora, Runway, Kli...
🦀 ClawHub
Yollomi AI Image & Video Generator
AI image generator skill (image, image generation). Multi-model image generator for Yollomi to generate AI images via one unified API endpoint. Requires YOLL...
🦀 ClawHub
Ai Image Skills
Build and execute skills.video image generation REST requests from OpenAPI specs. Use when user needs to create, debug, or document image generation calls on...
🦀 ClawHub
Imagen 4 AI Image Generator
Generate high-quality AI images using Google Imagen 4 via Media.io OpenAPI. Produces photorealistic, detailed images from text prompts with advanced text ren...
🦀 ClawHub
Nano Banana 2 Image Generator
Generate AI images using Nano Banana Pro via Media.io OpenAPI. State-of-the-art image quality with advanced reasoning, multi-image fusion, character consiste...
🦀 ClawHub
aesthetic-copilot
Use when the user wants to generate high-fidelity PROMPTS for Text-to-Image models (Flux, Ideogram, Midjourney) based on vague layout/content descriptions.
🦀 ClawHub
AI Image Generator
Generate AI images from text prompts — one API key for GPT Image, Gemini, Seedream, and 10+ models. No juggling subscriptions. Images saved to your YouMind k...
🦀 ClawHub
Chanjing Text To Digital Person
Use Chanjing text-to-digital-person APIs for AI portraits, talking videos, optional LoRA training, polling, and explicit downloads when requested.
🦀 ClawHub
Optimize text-to-image prompts for Grok and similar image models, especially for aviation posters, safety campaign visuals, official publicity images, and Chinese-language contest briefs.
Optimize text-to-image prompts for Grok and similar image models. Use when the user wants better image generation prompts, poster prompts, competition-grade...
🦀 ClawHub
SGLang-Diffusion Video Generation
Generate videos using a local SGLang-Diffusion server (Wan2.2, Hunyuan, FastWan, etc.). Use when: user asks to generate, create, or render a video with a loc...
🦀 ClawHub
Og Image Skill
Generate og image generator ai images with AI via the Neta AI image generation API (free trial at neta.art/open).
🦀 ClawHub
Midjourney Prompt Architect
Generate detailed, creative, and optimized prompts for Midjourney and other AI image generation tools (Stable Diffusion, DALL-E, Flux). Covers style specific...
🦀 ClawHub
Ai Image Generation Skills
AI video, image generation. 40+ models — Sora, Veo 3, Kling, Seedance, GPT Image, Hailuo, WAN. Text-to-video, image-to-video, text-to-image,image-to-image.
🦀 ClawHub
x402 Creative Resources
Access Xona's x402 creative resource APIs on api.xona-agent.com. Includes creative director (design research), image generation (nano-banana, seedream, grok-...
← PrevPage 3 / 12 (576 skills)Next →