Browse AI Agent Skills | BytesAgain

🎁 Get the FREE AI Skills Starter Guide — Subscribe →

All Skills — image-gen

145 skills in "image-gen"

LoRA fine-tuning pipeline for Stable Diffusion on Apple Silicon — dataset prep, training, evaluation with LLM-as-judge scoring. Use when fine-tuning image ge...

Free Image Editing

Skip the learning curve of professional editing software. Describe what you want — remove the background, adjust brightness, and add text overlay — and get e...

Yearbook Photo Skill

Generate ai yearbook photo generator images with AI via the Neta AI image generation API (free trial at neta.art/open).

Antigravity Image Generator

Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.

One MCP gateway to 230+ AI tools — SEO, web search, image generation, video, screenshots, security scanning, and more. Auto-provisions on first use with no A...

Evolink Image — AI Image Generation (GPT Image, Nano Banana 2, Seedream, GPT-4o)

AI image generation & editing — GPT Image, GPT-4o, Nano Banana 2, Seedream, Qwen, WAN, Gemini. Text-to-image, image-to-image, inpainting. 20 models, one API...

Send requests to the dr.eamer.dev LLM API for chat completions, vision analysis, image generation, text-to-speech, and video generation across 12 model provi...

Vidu API supports text-based images, reference images, and image editing.

Vidu AI 图片生成。支持 Nano 生图、Vidu 参考生图。对话式调用，自动识别意图。

Generate an image from a text prompt through the Hugging Face Inference API using stabilityai/stable-diffusion-xl-base-1.0 and the HUGGINGFACE_TOKEN environm...

Generate advertising images automatically from a product URL + brand profile. ✅ USE WHEN: - User provides a product URL (e-commerce link) - Want automated product scraping + image generation - Have a brand profile to apply (70+ brands available) - Need funnel-stage targeting (awareness/consideration/conversion) - Want AI to auto-select model, scene, lighting based on brand ❌ DON'T USE WHEN: - User provides local product image file → use morpheus-fashion-design - Don't need a person in the imag

Comfyui Workflow

Universal ComfyUI workflow executor with 33+ workflow templates. Self-describing — use --inspect on ANY workflow to discover inputs and outputs automatically...

Convert PDFs into structured Markdown filesystems and hydrate them into your workspace for exploration with standard Unix tools

Fetch handwritten notes, sketches, and drawings from a reMarkable tablet via Cloud API (rmapi). Process content by refining artwork with AI image generation, extracting handwritten text to memory/journal, or using sketches as input for other workflows. Use when working with reMarkable tablet content, syncing handwritten notes, processing sketches, or integrating tablet drawings into projects.

Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via inference.sh CLI. Capabilities: text-to-image, image editing, multi-image inpu...

Call Fetch.ai Agentverse agents by address. Search the Agentverse marketplace, browse a curated catalog of top agents (Tavily Search, ASI1-Mini, DALL-E 3, Te...

Dnd Character Skill

Generate dnd character art generator images from text descriptions via the Neta AI image generation API (free trial at neta.art/open).

Volcengine Ai Image Generation

Image generation workflow on Volcengine AI services. Use when users need text-to-image, style variants, prompt refinement, or deterministic image generation parameters and troubleshooting.

IMA Nano Banana Image Generator

Nano Banana-only image generation on IMA Open API. Supports text_to_image and image_to_image with gemini-3.1-flash-image (budget) and gemini-3-pro-image (pre...

Image Prompt Patterns

Write or optimize AI image generation prompts (for Midjourney, Nano Banana Pro, GPT-Image-2, Flux, etc.). USE THIS FIRST when user asks to write/compose/give...

Reve AI Image Generation

Generate, edit, and remix images using the Reve AI API. Use when creating images from text prompts, editing existing images with instructions, or combining/remixing multiple reference images. Requires REVE_API_KEY or REVE_AI_API_KEY environment variable.

Baoyu Image Gen

AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parall...

Seedream 5.0 AI Image Generator – Try the Latest Smart Image Creation Model Online – API-powered

AI image generation — create and edit images using Seedream 5.0 model by ByteDance

Access ATXP paid API tools for web search, AI image generation, music creation, video generation, and X/Twitter search. Use when users need real-time web sea...

Ai Prompt Optimizer Cn Payment

AI绘画提示词优化器 | 智能优化Midjourney/SD/DALL-E提示词。支持风格转换、批量生成。

VPick AI Image Generator

Multi-model AI image generation on a visual canvas. Supports Midjourney (relaxed/fast/turbo, v7.0, 4-image grid), Grok Imagine (6 images per call, auto I2I),...

ComfyUI Generator

Generate AI images and perform style transfers via ComfyUI with batch processing and automated workflow management through OpenClaw integration.

Dlazy Recraft V4 Pro

4MP high-resolution raster image generation. Suitable for print-ready assets and large-scale use.

Generate and edit AI images with Seedream (ByteDance) via AceDataCloud API. Use when creating images from text prompts, editing existing images with inpainti...

Nano Banana 2 — Gemini Image Generation

Gemini image generation, editing, and search-grounded image creation via gemini-3.1-flash-image-preview (Nano Banana 2). USE FOR: - Generating images from te...

nano banana text to image in Atlas AI

Generates images from text prompts using AtlasCloud Nanobanana 2 model, requiring an AtlasCloud API token and specific JSON parameters without media_resolution.

Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.

Memorable Image Generator

Science-backed image generation agent that scores and optimizes images for memorability using ResMem (Brain Bridge Lab, University of Chicago) before returni...

Run local ComfyUI workflows via the HTTP API. Use when the user asks to run ComfyUI, execute a workflow by file path/name, or supply raw API-format JSON; sup...

AI role play character image generation

Character-consistent AI image generation for agents. Same person, any outfit, any scene, every time. Use when: (1) Your agent needs to generate character ima...

Give your agent an inner life. Amigo bundles open-thoughts (free-thinking and exploration) with social-graph (social intelligence and sharing awareness) plus...

Generate images on Jimeng (即梦 / jimeng.jianying.com) using OpenClaw-managed browser. Supports prompt entry, ratio selection, quick result inspection, and loc...

Wan 2.7 Image — Free AI Text-to-Image Generator

Free AI text-to-image generator powered by Alibaba's open-source Wan 2.7 model. No signup needed.

Kling Image Generate

可灵AI图像生成API工具。支持文生图、图生图、多图参考生成、图像Omni、扩图等功能。使用环境变量KLING_ACCESS_KEY和KLING_SECRET_KEY进行鉴权。当用户需要生成AI图像、图片编辑、图像扩展等任务时使用此技能。

End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, and editing. Powered by SkillBoss API H...

Generate SVG images using text LLM instead of image generation APIs. Use when user wants to create illustrations, icons, cartoons, diagrams, or any visual co...

Agent wallet, identity, and paid tools in one package. Register an agent, fund it via Stripe or USDC, then use the balance for web search, AI image generatio...

OpenRouter Image Generation

Generate images using Google Gemini via OpenRouter API. Supports text-to-image and reference-image-guided generation. Use when the user asks to generate, cre...

Baoyu Image Gen

AI image generation with OpenAI, Google, OpenRouter, DashScope, Jimeng, Seedream and Replicate APIs. Supports text-to-image, reference images, aspect ratios,...

Autonomous internet exploration skill. Your agent roams the web driven by its own curiosity, discovers interesting things, and sends illustrated "postcards"...

CamScanner Add Watermark

Use CamScanner to add a tiled text watermark across an entire image. Triggers on "add watermark to image", "watermark image", "add copyright text to image",...

ComfyUI执行器

通过 HTTP API 与 ComfyUI 服务交互，支持工作流提交与执行、队列管理、文件上传和能力探测；自动检测视频工作流并使用合适超时；简洁输出执行结果；当用户需要使用 ComfyUI 生成图像、视频、音频或管理服务时使用

Shadows Oneshot Fix

Surgical quick fix — max 5 tool calls, zero exploration, read-diagnose-fix-verify. Use for small bugs, typos, simple changes that don't need deep analysis.

User Guide Automation

Reusable workflow for generating formal, detailed Markdown user guides from web applications using browser exploration or user-provided flows. Uses screensho...

← PrevPage 3 / 4 (145 skills)Next →