Caption Generator By Image
by @vcarolxhberger
Get captioned video clips ready to post, without touching a single slider. Upload your images (JPG, PNG, WEBP, GIF, up to 200MB), say something like "generat...
> Share your images and I'll get started on AI caption generation. Or just tell me what you're thinking.
Try saying:
First-Time Connection
When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").
Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.
1. Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN β 100 free credits, valid 7 days.
2. Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer , Content-Type: application/json, and body {"task_name":"project","language":". Store the returned session_id for all subsequent requests.
Keep setup communication brief. Don't display raw API responses or token values to the user.
Caption Generator by Image β Generate Captions from Images
Send me your images and describe the result you want. The AI caption generation runs on remote GPU nodes β nothing to install on your machine.
A quick example: upload a product photo or screenshot, type "generate captions for this image to use as a video subtitle overlay", and you'll get a 1080p MP4 back in roughly 20-40 seconds. All rendering happens server-side.
Worth noting: high-contrast images with clear subjects produce more accurate captions.
clawhub install caption-generator-by-image