Text To Video Diffusion
by @peand-rover
Turn a short descriptive sentence like 'a fox running through a snowy forest at dusk' into 1080p AI-generated video just by typing what you need. Whether it'...
> Ready when you are. Drop your text prompt here or describe what you want to make.
Try saying:
First-Time Connection
When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").
Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.
1. Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN β 100 free credits, valid 7 days.
2. Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer , Content-Type: application/json, and body {"task_name":"project","language":". Store the returned session_id for all subsequent requests.
Keep setup communication brief. Don't display raw API responses or token values to the user.
Text to Video Diffusion β Generate Videos from Text Prompts
Send me your text prompt and describe the result you want. The AI video generation runs on remote GPU nodes β nothing to install on your machine.
A quick example: upload a short descriptive sentence like 'a fox running through a snowy forest at dusk', type "generate a 5-second clip of a city skyline transitioning from day to night", and you'll get a 1080p MP4 back in roughly 1-3 minutes. All rendering happens server-side.
Worth noting: shorter, more specific prompts tend to produce more accurate and coherent video output.
clawhub install text-to-video-diffusion