🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

video-generate

by @warm-wm

Generate videos using Seedance models. Invoke when user wants to create videos from text prompts, images, or reference materials.

Versionv1.0.0
Installs37
πŸ’‘ Examples

Environment Variables

Before using this skill, ensure the following environment variables are set:

  • ARK_API_KEY or MODEL_VIDEO_API_KEY or MODEL_AGENT_API_KEY: API key for the video generation service
  • MODEL_VIDEO_API_BASE: API base URL (optional, has default)
  • MODEL_VIDEO_NAME: Model name (optional, has default)
  • Function Signature

    async def video_generate(
        params: list,
        batch_size: int = 10,
        max_wait_seconds: int = 1200,
        model_name: str = None,
    ) -> Dict:
    

    Parameters

    #### params (list[dict])

    A list of video generation requests. Each item is a dict with the following fields:

    Required per item:

  • video_name (str): Name/identifier of the output video file
  • prompt (str): Text describing the video to generate. Supports Chinese and English.
  • Optional per item - Input Materials:

  • first_frame (str): URL for the first frame image
  • last_frame (str): URL for the last frame image
  • reference_images (list[str]): 1-4 reference image URLs for style/content guidance
  • reference_videos (list[str]): 0-3 reference video URLs (mp4/mov, 2-15s each, total ≀15s)
  • reference_audios (list[str]): 0-3 reference audio URLs (mp3/wav, 2-15s each, total ≀15s)
  • Optional per item - Video Output Parameters:

  • ratio (str): Aspect ratio. Options: "16:9" (default), "9:16", "4:3", "3:4", "1:1", "2:1", "21:9", "adaptive"
  • duration (int): Video length in seconds. Range: 2-12s depending on model
  • resolution (str): Video resolution. Options: "480p", "720p", "1080p"
  • frames (int): Total frame count. Must be in [29, 289] and follow format 25 + 4n
  • camera_fixed (bool): Lock camera movement. Default: false
  • seed (int): Random seed for reproducibility. Range: [-1, 2^32-1]
  • watermark (bool): Whether to add watermark. Default: false
  • generate_audio (bool): Whether to generate audio. Only Seedance 1.5 supports this
  • tools (list[dict]): Tool configuration, e.g., [{"type": "web_search"}]
  • Input Modes

    1. Text-to-Video: Only provide prompt, no images/videos 2. First Frame Guidance: Provide first_frame for starting image 3. First + Last Frame Guidance: Provide both for transition video 4. Reference Images: Provide reference_images for style/content guidance 5. Multimodal Reference: Combine reference_images, reference_videos, reference_audios

    Return Value

    πŸ“‹ Tips & Best Practices

  • Keep prompt concise (recommended ≀ 500 characters)
  • For first/last frame, ensure aspect ratios match your chosen ratio
  • Reference images: 1-4 images, formats: jpeg/png/webp/bmp/tiff/gif
  • Reference videos: 0-3 videos, formats: mp4/mov, total duration ≀ 15s
  • Reference audios: 0-3 audios, formats: mp3/wav, total duration ≀ 15s
  • Multimodal requires at least one image or video (audio-only not supported)
  • Audio generation is only supported by Seedance 1.5 pro
  • If polling times out, use --query-task with the returned task_id
  • View on ClawHub
    TERMINAL
    clawhub install video-generate

    πŸ§ͺ Use this skill with your agent

    Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

    πŸ” Can't find the right skill?

    Search 60,000+ AI agent skills β€” free, no login needed.

    Search Skills β†’