Skip to main content
With the Decart API, you can generate high-quality videos from text descriptions or bring still images to life with animation. Simply describe what you have in mind, and let the model create dynamic video content. If you want more creative input from the model, you can also keep your description broad and let Lucy fill in the details. Our models support a wide range of styles and motion types—from realistic movement to artistic animations.

Quick start

Here’s a simple example using Lucy-Pro, our highest quality video model.

Generate from text

curl -X POST https://api.decart.ai/v1/generate/lucy-pro-t2v \
  -H "X-API-KEY: $DECART_API_KEY" \
  -F "prompt=A serene ocean with dolphins jumping at sunset" \
  --output generated-video.mp4

Animate images

curl -X POST https://api.decart.ai/v1/generate/lucy-pro-i2v \
  -H "X-API-KEY: $DECART_API_KEY" \
  -F "data=@landscape-photo.jpg" \
  -F "prompt=Make the clouds drift slowly and add gentle wind to the trees" \
  --output animated-video.mp4

Parameters

Text to Video

  • prompt (required) — Text description of the video to generate.
  • resolution (optional) — Output resolution: 480p or 720p (default: 720p).

Image to Video

  • data (required) — Input image file to animate.
  • prompt (required) — Text description of how to animate the image.
  • resolution (optional) — Output resolution: 480p or 720p (default: 720p).
For complete API documentation including response formats and error codes, see the API Reference for Text to Video and Image to Video.

Available Models and Options

Decart provides different configurations to balance speed, quality, and cost depending on your use case. We currently offer two model tiers:
  • Lucy-Pro — our flagship model for generating diverse, high-quality videos.
  • Lucy-Dev — a faster model ideal for quick prototyping and simple animations.

Video specifications:

  • Duration: Fixed 5 seconds
  • Resolution:
    • 480p: 480×832 (portrait) or 832×480 (landscape)
    • 720p: 720×1280 (portrait) or 1280×720 (landscape)
  • Format: MP4 with H.264 encoding

Prompt Engineering

For Text-to-Video

Video generation prompts should describe both what happens (subjects, actions) and how it looks (camera movement, style, atmosphere). Because video generation involves temporal elements, your prompts should consider movement, pacing, and progression. Effective prompts are specific about motion (around 80–100 words), include camera instructions (pan, zoom, static), and describe temporal progression (beginning, middle, end states). Lucy handles cinematic language well—camera angles, lighting changes, and atmospheric effects. Example prompts:
  • “A majestic eagle soaring through mountain valleys, camera tracking its flight. Golden hour lighting with dramatic clouds. The eagle dives toward a lake, then rises with a fish. Cinematic aerial perspective.”
  • “Time-lapse of a busy city intersection from sunrise to sunset. Traffic flows like rivers of light, shadows grow and shrink, sky transitions from pink to blue to orange. Static wide angle shot.”
Bottom line: For text-to-video, write prompts as if directing a short film. For image animation, focus on specific motion instructions while respecting the source image.

Endpoints and Full API Reference

Next steps

Now that you know the basics of video generation, you might want to check out one of these resources next.
I