Skip to content

Commit d89dc0a

Browse files
Fix ReadTheDocs build: move LTX-2 docs to proper location and add to nav
Co-authored-by: hsliuustc0106 <222337142+hsliuustc0106@users.noreply.github.com>
1 parent 31187ff commit d89dc0a

4 files changed

Lines changed: 44 additions & 173 deletions

File tree

docs/.nav.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ nav:
1212
- Offline Inference:
1313
- Image-To-Image: user_guide/examples/offline_inference/image_to_image.md
1414
- Image-To-Video: user_guide/examples/offline_inference/image_to_video.md
15+
- LTX-2: user_guide/examples/offline_inference/ltx2.md
1516
- Qwen2.5-Omni: user_guide/examples/offline_inference/qwen2_5_omni.md
1617
- Qwen3-Omni: user_guide/examples/offline_inference/qwen3_omni.md
1718
- Text-To-Image: user_guide/examples/offline_inference/text_to_image.md

docs/LTX2_INTEGRATION.md

Lines changed: 0 additions & 173 deletions
This file was deleted.

docs/models/supported_models.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,7 @@ th {
3434
|`StableDiffusion3Pipeline` | Stable-Diffusion-3 | `stabilityai/stable-diffusion-3.5-medium` |
3535
|`Flux2KleinPipeline` | FLUX.2-klein | `black-forest-labs/FLUX.2-klein-4B`, `black-forest-labs/FLUX.2-klein-9B` |
3636
|`StableAudioPipeline` | Stable-Audio-Open | `stabilityai/stable-audio-open-1.0` |
37+
|`LTX2Pipeline` | LTX-2 | `Lightricks/LTX-2` |
3738

3839

3940
## List of Supported Models for NPU
Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
# LTX-2 Text-To-Video
2+
3+
Source <https://github.com/vllm-project/vllm-omni/tree/main/examples/offline_inference/ltx2>.
4+
5+
The `Lightricks/LTX-2` pipeline generates high-quality videos from text prompts using a 19B parameter DiT architecture. It supports up to 4K video generation with synchronized audio.
6+
7+
## Local CLI Usage
8+
9+
```bash
10+
python text_to_video.py \
11+
--prompt "A panda riding a bicycle through a forest, cinematic lighting" \
12+
--height 512 \
13+
--width 768 \
14+
--num_frames 121 \
15+
--num_inference_steps 40 \
16+
--guidance_scale 4.0 \
17+
--output ltx2_output.mp4
18+
```
19+
20+
Key arguments:
21+
22+
- `--prompt`: Text description of the video to generate (string).
23+
- `--height/--width`: Output resolution (defaults 512x768). Dimensions must be divisible by 32.
24+
- `--num_frames`: Number of frames (should be 8*n+1, e.g., 25, 81, 121). Default is 121.
25+
- `--guidance_scale`: Classifier-free guidance scale (default: 4.0). Range 3.0-5.0 recommended.
26+
- `--negative_prompt`: Optional text describing what to avoid in the video.
27+
- `--num_inference_steps`: Number of denoising steps (default: 40).
28+
- `--seed`: Random seed for reproducibility (default: 42).
29+
- `--fps`: Frames per second for the saved MP4 (default: 24).
30+
- `--output`: Path to save the generated video.
31+
32+
## Example materials
33+
34+
??? abstract "text_to_video.py"
35+
``````py
36+
--8<-- "examples/offline_inference/ltx2/text_to_video.py"
37+
``````
38+
39+
??? abstract "text_to_video.md"
40+
``````md
41+
--8<-- "examples/offline_inference/ltx2/text_to_video.md"
42+
``````

0 commit comments

Comments
 (0)