Key Features

Supports LoRA training and full fine-tuning for LTX-2.
Covers text-to-video, image-to-video, and video-extension training modes.
Covers text-to-audio, audio extension, and audio inpainting modes.
Supports video inpainting, video outpainting, and IC-LoRA reference workflows.
Supports audio-to-video and video-to-audio conditioning.
Includes detailed docs for datasets, configs, training, inference, and troubleshooting.
Provides low-VRAM configuration guidance with quantization optimizations.
Includes an agent-assisted training skill inside the repository.

The package covers text-to-video, text-to-audio, image-to-video, video extension, audio extension, video inpainting, audio inpainting, video outpainting, IC-LoRA references, audio-to-video, and video-to-audio tasks. Its documentation includes quick start, dataset preparation, training modes, configuration, training, inference, utilities, and troubleshooting guides.


LTX-2 Trainer is useful for teams that want to adapt an audio-video foundation model to custom datasets, styles, or references. It expects local LTX-2 checkpoints, a Gemma text encoder, Linux with CUDA, and substantial GPU memory for standard configurations.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner
Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!