Experience the power of 6 billion parameters with sub-second inference latency. Z-Image Turbo delivers photorealistic quality in just 8 inference steps. Fits on 16GB VRAM consumer devices.






A futuristic cyberpunk street in Tokyo at night, neon signs, rain-soaked pavement...
Harness the power of Z-Image Turbo's 6B parameter model. Enter your prompt and generate photorealistic images in under a second.
Try a sample prompt:
Your generated image will appear here
Sub-second generation with Z-Image Turbo
Built on cutting-edge research from Tongyi-MAI, Z-Image Turbo combines unprecedented speed with exceptional quality—ranking #1 among open-source models on the AI Arena Leaderboard.
Generate images in under 1 second on H800 GPUs. Lightning-fast creation without compromising quality.
Massive 6 billion parameter model delivers state-of-the-art quality with exceptional detail and accuracy.
Highly optimized to achieve stunning results with just 8 Number of Function Evaluations (inference steps).
Excels at generating photorealistic images with exceptional aesthetic quality and fine details.
Accurate rendering of both English and Chinese text within images—a unique capability.
Runs comfortably on consumer GPUs with just 16GB VRAM. No expensive hardware required.
Z-Image Turbo is powered by innovative architectures and distillation techniques developed by the Tongyi-MAI research team.
Z-Image adopts a novel Scalable Single-Stream DiT architecture where text, visual semantic tokens, and image VAE tokens are concatenated at the sequence level as a unified input stream.
The core few-step distillation algorithm that empowers Z-Image's 8-step generation. It decouples two key mechanisms for optimized performance:
Read the research papers:
Create stunning AI images in three simple steps
Type a detailed description of the image you want. Z-Image Turbo excels at understanding complex prompts with excellent instruction adherence.
Choose dimensions (up to 1024px) and adjust quality settings. The default 8 inference steps is optimized for the best quality/speed balance.
Click generate and get your photorealistic image in under a second. Download in JPG, PNG, or WebP format.
Got questions? We've got answers.
Z-Image Turbo is a distilled version of Z-Image (造相), a 6B parameter image generation model developed by Tongyi-MAI. It delivers state-of-the-art quality with only 8 inference steps and sub-second latency.
Z-Image Turbo uses the innovative S3-DiT architecture and Decoupled-DMD distillation. It's ranked #1 among open-source models on the AI Arena Leaderboard while requiring only 8 NFEs (vs 20-50 for other models).
Z-Image Turbo excels at accurately rendering complex Chinese and English text within images—a unique capability that most other models struggle with. This makes it ideal for creating images with embedded text.
Z-Image Turbo fits comfortably within 16GB VRAM consumer devices. On enterprise-grade H800 GPUs, it achieves sub-second inference latency.
The recommended setting is 8 inference steps (NFEs), which provides the optimal balance between quality and speed. You can increase steps for potentially higher quality at the cost of longer generation time.
Z-Image Turbo is released under Apache-2.0 license. Images generated can be used for both personal and commercial projects. Please ensure your prompts don't infringe on copyrights or trademarks.
We support JPG, PNG, and WebP formats. The maximum resolution is 1024×1024 pixels, with presets for square, landscape, and portrait orientations.
Check out the research papers on arXiv: Z-Image (2511.22699), Decoupled-DMD (2511.22677), and DMDR (2511.13649). The model is also available on Hugging Face.