Browse Models
The simplest way to self-host Stable Diffusion 3.5 Large. Launch a dedicated cloud GPU server running Lab Station OS to download and serve the model using any compatible app or framework.
Download model weights for local inference. Must be used with a compatible app, notebook, or codebase. May run slowly, or not work at all, depending on your system resources, particularly GPU(s) and available VRAM.
Stable Diffusion 3.5 Large is an 8.1B parameter text-to-image model using three text encoders (OpenCLIP, CLIP, T5) and Query-Key normalization. It generates 1-megapixel images with strong typography handling. Optimal settings: 28 inference steps, 3.5 guidance scale. Part of SD3.5 family alongside Medium (2.5B) and Turbo variants.
Stable Diffusion 3.5 Large represents a significant advancement in text-to-image generation technology, introducing a sophisticated Multimodal Diffusion Transformer (MMDiT) architecture that leverages three pretrained text encoders: OpenCLIP-ViT/G, CLIP-ViT/L, and T5-xxl. With 8.1 billion parameters, it stands as the most powerful model in the Stable Diffusion family, designed to excel in both image quality and prompt adherence at 1-megapixel resolution.
The model employs Query-Key (QK) normalization in its transformer blocks, a technical innovation that enhances training stability and simplifies fine-tuning processes. This architectural choice not only improves the model's performance but also makes it highly customizable for specific use cases. The training dataset encompasses both synthetic and filtered publicly available data, contributing to the model's diverse capabilities.
Stable Diffusion 3.5 Large demonstrates remarkable improvements in several key areas:
The model has shown competitive performance against much larger models in terms of image quality, while leading the market in prompt adherence. For optimal results, recommended parameters include using 28 inference steps with a guidance scale of 3.5.
Within the Stable Diffusion 3.5 family, three distinct variants serve different use cases:
The model is released under the Stability Community License, which permits free use for:
Organizations exceeding the revenue threshold require an Enterprise License. The model emphasizes safety-by-design, incorporating data filtering and safeguards to mitigate potential harms.