Browse Models

stabilityai /

ControlNet SDXL Canny

Family

Stable Diffusion XL

Type

ControlNet Model

License

CreativeML Open RAIL-M License

Released

2023-08-29

How To Use

Laboratory OS

Launch a dedicated cloud GPU server running Laboratory OS to download and run ControlNet SDXL Canny using any compatible app or framework.

Direct Download

Must be used with a compatible app, notebook, or codebase. May run slowly, or not work at all, depending on local system resources, particularly GPU(s) and available VRAM.

Browse Compatible Apps

comfyanonymous /

ComfyUI

Generate images and videos using a powerful low-level workflow graph builder - the fastest, most flexible, and most advanced visual generation UI.

lllyasviel /

Stable Diffusion WebUI Forge

Forge is a platform built on top of Stable Diffusion WebUI to make development easier, optimize resource management, speed up inference, and study experimental features.

Automatic1111 /

Stable Diffusion Web UI

Automatic1111's legendary web UI for Stable Diffusion, the most comprehensive and full-featured AI image generation application in existence.

lllyasviel /

Fooocus

Simple, intuitive, and powerful image generation. Easily inpaint, outpaint, and upscale. Influence the generation using image prompts.

bmaltais /

Kohya's GUI

Train your own LoRAs and finetunes for Stable Diffusion and Flux using this popular GUI for the Kohya trainers.

Model Report

stabilityai / ControlNet SDXL Canny

ControlNet SDXL Canny is an image generation model that uses Canny edge maps to provide structural guidance to Stable Diffusion image synthesis. Part of the ControlNet 1.1 family, it processes edge-detected representations through an encoder that merges with the U-Net backbone, enabling constrained image generation that faithfully preserves the shapes and contours specified by the input edge map while maintaining creative flexibility in other visual aspects.

Explore the Future of AI

Your server, your data, under your control

Model Architecture and Training Procedure

ControlNet SDXL Canny maintains architectural parity with the original ControlNet 1.0 framework, as outlined in the official documentation. The model integrates with the Stable Diffusion image synthesis pipeline, introducing an additional pathway that accepts structural control data—in this instance, the output of the Canny edge detection algorithm—alongside standard text prompts. This edge map is processed through an encoder, the output of which is global average pooled before being merged at intermediate layers within the U-Net backbone of Stable Diffusion.

The Canny model was further refined from its predecessor through continued training, benefitting from an extended session on high-performance hardware—specifically, 72 hours utilizing eight NVIDIA A100 80GB GPUs at a batch size of 256. This computational investment aimed to enhance model robustness and visual quality. During training, Canny edge maps were generated with variable thresholds and paired with high-quality, prompt-annotated images. To improve generalization, data augmentations such as random horizontal flipping were employed. Several known issues in the earlier dataset, including duplication of certain image modalities and corrupted data, were carefully addressed, resulting in a cleaner and more representative data distribution.

Technical Features and Control Capabilities

The defining feature of ControlNet SDXL Canny is its ability to harness edge information for tightly constrained image synthesis. By conditioning the generative process on a Canny edge map, the model ensures that the resulting images adhere closely to the key contours and shapes defined by the user. This facilitates a variety of applications, such as reconstructing images from scanned line art, transforming technical blueprints into photorealistic renders, or enabling precise style transfer while preserving essential form.

Canny edge-controlled AI image synthesis sample

Demonstration of Canny edge-guided generation: The left shows a Canny edge map, while the right displays four stylized outputs synthesized by the model from the detected edge contours. (Prompt unavailable.)

Full Size Image Image Source

The model’s structure enables multi-condition input, allowing for the use of several ControlNets simultaneously. This feature broadens the creative possibilities by supporting parallel guidance signals, such as blending edge constraints with segmentation maps, pose estimation, or other modalities. Operational details for using multi-ControlNet setups are elaborated in the ControlNet documentation.

Model Performance and Use Cases

ControlNet SDXL Canny was designed to provide enhancements in robustness and subjective perceptual quality. Although no quantitative benchmarks have been formally published, developer insights include observations of its fidelity and reliability during image generation. Practical applications extend across a spectrum of domains, including image reconstruction, stylization with strict geometric adherence, blueprint interpretation, and technical visualization.

Multiple ControlNet outputs using Canny and Shuffle

Generated outputs showing the effect of combined Canny and Shuffle ControlNets on image realism and stylistic diversity in a photorealistic setting. (Prompt unavailable.)

Full Size Image Image Source

In practice, users can employ the model for tasks requiring high-fidelity adherence to structure, with the outputs often serving production environments in visual design, digital artistry, and technical illustration.

Interface demonstrating Canny edge-guided generation

A user interface screenshot: On the left, a grayscale Canny edge map and a text prompt ("portrait of a man wearing a hat, sitting on the time square"); on the right, the model’s colorized output, faithfully adhering to the edge-defined structure.

Full Size Image Image Source

Integration, Installation, and Related Models

ControlNet SDXL Canny is provided as part of the broader ControlNet 1.1 model suite, each model following the Standard ControlNet Naming Rules (SCNNRs). The Canny model specifically requires the relevant model and configuration files—typically named control_v11p_sd15_canny.pth and control_v11p_sd15_canny.yaml—alongside a compatible version of Stable Diffusion (such as Stable Diffusion 1.5). Official integration and multi-model interoperability are supported through the sd-webui-controlnet extension, which permits the orchestration of multiple conditioning models within user workflows.

Within the 1.1 release, a total of 14 models were introduced, addressing a wide range of guidance modalities. These include depth maps (ControlNet Depth), normal maps (ControlNet Normal), straight lines, scribbles, soft edges, semantic segmentation, pose estimation, line art, anime-specific line art, content shuffling, inpainting, and tile-based upscaling or enhancement. Each model is tailored to leverage a specific annotation or structured control input, while sharing core architectural consistency for ease of use and extensibility.

Limitations and Considerations

The developers note that ControlNet SDXL Canny, like other ControlNet models, is not directly provided as an extension for all Stable Diffusion user interfaces; users are advised to consult the sd-webui-controlnet repository for guidance on proper integration. Certain related models in the family (such as Shuffle, InstructPix2Pix, and Tile) are considered experimental and may require further curation or exhibit varying output stability.

The architecture’s reliance on precise edge map inputs makes it sensitive to the quality and relevance of those inputs. The model's ability to generalize beyond the contours defined by the edge map is deliberately constrained, which is desirable for some use cases but may limit flexibility in others. Requirements such as specific base model checkpoints or compatibility with certain prompt structures should be considered prior to deployment.

ControlNet SDXL Canny

Laboratory OS

Direct Download

ComfyUI

Stable Diffusion WebUI Forge

Stable Diffusion Web UI

Fooocus

Kohya's GUI

Explore the Future of AI

Your server, your data, under your control

ControlNet SDXL Canny

Laboratory OS

Direct Download

ComfyUI

Stable Diffusion WebUI Forge

Stable Diffusion Web UI

Fooocus

Kohya's GUI

Explore the Future of AI

Your server, your data, under your control

Model Architecture and Training Procedure

Technical Features and Control Capabilities

Model Performance and Use Cases

Integration, Installation, and Related Models

Limitations and Considerations

External Links and Resources