Back to blog
TutorialStable DiffusionSDXLControlNetMetalImage Generation

Stable Diffusion on Apple Silicon: SDXL and ControlNet with Metal GPU

February 22, 20267 min readby Macyou Team

Stable Diffusion is the most widely used open-source image generation model. Originally built for NVIDIA GPUs, it now runs natively on Apple Silicon through PyTorch's MPS (Metal Performance Shaders) backend. This means you can generate images with SDXL, apply ControlNet for guided generation, use LoRA adapters for style transfer, and run inpainting pipelines — all on a Mac, with no NVIDIA hardware anywhere in the stack.

Metal GPU Performance

The M4 Pro's 20-core GPU generates a 1024x1024 SDXL image in about 3–5 seconds with 20 sampling steps. That's fast enough for interactive work: type a prompt, see the result, adjust, regenerate. For batch generation — say, producing 500 product mockups overnight — the always-on nature of a Macyou server means you can queue jobs and collect results in the morning. The 48 GB unified memory holds SDXL base + refiner + ControlNet simultaneously without swapping.

Deploying on Macyou

The Macyou Catalog offers a Stable Diffusion stack with diffusers, PyTorch MPS, and popular model weights pre-downloaded. Deploy, SSH in, and start generating.

$ ssh root@YOUR_IP
$ python generate.py --prompt "a mountain lake at sunset, photorealistic" \
    --model stabilityai/stable-diffusion-xl-base-1.0 \
    --steps 20 --size 1024x1024
Generated image saved to output/img_001.png (3.4s on M4 Pro Metal)

ControlNet and LoRA Workflows

ControlNet adds structural guidance to image generation — feed in a depth map, edge detection, or pose skeleton and the model respects that structure. LoRA adapters let you fine-tune style without retraining the full model. Both work on Apple Silicon through the diffusers library. A practical workflow: use ControlNet depth for architectural renders, apply a LoRA for a specific art style, and batch-render variations. The Macyou server handles the compute while you focus on creative direction.

Pricing

Standard tier ($299/mo, 48 GB) is the sweet spot for Stable Diffusion work — enough memory for SDXL with ControlNet and multiple LoRAs. For simpler SD 1.5 workflows, Starter ($149/mo) works fine. See pricing details.

Start creating — deploy Stable Diffusion on Apple Silicon today.