Stable Diffusion 3.5 (SD 3.5) is the latest evolution of Stability AI’s groundbreaking open-source image generation model. Released in late 2024 and now widely adopted in 2026, it represents a significant leap in quality, prompt adherence, and versatility compared to its predecessors. Best of all — it’s free and can run locally on your own hardware.
Photo by Nong on Unsplash
What Is Stable Diffusion 3.5?
Stable Diffusion 3.5 is a text-to-image AI model developed by Stability AI. Unlike Midjourney or DALL-E 3, it’s open-source and can be run:
- Locally on your own GPU (free)
- Via API (Stability AI platform)
- Via third-party UIs (ComfyUI, Automatic1111, InvokeAI)
Key improvements over SD 3.0:
- Better text rendering in images
- Improved human anatomy and hands
- Higher resolution outputs (up to 2MP natively)
- Faster inference time
- Better instruction following for complex prompts
Model Variants
| Model | Parameters | Best For | VRAM Required |
|---|---|---|---|
| SD 3.5 Large | 8B | Highest quality | 24GB+ |
| SD 3.5 Large Turbo | 8B | Speed (4 steps) | 24GB+ |
| SD 3.5 Medium | 2.5B | Balance of quality/speed | 10GB+ |
For most users, SD 3.5 Medium is the sweet spot — it fits on consumer GPUs (RTX 3090, RTX 4080) and produces excellent results.
How to Run Stable Diffusion 3.5
Option 1: ComfyUI (Recommended)
ComfyUI is the most powerful and flexible UI for Stable Diffusion:
Installation:
git clone https://github.com/comfyanonymous/ComfyUI
cd ComfyUI
pip install -r requirements.txt
Download the model:
- Get the SD 3.5 weights from Hugging Face
- Place in
ComfyUI/models/checkpoints/ - Run
python main.pyand openhttp://localhost:8188
Option 2: Stability AI API (No GPU Needed)
import requests
url = "https://api.stability.ai/v2beta/stable-image/generate/sd3"
headers = {
"authorization": "Bearer YOUR_API_KEY",
"accept": "image/*"
}
data = {
"prompt": "A majestic mountain landscape at golden hour, hyperrealistic, 8K",
"model": "sd3.5-medium",
"output_format": "jpeg"
}
response = requests.post(url, headers=headers, files={"none": ""}, data=data)
with open("output.jpg", "wb") as f:
f.write(response.content)
Option 3: Automatic1111 WebUI
The most popular SD interface:
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui
cd stable-diffusion-webui
bash webui.sh
Mastering Prompts for SD 3.5
The Anatomy of a Great Prompt
SD 3.5 responds well to natural language, unlike older models that required special syntax. Structure:
[Subject] [Action/Pose] [Setting] [Lighting] [Style] [Quality modifiers]
Example:
A young woman reading a book in a sun-drenched Parisian café, golden afternoon light streaming through window, impressionist oil painting style, rich warm colors, highly detailed
Prompting Tips
1. Be descriptive about lighting:
- “dramatic chiaroscuro lighting”
- “soft diffused morning light”
- “neon-lit cyberpunk night”
- “golden hour sunlight”
2. Specify art style clearly:
- “digital art by Studio Ghibli”
- “watercolor painting”
- “photorealistic”
- “oil painting in the style of Monet”
- “flat vector illustration”
3. Use quality boosters:
- “highly detailed, 8K resolution”
- “masterpiece, best quality”
- “sharp focus, professional photography”
4. Text in images (SD 3.5 improvement):
A coffee shop sign reading "Open", vintage typography, warm lighting
SD 3.5 handles text in images far better than previous versions.
Negative Prompts
Tell the model what to avoid:
Negative: blurry, low quality, deformed, extra limbs, watermark, signature, text, bad anatomy
Photo by Milad Fakurian on Unsplash
Key Parameters Explained
| Parameter | Default | Effect |
|---|---|---|
| CFG Scale | 4.5 | How strictly to follow prompt (higher = more literal) |
| Steps | 28 | More steps = more refined (diminishing returns after 40) |
| Sampler | DPM++ 2M | Affects style and quality of output |
| Resolution | 1024×1024 | Native resolution |
| Seed | Random | Fix seed for reproducible results |
Recommended settings for quality:
- Steps: 25-35
- CFG Scale: 4.5-6.5
- Sampler: DPM++ 2M Karras or Euler a
Advanced Techniques
ControlNet
Guide the composition using reference images:
- Depth map — control 3D depth of scene
- Canny edges — maintain structural composition
- OpenPose — control character pose with skeleton
- Inpainting — edit specific parts of an image
Image-to-Image (img2img)
Start from an existing image and transform it:
Start image: rough sketch
Prompt: "Professional concept art of the sketch, detailed, polished"
Denoising strength: 0.65
LoRA Models
Fine-tuned small model additions for specific styles:
- Download from civitai.com
- Add to prompt:
<lora:model_name:0.8> - Common uses: specific art styles, character consistency, product photography
Upscaling
Get 4K+ output:
- Generate at base resolution (1024×1024)
- Use Ultimate SD Upscaler in A1111
- Scale 2x or 4x with minimal quality loss
SD 3.5 vs. Competitors
| SD 3.5 Medium | Midjourney v7 | DALL-E 3 | Flux 1.1 Pro | |
|---|---|---|---|---|
| Cost | Free (local) | $10/mo | Pay/image | Pay/image |
| Quality | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ |
| Open source | ✅ | ❌ | ❌ | ❌ |
| Customizable | ✅✅✅ | ✅ | ❌ | Limited |
| Local run | ✅ | ❌ | ❌ | ❌ |
| Text in image | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ |
Hardware Requirements
| GPU | VRAM | Can Run |
|---|---|---|
| RTX 4090 | 24GB | SD 3.5 Large ✅ |
| RTX 4080 / 3090 | 16-24GB | SD 3.5 Medium ✅ |
| RTX 4070 / 3080 | 10-12GB | SD 3.5 Medium ✅ |
| RTX 4060 / 3070 | 8GB | SD 3.5 Medium (limited) |
| M2/M3 Mac | 16GB+ unified | SD 3.5 Medium via MPS ✅ |
For users without a capable GPU, use the Stability AI API (~$0.04/image).
Final Verdict
Stable Diffusion 3.5 is the best choice for users who want full control, privacy, and zero ongoing costs for AI image generation. The open-source nature means unlimited customization, a massive community of extensions and LoRAs, and no censorship restrictions.
For casual users who prefer simplicity, Midjourney or Adobe Firefly might be easier — but for power users and developers, SD 3.5 is unmatched.
Rating: 9/10 ⭐⭐⭐⭐⭐
Best for: Power users, developers, digital artists, privacy-conscious users Standout feature: Free, open-source, fully customizable Hardware: RTX 3080+ or 16GB unified memory Mac recommended