SDXL 1.0: The Good, The Bad and The Ugly

Ng Wai Foong
5 min readJul 27, 2023

A new era of generative AI for everyone

Image by the author

On 26 July 2023, Stability.AI announced the public release of SDXL 1.0, another big update to its generative models with breaking changes. The initial responses from the community is largely positive and people are now experimenting the capabilities of the new architecture.

SDXL 1.0 consists of two different models:

  • SDXL-base-1.0: A base text-to-image model to generate 1024 x 1024 images. The base model uses OpenCLIP-ViT/G and CLIP-ViT/L for text encoding.
  • SDXL-refiner-1.0: A image-to-image model to refine the latent output of the base model for generating higher fidelity images. The refiner model only uses the OpenCLIP-ViT/G model.

SDXL 1.0 are licensed under the permissive CreativeML Open RAIL++-M license. Read the technical report here.

This article covers some of my personal opinions and facts related to SDXL 1.0

The Good

Image quality

By default, SDXL 1.0 can generate 1024 x 1024 images natively. This alone is a big improvement over its predecessors. SDXL 1.0 is particularly well-tuned for vibrant and accurate colors. In addition, you can now generate images with proper lighting, shadows and contrast without using…

--

--

Ng Wai Foong

Senior AI Engineer@Yoozoo | Content Writer #NLP #datascience #programming #machinelearning | Linkedin: https://www.linkedin.com/in/wai-foong-ng-694619185/