StabilityAI releases Stable Diffusion 3.5 — a step up in realism
Three models in one!
StabilityAI is today releasing its new Stable Diffusion 3.5 family of AI image models. It comes with improved realism, prompt adherence and text rendering compared to SD3.
Like the first incarnation of SD3, Stable Diffusion 3.5 comes in three sizes — Large (8B), Large Turbo (8B), and Medium (2.6B). These are all customizable and tuned to run on consumer hardware, while also being available to use under the Stability AI Community License.
In short, this is a big step towards any user being able to create more realistic AI images. In a press release, StabilityAI admitted its Stable Diffusion 3 Medium model that launched in June "didn't fully meet our standards or our communities’ expectations".
The company added that "After listening to the valuable community feedback, instead of a quick fix, we took the time to further develop a version that advances our mission to transform visual media."
Our AI Editor Ryan Morrison has been testing SD3.5 and says it is a significant upgrade, matching and potentially exceeding the capabilities of the recently released Flux 1.1 Pro.
What's new with Stable Diffusion 3.5?
Stability AI says the models included are focused on customizability, efficient performance, and diverse outputs. "Stable Diffusion 3.5, our most powerful models yet, reflects our commitment to empowering creators with tools that are both widely accessible and cutting-edge," a spokesperson explained.
That means images can be fine-tuned, the models run on consumer hardware "out of the box", and the images generated will feel more unique to one another.
Sign up now to get the best Black Friday deals!
Discover the hottest deals, best product picks and the latest tech news from our experts at Tom’s Guide.
There's also a focus on new stylistic choices, including photography, painting, and more. It takes in hashtag prompts to specify a style such as boho or fashion. It can also use highlighting within the prompt to steer the model in a specific direction.
"Additionally, our analysis shows that Stable Diffusion 3.5 Large leads the market in prompt adherence and rivals much larger models in image quality," a press release explains.
"Stable Diffusion 3.5 Turbo offers some of the fastest inference times for its size, while remaining highly competitive in both image quality and prompt adherence, even when compared to non-distilled models of similar size."
"Stable Diffusion 3.5 Medium outperforms other medium-sized models, offering a balance of prompt adherence and image quality, making it a top choice for efficient, high-quality performance."
The model is free for non-commercial use, including scientific research, and free for small to medium-sized businesses up to $1 million in revenue. Anything beyond that will require an Enterprise Licence.
Prompt: a scene of a celestial garden floating in space, with trees made of crystal and flowers that glow in different colours. Planets and stars can be seen in the sky above, and a river of light flows through the garden, surrounded by statues of ancient gods and glowing orbs. Ethereal and serene.
Prompt: photo of an enchanted spellbook floating above a stone pedestal, pages glowing with ancient runes and blue mystical light. Surrounding the book are floating magical artifacts: a crystal orb, a feathered quill, and a golden key. Dim candlelight and flickering shadows in an old library setting.
Prompt: a brave explorer standing on the edge of a massive waterfall, overlooking a vast, untouched jungle. The explorer is wearing a worn leather jacket and carrying an old map, as the mist from the waterfall rises around them. A glowing golden temple is barely visible in the distance, hidden among the trees.
Prompt:~dreamy vibes~ #boho #fashion, young woman sitting in a sunlit meadow, surrounded by wildflowers, casual and relaxed pose, gentle smile. Soft sunlight filtering through the scene.
Prompt: A couple walking on the beach with the words "Love is Forever" written in the sky in a sky writing style
More from Tom's Guide
A freelance writer from Essex, UK, Lloyd Coombes began writing for Tom's Guide in 2024 having worked on TechRadar, iMore, Live Science and more. A specialist in consumer tech, Lloyd is particularly knowledgeable on Apple products ever since he got his first iPod Mini. Aside from writing about the latest gadgets for Future, he's also a blogger and the Editor in Chief of GGRecon.com. On the rare occasion he’s not writing, you’ll find him spending time with his son, or working hard at the gym. You can find him on Twitter @lloydcoombes.
- Ryan MorrisonAI Editor