Google Gemini just got AI image generation back with Imagen 3 — how to try it now

Google Gemini
(Image credit: Google Gemini/Future AI)

All Google Gemini users can make images using Google's latest artificial intelligence image mode, Imagen 3. This includes those using it on the web, in the app or integrated into Android. It was previously only available to Gemini Advanced subscribers using the web version of the AI.

Imagen 3 was unveiled by Google during the I/O event earlier this year. Built by the company's research lab, DeepMind, it can create square format images with a similar level of realism as those from Midjourney or Flux.

There are some limitations, especially for free Gemini users. At the moment, you won't be able to use it to generate images of people unless you pay $19 per month for Gemini Advanced, and even then, it won't make images of real people.

Gemini's true power in terms of image generation comes from its ability to easily adapt or even completely change an image from a simple text description. You may need to update the app to get image creation abilities.

What's new with Imagen 3?

ImageFX Imagen 3

(Image credit: ImageFX Imagen 3/AI)

Imagen 3 is a visual upgrade on the previous Imagen 2. The images are richer and more detailed, and the model is better at following instructions given to it by Gemini.

Google removed image generation capabilities from Gemini for some time over concerns it was being overly cautious when rendering pictures of people. The previous model also tended to make everything look like it was plastic.

The new model brings Google's image generation capabilities in line with DALL-E 3 from OpenAI, although it still only generates square images, whereas ChatGPT can use DALL-E 3 to make pictures of any orientation and size. In my experiments, I found it still refused more often than I'd like to make an image, especially if it was something more complex, such as showing a person sitting in a cafe on a moon base. But for everyday depictions — it is great.

Adding image creation comes off the back of another update, making it easier to send an image to Gemini for analysis. This is known as AI Vision and allows the model to describe the contents of a picture. It can be beneficial for spotting a missing object in a room or creating a recipe from ingredients.

More from Tom's Guide

Ryan Morrison
AI Editor

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?