I just put Google's new Imagen 3 AI image generator to the test with 7 prompts — and I'm blown away
Photorealism from Google is here
Google has unveiled a new version of its Imagen 3 artificial intelligence image generation model that promises improved realism, better prompt adherence and a wider range of custom styles from photorealism and impressionism to abstract and anime.
While you may not be familiar with Imagen 3 itself, if you’ve ever used Gemini to create an image, or even adapted images on an Android phone, chances are you’ve used the model from the Google DeepMind AI lab. The best place to use it is in the ImageFX labs experiment.
With the new update, Imagen 3 has not only gained an improvement in how it renders images but also in how it understands prompts. For example, it now understands the language of photography better than previous models including lens types and lighting. So Imagen 3 has the potential to be one of the best AI image generators.
The best way to put it to the test is to use the completely free ImageFX tool, which is part of Google Labs. This has one particularly unique feature that allows you to quickly adapt the prompt after the first version is generated, for example by switching out lens types.
Putting Imagen 3 to the test
To find out just how well Imagen 3 works I've come up with a series of photography-style prompts. Each of these prompts includes a different lens or camera type. Some also have different techniques such as sports photography or photojournalism.
The idea is to see how well the model generates the image, and, more importantly, captures the emotion and feeling of the moment outlined in the prompt.
1. A rainy day in London
One thing most models struggle with when asked to generate a street scene is placing the people. They can't tell the road from the sidewalk but Imagen 3 seems to have got it right, having someone cross the street while others are on the side.
The prompt: "Street-level photograph of a bustling London street on a rainy day, people holding umbrellas as reflections shimmer on wet pavement, shot with a 35mm lens, shallow depth of field focusing on a red double-decker bus in the background, natural light, candid moment."
2. A moment of reflection
This prompt could very easily have failed. Largely because of the fingers. Yes, almost all models have cracked the finger problem but when holding a cup or close up they still sometimes struggle. Add in complexities of depicting age and you easily get an uncanny valley — not so much here.
The prompt: "Golden hour portrait of an elderly woman with weathered hands holding a steaming cup of tea, soft sunlight highlighting her wrinkles and smile, taken with an 85mm f/1.4 lens for a creamy bokeh background, warm and intimate mood, natural outdoor setting."
3. Feeding the nation
Here we had the model depict a specific type of lighting, the complexities of netting and correct shadows for the time of day. It also had to consider the requirement — a democracy-style image.
The prompt: "Photojournalistic image of a fisherman pulling a net from the ocean at sunrise, water droplets glistening in the light, shot on a Canon EOS R5 with a 24-70mm f/2.8 lens, high contrast with sharp detail in the man's hands and the waves, capturing human resilience."
4. The Barista’s art
Weirdly, latte art is something AI image models can struggle with. Imagen 3 not only got it right but also placed fingers correctly.
The prompt: "Natural light photograph of a barista pouring steamed milk into a cappuccino in a rustic European café, soft focus on the coffee cup while the background remains blurred, shot with a 50mm f/1.8 lens, capturing the steam rising and the texture of the foam."
5. Caught in the moment
I had to do a few tweaks to this image. Originally I wanted to depict sweat droplets but it looked like rain, so I went for the rain motif. Looks good.
The prompt: “Dynamic long exposure shot of a sprinter mid-stride during a track and field race, muscles tensed and rain drops visible in the air, shot with a 70-200mm f/2.8 telephoto lens, fast shutter speed for pin-sharp focus, motion blur in the background.”
6. Full of potential
Here I wanted to see if Imagen 3 could capture emotion in an image. Or, at the very least, depict an artistic, model-style photograph and it achieved the goal. Properly capturing the right shadows and harsh light for a black and white image.
The prompt: “High-contrast black and white portrait of a young man standing under a bridge, sharp shadows and highlights emphasizing his angular jawline and intense gaze, taken with a Leica M10 and a 50mm lens, classic film grain effect for a timeless look.”
7. A candid moment
This was another image that required some tweaking to get it right. I wanted a casual, but impressive (taken with a good camera) shot of a farmer. It needed to position the farmer in such a way that he looks uncomfortable at having his photograph taken but also proud of his farm.
The prompt: “An environmental portrait of an elderly farmer standing proudly in the middle of a corn field at sunset, blue moonlight casting long shadows, shot with a Nikon Z9 and a 35mm f/1.4 lens, bokeh on the farmer ’s face and hands while the background shows rows of wheat softly blurred, capturing the grit and indifference of rural life.”
One more thing: Terrible photography
I wanted to see how well Imagen 3 could handle bad photography. It is great that models are able to create stunning works of art, realistic brilliant photographs and abstract pieces that lead you to question whether it was human-made or not — but what about bad pictures?
I gave Imagen 3 this prompt to see how it handled the type of terrible photography commonly found in cameras in the 80s and 90s. I wasn’t disappointed.
The prompt: “A poorly lit indoor snapshot taken with a film camera using a harsh flash, saturating the faces of two people sitting at a dinner table, creating red-eye and deep, unflattering shadows on the wall behind them, taken at close range with slightly off-center framing.”
More from Tom's Guide
- Google Gemini can finally make images again — here's how to use it for free
- AI has made me all but give up on traditional Google searches — here’s why
- I just had a conversation with Meta AI Voice — and it’s way better than I expected
Sign up to get the BEST of Tom's Guide direct to your inbox.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?
-
mikexilva This can be the beginning of a new era in computational photography where it analyses the image and recreates it with more detail on objects well known and change image parameters to user preferred options (like simulate certain camera and lens).. (not unlike what Samsung does with the 100x zoom on moon pictures with S23Ultra).Reply -
lexter99 These are great generated images. But I can't agree with your comment that the model got the pedestrian placement correct - the person "crossing the street" is actually walking down the middle of the street, head on into the bus!Reply -
RyanMorrison
If you’ve ever crossed the street in London that is exactly how it happenslexter99 said:These are great generated images. But I can't agree with your comment that the model got the pedestrian placement correct - the person "crossing the street" is actually walking down the middle of the street, head on into the bus! -
melchizedediah Still defaulting to white American except for athlete where it defaults to black. Reflects training days I guess.Reply