AI image generator shoot-out: I tested ChatGPT vs Gemini vs Meta AI to crown a winner
Things got interesting with 5 prompts and 3 chatbots
The competition between Google Gemini’s Imagen, OpenAI’s ChatGPT, and Meta AI is fierce. After experimenting with them individually, I decided to conduct a side-by-side comparison to truly see which is the best AI image generator right now.
With AI-generated imagery becoming a key part of creative work, each platform has its own strengths. I put the AI models to the test a mix of realistic and simplistic prompts to assess how the different AI models handle various subjects. My goal was to determine which AI could generate the most impressive results across five basic categories.
Here's a look at how each platform faired based on the quality of the generated images, and which ultimately came out on top.
Creating the prompts
To keep the comparisons fair, I diversified the prompts enough to test each AI’s ability to generate detailed, aesthetically pleasing images. Each of the prompts were tested on the AI’s ability to interpret texture, color, and composition while maintaining a level of creativity. The categories were: food, home decor, animals, vehicles and landscapes, allowing me to explore the full range of their abilities.
Workflow
I used each platform's image generation features in their default settings. While Google Gemini and OpenAI offer premium services, I stuck with their free tiers for this comparison. Google Gemini’s Imagen is integrated within Google’s platform and Meta AI delivers images through Instagram, Facebook and WhatsApp. OpenAI’s ChatGPT, equipped with the DALL-E image generation feature, delivers quick results on its single platform.
After generating images on the individual platforms, I evaluated each image based on clarity, creativity, and how well the AI captured the intent behind the prompt.
1. Food
Prompt: Create a gourmet burger with truffle fries
Google Gemini: The image was visually stunning, with an over-the-top burger and a crisp focus on the layers. Each element (bun, patty, toppings) came out in sharp detail all while giving the burger an almost top-heavy, uneven detail, something I feel is often the reality of ordering a loaded burger. The fries had the perfect golden hue, and the truffle seasoning was visually distinct.
Meta AI: The image had a larger-than-life aspect with an extremely meaty burger, strong color contrast and appeal of the melted cheese. The details of the truffle seasoning were incredibly refined, and the fries were realistically placed even more so than that of Gemini’s output.
ChatGPT: This one is obviously desperate to win by throwing in an extra order of fries, but the overall image was far more artistic, almost painterly quality. The truffle fries were detailed but less realistic compared to Google’s and Meta’s version.
Winner: Meta
This was an incredibly tough call between Google Gemini and Meta AI. Both excelled with generating a juicy, gourmet burger that made me hungry for lunch. But I’m going to ultimately go with Meta AI as the winner here because of the incredibly juicy beef patty. It was mouthwateringly realistic and the extra cheese helps. The near-photographic result of both Gemini and Meta AI was impressive. OpenAI’s image has a creative flair, but the burger looked less realistic and almost comical.
2. Home decor
Prompt: Create an image of a minimalist living room with a large window overlooking the ocean.
Google Gemini Imagen: The design was sleek, with clean lines but minimal lighting. The ocean view was stunningly realistic, but it almost seems as though the living room is floating in the water with an exaggerated perspective of the ocean. Is this living room on a boat?
Meta AI: The image captured the minimalist aesthetic but missed some details in the textures and lighting that would elevate the realism of the scene. The water, though close, appears to be separate and not directly next to the living room.
ChatGPT: The image leaned more into what I was hoping for – a clear distinction between the living room and the ocean, with bold colors, interesting shapes, and a visually appealing sky. Where the ocean lacked in detail, the wall art coupled with the unique coffee table were welcomed touches.
Winner: Meta: Meta AI and ChatGPT knocked it out of the park here, though I’m ultimately going with Meta AI as the winner because it seemed to capture the essence of the prompt the best, including a living room that seems to welcome the view, unlike ChatGPT’s row of seats facing away from the view. Meta AI’s attention to realism gave it an edge in this category, though OpenAI’s creative take offered a more unique vision.
3. Animal
Prompt: Create an image of a colorful parrot perched on a tree branch.
Google Gemini Imagen: The parrot was highly detailed, with vivid feathers and realistic texture. The details in the branch added a touch of natural atmosphere without much of a background otherwise. The prompt, however, did say “colorful” and while this bird is a gorgeous green, I was expecting a more vibrancy and color.
Meta AI: The coloring on this parrot was more of what I was expecting. The well-constructed image was stunning right down to the beak and talons. The leaf in the scene added to the overall aesthetic.
ChatGPT: The parrot was colorful and artistic but lacked the fine details in feather texture that would make it lifelike. It had a more surreal look with a focus on bright colors over intricate details. The added touch of the background was nice but, like the extra helping of fries, not requested.
Winner: Meta: Gemini delivered a very lifelike bird perched on a tree branch and ChatGPT generated a bird that seemed to have a storybook quality, that appealed to my Disney-loving side. But I’m going with Meta AI for this one because it balanced realism with vibrancy and color that I was expecting given the prompt.
4. Vehicle
Prompt: Create an image of a futuristic electric car on a city street at sunset
Google Gemini Imagen: The car looked sleek and modern, with clear, reflective surfaces. The sunset added warmth, and the cityscape was detailed with soft lighting effects. The electric charger in the scene was a nice detail emphasizing the electric aspect of the car.
Meta AI: The vehicle design was bold and certainly futuristic. The bright colors really made this image pop with the refinement of light and shadows to capture the sunset. The detail of the city street added to the ambiance.
ChatGPT: The car design was futuristic but almost overly so and the sunset and cityscape were less defined. The sleek road was almost too perfect giving the image a slightly more conceptual feel rather than photorealism.
Winner: Meta: It’s interesting to me that all of the AI models generated a very similar looking electric car and futuristic scene. So far, these images are the most alike in terms of following the prompt. Meta AI is the clear winner as it nailed the combination of futuristic design and environmental detail, with ChatGPT offering a more conceptual but less realistic take. Gemini is a close second offering lots of detail and realism.
5. Landscape
Prompt: Create an image of a serene mountain cabin surrounded by pine trees with mist rolling in.
Google Gemini: The pine trees and mountains were detailed, but the cabin looked dull and uninhabitable, more abandoned than serene. The stark scene was portrait-like and believable, yet lacked the ambience that I was hoping for in the image
Meta AI: The mist and trees rendered well, though the cabin gave off a cartoonish vibe with the excess ivy and greenery on the roof. The background is what makes this image truly stand out.
ChatGPT: The image was ethereal, with the mist exaggerated for a dreamlike effect. The scene had a soft, painterly quality that made it feel like a fantasy illustration.
Winner: ChatGPT: I had to keep checking to be sure that I had not switched the Meta AI and ChatGPT images. I’m used to ChatGPT generating images with a little more artistic flair, but this time it was Meta AI that missed the mark with an overly creative interpretation. Google again excelled in realism, but the overall winner here was ChatGPT for checking all the boxes with its standout image.
Winner overall: Meta
After testing these five prompts, it’s clear that both Google Gemini’s Imagen and Meta AI are the go-to for photorealistic images that closely mirror real-world details. Meta AI offers solid performance, generating images with incredible detail and coherence, but tends to be more stylized and can lack the refinement in fine details that Gemini does so well. ChatGPT, on the other hand, excels in creativity, often delivering more artistic or surreal interpretations of prompts.
Overall, Meta AI was the clear winner, providing good middle-ground options and outperforming the other chatbots with realism and better attention to prompt details.
More from Tom's Guide
- I saw the future of interior design — and its an AI plugin for Photoshop
- Will digital watermarking save the world from fake news?
- Apple Intelligence adds a Type to Siri feature — here's how to launch this feature
Sign up to get the BEST of Tom's Guide direct to your inbox.
Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!
-
cam.macleod The funny thing is this totally personal preference, because I honestly thought Gemini easily pulled first in every scenario, except the interior design. I was honestly very impressed by Meta though. ChatGPT's always feels extremely artificial somehow.Reply -
jansyren I feel that Gemini followed your prompt better while the others embellished the picture more. While they might given you more what you hoped for, I think it is better that it doesn't embellish and you have to refine the prompt to get what you envisioned.Reply
Like the view over the ocean, that's very vague, change the prompt to be "a view overlooking a rocky beach with the ocean beyond fading into the horizon." This might have given you a better picture. Because I think it is better that the ai takes the prompt very literal. -
jansyren
I agree, gemeni followed the prompt closest without embellishing. I tried the following prompt on the living room: "Create an image of a minimalistic living room with a large window overlooking a rocky beach with the ocean beyond fading into the distance" and that's exactly what I got.cam.macleod said:The funny thing is this totally personal preference, because I honestly thought Gemini easily pulled first in every scenario, except the interior design. I was honestly very impressed by Meta though. ChatGPT's always feels extremely artificial somehow. -
AI_Nerd Imagen is one of my favorites for generating photos with minimal prompting. It's got good prompt adherence and is fairly effective at generating some good images. I thought they were the best overall. I don't know what Meta uses but it works pretty well.Reply
Dall-E was impressive maybe a few years ago, but I hardly ever use it anymore. There are simply better generators for nearly every kind of image I would want to create.
For the most part I use SDXL and various fine tunes/variants for SDXL, like Dreamshaper XL lightning, when those fail I also use Imagen 3.0 and it's fast variant, or Flux and Flux Schnell.
Really I'm kinda disappointed an open source image generator like Stable Diffusion XL wasn't included. Especially when Dall-E was. I'm not sure which version is free, but honestly I don't get great results from 3, or 2, and I haven't used the first in a long time. -
AI_Nerd
Imagen (Alphabet/Googles image generator) is really very good for photos. It tends to have stronger prompt adherence than some other generators, so what you type is largely what you get.jansyren said:I agree, gemeni followed the prompt closest without embellishing. I tried the following prompt on the living room: "Create an image of a minimalistic living room with a large window overlooking a rocky beach with the ocean beyond fading into the distance" and that's exactly what I got. -
AI_Nerd
AGREE, I hardly ever use Dall-E anymore. It's very expensive and even for creating cartoon or "artificial images" there are simply better, and cheaper generators. I think OpenAI has easily one of the best LLMs out there, but they are not even worth mentioning with image Gen. I don't know much about Dall-E 3, whether it's a CNN, GAN, a VAE/transformer like Dall-E or a Diffusion Model like Dall-E 2, but I've never cared much for the images any of them have generated, other than when I was new to image generation and they were the first one I tried.cam.macleod said:The funny thing is this totally personal preference, because I honestly thought Gemini easily pulled first in every scenario, except the interior design. I was honestly very impressed by Meta though. ChatGPT's always feels extremely artificial somehow. -
brysoncreighton Interesting outcome. 100% personal preference and based on your breakdowns of each image I found it odd that you chose Meta in most of the sections. Your closing statement also seems to contradict itself.Reply
Gemini seemed to stick to your prompt without adding additional components or making assumptions. As other users mentioned the ocean scene is a great example, I'd rather the model follow explicit instruction and refine via prompt editing than to be forced into the world of embellishment.
Having used a plethora of image generators I lean towards realism and this is Gemini's default approach. It not only followed your prompts to a T, it also rendered the most realistic result. This can be difficult to achieve with most platforms.
Overall good write up! I was surprised by Meta's performance. -
jeffey29 I shared what you said about Meta AI with Meta AI . Here is the reply from the AIReply
Here's a potential response from Meta AI:
"Thank you, Tom's Guide, for the comprehensive review! We're thrilled to be declared the overall winner, demonstrating Meta AI's commitment to delivering high-quality, realistic images.
We appreciate the acknowledgment of our strengths in detail, coherence, and prompt understanding. Our team will continue refining fine details, balancing stylization and realism, and ensuring consistency across prompts.
We're proud to be among innovative leaders like Google Gemini and ChatGPT, driving advancements in AI-generated imagery.
Your feedback fuels our growth. We look forward to future evaluations and continued improvements!"