Forget ChatGPT — Google Gemini can now see the world with live video and screen-sharing

The new Gemini app home page vs the old
(Image credit: Future)

Google's AI assistant, Gemini, is set to introduce exciting features to give Android users new ways to interact more intuitively with their devices. Leveraging advanced capabilities, Gemini will soon allow users to ask questions about content on their screens, much like the screen sharing feature currently available in Gemini 2.0 on desktop.

In a recent announcement, Google unveiled these Gemini functionalities, which focus on real-time interaction and on-screen inquiries. These features are part of Google's Project Astra.

New functionalities

Google Gemini features

(Image credit: Google Gemini)

The screen-sharing function allows users to share their screens with Gemini and ask questions based on displayed content. For instance, while viewing an image of a jacket, a user might ask for shoe recommendations to complement the attire.

The live video interactions, which are undoubtedly a direct response to ChatGPT's Voice and Vision option, let users engage in real-time conversations about their surroundings by enabling the camera within the Gemini app.

This functionality allows Gemini to provide insights based on live video feeds, similar to a video call experience.

These enhancements position Gemini as a versatile AI assistant capable of understanding and interacting with visual content to deliver support that is more personalized and context-aware.

Integration with existing applications

Gemini sending details to contacts

(Image credit: Evan Blass)

Gemini's new features are designed to integrate seamlessly with various applications such as YouTube. Now, while watching a video, users can activate Gemini to ask questions about the content.

For example, a user could inquire about a specific muscle or fitness technique during an exercise tutorial.

In addition, when viewing a PDF, the "Ask about this PDF" option allows users to obtain summaries or clarifications, streamlining the research process without moving to a desktop.

These features aim to make on-the-go information retrieval more efficient, reducing the need for manual searches and enhancing user productivity.

Project Astra

Project Astra AI agent

(Image credit: Google)

The development of these features falls under Google's Project Astra, a multimodal AI assistant initiative. Project Astra seeks to create an assistant to perceive and understand its environment, facilitating more natural interactions.

By enabling Gemini to interpret visual inputs and engage in contextual conversations, Google is advancing toward a more immersive AI experience.

Availability

Google plans to roll out these features to Gemini Advanced subscribers on Android devices later this month.

Google's introduction of screen-aware capabilities in Gemini marks a pivotal moment in AI assistant development. By allowing users to ask questions about on-screen content, Gemini is moving beyond passive viewing into interactive experiences, enhancing AI's utility in everyday life.

As these features become widely available, they hold the potential to redefine user expectations and set new benchmarks for what AI assistants can achieve.

More from Tom's Guide

Category
Arrow
Arrow
Back to MacBook Air
Brand
Arrow
Processor
Arrow
RAM
Arrow
Storage Size
Arrow
Screen Size
Arrow
Colour
Arrow
Storage Type
Arrow
Condition
Arrow
Price
Arrow
Any Price
Showing 10 of 99 deals
Filters
Arrow
Show more
Amanda Caswell
AI Writer

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

Read more
Google Gemini logo on phone with Google logo in background
Google announces several new AI features for Android and Pixel phones — here’s what’s coming
Gemini Live
Gemini Live major upgrade just revealed by Google
Gemini logo
Forget Siri — Google just brought its Gemini widgets to the iPhone
Gemini AI on Google TV
I just tried Gemini AI on Google TV and you may never use your remote again
Gemini logo
Google's Gemini 2.0 update will supercharge your phone — 3 changes to try first
Gemini 2
Playing your favorite games on phones could include AI advice thanks to Gemini 2.0 — here’s how
Latest in AI
The new Gemini app home page vs the old
Forget ChatGPT — Google Gemini can now see the world with live video and screen-sharing
iOS 18 logo on iPhone in person's lap
iOS 18.5 is coming soon with huge Siri upgrades — here’s everything to expect
The DeepSeek logo seen on the silhouette of a smartphone
I have ChatGPT Plus — but here's 7 reasons why I use DeepSeek instead
Gemini logo
Forget Siri — Google just brought its Gemini widgets to the iPhone
Meta AI logo on a phone
Meta set to release a direct competitor to ChatGPT — here's what you need to know
Shutterstock Sora image
OpenAI just announced that its Sora AI video generator is coming to ChatGPT
Latest in News
NYTimes Connections
NYT Connections today hints and answers — Wednesday, March 5 (#633)
The new Gemini app home page vs the old
Forget ChatGPT — Google Gemini can now see the world with live video and screen-sharing
MacBook Air 15-inch M3
MacBook Air M4 biggest upgrades just tipped right before launch
James Marsden and Sterling K. Brown in Paradise
'Paradise' season finale ending explained — who killed President Bradford?
Android 12
Google March Android Security Update fixes two high severity vulnerabilities — update now
Kieran Culkin as Benjamin "Benji" Kaplan in "A Real Pain"
Hulu top 10 movies — here's the 3 worth watching now