Google’s plan to train its AI now includes the entire public internet

Google AI logo on phone laying on a table
(Image credit: Shutterstock)

Google has made a change to its privacy policy that clarifies the extent of its growing AI ambitions. In essence, the search giant says anything posted publicly online is fair game to be scraped and used to improve AI tools like its ChatGPT rival, Google Bard.

First spotted by Gizmodo, the company updated its policy over the weekend to reflect it’ll use “publicly available information to help train Google’s AI models and build products and features like Google Translate, Bard, and Cloud AI capabilities.”

That’s a noted change from the previous policy wording that said Google just used the information to build “language models” for Google Translate. The company’s chatbot Bard and the nebulous term “Cloud AI” are now included too.

What does that mean? Well, users can confidently assume anything posted using a Google product (Search, Gmail, YouTube) is going to be saved and used by the company in some way or another. But this goes further. The wording states Google will use publicly available information to train its AI which could be read as basically anything posted on the internet at large. Considering Google’s industry-conquering Search platform was built on PageRank and its ability to effectively index the web, this AI move has huge ramifications.

“Our privacy policy has long been transparent that Google uses publicly available information from the open web to train language models for services like Google Translate. This latest update simply clarifies that newer services like Bard are also included," a Google spokesperson told Tom's Guide. "We incorporate privacy principles and safeguards into the development of our AI technologies, in line with our AI Principles.”

Google revealed a number of new AI projects at its annual I/O conference earlier in the year, including the fact its entire Workspace portfolio would be utilizing it. But the sticking point for many organizations is where and how the large language models (LLMs) that fuel these projects are getting their information.

Sundar Pichai presenting Gemini onstage at Google I/O 2023

(Image credit: Google via YouTube)

Google has been forced to pause the rollout of its Bard chatbot in Europe while the EU considers imposing regulation on AI products. Meanwhile, platforms such as Reddit and Twitter have turned off free access to their APIs which allowed others to download large backlogs of posts and potentially harvest the contents to train AI.

ChatGPT, Google’s main rival in the AI space right now, was forced to introduce data controls for users worldwide after a complaint from Italy’s data protection authorities. Users can now ask it not to train on the data we provide it.

But that’s a different beast than opening up the entirety of the public web to potential crawling from one of the largest tech companies on the planet. While most internet users should understand the difference between public and private internet, it remains to be seen how they’ll feel about having any public post made at any point in time suddenly turned into fodder for Google’s burgeoning AI empire. 

More from Tom's Guide

TOPICS
Jeff Parsons
UK Editor In Chief

Jeff is UK Editor-in-Chief for Tom’s Guide looking after the day-to-day output of the site’s British contingent. Rising early and heading straight for the coffee machine, Jeff loves nothing more than dialling into the zeitgeist of the day’s tech news.

A tech journalist for over a decade, he’s travelled the world testing any gadget he can get his hands on. Jeff has a keen interest in fitness and wearables as well as the latest tablets and laptops. A lapsed gamer, he fondly remembers the days when problems were solved by taking out the cartridge and blowing away the dust.

Read more
ChatGPT search interface
ChatGPT Search is now open to everyone — no account required
ChatGPT on phone with Google logo in background
New study reveals people are ditching Google for AI tools like ChatGPT search — here's why
AI Mode of google search
Google launches 'AI Mode' for search — here's how to try it now
Gemini logo on smartphone
Google is giving away Gemini's best paid features for free — here's the tools you can try now
two bots chatting
What is 'Gibberlink'? Why it's freaking out the internet after these two AIs talking to each other went viral
A phone with Google Search open on screen
Google just made it easier to remove your personal info from search results — here's how to do it
Latest in Google Gemini
Google Gemini logo
You can now use Google Gemini without an account — here's how to get started
A stock photo of a person on their phone looking at a spreadsheet while several graphs are displayed on the laptop in front of them.
Google Sheets just got an AI upgrade that analyzes your data and visualizes it
Gemini logo shown on a phone's screen
Google Gemini can now analyze and summarize documents for free — here's how
Gemini Live
Gemini Live major upgrade just revealed by Google
Gemini 2
Google Gemini 2.0 is now free for users — here’s how to access it now
Gemini 2
My browser tabs were getting out of hand so I let Gemini 2.0 takeover — here's how it went
Latest in News
Bill Gates in 2019
Bill Gates just predicted the death of every job thanks to AI — except for these three
NYTimes Connections
NYT Connections today hints and answers — Wednesday, March 26 (#654)
Gemini screenshot image
Google unveils Gemini 2.5 — claims AI breakthrough with enhanced reasoning and multimodal power
Samsung Galaxy Z Flip 6 review.
Samsung Galaxy Z Flip 7 design just teased in new cases leak — and the outer display is huge
Google Chrome
Chrome failed to install on Windows PCs, but Google has issued a fix — here's what happened
nyc spring day AI image
OpenAI just unveiled enhanced image generator within ChatGPT-4o — here's what you can do now