AI video models try to mimic real-world physics — but they don't understand it

AI tiger walking in snow — (Image credit: Haiper)

AI video generators can’t understand the laws of physics solely by watching videos, scientists have found.

Coming hot on the heels of chatbots and image generators, AI video generators like Sora and Runway have already been delivering impressive results. But a team of scientists from Bytedance Research, Tsinghua University, and Technion were curious to learn if such models could discover physical laws from visual data without any additional human input.

YouTube

Watch On

The three fundamental physical laws for simulation they chose to study were the uniform linear motion of a ball, the perfectly elastic collision between two balls, and the parabolic motion of a ball.

Based on the team's pre-print paper, it turned out that while the shapes acted as they should for simulations based on the data they were trained on, they failed to act properly in new, unforeseen scenarios. At best, the models tried to mimic the closest training example they could find.

During the course of their experiments, the scientists also observed that the video generator often changed one shape into another (e.g. a square randomly turns into a ball) or made other nonsensical adjustments. The model's priorities appeared to follow a clear hierarchy, with color holding the highest importance, followed by size, and then velocity. Shape received the least emphasis.

Have they found a solution?

“It is challenging to determine whether a video model has learned a law instead of merely memorizing the data,” the researchers said. They explained that since the model’s internal knowledge is inaccessible, they could only infer the model’s understanding by examining its predictions on unseen scenarios.

“Our in-depth analysis suggests that video model generalization relies more on referencing similar training examples rather than learning universal rules,” they said, highlighting this happens regardless of the amount of data a model trains on.

Have they found a solution? Not yet, lead author Bingyi Kang wrote on X. “Actually, this is probably the mission of the whole AI community,” he added.

More from Tom's Guide

Back to MacBook Air

Apple

Asus

Lenovo

Intel Core i5

Intel Pentium

8GB RAM

16GB RAM

24GB RAM

128GB

256GB

512GB

1TB

14-inch

15-inch

Black

Grey

Silver

New

Refurbished

EMMC

SSD

Showing 10 of 73 deals

Filters☰

Apple MacBook Air M3

(256GB SSD)

$1,099

View

Asus Zenbook S 13 OLED

(256GB 8GB RAM)

$1,080.99

View

Lenovo IdeaPad Duet 3

$369.99

View

Apple MacBook Pro 14-inch M4 (2024)

(512GB Black)

(256GB 16GB RAM)

Asus Zenbook S 13 OLED

(OLED)

$1,399.99

View

Lenovo IdeaPad Duet 3

(128GB 8GB RAM)

$387.80

View

Apple MacBook Pro 14-inch M4 (2024)

(14-inch 512GB)

(15-inch 512GB)

Asus Zenbook S 13 OLED

(OLED)

$1,599

View

Christoph Schwaiger is a journalist, mainly covering AI, health, and current affairs. His stories have been published by Tom's Guide, Live Science, New Scientist, and the Global Investigative Journalism Network, among other outlets. Christoph has appeared on LBC and Times Radio. Additionally, he previously served as a National President for Junior Chamber International (JCI), a global leadership organization, and graduated cum laude from the University of Groningen in the Netherlands with an MA in journalism. You can follow him on X (Twitter) @cschwaigermt.