Microsoft new AI model can turn still images into gifs — and you can try it now

(Image credit: Pix2Gif image)

Microsoft's research division has launched a new artificial intelligence model that can take a still image and turn it into a gif in a matter of seconds.

Pix2Gif uses the same type of diffusion model found in many of the AI text-to-video models like Runway and Pika Labs but takes an "image translation" approach.

In a paper on the new publicly available model, the researchers say they’ve focused on guiding the motion through a text prompt alongside the image input. It "spatially transforms" the features of the original image based on the text prompt.

If you give it a picture of a train and say “moving on the tracks” it should make it appear the train is moving — but as it is a gif and only 2 seconds long, it won’t go very far. Its a new approach to AI generation, joining image, video and 3D models.

How do I access Pix2Gif?

Like many of the recent run of AI models this is more of a research project than a publicly available product from Microsoft that could be built into Copilot. But it is available to try and the code can be downloaded.

This means you could run it locally on your laptop. For now the best way is to use the test environment where you can give it an image, a text prompt and get a gif.

In my experiments it took about one minute to generate a 2-second gif from a still image, although if you have a fast GPU that may be considerably faster.

All of the gifs in the story were created using the tool. They were AI images generated using Adobe Firefly then run through Pix2Gif to animate them.

What are the use cases for Pix2Gif?

The motion-guided warping aspect is possibly more interesting than the ability to create a gif from a still image as that could be applied to other video-like models in the future and potentially allow for editing gifs.

Something like it could be incorporated into Microsoft Designer or Paint to make it easier to animate, or even make adjustments to an image using AI.

To train the model they used 100,000 animated gifs with appropriate captions, then extracted frames from the gifs and used the captions as the text prompt.

It isn’t clear where the gifs came from and so that might be why it is reserved as a research project. A publicly available and commercial model would require licensed data, especially if it were built into a Microsoft product.

For now it is a fun diversion tool, a way to experiment with how your still image might look if it had just a few more frames. Personally, I'll stick to AI video tools like Runaway, Pika Labs and the upcoming OpenAI Sora.

More from Tom's Guide

Back to MacBook Air

Apple

Asus

Lenovo

Intel Core M3

Intel Pentium

8GB RAM

16GB RAM

128GB

512GB

1TB

Black

Grey

Silver

New

Refurbished

EMMC

SSD

Showing 10 of 36 deals

Filters☰

Apple MacBook Air M3

$849

View

Lenovo IdeaPad Duet 3

(128GB 8GB RAM)

$379.99

View

Asus Zenbook S 13 OLED

(13.3-inch 512GB)

$1,524.99

$1,189.99

View

Asus ROG Zephyrus G14 2023

$1,599.99

View

Lenovo IdeaPad Duet 3

$369.99

View

Asus Zenbook S 13 OLED

(OLED)

$1,399.99

View

Apple MacBook Pro 14-inch M3 (2023)

(1TB Intel Core M3)

Our Review

☆☆☆☆☆

$2,399

$1,998.98

View

Apple MacBook Pro 14-inch M3 (2023)

(512GB Black)

Our Review

☆☆☆☆☆

Asus ROG Zephyrus G14 2023

$3,299.99

View

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?

How do I access Pix2Gif?

Sign up to get the BEST of Tom's Guide direct to your inbox.

What are the use cases for Pix2Gif?

More from Tom's Guide