Google launches Lumiere, a ‘cute and fluffy’ text-to-video generator

Jan 25, 2024

Alex Baker

Alex Baker is a portrait and lifestyle driven photographer based in Valencia, Spain. She works on a range of projects from commercial to fine art and has had work featured in publications such as The Daily Mail, Conde Nast Traveller and El Mundo, and has exhibited work across Europe

Google launches Lumiere, a ‘cute and fluffy’ text-to-video generator

Jan 25, 2024

Alex Baker

Alex Baker is a portrait and lifestyle driven photographer based in Valencia, Spain. She works on a range of projects from commercial to fine art and has had work featured in publications such as The Daily Mail, Conde Nast Traveller and El Mundo, and has exhibited work across Europe

Join the Discussion

Share on:

YouTube video

If you like all things cute and just wish you could create videos of cute things, then Google has the answer for you. Lumiere, Google’s latest answer to text-to-video generators, dropped on Tuesday, and it’s great…if you like pandas driving cars.

Google describes Lumiere as a “Space-Time Diffusion Model for Realistic Video Generation”. I’m not really certain what that means since I thought all video existed in space and time. However, let’s take a closer look.

Google says that the generator is capable of creating videos from both text prompts and still images. It can also create videos in a particular targeted style. This sounds like many of the other young AI video generators out there. But apparently, Lumiere is drastically different in how it works.

According to Google, it introduced a Space-Time U-Net architecture (ah, that’s the space-time connection then) that generates the entire temporal duration of the video at once through a single pass in the model. This is in contrast to other AI generators that make each frame separately and then string them all together. This is why other video generators can lack consistency. Google hopes to eliminate that problem by making the whole video in one go.

Google launches Lumiere, a 'cute and fluffy' text-to-video generator

Lumiere can also make effective cinemagraphs, where just one part of the image moves and the rest stays still. It’s also boasting some fairly impressive examples of wardrobe changes and owls wearing hats.

Google launches Lumiere, a 'cute and fluffy' text-to-video generator

In the paper, the Google team states that the AI model outputs five-second long 1024×1024 pixel videos, which they describe as “low-resolution.” I think give it a little time and space (see what I did there?), this will become more and more powerful and probably offer higher resolutions.

Though I do have to agree with Ars Technica when they say that this could be “the most advanced text-to-animal AI video generator yet demonstrated”.

[via ars technica]

Filed Under:

Tagged With:

Find this interesting? Share it with your friends!

Alex Baker

Alex Baker

Alex Baker is a portrait and lifestyle driven photographer based in Valencia, Spain. She works on a range of projects from commercial to fine art and has had work featured in publications such as The Daily Mail, Conde Nast Traveller and El Mundo, and has exhibited work across Europe

Join the Discussion

DIYP Comment Policy
Be nice, be on-topic, no personal information or flames.

Leave a Reply

Your email address will not be published. Required fields are marked *