Google might have solely just lately begun rolling out its Veo generative AI to enterprise clients, however the firm isn’t losing any time getting a brand new model of the video software out to early testers. On Monday, Google introduced a preview of Veo 2. In response to the corporate, Veo 2 “understands the language of cinematography.” In observe, meaning you possibly can reference a selected style of movie, cinematic impact or lens when prompting the mannequin.
Moreover, Google says the brand new mannequin has a greater understanding of real-world physics and human motion. Accurately modeling people in movement is one thing all generative fashions battle to do. So the corporate’s declare that Veo 2 is healthier in relation to each of these hassle factors is notable. In fact, the samples the corporate offered aren’t sufficient to know for positive; the true take a look at of Veo 2’s capabilities will come when somebody prompts it to generate a video of a gymnast's routine. Oh, and talking of issues video fashions battle with, Google says Veo will produce artifacts like further fingers “much less regularly.”
Individually, Google is rolling out enhancements to Imagen 3. Of its text-to-image mannequin, the corporate says the most recent model generates brighter and better-composed photographs. Moreover, it will probably render extra various artwork kinds with higher accuracy. On the identical time, it’s additionally higher at following prompts extra faithfully. Immediate adherence was a problem I highlighted when the corporate made Imagen 3 obtainable to Google Cloud clients earlier this month, so if nothing else, Google is conscious of the areas the place its AI fashions want work.
Veo 2 will steadily roll out to Google Labs customers within the US. For now, Google will restrict testers to producing as much as eight seconds of footage at 720p. For context, Sora can generate as much as 20 seconds of 1080p footage, although doing so requires a $200 per 30 days ChatGPT Professional subscription. As for the most recent enhancements to Imagen 3, these can be found to Google Labs customers in additional than 100 international locations by means of ImageFX.
This text initially appeared on Engadget at https://www.engadget.com/ai/googles-new-ai-video-model-sucks-less-at-physics-170041204.html?src=rss
