Textual content-to-video AI blossoms with new metamorphic video capabilities

Could 5, 2025

The GIST Editors' notes

This text has been reviewed in line with Science X's editorial course of and insurance policies. Editors have highlighted the next attributes whereas guaranteeing the content material's credibility:

fact-checked

peer-reviewed publication

trusted supply

proofread

Textual content-to-video AI blossoms with new metamorphic video capabilities

Text-to-video AI blossoms with new metamorphic video capabilities — Overview of the proposed MagicTime method. Credit score: arXiv: DOI: 10.48550/arxiv.2404.05014

Whereas text-to-video synthetic intelligence fashions like OpenAI's Sora are quickly metamorphosing in entrance of our eyes, they’ve struggled to supply metamorphic movies. Simulating a tree sprouting or a flower blooming is tougher for AI techniques than producing different kinds of movies as a result of it requires the information of the bodily world and may range extensively.

However now, these fashions have taken an evolutionary step.

Laptop scientists on the College of Rochester, Peking College, College of California, Santa Cruz, and Nationwide College of Singapore developed a brand new AI text-to-video mannequin that learns real-world physics information from time-lapse movies. The workforce outlines their mannequin, MagicTime, in a paper revealed in IEEE Transactions on Sample Evaluation and Machine Intelligence.

"MagicTime is a step towards AI that may higher simulate the bodily, chemical, organic, or social properties of the world round us," says pc science Ph.D. pupil Jinfa Huang. Credit score: College of Rochester GIF created utilizing MagicTime

"Synthetic intelligence has been developed to attempt to perceive the true world and to simulate the actions and occasions that happen," says Jinfa Huang, a Ph.D. pupil supervised by Professor Jiebo Luo from Rochester's Division of Laptop Science, each of whom are among the many paper's authors. "MagicTime is a step towards AI that may higher simulate the bodily, chemical, organic, or social properties of the world round us."

Earlier fashions generated movies that usually have restricted movement and poor variations. To coach AI fashions to extra successfully mimic metamorphic processes, the researchers developed a high-quality dataset of greater than 2,000 time-lapse movies with detailed captions.

"dough […] swells and browns within the oven […]" Credit score: Shenghai Yuan et al

At present, the open-source U-Internet model of MagicTime generates two-second, 512-by-512-pixel clips (at 8 frames per second), and an accompanying diffusion-transformer structure extends this to 10-second clips. The mannequin can be utilized to simulate not solely organic metamorphosis but additionally buildings present process building or bread baking within the oven.

However whereas the movies generated are visually fascinating and the demo could be enjoyable to play with, the researchers view this as an necessary step towards extra subtle fashions that might present necessary instruments for scientists.

"Our hope is that sometime, for instance, biologists might use generative video to hurry up preliminary exploration of concepts," says Huang. "Whereas bodily experiments stay indispensable for closing verification, correct simulations can shorten iteration cycles and scale back the variety of reside trials wanted."

Extra data: Shenghai Yuan et al, MagicTime: Time-lapse Video Technology Fashions as Metamorphic Simulators, IEEE Transactions on Sample Evaluation and Machine Intelligence (2025). DOI: 10.1109/TPAMI.2025.3558507. On arXiv: DOI: 10.48550/arxiv.2404.05014

Journal data: arXiv , IEEE Transactions on Pattern Analysis and Machine Intelligence Supplied by College of Rochester Quotation: Textual content-to-video AI blossoms with new metamorphic video capabilities (2025, Could 5) retrieved 7 Could 2025 from https://techxplore.com/information/2025-05-text-video-ai-blossoms-metamorphic.html This doc is topic to copyright. Other than any honest dealing for the aim of personal examine or analysis, no half could also be reproduced with out the written permission. The content material is supplied for data functions solely.

Discover additional

Meta's Film Gen lets customers animate images into movies 18 shares

Feedback to editors

Textual content-to-video AI blossoms with new metamorphic video capabilities

By cryptoadmin

You Missed

American Airlines and Google say AI helped airplanes reduce contrails that trap heat

Android will hide some app sideloading behind a new one-time security process

Wall Street heavyweight Cantor among investment banks pitching FalconX for its potential IPO

Humanoid robot learns impressive tennis skills from imperfect human motion

Categories

Textual content-to-video AI blossoms with new metamorphic video capabilities

By cryptoadmin

Related Post

American Airlines and Google say AI helped airplanes reduce contrails that trap heat

Humanoid robot learns impressive tennis skills from imperfect human motion

A better method for identifying overconfident large language models

You Missed

American Airlines and Google say AI helped airplanes reduce contrails that trap heat

Android will hide some app sideloading behind a new one-time security process

Wall Street heavyweight Cantor among investment banks pitching FalconX for its potential IPO

Humanoid robot learns impressive tennis skills from imperfect human motion