OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

October 24, 2024 report

Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

preprint

trusted source

proofread

OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

Two experts with the OpenAI team have developed a new kind of continuous-time consistency model (sCM) that they claim can generate video media 50 times faster than models currently in use. Cheng Lu and Yang Song have published a paper describing their new model on the arXiv preprint server. They have also posted an introductory paper on the company's website.

In machine learning methods by which AI apps are trained, diffusion models, sometimes called diffusion probabilistic models or score-based generative models, are a type of variable generative model.

Such models typically have three major components: forward and reverse processes and a sampling procedure. These models are the basis for generating visually based products such as video or still images, though they have been used with other applications, as well, such as in audio generation.

As with other machine-learning models, diffusion models work by sampling large amounts of data. Most such models execute hundreds of steps to generate an end product, which is why most of them take a few moments to carry out their tasks.

In sharp contrast, Lu and Song have developed a model that carries out all its work using just two steps. That reduction in steps, they note, has drastically reduced the amount of time their model takes to generate a video—without any loss in quality.

The new model uses more than 1.5 billion parameters and can produce a sample video in a fraction of a second running on a machine with a single A100 GPU. This is approximately 50 times faster than models currently in use.

The researchers note that their new model requires a lot less computational power than other models, as well, an ongoing issue with AI applications in general as their use skyrockets. They also note that their new approach has already undergone benchmarking to compare their results with other models, both those in current use and those under development by other teams. They suggest their model should allow for real-time generative AI applications in the near future.

More information: Cheng Lu et al, Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models, arXiv (2024). DOI: 10.48550/arxiv.2410.11081

OpenAI blog: openai.com/index/simplifying-s … -consistency-models/

Journal information: arXiv

Citation: OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models (2024, October 24) retrieved 24 October 2024 from https://techxplore.com/news/2024-10-openai-unveils-scm-generates-video.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Google's GameNGen simulates parts of video game Doom 27 shares

Feedback to editors

OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

By cryptoadmin

You Missed

The bond market is flashing a clear signal on interest rates. Bitcoin bulls should take note

Midjourney, the AI image generator, is developing a full-body ultrasonic scanner

Buying bitcoin below its 200-week average has historically delivered over 100% in median returns, Kraken says

Anthropic’s design assistant now works better with its coding agent

Categories

OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

By cryptoadmin

Related Post

Could AI tell you where you left your keys?

Top researcher backs nations’ push for sovereign AI

In game theory, generalists sometimes win out over specialists, finds research

You Missed

The bond market is flashing a clear signal on interest rates. Bitcoin bulls should take note

Midjourney, the AI image generator, is developing a full-body ultrasonic scanner

Buying bitcoin below its 200-week average has historically delivered over 100% in median returns, Kraken says

Anthropic’s design assistant now works better with its coding agent