Video generation models as world simulators

OpenAI

Video generation models as world simulators

AITime Stamp: February 15, 2024 3:00 AM

Source Node: 2483166

Republished By Plato

Followers: 0

This technical report focuses on (1) our method for turning visual data of all types into a unified representation that enables large-scale training of generative models, and (2) qualitative evaluation of Sora’s capabilities and limitations. Model and implementation details are not included in this report.

Much prior work has studied generative modeling of video data using a variety of methods, including recurrent networks,^{[^1]}^{[^2]} generative adversarial networks,^{[^4]}^{[^6]} autoregressive transformers,^{[^8]} and diffusion models.^{[^10]}^{[^12]} These works often focus on a narrow category of visual data, on shorter videos, or on videos of a fixed size. Sora is a generalist model of visual data—it can generate videos and images spanning diverse durations, aspect ratios and resolutions, up to a full minute of high definition video.

SEO Powered Content & PR Distribution. Get Amplified Today.
PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
PlatoESG. Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
Source: https://openai.com/research/video-generation-models-as-world-simulators

Time Stamp: February 15, 2024

More from OpenAI

Start using ChatGPT instantly

Start using ChatGPT instantly

Source Cluster:

Source Node: 2532710

Time Stamp: Apr 1, 2024

Improving Verifiability in AI Development

Source Cluster:

Source Node: 768082

Time Stamp: Apr 16, 2020

DALL·E 2 Research Preview Update

Source Cluster:

Source Node: 1314558

Time Stamp: May 18, 2022

Forecasting Potential Misuses of Language Models for Disinformation Campaigns—and How to Reduce Risk

Forecasting Potential Misuses of Language Models for Disinformation Campaigns—and How to Reduce Risk

Source Cluster:

Source Node: 1891578

Time Stamp: Jan 11, 2023

OpenAI and Elon Musk

OpenAI and Elon Musk

Source Cluster:

Source Node: 2505653

Time Stamp: Mar 5, 2024

DALL·E Now Available in Beta

Source Cluster:

Source Node: 1585029

Time Stamp: Jul 20, 2022

Scaling Kubernetes to 7,500 Nodes

Source Cluster:

Source Node: 747639

Time Stamp: Jan 25, 2021

OpenAI announces new members to board of directors

OpenAI announces new members to board of directors

Source Cluster:

Source Node: 2508918

Time Stamp: Mar 8, 2024

OpenAI Microscope

Source Cluster:

Source Node: 747769

Time Stamp: Apr 14, 2020

Practices for Governing Agentic AI Systems

Practices for Governing Agentic AI Systems

Source Cluster:

Source Node: 2409416

Time Stamp: Dec 14, 2023

Language models can explain neurons in language models

Language models can explain neurons in language models

Source Cluster:

Source Node: 2088680

Time Stamp: May 9, 2023

New AI classifier for indicating AI-written text

New AI classifier for indicating AI-written text

Source Cluster:

Source Node: 1930905

Time Stamp: Jan 31, 2023