Frontier risk and preparedness

OpenAI

Frontier risk and preparedness

AITime Stamp: October 26, 2023 3:00 AM

Source Node: 2350914

Republished By Plato

Followers: 0

To minimize these risks as AI models continue to improve, we are building a new team called Preparedness. Led by Aleksander Madry, the Preparedness team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, from the models we develop in the near future to those with AGI-level capabilities. The team will help track, evaluate, forecast and protect against catastrophic risks spanning multiple categories including:

Individualized persuasion
Cybersecurity
Chemical, biological, radiological, and nuclear (CBRN) threats
Autonomous replication and adaptation (ARA)

The Preparedness team mission also includes developing and maintaining a Risk-Informed Development Policy (RDP). Our RDP will detail our approach to developing rigorous frontier model capability evaluations and monitoring, creating a spectrum of protective actions, and establishing a governance structure for accountability and oversight across that development process. The RDP is meant to complement and extend our existing risk mitigation work, which contributes to the safety and alignment of new, highly capable systems, both before and after deployment.

SEO Powered Content & PR Distribution. Get Amplified Today.
PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
PlatoESG. Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
Source: https://openai.com/blog/frontier-risk-and-preparedness

Time Stamp: October 26, 2023

More from OpenAI

Customizing GPT-3 for Your Application

Source Cluster:

Source Node: 1222078

Time Stamp: Dec 14, 2021

DALL·E 3 system card

DALL·E 3 system card

Source Cluster:

Source Node: 2307577

Time Stamp: Oct 3, 2023

March 20 ChatGPT outage: Here’s what happened

March 20 ChatGPT outage: Here’s what happened

Source Cluster:

Source Node: 2029062

Time Stamp: Mar 24, 2023

GPTs are GPTs: An early look at the labor market impact potential of large language models

GPTs are GPTs: An early look at the labor market impact potential of large language models

Source Cluster:

Source Node: 2022734

Time Stamp: Mar 17, 2023

Introducing Text and Code Embeddings in the OpenAI API

Source Cluster:

Source Node: 1210197

Time Stamp: Jan 25, 2022

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

Source Cluster:

Source Node: 2180781

Time Stamp: Jan 11, 2023

Procgen and MineRL Competitions

Source Cluster:

Source Node: 768080

Time Stamp: Jun 9, 2020

DALL·E API Now Available in Public Beta

Source Cluster:

Source Node: 1734687

Time Stamp: Nov 3, 2022

GPT-4V(ision) system card

GPT-4V(ision) system card

Source Cluster:

Source Node: 2291469

Time Stamp: Sep 25, 2023

New and Improved Content Moderation Tooling

New and Improved Content Moderation Tooling

Source Cluster:

Source Node: 1776447

Time Stamp: Aug 10, 2022

OpenAI acquires Global Illumination

OpenAI acquires Global Illumination

Source Cluster:

Source Node: 2291559

Time Stamp: Aug 16, 2023

Learning to play Minecraft with Video PreTraining

Learning to play Minecraft with Video PreTraining

Source Cluster:

Source Node: 2325559

Time Stamp: Jun 23, 2022