To minimize these risks as AI models continue to improve, we are building a new team called Preparedness. Led by Aleksander Madry, the Preparedness team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, from the models we develop in the near future to those with AGI-level capabilities. The team will help track, evaluate, forecast and protect against catastrophic risks spanning multiple categories including:
- Individualized persuasion
- Cybersecurity
- Chemical, biological, radiological, and nuclear (CBRN) threats
- Autonomous replication and adaptation (ARA)
The Preparedness team mission also includes developing and maintaining a Risk-Informed Development Policy (RDP). Our RDP will detail our approach to developing rigorous frontier model capability evaluations and monitoring, creating a spectrum of protective actions, and establishing a governance structure for accountability and oversight across that development process. The RDP is meant to complement and extend our existing risk mitigation work, which contributes to the safety and alignment of new, highly capable systems, both before and after deployment.
- SEO Powered Content & PR Distribution. Get Amplified Today.
- PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
- PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
- PlatoESG. Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
- PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
- Source: https://openai.com/blog/frontier-risk-and-preparedness
- :is
- 27
- a
- accountability
- across
- actions
- adaptation
- After
- against
- AI
- AI models
- alignment
- also
- and
- approach
- ARE
- AS
- assessment
- before
- both
- Building
- by
- called
- capabilities
- capability
- capable
- catastrophic
- categories
- Complement
- Connect
- continue
- contributes
- Creating
- deployment
- detail
- develop
- developing
- Development
- establishing
- evaluate
- evaluations
- existing
- extend
- For
- Forecast
- from
- Frontier
- future
- governance
- help
- highly
- HTTPS
- improve
- in
- includes
- Including
- internal
- Led
- maintaining
- meant
- minimize
- Mission
- mitigation
- model
- models
- monitoring
- multiple
- Near
- New
- nuclear
- of
- OpenAI
- our
- Oversight
- plato
- Plato Data Intelligence
- PlatoData
- policy
- process
- protect
- Protective
- Red
- replication
- rigorous
- Risk
- Risk Mitigation
- risks
- Safety
- spanning
- Spectrum
- structure
- Systems
- team
- that
- The
- These
- those
- tightly
- to
- track
- we
- which
- will
- with
- Work
- zephyrnet