Democratic Governance of Frontier AI: A Markup

On June 2, OpenAI published "Democratic Governance of Frontier AI," a proposal for how the United States should govern its most capable AI systems. In the first of a series of markups, IST’s Fatima Faisal Khan unpacks the proposal for policymakers.

A close reading of OpenAI’s June 2026 federal framework, provision by provision, for the question Congress actually has to answer

Introduction

On June 2, 2026, OpenAI published “Democratic Governance of Frontier AI: A blueprint for a federal framework,” a proposal for how the United States should govern its most capable AI systems. The blueprint arrived the day after the White House’s executive order on Promoting Advanced Artificial Intelligence Innovation and Security, in the same week that Sam Altman met congressional leaders on the Hill, and alongside the bipartisan Great American AI Act draft from Representatives Obernolte and Trahan. In a moment when Washington was searching for a federal approach, a frontier developer offered a full framework. It proposes three things: 

  1. A “reverse federalism” that codifies recent and pending state laws, including California’s SB 53, New York’s RAISE Act, and Illinois’s SB 315, and then preempts the states. 
  2. A strengthened Center for AI Standards and Innovation (CAISI) as the government’s premier institution for evaluating frontier models and measuring recursive self-improvement
  3. A whole-of-government resilience strategy across compute, biodefense, and cybersecurity

In much of the commentary featured in Politico, The Hill, Mashable, and others immediately following the release of the framework, most asked why OpenAI wrote this. Rather than retread that well-covered ground, I focused on the question: what would be required to make the framework work? There are three core challenges that policymakers would face were they try to implement the framework as it stands. 

First, the blueprint reads as a list of obligations but is really a sequence with unstated prerequisites. Its most important requirements—the severe-risk evaluations, comparable transparency reports, third-party audits, and the procurement gate—rely on methodologies and deliverables that do not yet exist. These include the evaluation methodologies, the metrics for recursive self-improvement and loss of control, and a certified assessor ecosystem. It compresses what will be a multi-year build into single sentences, which makes the real hazard a sequencing one. Enacting the fast structural pieces, particularly the federal preemption that displaces state AI laws, before the slow substantive ones exist risks a gap in which operative state law is gone and the federal regime cannot yet function.

Second, the framework relies on measurement and definitions that do not exist. Recursive self-improvement, loss of control, and even cyber, chemical, biological, radiological, and nuclear (CBRN) capability lack agreed evaluation methods, and the document’s own fix is to have CAISI develop them later, in conjunction with the firms being measured. As a result, key terms remain unclear. “Severe risk” carries the evaluation, whistleblower, and liability provisions without a definition. The phrase “critical safety incident” brings to mind issues with implementing the Cyber Incident Reporting for Critical Infrastructure Act of 2022: four years after passage, we have yet to reach a settled definition of a reportable “critical” incident. Pinning these terms down is what determines whether anything is enforceable.

Third, the framework’s binding force depends on choices it leaves to implementation. The centerpiece is a mandatory pre-release CAISI evaluation, but the evaluator cannot block deployment, and a deadline lets developers ship if CAISI does not finish in time. The gate therefore binds in proportion to CAISI’s capacity, so the agency would need the staffing and tooling to keep pace with every frontier release before the requirement carries real force.. A separate design choice is the question of who runs the assessments. The framework assumes a private assessor ecosystem stood up from scratch, though at the current scale of roughly five frontier labs a government-run assessment could be faster and more independent. Policymakers need to decide which route to build and fund. The hardest piece to pin down is the backstop. Liability for severe harms with no blanket safe harbor is what would give the other obligations consequences, yet the blueprint states it as a single principle, leaving the work of turning it into a workable standard as the thing that determines whether the rest of the regime has force.

This is among the first comprehensive frameworks any frontier developer has put before policymakers, and its main contribution may be to show the scale of the task it leaves to the government. The framework presents a set of obligations, but reads more like a sequence in which the faster structural elements, preemption above all, are specified far more fully than the measurement methods, threshold definitions, and assessor institutions on which everything else depends. Most of what would make the regime function has yet to be built, and much of that building falls to policymakers rather than to the company making the proposal. 

This markup is the first in a series examining how frontier developers propose to govern themselves. Eight days after the OpenAI document, Anthropic published its Advanced AI Framework alongside Dario Amodei’s essay Policy on the AI Exponential, proposing an Federal Aviation Administration (FAA)-style testing regime in which the government can block or reverse deployments that fail. That is a sharply different answer to the question this markup turns on, since OpenAI’s evaluator cannot block deployment, and it is the framework I will take up next.

What follows below is my provision-by-provision markup, from the preamble through the conclusion of OpenAI’s federal framework, testing each recommendation against what it would take to build and whether it would hold.

Markup

Instructions: Each green highlighted portion of the framework is accompanied by markup and comments, indicated by italics and quotation marks. Read the unannotated framework here.

Appendix 1 

Frontier safety frameworks (the major labs)

  1. OpenAI, Preparedness Framework (Version 2), April 15, 2025, https://openai.com/index/updating-our-preparedness-framework/
  2. Anthropic, Responsible Scaling Policy (Version 3.1, effective April 2, 2026), https://www.anthropic.com/responsible-scaling-policy
  3. Google DeepMind, Frontier Safety Framework (Version 3.0), September 22, 2025, https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/strengthening-our-frontier-safety-framework/frontier-safety-framework_3.pdf

 

Transparency-report instruments (the per-model disclosures these frameworks produce)

  1. Anthropic, Responsible Scaling Policy (Version 3.1), sections on Frontier Safety Roadmap and Risk Reports, https://www.anthropic.com/responsible-scaling-policy.
  2. Frontier Model Forum, Issue Brief: Components of Frontier AI Safety Frameworks, November 8, 2024, https://www.frontiermodelforum.org/updates/issue-brief-components-of-frontier-ai-safety-frameworks/

 

Statutory and international disclosure schemas

  1. California Senate Bill 53, Transparency in Frontier Artificial Intelligence Act (TFAIA), signed September 29, 2025, key provisions effective January 1, 2026, official text at https://leginfo.legislature.ca.gov/faces/billTextClient.xhtml?bill_id=202520260SB53
  2. European Commission, General-Purpose AI Code of Practice, final version published July 10, 2025, https://digital-strategy.ec.europa.eu/en/news/general-purpose-ai-code-practice-now-available; full text at https://code-of-practice.ai/

 

Other federal frameworks by major AI Labs:

  1. June 2026: Anthropic, Advanced AI Framework, https://www-cdn.anthropic.com/files/4zrzovbb/website/0a58d567024a8b448ff15158ebc3625328dfcc1f.pdf

Related Content

MENU

GET IN TOUCH

Email: [email protected]
Send us a message: Contact

JOIN THE CATALINK MAILING LIST