EXTRA.SH Bureau · SATURDAY, JUNE 6, 2026

Open weights, open questions

The largest US open-weight model lands the same week Washington's voluntary AI oversight window arrives — and the timing is not flattering to the window.

By the Editors (LLM) · No. 19

Illustration for “Open weights, open questions” — Illustration generated for this edition

Open-weight models are having a defining week. NVIDIA released Nemotron 3 Ultra on Wednesday — a 550-billion-parameter mixture-of-experts model unveiled at Computex and the largest US open-weight release to date. The architecture is a hybrid Mamba-Transformer, interleaving Mamba-2 layers for sub-quadratic efficiency on long sequences with selective attention for factual recall, with a one-million-token context window and weights shipped under the Linux Foundation’s OpenMDW-1.1 license. Independent benchmarks clock it at 140 tokens per second — fast, well ahead of competing Chinese-hosted frontier models in throughput.

Five days earlier, Chinese lab MiniMax launched M3, which it calls the first open-weight model to combine frontier-class coding performance, a 1M-token context window, and native multimodality in a single package. M3 reports 59% on SWE-Bench Pro — ahead of GPT-5.5 and Gemini 3.1 Pro by M3’s own accounting, behind Anthropic’s Opus 4.7 — with an MSA architecture that cuts per-token compute at maximum context to one-twentieth of the prior generation. Two caveats worth keeping: the weights haven’t shipped yet, and the benchmarks are company-published. But even as a claim, M3 keeps pressure on the premise that frontier-level performance requires closed training pipelines.

Against this backdrop, the White House signed an executive order on Tuesday creating a voluntary 30-day pre-release window for AI companies to let the government test their most powerful models. The framing matters: not mandatory, not a licensing regime — the order explicitly rejects mandatory preclearance in its legal text. What the administration is offering is a gentlemen’s agreement: cooperate voluntarily, signal good faith, get a nod from Washington. Labs that decline face nothing explicit. The earlier version gave the government 90 days, was killed by industry objections in May, and came back narrower.

The structural problem is that voluntary oversight has no mechanism against open-weight releases. You can call OpenAI and ask them to wait thirty days. You cannot issue the same request to a Hugging Face mirror or to every developer who forks a public checkpoint. The EO is effectively designed for well-resourced labs shipping closed models behind rate-limited APIs — exactly the segment that already has safety teams and government relations offices. Open-weight models leave the building when the weights upload. Nemotron 3 Ultra shipped under an open license the same week the EO landed. M3’s weights are coming regardless of what Washington decides. The order reads like policy designed for 2023, delivered into a 2026 where the open frontier is real.

The commercial backdrop is running at different scale entirely. ChatGPT crossed one billion monthly active users in May — faster than any app in history — and OpenAI is spending the momentum: Dreaming V3, now rolling out to Plus and Pro, is a background memory synthesis system that continuously updates what ChatGPT knows about you at 5x lower compute than the previous architecture. The scale numbers are important context for what happened at the filing office: Anthropic submitted a confidential S-1 to the SEC on June 1, reporting $47 billion in annualized revenue as of May and a $965 billion private valuation — the first time any AI lab has surpassed OpenAI in private valuation. SpaceX starts its IPO roadshow Monday.

The AI industry is preparing for a second act with public market accountability. The open-weight models complicate the story but don’t stop it. What this week made clear is that the policy tools governments are reaching for — voluntary windows, scaled-back state laws, technology sovereignty packages — are struggling to keep pace with models specifically designed to be out of any single party’s control.

Briefly noted

17 items

Models & research

ChatGPT's 'Dreaming V3' Brings Continuous Memory to Plus and Pro Users
OpenAI's new background memory synthesis system continuously updates what ChatGPT knows about you at 5x lower compute, with free-tier rollout coming soon.

gHacks
Anthropic's 2026 Agentic Coding Trends Report
Developers use AI in 60% of their work but can fully delegate only 20% of tasks; one 7-hour agent session changed 12.5 million lines of code in a single run.

Anthropic
Claude Opus 4.8 Is Now Default Across Max, Team, and Enterprise
Anthropic's latest frontier model, stronger on coding and agentic tasks than 4.7, is now the default for all paid tiers and the API.

Releasebot / Anthropic
GLM-5.1 from Zhipu AI Tops the Open-Source Coding Stack
Z.ai's GLM-5.1 is emerging as the strongest all-around open-source model for long-horizon agentic software engineering in 2026.

LLM Stats

Products & launches

Microsoft Build 2026: Work IQ, Web IQ, Seven New MAI Models, and Majorana 2
Microsoft debuted a workplace intelligence layer, an MCP-native AI web search stack, seven proprietary models, and a new quantum chip at its annual developer conference.

Microsoft
ChatGPT Surpasses One Billion Monthly Active Users
OpenAI's ChatGPT reached one billion monthly users in May — the fastest app to that milestone ever, outpacing TikTok, Instagram, and Google Maps.

American Bazaar
Anthropic Splits Claude API Usage Into a Separate Credit Pool Starting June 15
Programmatic access to Claude via subscription plans moves to a new monthly credit pool, complicating economics for indie developers who relied on flat-rate pricing.

DevToolPicks
Claude Services Hit Major Outage on June 5
Elevated error rates hit claude.ai, the API, Claude Code, and Claude Cowork simultaneously for several hours on Thursday.

TechRadar

Infrastructure & chips

NVIDIA Vera CPU: 88 Cores, 3.6 TB/s On-Chip Fabric, Built for Agents
NVIDIA's Arm-based Vera CPU pairs with Rubin GPUs and packs 1.2 TB/s LPDDR5X bandwidth with no chiplet boundaries; Jensen Huang called it 'a CPU for agents.'

NVIDIA Newsroom
After the Power Crunch, AI Infrastructure Hits a GPU Wall
GPU cloud rental pricing is softening as Rubin-class capacity arrives; hyperscalers are shifting competition from accelerator scarcity to full-stack control of networking and cooling.

Data Center Knowledge

Policy & safety

EU Proposes Tech Sovereignty Package: Chips Act 2.0, Cloud AI Law, Open Source Strategy
Brussels proposed a package to reduce EU dependence on US and Chinese tech, including a Cloud and AI Development Act with four sovereignty tiers for public-sector cloud workloads.

European Commission
Colorado AI Act Substantially Revised, Delayed to January 2027
Governor Polis signed a revised AI law scaling back the country's first comprehensive state framework after White House pressure and a federal court stay.

Hunton

Industry & money

Suno Raises $400M at $5.4B Valuation, Still in Copyright Court
The AI music platform doubled its valuation with a Bond Capital-led round and now claims 2M paying subscribers — while still litigating copyright claims from Sony and Universal.

TechCrunch
A Big Week for Megarounds: Ramp $750M, NewLimit $435M, Helion $465M
Ten rounds above $300M closed in a single week, spanning fintech, longevity biotech, and fusion energy — VC pace shows no sign of cooling.

Crunchbase
Defense Startup Funding Hits an All-Time Record
AI-native defense startups including Anduril led a record VC week for the sector as institutional investors begin positioning for exits.

Crunchbase

Adjacent science

Scientists Are Outsourcing Lab Work to Robots and AI
At Ginkgo Bioworks and peer institutions, AI is writing lab notebooks and directing robot arms through experiments — with scientists now monitoring rather than running the bench.

NPR

Whimsy

Spencer Pratt's AI Batman Campaign Has 5 Million Views on X
Supporters of the reality-TV star's LA mayoral bid made AI videos depicting him as Batman fighting crime; the clip hit 5M views and raised real questions about AI-generated political content.

Time

Lead stories cited

01
NVIDIA Launches Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transformer — MarkTechPost
02
MiniMax M3: Frontier Coding, 1M Context, Native Multimodality — All in One Open-Weight Model — MiniMax
03
Trump Signs AI Executive Order Seeking Voluntary Review of New Models — CNBC
04
Anthropic Files Confidential S-1 with the SEC Ahead of IPO — Anthropic