Back to Blog

Pentagon Picks Seven, Anthropic Sits Out: Sunday Briefing, May 3, 2026

By ML Team8 min read
Industry NewsFoundation ModelsAgentsPolicyComputeResearch

Pentagon Picks Seven, Anthropic Sits Out: Sunday Briefing, May 3, 2026

The first weekend of May reshapes the procurement map and the agent map at the same time. The Pentagon’s May 1 classified-network deals went to seven vendors — AWS, Google, Microsoft, NVIDIA, OpenAI, SpaceX, and Reflection — with Anthropic notably absent after prior DoD engagement, the most consequential single-day shift in U.S. federal AI procurement this year. Stanford’s 2026 AI Index puts agent success on real computer tasks at 66% (up from 12% a year ago), and MCP has crossed 97 million installs with every major provider shipping compatible tooling. Google committed up to $40B to Anthropic at a $380B valuation with Anthropic’s run-rate now ~$30B. Colorado’s AI Act takes effect on June 30 — about eight weeks out — and Gemini 3.1 Ultra (2M-token context) plus Flash-Lite at $0.25/M tokens resets frontier-tier economics for high-volume use cases.

7
Pentagon AI vendors (Anthropic absent)
66%
agents on real computer tasks
97M
MCP installs (cumulative)
$40B
Google’s Anthropic commitment
~8 wks
to Colorado AI Act enforcement

Pentagon Picks Seven, Anthropic Sits Out: The Most Consequential Procurement Shift of 2026

On May 1, the Department of Defense announced classified-network AI deals with seven vendors: AWS, Google, Microsoft, NVIDIA, OpenAI, SpaceX, and Reflection. Reporting characterizes the cohort selection as a deliberate shift after previous DoD engagement with Anthropic — the headline isn’t who’s in, it’s who’s out. For the first time in the 2026 federal cycle, the most safety-vocal frontier lab is on the outside of a major classified-network procurement, with no public explanation from either side.

The implications cut multiple ways. For procurement officers, this is a new vendor list to plan against on classified surfaces. For the safety-vs.-deployment debate inside the labs, it’s the first concrete signal that posture choices have procurement consequences at the federal scale. Watch the next 48–72 hours for any official Anthropic statement and for follow-on commentary on the criteria DoD applied — that signal will shape how every other frontier lab calibrates its public-sector engagement for the rest of the year.

Why It Matters

Federal procurement is now a leading indicator of how labs balance capability release against safety-and-policy posture. Buyers planning around classified or regulated workloads should re-baseline their vendor short-lists; sellers should expect that the “policy posture” column on the evaluation matrix is no longer informational.

Frontier Cadence: GPT-5.5 Pro, Gemini 3.1 Ultra at 2M Context, Flash-Lite at $0.25/M, Muse Spark

OpenAI shipped GPT-5.5 and GPT-5.5 Pro on April 23 — the strongest agentic tooling OpenAI has put out, aimed squarely at enterprise knowledge work and coding. The Pro variant uses parallel compute, the centerpiece of OpenAI’s push to tighten competition with Anthropic on tool use and computer use. Google’s Gemini 3.1 Ultra arrives with a 2M-token context window and native multimodal reasoning across video, audio, and text, while Gemini 3.1 Flash-Lite resets frontier-tier pricing at $0.25 per million tokens — a direct shot at high-volume, latency-sensitive workloads.

Anthropic’s public lineupOpus 4.6, Sonnet 4.6, Haiku 4.5 — is paired with the cybersecurity-focused “Mythos” preview in limited testing, the focus of CEO Dario Amodei’s recent White House meeting with Chief of Staff Susie Wiles. Mythos signals Anthropic is building specialty offerings on top of the core Claude family rather than expanding the public-API frontier alone. Meta’s Muse Spark, the first flagship out of Alexandr Wang’s Superintelligence Labs, posts competitive multimodal/agentic performance “at a fraction of the compute cost,” the clearest signal in months that Meta is back in the frontier conversation. And Google’s Gemma 4 (Apache 2.0) keeps the most permissive frontier-adjacent open family in play for fine-tuning and on-prem use.

Why It Matters

The catalog now exceeds 500 commercial and open models. Differentiation pressure on second-tier providers is climbing, while the frontier itself is splitting into capability slices — massive-context (Gemini 3.1 Ultra), parallel-compute reasoning (GPT-5.5 Pro), low-cost long-tail (Flash-Lite), and security-gated specialty (Mythos). Plan evals against slices, not SKUs.

Agent Surface: 12% → 66% on Real Tasks, 97M MCP Installs, Gemini Enterprise Agent Platform Lands

Stanford’s 2026 AI Index headlines a step-change: agent task success on real computer tasks jumped from 12% to 66% year over year. Agents are now production-viable for routine software navigation, not just demo reels — and procurement teams should expect this number to drive enterprise budget decisions through Q3. The plumbing caught up at the same time: Anthropic’s Model Context Protocol crossed 97 million cumulative installs, with every major provider now shipping MCP-compatible tooling. The tooling-format war is, in practical terms, over.

Google Cloud launched the Gemini Enterprise Agent Platform, the official enterprise on-ramp for agent deployment — complete with the published defensive playbook (a sanitiser model that strips embedded prompts, zero-trust per-agent permissioning, and full audit trails back to source URL). The security posture is a meaningful differentiator versus earlier agent stacks. Meanwhile, Amazon rolled out conversational shopping agents on millions of product pages — the first at-scale, consumer-facing agentic commerce surface. Agentic commerce is now a three-horse race: Anthropic via Project-Deal-style programs, OpenAI via GPT-5.5 computer-use, Google via UCP and partnerships like Ulta Beauty in AI Mode.

Why It Matters

The lock-in fight has migrated up the stack — from foundation-model SKU to the agent wrapper, the tool catalog (MCP), and the connector ecosystem. The defensive-playbook bar published by Google Cloud (sanitiser, zero-trust, audit trails) is rapidly becoming table stakes for enterprise procurement.

Capital & Compute: Google’s $40B Anthropic Commitment, OpenAI at $25B Revenue, Novo Nordisk Goes All-In

Google committed up to $40 billion to Anthropic — a $10B initial cash injection at a $380B valuation announced April 25, with $30B more contingent on milestones. The deal deepens the Google–Anthropic compute and capital alignment alongside the parallel Broadcom partnership. Anthropic’s run-rate revenue is now ~$30B, up from ~$9B at the end of 2025 — one of the fastest revenue ramps in enterprise software history. OpenAI separately surpassed $25B annualized revenue, with 9M paying business users and 900M weekly ChatGPT actives; reporting indicates the company is exploring a late-2026 IPO.

Vertical adoption keeps deepening. Novo Nordisk signed a strategic partnership with OpenAI spanning discovery, trials, manufacturing, supply chain, and commercial, with full deployment targeted by end of 2026 — the largest-scale pharma–frontier-lab integration to date and a template other top-10 pharma will likely follow. On the consumer side, Reddit is testing LLM-powered product-recommendation search that synthesizes community advice and pros/cons — the affiliate/commerce funnel implication is bigger than the tech itself.

Why It Matters

The frontier capital map is consolidating around two anchor relationships (Microsoft–OpenAI, Google–Anthropic) and one increasingly independent third (Meta Superintelligence Labs). Vertical deals like Novo Nordisk are the new pattern: end-to-end deployment commitments that lock a single lab into the full enterprise workflow.

Policy Compass: Colorado AI Act ~8 Weeks Out, Federal Preemption Fight Continues, State AGs Step Up

The Colorado AI Act takes effect on June 30, 2026 — about eight weeks out and the single most actionable compliance deadline on the calendar. The Act imposes an algorithmic-discrimination duty of care, a mandatory risk management policy, impact assessments, and disclosure obligations on developers and deployers. With rulemaking still in motion, expect last-minute guidance over the next several weeks — compliance teams should be in a posture of finalizing documentation, not starting it.

The federal–state preemption fight continues. The December 2025 Executive Order directs DoJ to challenge state AI laws and asks Commerce to evaluate “burdensome” ones, but state laws remain enforceable until courts rule. Practical posture: comply with state regimes now, don’t wait. State AGs are increasing AI-related enforcement through 2026, particularly relevant for consumer-facing AI products at scale. Governance is moving from principles to enforceable rules: documented inventories, risk classification, third-party diligence, and lifecycle controlsare now baseline procurement-side asks.

Why It Matters

Customers are now asking who controls the company, who approves outputs, and how risk is managedbefore signing. The Colorado deadline is the forcing function for the next quarter; the federal preemption fight is the structural variable for the rest of the year.

Research Edge: TurboQuant Slashes KV-Cache, Neuro-Symbolic Hybrid Claims ~100× Energy Reduction

Google’s TurboQuant (ICLR 2026) attacks one of the dominant cost drivers in long-context inference: KV-cache memory. The technique combines PolarQuantwith Johnson–Lindenstrauss compression — if the published numbers hold up at production scale, expect rapid integration into vLLM, TGI, and other open inference stacks within a quarter. A separate symbolic + neural hybrid result claims ~100× energy reduction with accuracy gains — to be presented at ICRA in Vienna this month. The claim is large; treat as promising-but-unverified until independent reproduction.

Apple SimpleFold (ICLR 2026) shows protein folding with vanilla transformer blocks — an architectural-simplicity story rather than SOTA-on-CASP, but a useful signal for downstream practitioners and for Apple’s quiet ML research presence. MIT also published a new method to increase LLM training efficiency (Feb 26) worth a deeper read for anyone running their own pretraining or large-scale fine-tunes. And a physics-informed ML advance from the University of Hawai‘i at Mānoaimproves accuracy in fluid dynamics and climate modeling — niche but meaningful for scientific computing buyers.

Why It Matters

Two independent lines — KV-cache quantization and neuro-symbolic constraints — are pulling on the same lever: inference and training economics. The companies that close the gap between “runs on frontier infra” and “runs profitably” first will set the next round of unit economics for agentic deployments.

The Six-Item Synthesis

If only six takeaways carry from this batch into the next planning cycle:

  1. Pentagon picked seven; Anthropic was not one of them. The first 2026 federal procurement to tie posture choices to vendor selection at scale — expect the safety-policy column on the eval matrix to harden into a gating criterion.
  2. GPT-5.5 Pro, Gemini 3.1 Ultra at 2M, Flash-Lite at $0.25/M, Muse Spark. The frontier is splitting into capability slices — eval against the slices, not the SKUs.
  3. MCP at 97M installs, agents at 66% on real tasks. Agent infrastructure is no longer experimental; the lock-in fight has shifted to the wrapper and the tool catalog.
  4. Google’s $40B Anthropic commitment, OpenAI past $25B revenue, Novo Nordisk all-in.Capital map is consolidating around two anchor relationships and one increasingly independent third.
  5. Colorado AI Act: ~8 weeks out. Single most actionable compliance deadline on the calendar — finalize, don’t start.
  6. TurboQuant + neuro-symbolic ~100×. Two independent attacks on inference and training economics; the cost curve underneath agent workloads is starting to bend.

References

MachinaLearning - Machine Learning Education Platform