Today in AI — 7 March 2026

Today's top AI news — curated links and commentary on the stories that matter for product builders.

·3 min read

Three separate teams shipped models this week that do the same work with roughly half the resources. The arms race is shifting from scale to efficiency, and the implications for cost, deployment, and what you can run locally are immediate.

Efficiency is the new frontier

GPT-5.4's tool search cuts agent token usage by 47% by loading only the definitions a model needs. OLMo Hybrid matches its predecessor's accuracy with 49% fewer training tokens. Phi-4-reasoning-vision skips chain-of-thought when it doesn't help. Same conclusion from three different directions: the next performance gains come from spending compute more carefully.

The Pentagon standoff escalates

The DoD formally designated Anthropic a supply chain risk — the first time an American company has received that label. Anthropic will challenge in court. Meanwhile, Claude is adding over a million users a day. The designation is meant to freeze Anthropic out of government contracts, but it's doing more for consumer loyalty than any marketing spend could.

Infrastructure economics

Oracle and OpenAI scrapped the Stargate data centre expansion after financing stalled, with Meta circling the site. Oracle is also preparing mass layoffs as AI infrastructure spending hits $50 billion for fiscal 2026. Foxconn, meanwhile, posted record revenue from building the servers everyone else is buying. The returns from AI infrastructure are concentrating in hardware suppliers, not the companies writing the cheques.

Developer ecosystem shifts

GitHub's Octoverse data shows AI coding tools creating convenience loops: languages that work well with AI attract more developers, generating more training data, which improves AI support further. TypeScript surged 66% to the top spot. OpenClaw passed React in GitHub stars as demand for self-hosted AI agents accelerates. And OpenAI shipped ChatGPT for Excel with live financial data feeds, a sign that AI copilots are moving beyond code into domain-specific tooling.

Google's mixed week

Google gave developers three days to migrate off the Gemini -latest alias, shorter than its own 14-day deprecation policy. Pin your model versions. Separately, DeepMind launched an equity-free robotics accelerator for European startups, with applications closing 25 March.

The cost of running agents is dropping fast; the stability of the platforms you run them on is not. Build for portability.


Stay up to date

Get notified when I publish something new, and unsubscribe at any time.

More news