AI Silicon
Anthropic in Talks to Use Microsoft Maia 200 Chips for Claude
Anthropic is in talks to adopt Microsoft's custom Maia 200 AI chip for its Claude models, adding yet another silicon partner to a portfolio that already includes NVIDIA, AWS Trainium, Google TPUs, and a newly revealed $1.25B/month SpaceX compute deal.
Another Chip Partner for the Compute‑Hungry AI Lab
Anthropic is in talks to adopt Microsoft's Maia 200 custom AI chip to power its Claude models, CNBC reported Thursday, citing a person familiar with the matter. No deal has been signed, but the discussions — first reported by The Information — underscore how aggressively Anthropic is diversifying its silicon supply chain.
The Maia 200, unveiled by Microsoft in January 2026, is designed specifically for AI inference — running existing models faster and cheaper, not training new ones from scratch. Microsoft CEO Satya Nadella told investors in April that the chip delivers "over 30% improved tokens per dollar, compared to the latest silicon in our fleet," CNBC reported. The chips are currently running in Microsoft data centers in Arizona and Iowa but aren't yet available to external Azure customers.
Anthropic's Silicon Shopping Spree
The Maia talks are the latest in a rapid‑fire series of chip and compute deals for Anthropic. In April, the company signed a 10‑year, $100 billion‑plus deal to use Amazon Web Services' custom Trainium chips. In October 2025, it announced plans to use Google's tensor processing units (TPUs). And on Wednesday, SpaceX's IPO filing revealed Anthropic is paying $1.25 billion per month for data center access through 2029. It still relies on NVIDIA GPUs as its historical backbone for training.
That's five separate compute vendors — NVIDIA, AWS Trainium, Google TPUs, SpaceX/xAI compute, and potentially Microsoft Maia. No other AI lab comes close to this level of multi‑vendor diversification.
Why Anthropic Needs Five Chip Suppliers
The answer is straightforward: Anthropic can't get enough compute from any single source. CEO Dario Amodei publicly acknowledged "difficulties with compute" in early May, CNBC reported, as demand for Claude and the Claude Code developer tool outstripped available capacity.
Anthropic's revenue is surging — projected to hit at least $10.9 billion this quarter, per fundraising materials reported by Reuters — and every dollar of that growth requires more inference capacity. The company is approaching its first profitable quarter, but profitability depends on keeping inference costs down. That's where chips like Maia 200, designed to improve tokens‑per‑dollar economics, become critical.
Microsoft's Big Silicon Moment
For Microsoft, landing Anthropic as a Maia customer would be a major competitive win. The company has trailed Amazon (Trainium) and Google (TPUs) in offering custom AI silicon to cloud clients, Quartz noted. Microsoft's custom chip development faced delays last year, and the Maia 200 is still not available externally — a deal with Anthropic would be its first marquee external silicon customer.
The relationship between the two companies is already deep. Microsoft invested $5 billion in Anthropic in November 2025, and Anthropic committed to spending $30 billion on Azure cloud services. But Anthropic also uses Amazon and Google cloud infrastructure, maintaining a deliberate multi‑cloud strategy.
Microsoft shares rose about 2% on the news, Investing.com reported, reflecting investor optimism that the Maia chip could finally compete in the AI silicon market.
Inference‑Only: What the Maia 200 Can and Can't Do
One important detail: the Maia 200 is designed to run existing models, not to train new ones, The Information reported. That means even if Anthropic signs a deal, it would still need NVIDIA GPUs or other training‑capable hardware for developing future Claude models.
For inference workloads — the day‑to‑day serving of Claude to millions of users and developers — Maia 200's cost advantage could be significant. Nadella's claim of 30% better tokens per dollar suggests that running Claude inference on Maia would be meaningfully cheaper than on NVIDIA hardware. At Anthropic's scale, the improved tokens‑per‑dollar efficiency could translate to billions of dollars in annual savings.
The Multi‑Vendor Future of AI Compute
Anthropic's silicon strategy is a blueprint for where the industry is heading. The days of single‑vendor NVIDIA dependence are giving way to a landscape where AI labs mix and match chips based on workload, cost, and availability. Inference goes to Maia or Trainium. Training stays on NVIDIA or TPUs. Overflow capacity gets rented from whoever has it — even competitors like xAI.
For builders, this diversification is good news. More chip vendors competing for AI workloads means downward pressure on inference costs, which eventually shows up in lower API prices. The Maia 200 talks are one more data point suggesting the NVIDIA monopoly on AI silicon is cracking — slowly, but in enough places to matter.
Sources
Related News
May 26, 2026
Chinese AI Models Hit 60% of OpenRouter Usage as Pricing War Threatens OpenAI, Anthropic IPOs
Chinese AI labs have surged from 1% to over 60% of OpenRouter token usage since 2024, with models from DeepSeek, MiniMax, and Zhipu matching frontier capability at one-ninth the cost. Enterprises are adopting 'advisor model' architectures that slash spending, threatening the $800B+ IPO valuations OpenAI and Anthropic are chasing.
May 26, 2026
Anthropic Co-Founder Tells Vatican AI Must Be Guided from Outside Big Tech
Anthropic co-founder Chris Olah, the sole Big Tech representative at the Vatican's presentation of Pope Leo XIV's first encyclical on AI, warned that frontier AI labs operate within “incentives and constraints that can conflict with doing the right thing” and called for religious communities, governments, and civil society to hold the industry accountable.
May 23, 2026
Anthropic and OpenAI Race to Embed AI Agents on Wall Street
Within a 72-hour window in May 2026, Anthropic and OpenAI each launched enterprise deployment arms, announced major financial-services partnerships, and shipped agent tooling targeting Wall Street's most critical workflows. The race to become the operating system for finance is accelerating — and the stakes have never been higher.