Amazon in Talks to Sell Custom AI Chips to Other Companies

Amazon is opening its Trainium AI chip to external data centers in a move that could reshape the cloud wars. The custom silicon, designed for inference workloads, will now be sold directly to third-party operators—marking a rare break from AWS’s vertical integration strategy. Demand is so strong that Amazon’s AI chief, Swami Sivasubramanian, has flagged potential supply constraints by late 2026, according to internal discussions with Quartz. This shift could force Nvidia and Google to accelerate their own foundry partnerships, while raising questions about AWS’s long-term commitment to open ecosystems.

Why Trainium’s Inference Edge Could Disrupt the Chip Wars

Trainium isn’t just another AI accelerator. Built on a 7nm process with a dedicated Neural Processing Unit (NPU) optimized for sparse matrix operations, it delivers up to 2x throughput per watt over x86-based alternatives for vision and language models under 70B parameters, according to internal AWS benchmarks leaked to AnandTech. The chip’s real advantage lies in its AWS Neuron SDK, which offers near-native performance for models like Llama 2 and Stable Diffusion via a C++ API—something Nvidia’s H100 lacks in pure inference efficiency.

Why Trainium’s Inference Edge Could Disrupt the Chip Wars

But here’s the catch: Trainium’s architecture is tightly coupled with AWS’s proprietary Nitro Enclaves for secure workload isolation. This means third-party data centers adopting Trainium will need to implement AWS’s custom firmware stack—a non-starter for hyperscalers like Microsoft or Alibaba, which rely on open-source tooling like Triton. “AWS is playing a dangerous game,” warns Dr. Emily Chen, CTO of Databricks. “

If they lock customers into Trainium via proprietary enclaves, they risk becoming the next Oracle—cherished by a few, but hated by the ecosystem at large.

The 30-Second Verdict

  • For AWS: A potential revenue stream from Trainium sales could offset Nvidia’s dominance in AI chips, but risks alienating partners who prefer open architectures.
  • For Nvidia: The move forces them to double down on their foundry strategy (e.g., TSMC’s 3nm process) or risk losing enterprise customers to AWS’s “stickier” stack.
  • For Hyperscalers: Microsoft and Google will likely push harder for ARM-based alternatives (e.g., Ampere’s Altra or Graviton4) to avoid AWS’s enclave lock-in.

How Trainium Compares to Nvidia’s Dominance

Trainium’s entry into the market isn’t just about performance—it’s about platform lock-in. While Nvidia’s H100 dominates the training market (holding ~80% share per Gartner), AWS’s bet on inference reflects a deliberate pivot: most cloud AI workloads (60%+ per McKinsey) are inference-heavy, not training.

How Trainium Compares to Nvidia’s Dominance
Metric Trainium (AWS) Nvidia H100 Google TPU v4
Process Node 7nm (TSMC) 4nm (Nvidia Custom) 5nm (Samsung)
NPU TOPS/Watt (Inference) 45 (Llama 2, FP8) 35 (FP8) 28 (BERT-Large)
SDK Compatibility Neuron SDK (C++/Python) TensorRT/CUDA XLA (TensorFlow/PyTorch)
Security Model Nitro Enclaves (AWS-only) Confidential Computing (AMD SEV) Google Cloud Confidential VMs

The table above highlights Trainium’s efficiency edge in inference—but its proprietary security model could become a liability. “AWS is betting that data centers will prioritize performance over portability,” says Rajesh Kumar, VP of Engineering at Cisco. “

If they don’t offer a path to multi-cloud compatibility, they’ll lose the same way IBM did with PowerPC.

What Happens Next: The Antitrust and Open-Source Gambit

AWS’s move isn’t just technical—it’s strategic. By selling Trainium externally, Amazon is testing whether it can monetize its chip design without ceding control. But the real wild card is open-source pressure. Projects like TinyGrad and ONNX Runtime are already pushing for hardware-agnostic AI frameworks. If AWS refuses to open Trainium’s firmware stack, developers may flock to alternatives like Intel’s Gaudi 3, which supports open-source acceleration.

Amazon's Trainium 2: A Game-Changer for AI Chips in 2024?

The antitrust implications are equally stark. The FTC has already scrutinized AWS’s Nitro Enclaves for potential monopolistic practices. If Trainium sales are tied to AWS-only enclaves, regulators may see this as a de facto exclusionary tactic—especially if it stifles competition from Google’s TPU or Microsoft’s Azure AI chips.

The Open-Source Escape Hatch

There’s a silver lining for AWS: if they release Trainium’s Neuron SDK under an open-core license (like Nvidia did with CUDA), they could mitigate backlash. But don’t hold your breath. “AWS has never been a fan of open-source hardware,” notes Dr. Linus Lee, Professor of Computer Science at UC Berkeley. “

Their entire business model relies on lock-in. Trainium is just another nail in the coffin for multi-cloud portability.

Who Wins in the Long Run?

Short-term, AWS gains a foothold in the $30B+ AI chip market. Long-term, the outcome hinges on three factors:

Who Wins in the Long Run?
  1. Adoption Speed: Can AWS convince hyperscalers to adopt Trainium despite the enclave dependency? Early talks suggest Oracle is interested, but Microsoft has already committed to custom Azure AI chips.
  2. Regulatory Pressure: Will the FTC or EU’s Digital Markets Act force AWS to open Trainium’s stack? The DMA could reclassify AWS’s enclaves as “self-preferencing” if they block interoperability.
  3. Developer Sentiment: Will the AI community reject Trainium over Nvidia’s broader ecosystem? GitHub’s 2025 State of AI report found 72% of developers prioritize hardware compatibility over performance.

The Bottom Line

Amazon’s Trainium gambit is bold—but risky. It could either diversify AWS’s revenue or accelerate its isolation in the cloud wars. For now, the chip remains a beta product, with no confirmed pricing or third-party deployment timeline. What’s clear is this: the real battle isn’t just about silicon. It’s about who controls the stack—and whether the industry will tolerate another walled garden.

Photo of author

Sophie Lin - Technology Editor

Sophie is a tech innovator and acclaimed tech writer recognized by the Online News Association. She translates the fast-paced world of technology, AI, and digital trends into compelling stories for readers of all backgrounds.

The Unconventional Doctor-Patient Bond: A Father’s Love

Illinois Man Captures Heartbreaking Footage as Tornado Devastates Effingham Home

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.