AI News

Google Launches Two New AI Chips to Compete With Nvidia

Apr 23, 2026, 11:30 AM
4 min read
147 views
Google Launches Two New AI Chips to Compete With Nvidia

Table of Contents

Google has split its eighth-generation tensor processing unit into two separate chips for the first time — one optimized for training AI models and another for running them. The TPU 8t and TPU 8i, announced at Google Cloud Next in Las Vegas, represent Google's most aggressive move yet to reduce the AI industry's dependence on Nvidia while acknowledging that the chip giant remains indispensable.

Two Chips, Two Jobs

The TPU 8t is designed for model training the computationally intensive process of teaching AI models from massive datasets. The TPU 8i is built for inference — the ongoing work of processing user prompts and generating responses after a model has been trained. By separating these functions into dedicated chips, Google can optimize each for its specific workload rather than forcing a single chip to handle both.

Google claims the new TPUs deliver up to 3x faster AI model training, 80 percent better performance per dollar, and the ability to connect more than one million TPUs in a single cluster. The result should be significantly more compute for less energy and lower cost critical advantages as AI infrastructure costs continue to spiral upward.

Not Replacing Nvidia — Yet

Despite the impressive specs, Google is not positioning these chips as a direct replacement for Nvidia. Like Amazon with its Trainium chips and Microsoft with its Maia accelerators, Google is using custom silicon to supplement — not replace — the Nvidia-based systems it offers in its cloud.

Google confirmed that Nvidia's latest chip, Vera Rubin, will be available on Google Cloud later this year. The company has also agreed to work with Nvidia on engineering improved networking that allows Nvidia-based systems to perform more efficiently in Google's infrastructure, including collaboration on the open-source networking technology Falcon.

Chip market analyst Patrick Moorhead jokingly noted on social media that he had predicted Google's TPUs would be bad news for Nvidia back in 2016, when the first TPU launched. Nvidia is now a nearly $5 trillion company — a reminder that predictions of Nvidia's decline have consistently been premature.

Why Custom Chips Matter

The strategic logic behind custom AI chips is straightforward. Nvidia GPUs are powerful but expensive, and demand routinely exceeds supply. By building their own chips, cloud providers like Google can offer customers an alternative that is cheaper per unit of compute, optimized specifically for their cloud environment, and available without the supply constraints that affect Nvidia hardware.

For Google's customers — including Anthropic, which recently signed a major TPU capacity deal with Google and Broadcom, and Thinking Machines Lab, which just secured a multi-billion dollar Google Cloud agreement — custom chips provide cost savings that compound at scale. When you are training models that require millions of chip-hours, even small efficiency improvements translate into billions of dollars in savings.

The Hyperscaler Chip Race

Google is not alone in this push. Amazon's Trainium chips have already won over major customers including Anthropic, OpenAI, and Apple. Microsoft is developing its Maia AI accelerator. And all three hyperscalers are investing billions in custom silicon alongside their Nvidia purchases.

The long-term question is whether these custom chips will eventually reduce the industry's Nvidia dependence to the point where it materially affects the chip giant's business. For now, the answer appears to be no. As Google's AI cloud business grows, it is buying more Nvidia hardware alongside deploying more TPUs — a rising tide that lifts both boats.

But if enterprises increasingly port their AI workloads to cloud-native custom chips because the economics are better, the balance could eventually shift. Google's decision to split its TPU line into training and inference chips suggests it is getting more serious about making that case to customers.

The Bigger Picture

The TPU 8t and 8i announcements cap a massive Google Cloud Next conference that has included generative AI features for Google Maps, new enterprise partnerships, and expanded Gemini availability. Together, these moves position Google as a full-stack AI platform provider from custom chips to cloud infrastructure to consumer and enterprise applications.

For the AI industry, the proliferation of custom chips from all three major cloud providers means more compute options, lower costs, and reduced risk of Nvidia supply bottlenecks. Whether it also means the beginning of the end of Nvidia's dominance is a question the market has been asking for a decade — and one that Nvidia, now worth nearly $5 trillion, has answered decisively so far.

Muhammad Zeeshan

About Muhammad Zeeshan

Muhammad Zeeshan is a Tech Journalist and AI Specialist who decodes complex developments in artificial intelligence and audits the latest digital tools to help readers and professionals navigate the future of technology with clarity and insight. He publishes daily AI news, analysis, and blogs that keep his audience updated on the latest trends and innovations.

Comments (0)

Leave a Comment

No Comments Yet

Be the first to share your thoughts!

Relevant AI Tools

More AI News

Robinhood Now Lets AI Agents Trade Stocks for You
Robinhood Now Lets AI Agents Trade Stocks for You

Robinhood launched support for agentic trading and a new AI agent credit card, letting AI agents read portfolios, execute trades, and make payments using dedicated wallets with spending limits and approval controls. It is one of the boldest moves yet in agentic finance.

May 28, 2026, 3:00 PM

DuckDuckGo Installs Surge as Users Flee Google AI Search
DuckDuckGo Installs Surge as Users Flee Google AI Search

DuckDuckGo app installs spiked as much as 30% after Google's I/O 2026 Search overhaul replaced blue links with AI agents. The backlash reveals a growing segment of users who want control over how much AI they encounter — and an off switch Google never gave them.

May 28, 2026, 11:00 AM

Human Archive Pays India Gig Workers to Train Robots
Human Archive Pays India Gig Workers to Train Robots

Silicon Valley startup Human Archive raised $8.2 million to pay India's gig workers roughly $1 an hour to wear camera-equipped caps and sensors, collecting the real-world data that robotics labs need to train physical AI — and sparking a privacy debate.

May 28, 2026, 7:00 AM

What ClickUp's AI Layoff Means for the Future of Work
What ClickUp's AI Layoff Means for the Future of Work

ClickUp replaced hundreds of employees with 3,000 AI agents and is paying survivors million-dollar salaries. The move is a preview of how AI is reshaping the workforce — creating a small group of highly paid orchestrators while the middle disappears.

May 28, 2026, 3:00 AM

Grok Has Just 3 Federal AI Uses vs OpenAI's 234: Reuters
Grok Has Just 3 Federal AI Uses vs OpenAI's 234: Reuters

Reuters found Grok appears in just 3 of 400+ federal AI use cases compared to OpenAI's 234, undermining SpaceX's AI growth narrative ahead of its IPO.

May 26, 2026, 3:00 PM

Gartner Names OpenAI, GitHub, Cursor AI Coding Leaders
Gartner Names OpenAI, GitHub, Cursor AI Coding Leaders

Gartner published its first Magic Quadrant for AI Coding Agents, naming OpenAI Codex, GitHub Copilot, and Cursor as Leaders in the new enterprise category.

May 26, 2026, 11:00 AM