Dev Tools

Google Cloud Next 2026: New TPU Chips, Gemini Enterprise Agents, and a $750M Partner Bet

Google unveiled its 8th-generation TPU chips, a comprehensive Gemini Enterprise Agent Platform, and $750 million in partner funding at Cloud Next 2026, positioning itself as the full-stack infrastructure provider for the agentic AI era. With 75% of its cloud customers already using AI products and 330 customers processing over one trillion tokens monthly, Google is betting its enterprise future on owning the entire AI stack.

1h ago 5 min read

Google’s annual Cloud Next conference has long been a showcase for its enterprise ambitions, but the 2026 edition felt different in scale and urgency. Over two days in San Francisco, the company rolled out its most comprehensive hardware and software platform updates in years — new silicon to challenge Nvidia, a full-stack agent orchestration framework to compete with Microsoft and Anthropic, and a $750 million commitment to pull its 120,000-member partner network into the agentic AI era.

The central thesis: Google wants to own the entire AI stack, from custom chips to the no-code agent builder that a marketing manager uses on a Tuesday afternoon.

Two New Chips to Challenge Nvidia’s Inference Dominance

The headline hardware announcement was a split of Google’s Tensor Processing Unit (TPU) line into two distinct architectures for the first time.

The TPU 8t targets AI training, delivering 3x the raw processing power of last year’s Ironwood TPU with twice the performance per watt. Each TPU 8t superpod can scale to 9,600 chips sharing 2 petabytes of memory — a configuration designed to compress frontier model training cycles “from months to weeks,” according to Google.

The TPU 8i is a purpose-built inference chip, and it’s the more aggressive competitive move. Its Boardfly topology directly connects 1,152 TPUs in a single pod, with 3x more on-chip SRAM than the previous generation for larger key-value caches. Google claims 80% better performance-per-dollar over Ironwood at low-latency targets — specifically when serving the mixture-of-experts (MoE) models that now dominate the frontier. At those models’ scale, SRAM capacity directly determines whether you can avoid expensive memory fetches; the TPU 8i is engineered around that constraint.

Google also announced the Virgo Network, a purpose-built AI fabric connecting either NVIDIA Vera Rubin NVL72 racks or TPU 8t superpods into massive shared supercomputers — a direct acknowledgment that the AI infrastructure war is as much about interconnect as it is about raw compute. The company pointedly noted that its cloud infrastructure continues to support NVIDIA’s entire portfolio, including Vera Rubin NVL72 and Blackwell, alongside its own silicon.

Supporting infrastructure numbers are equally striking: Managed Lustre storage now delivers 10 TB/sec throughput via RDMA directly to TPU 8t, Rapid Storage has scaled to 15 TB/sec (up from 6 TB/sec), and Google Kubernetes Engine now provisions 300 secure sandboxes per second with sub-second time to first instruction.

The Gemini Enterprise Agent Platform

On the software side, Google unveiled the Gemini Enterprise Agent Platform — a four-pillar framework for building, scaling, governing, and optimizing AI agents at enterprise scale.

The build layer includes the Agent Developer Kit (ADK), a graph-based orchestration framework; Agent Studio, a low-code natural language interface; and an Agent Registry for discovery and governance. The Agent Marketplace ships with pre-built specialized agents from Atlassian, Box, ServiceNow, Workday, Oracle, and the no-code startup Lovable.

Crucially, Google natively supports the Model Context Protocol (MCP) — the open standard for connecting AI models to external tools — and has exposed its entire suite of Cloud services as MCP endpoints. This allows agents built on any MCP-compatible framework to access Google Cloud services directly.

The scale layer introduces long-running agents that execute multi-step autonomous workflows inside secure, isolated cloud sandboxes. A Memory Bank feature gives agents persistent context spanning months of interaction history — a capability that meaningfully separates enterprise deployments from consumer chatbots.

The govern layer addresses the operational reality that no enterprise will deploy autonomous agents without auditability. Agent Identity issues cryptographic IDs with auditable authorization policies; Agent Gateway enforces real-time policies and integrates Model Armor protections against prompt injection, tool poisoning, and data leakage; Agent Anomaly Detection proactively flags suspicious behavior.

Early enterprise deployments give a sense of the platform’s reach: KPMG reported 90% employee adoption with over 100 agents deployed in its first month; Mars is using the platform as its “primary AI operating system” for its global workforce; NASA is deploying agents for Artemis II flight readiness and astronaut safety protocols.

Workspace Gets an Intelligence Layer

Separately from the enterprise developer platform, Google announced Workspace Intelligence — a semantic layer designed to break down information silos across Gmail, Drive, Docs, Sheets, and Slides.

The flagship feature is AI Inbox in Gmail, which acts as a proactive personal assistant, prioritizing and summarizing messages based on personal context. Ask Gemini in Google Chat can synthesize information across an organization and execute multi-step actions — scheduling meetings, creating Docs, pulling data from connected services.

For content creation, Google has rebuilt the core Workspace apps around agentic workflows: Docs now generates first drafts in the user’s own writing voice; Sheets can transform raw data into interactive dashboards from a prompt; Slides builds complete, on-brand presentations from a single sentence.

The migration pitch was pointed: Google claims organizations can move from Microsoft 365 to Google Workspace “up to 5x faster” with the new Workspace Agent, clearly targeting enterprises that have historically been reluctant to switch productivity suites.

Workspace Intelligence also expands the model ecosystem — its Model Garden now includes over 200 models, with Anthropic’s Claude Opus 4.7 newly added alongside the existing Claude Sonnet and Haiku options.

$750 Million to Pull the Partner Ecosystem In

Perhaps the most strategically significant announcement was the quietest: a $750 million fund directed at Google Cloud’s 120,000-member partner ecosystem to accelerate joint customers’ transitions to agentic AI. The fund provides incentives, resources, and engineering support for systems integrators, ISVs, and consultancies to build and deploy agent-based solutions. Google noted it has 330,000 trained consultants worldwide in its partner network.

This matters because the enterprise software market has always been won through partners, not direct sales. By funding partners to build on Gemini Enterprise rather than on OpenAI or Anthropic APIs, Google is attempting to lock in the professional services layer that ultimately controls which AI platforms large organizations standardize on.

The Scale of the Adoption Curve

The business metrics Google disclosed painted a picture of rapid enterprise adoption that would have seemed implausible 18 months ago. Three hundred thirty customers now process more than one trillion tokens monthly on Google Cloud; 35 customers exceed ten trillion. The API is serving 16 billion tokens per minute — up from 10 billion last quarter. Gemini Enterprise paid monthly active users grew 40% quarter-over-quarter in Q1 2026.

Context: A Full-Stack Bet Against OpenAI and Microsoft

The overall picture from Cloud Next 2026 is of a company executing a coherent full-stack strategy: custom silicon tuned for its own models, an orchestration platform for agent deployment, a security layer to govern those agents, a data architecture for agent-scale retrieval, and a productivity suite that embeds agents into daily work.

The risk is obvious — it’s an enormous surface area to compete on simultaneously, against focused competitors who do one layer well. Microsoft owns the enterprise productivity relationship through Office. OpenAI leads on model quality at the frontier. Anthropic is the preferred choice for safety-sensitive enterprise deployments.

But Google’s argument is that the agentic era requires all those layers to work together, and that switching costs compound when your training infrastructure, inference chips, agent framework, data platform, and productivity suite are from the same vendor. Cloud Next 2026 was Google’s most complete articulation of that bet.

Sources

Google TPU Gemini AI agents cloud computing developer tools enterprise AI

Google Cloud Next 2026: New TPU Chips, Gemini Enterprise Agents, and a $750M Partner Bet

Two New Chips to Challenge Nvidia’s Inference Dominance

The Gemini Enterprise Agent Platform

Workspace Gets an Intelligence Layer

$750 Million to Pull the Partner Ecosystem In

The Scale of the Adoption Curve

Context: A Full-Stack Bet Against OpenAI and Microsoft

Sources

Related Stories

Google's Gemini 3.1 Flash Live Brings Real-Time Multimodal Voice AI to Developers

Microsoft Launches MAI-Image-2-Efficient: 22% Faster, 41% Cheaper AI Image Generation

Google Expands Gemini AI in Chrome to Seven Asia-Pacific Markets, Launches Reusable 'Skills'