Skip to content
FAQ

Alibaba Unveils Zhenwu M890 Chip and Qwen 3.7-Max: China's Most Ambitious AI Stack Yet

Alibaba's T-Head semiconductor unit launched the Zhenwu M890 AI accelerator, claiming triple the performance of its predecessor and targeting Nvidia's H20 in China's restricted chip market. Paired with the Qwen 3.7-Max model that ran uninterrupted for 35 hours calling over 1,000 tools, the announcements represent the most comprehensive vertical AI stack from any Chinese tech giant to date.

4 min read

At Alibaba’s annual Cloud Summit in Hangzhou on May 19–20, the company’s semiconductor division T-Head pulled the curtain off its most ambitious hardware-software pairing yet: the Zhenwu M890 AI accelerator and the Qwen 3.7-Max large language model. Together, the announcements underscore just how seriously China’s largest e-commerce and cloud company is pursuing end-to-end AI sovereignty — building its own silicon, its own frontier models, and its own cloud infrastructure to run them on.

The Zhenwu M890: Taking Aim at Nvidia’s H20

The Zhenwu M890 is built on T-Head’s in-house PPU (Parallel Processing Unit) architecture and features a dedicated Transformer core engine optimized for the matrix operations that dominate modern AI workloads. On paper, the specs are notable: 144GB of HBM3 GPU memory — a 50% jump over the Zhenwu 810E’s 96GB — and an interchip bandwidth of 800GB per second, critical for the massive parallelism required in distributed AI training.

Alibaba claims the M890 delivers triple the performance of its predecessor, the Zhenwu 810E, and positions it directly against Nvidia’s H20 — the chip Nvidia specifically engineered to comply with U.S. export control regulations while still serving Chinese customers. If the 3x claim holds up under independent testing, the M890 would represent a significant leap for domestically produced Chinese AI silicon.

The chip supports FP32, FP16, FP8, and FP4 numeric formats, giving it flexibility across training and inference workloads. A new server configuration called the Panjiu AL128 packages 128 M890 accelerators into a single rack — a density play that signals ambitions for hyperscale deployment.

T-Head also disclosed a forward-looking roadmap: the V900 is scheduled for mid-2027 with projected performance roughly triple that of the M890, and the J900 is slated for Q3 2028, giving enterprise customers a multi-year visibility horizon. For procurement teams weighing domestic versus imported options, that kind of roadmap clarity matters.

China’s Chip Race and Why Alibaba’s Timing Is Strategic

The context here is impossible to separate from geopolitics. Since 2022, successive rounds of U.S. export controls have progressively cut off Chinese AI developers from access to Nvidia’s most powerful GPUs — the A100, H100, and now the H200 are all restricted. Nvidia has repeatedly adapted its product lineup to produce variants like the A800 and H20 that comply with export rules, but even those face tightening scrutiny.

This has created a structural opportunity — and a structural necessity — for Chinese chipmakers. Huawei’s Ascend 910C has been positioned as the primary domestic alternative, but supply constraints have been significant. Cambricon, a state-backed semiconductor startup, has also gained traction. Alibaba’s M890 enters a competitive but undersupplied market.

What sets Alibaba apart from pure-play chipmakers is the integrated strategy. T-Head sells hardware to Alibaba Cloud, which runs it at scale for thousands of enterprise customers. That vertical integration means Alibaba can optimize the full stack — silicon, firmware, cloud orchestration, and model serving — in ways that independent chip vendors cannot easily replicate. By the time an external customer deploys M890s, Alibaba has already stress-tested them at production scale.

T-Head also disclosed that it has already shipped 560,000 Zhenwu units to more than 400 customers across 20 industries, providing a meaningful installed base against which to validate M890 improvements.

Qwen 3.7-Max: Built for the Agentic Era

While the M890 dominated hardware headlines, the companion software announcement — Qwen 3.7-Max — may prove equally significant in the longer run.

Alibaba is positioning Qwen 3.7-Max explicitly for “agentic AI”: systems capable of executing intricate, multi-step tasks with limited human supervision over extended time horizons. The company backed this positioning with a striking benchmark: in an operational longevity test, Qwen 3.7-Max ran uninterrupted for 35 hours on a new compute platform, autonomously calling over 1,000 different tools. The task it was set? Writing an optimized compute kernel — and the result ran 10 times faster than the chip manufacturer’s own official code.

That benchmark, if reproducible, is a genuinely meaningful demonstration. Long-running agentic tasks that maintain coherent state, adapt to tool outputs, and avoid compounding errors across thousands of steps remain one of the hardest unsolved challenges in applied AI. Claims of 35-hour coherent operation with substantive technical output set a bar that competing frontier labs have not yet publicly matched in this specific domain.

On the competitive leaderboard, Qwen 3.7-Max ranks 13th globally in general text capabilities and 7th in mathematical reasoning on the Chatbot Arena — placing Alibaba sixth among global AI labs by this measure. That’s a significant shift from just two years ago, when Chinese labs were widely regarded as trailing their Western counterparts by 12 to 18 months.

Alibaba’s open-source strategy mirrors the bifurcated model now common across the industry: the Qwen 3.7 Plus variant is being open-sourced, while Qwen 3.7-Max stays proprietary as a commercial offering. This lets Alibaba build ecosystem goodwill and attract developers through open weights, while monetizing the flagship capabilities through Alibaba Cloud’s Model Studio.

Implications for the Global AI Hardware Market

The M890 announcement is unlikely to disrupt Nvidia’s position in Western markets in the near term. But within China — a market that collectively spent tens of billions of dollars on AI infrastructure in 2025 — the M890 represents a credible domestic alternative arriving at a time when Nvidia supply is constrained and politically fraught.

The more interesting question is what the M890 signals for Alibaba’s international cloud ambitions. The company has been quietly expanding Alibaba Cloud’s footprint in Southeast Asia, the Middle East, and Europe, regions where U.S. export controls do not apply and where Alibaba could theoretically market M890-powered cloud services to international customers who prefer not to be exclusively dependent on American infrastructure. The Panjiu AL128 server’s rack-scale density suggests Alibaba is thinking about datacenters, not just Chinese enterprise deployments.

For the broader AI ecosystem, the coordinated chip-plus-model announcement is a template worth watching. Alibaba is not the first to pursue vertical integration — Google has done it with TPUs and Gemini, and AWS with Trainium and Bedrock — but it is the first Chinese company to execute it at this level of technical coherence and commercial scale. Whether the M890’s performance claims survive independent scrutiny will be the next milestone to watch.

Alibaba AI chips Zhenwu M890 Qwen 3.7-Max China AI T-Head semiconductor agentic AI
Share

Related Stories

TSMC's 2nm Chip Production Surges Toward 140,000 Wafers a Month as AI Demand Devours Supply

TSMC's N2 process node, which uses Gate-All-Around transistors and carries a $30,000-per-wafer price tag, is ramping toward 140,000 wafers per month by year-end 2026—yet orders are already booked through 2028. Apple has locked up over half the initial allocation for its A20 and M6 chips, while Nvidia's Rubin AI architecture depends on N2 for the next generation of data center GPUs.

5 min read