Skip to content
FAQ

DeepSeek V4 Goes Live on Huawei Chips as U.S. Escalates AI Theft Accusations

China's DeepSeek released its 1.6 trillion-parameter V4 model fully optimized for Huawei's Ascend processors, delivering frontier-adjacent performance at a fraction of Western prices. The launch arrives as U.S. officials allege Chinese firms are running industrial-scale campaigns to steal American AI intellectual property.

5 min read

DeepSeek V4 Goes Live on Huawei Chips as U.S. Escalates AI Theft Accusations

DeepSeek, the Hangzhou-based AI laboratory backed by quantitative hedge fund High-Flyer Capital, has released its most capable model to date—a 1.6 trillion-parameter mixture-of-experts architecture called V4-Pro—trained end-to-end on Huawei’s Ascend 910C processors rather than Nvidia hardware. The release, arriving weeks after a preview launch in late April, marks the most significant demonstration yet that Chinese AI development has found a credible path around U.S. semiconductor export controls.

The timing is no accident. Washington has ramped up pressure simultaneously, with senior officials now formally alleging that DeepSeek and other Chinese AI labs are running what one government document described as “industrial-scale campaigns” to extract intellectual property from American models through systematic distillation and data poisoning. Beijing’s foreign ministry dismissed the characterizations as “groundless.”

What the Model Actually Does

V4-Pro is enormous: 1.6 trillion total parameters across 256 experts, activating approximately 37 billion at inference time. In raw parameter count, that places it alongside GPT-5.4 and Gemini 3.1 Pro. DeepSeek’s own technical report acknowledges V4 “falls marginally short” of those frontier models, estimating a developmental lag of three to six months—a significant narrowing from the 12-to-18 month gap that characterized DeepSeek’s earlier releases.

A companion model, V4-Flash, delivers a 284 billion-parameter variant at dramatically lower cost, targeting latency-sensitive applications where the full flagship would be impractical.

The 1 million-token context window is V4’s other headline capability. At a moment when context length has become a primary competitive axis—Gemini 3.5 Pro recently expanded to 2 million tokens, and Anthropic’s Mythos models offer comparably long windows—V4’s context capability keeps it within striking range of leading Western alternatives.

The Price Weapon

DeepSeek’s most disruptive contribution has never been raw capability alone. It is price. V4-Pro enters at $3.48 per million output tokens. OpenAI’s comparable tier runs at $30 per million; Anthropic charges $25. That is roughly an 8-to-10x cost difference for models with substantially overlapping practical utility. V4-Flash at $0.28 per million output tokens undercuts even the leanest Western mini-models.

“Pricing could fall further once Huawei scales up Ascend 950 production in the second half of this year,” the company said in its release statement—suggesting DeepSeek’s competitive advantage is structural rather than a temporary promotional position.

Financial markets responded accordingly. SMIC, the Chinese foundry manufacturing Huawei’s Ascend chips, saw shares jump 10% on the announcement. Rival Chinese AI firms MiniMax and Knowledge Atlas each fell over 9%, as investors recalculated the competitive landscape.

The Huawei Silicon Story

The technically significant aspect of V4 is the chip stack. Until now, training large frontier models outside the Nvidia H100/H200/GB200 supply chain was considered largely theoretical. V4 appears to have changed that calculus.

A research group that includes Huawei Technologies demonstrated full-parameter post-training of V4-Pro using a cluster of at least 1,000 Huawei Ascend 910C chips—a significant engineering achievement in cluster coordination at that scale. Huawei confirmed “day-zero compatibility” across its full Ascend SuperNode product line, including the newer 950 series processors entering limited production.

The implication is geopolitically significant: China now has at least one domestic model-training stack capable of producing frontier-adjacent results without any Nvidia involvement. That was not assumed to be possible when the Biden administration imposed its first export controls in October 2022.

Huawei is projecting $12 billion in AI chip revenue for 2026 as domestic demand absorbs the orders that would otherwise have gone to Nvidia. The American company’s market share in China has effectively reached zero since the most recent export control tightening.

The Intellectual Property Dimension

U.S. officials have paired the model release news with escalating IP theft allegations. The specific claims center on model distillation—training a student model to mimic a more capable teacher model’s outputs—which government documents allege Chinese labs pursued at “industrial scale” using API access to American frontier models before export restrictions closed that pathway.

DeepSeek’s early V2 and V3 releases sparked similar accusations, particularly after analysis of V3’s outputs showed response patterns some researchers attributed to distillation from GPT-4 and Claude 3 family models. DeepSeek denied those characterizations.

V4 provides a partial answer to those allegations: because it was trained from scratch on Huawei hardware without RLHF feedback from American model APIs, the distillation attack surface simply did not exist in the same form. Whether that fully satisfies U.S. government concerns is unclear, given broader allegations about architectural and data theft that go beyond API-based distillation techniques.

Enterprise and Developer Implications

For enterprises and developers operating outside the U.S. export control regime, V4 represents a genuine competitive alternative. The combination of long context, competitive benchmark performance, and sub-$4 pricing for the flagship tier creates a compelling proposition for high-volume inference applications—document analysis, code review, RAG pipelines—where Western frontier model costs become prohibitive at scale.

V4’s Huawei-native architecture does create certain limitations. Deployments in cloud environments that depend on Nvidia-optimized toolchains—CUDA kernels, TensorRT, most mainstream MLOps platforms—require adaptation. DeepSeek has released Ascend-specific documentation and a compatibility layer, but developer tooling remains less mature than the Nvidia ecosystem it competes against.

For organizations in China or in markets less constrained by U.S. technology policy, those friction points are likely to recede quickly as the ecosystem develops. The question is how quickly this alternative technology stack will percolate into global enterprise procurement decisions.

A Geopolitical Inflection Point

The release lands in a context where U.S. chip controls are being stretched in both directions. The Trump administration tightened restrictions further in June 2026, expanding the ban to overseas Chinese-owned subsidiaries. At the same time, V4’s launch demonstrates that those controls have not frozen Chinese AI capability—they have redirected it toward indigenous hardware.

“We’re no longer in a world where you can assume that AI progress is contingent on access to Nvidia chips,” said one industry analyst briefed on the V4 architecture. “That assumption was the foundation of the export control strategy.”

Whether V4 represents a durable capability crossing or a high-water mark that Chinese labs will struggle to maintain without continued access to Western research outputs remains genuinely contested. What is no longer contested is that Huawei’s Ascend platform is a serious training substrate for frontier-scale models.

The next six months—as Ascend 950 production scales and V4 gets deployed across enterprise customers—will determine whether this moment represents a lasting strategic shift in the global AI balance or a demonstration that outpaces the underlying supply chain’s capacity to sustain it.

deepseek huawei china-ai llm export-controls
Share

Related Stories

DeepSeek Closes $7.4 Billion First-Ever Funding Round at $50B+ Valuation

DeepSeek, the Chinese AI lab that shook the world in early 2025 by training frontier models at a fraction of Western costs, has raised over $7.4 billion in its first external funding round, valuing the company at $52–59 billion. The deal, led by Tencent and CATL with backing from China's state AI fund, comes with zero voting rights for outside investors and a five-year lock-up — preserving ironclad founder control.

5 min read