Category: e

  • China’s AI Industry Is Exploding—From Chips to Applications, China Is Now Leading the World

    Let me start with a confession. I used to be one of those people who rolled their eyes at the phrase “Chinese innovation.” Like many Western observers, I assumed China was great at copying, great at scaling, great at manufacturing—but not at inventing. Real breakthroughs happened in Silicon Valley, in London, in Paris. China was the factory floor, not the design studio.

    I was wrong. Not just a little wrong. Embarrassingly, catastrophically wrong. And if you are still thinking the way I did, you need to wake up. Right now.

    What I witnessed over the past eighteen months is nothing short of a complete industrial transformation. China is not catching up in artificial intelligence. In more ways than most people in the West realize, China is already leading.

    The Chip Story Nobody Expected

    Let us talk about the elephant in the room first: chips. We all know the story. Washington imposed severe export controls on advanced semiconductors, cutting off China from Nvidia’s most powerful GPUs. The assumption was simple: choke off the hardware, and you choke off the AI industry. Game over.

    Except it was not game over. It was game on.

    Huawei recently laid out a three-year chip roadmap that would be ambitious for any company on earth. They are planning to release the Ascend 950 series in 2026, the 960 in 2027, and the 970 in 2028, with a rhythm of roughly doubling compute power every single year. Think about what that means. A company cut off from the world’s most advanced chip manufacturing is nevertheless promising annual performance leaps that would make Moore’s Law blush.

    And Huawei is not alone. The AI semiconductor ecosystem in China is expanding rapidly. Cambricon, often described as China’s answer to Nvidia, has secured major customers like Alibaba, Tencent, and DeepSeek for its cloud and data center chips. Some of its latest silicon reportedly reaches eighty percent of the Nvidia A100’s performance. Alibaba has developed its own AI inference chip, designed and manufactured entirely within China, that is compatible with Nvidia’s CUDA platform. The symbolism is hard to miss: a fully domestic chip that slots into the existing software ecosystem.

    The numbers tell the same story. China’s AI accelerator chip market was projected to hit roughly 240 billion yuan in 2025 and is expected to surge to over 380 billion yuan in 2026. That is not incremental growth. That is a market doubling in size year after year.

    None of this means China has caught Nvidia in raw single-chip performance. But they have done something arguably more important: they have built a complete, self-reliant stack, from chips to servers to interconnect technology to software. Huawei’s Atlas SuperPoD, which strings together over fifteen thousand Ascend chips, delivers total system compute and memory bandwidth that dramatically exceeds what Nvidia’s upcoming NVL144S can offer. When you cannot win on the single chip, you win on the system. That is not desperation. That is strategy.

    The Models Are Eating the World

    If chips are the engine, AI models are the car. And Chinese models are now among the fastest cars on the road.

    April 2026 was the month that reset everyone’s expectations. DeepSeek, the Hangzhou-based startup that first stunned the world in early 2025, released DeepSeek V4. The reviews from global developers were ecstatic. Its agent coding capabilities were ranked as the best among open-source models, with internal tests suggesting it outperformed Anthropic’s Sonnet 4.5 and approached Claude Opus 4.6 in non-thinking mode. Its context window stretched to a million tokens, and it achieved all this while slashing KV cache usage to just ten percent of its predecessor.

    Then came the Artificial Analysis intelligence index rankings for open-source models. The global top five were all Chinese: DeepSeek V4, Kimi K2.6, and three others. Not one American model cracked the top tier. Let that sink in. The open-source AI ecosystem, the breeding ground for the next generation of startups and applications, is now dominated by Chinese companies.

    Even more telling is what happened on the OpenRouter platform, the world’s largest model API aggregator. In early 2026, for the first time in history, Chinese models accounted for more than half of all token consumption on the platform in a single month. More Chinese tokens were being consumed than American ones. That is not a niche phenomenon. That is the global developer community voting with their API keys.

    Stanford’s 2026 AI Index Report delivered the final verdict. The performance gap between top Chinese and American AI models has narrowed to just 2.7 percent. In February 2025, DeepSeek-R1 had already briefly matched America’s best model. By March 2026, Anthropic’s Claude Opus 4.6 held only a razor-thin lead. The report called the gap “substantially eliminated.” The race is now a dead heat.

    Robots, Robotaxis, and the Real World

    Here is where things get truly surreal. In the West, we still talk about AI mostly in terms of chatbots and image generators. In China, AI has already left the screen and entered the physical world.

    In 2025, Chinese companies released more than three hundred humanoid robot models, accounting for over half of all such products globally. Unitree Robotics shipped over five thousand five hundred humanoid robots in a single year and is now targeting ten to twenty thousand units for 2026. Their revenue tripled. Their profit margins reached sixty percent. This is not a science project anymore. This is a business.

    UBTech, another Chinese robotics company, delivered over a thousand humanoid robots in 2025 and is scaling toward five thousand in 2026, with the ultimate goal of ten thousand units by 2027. Their Walker S1 humanoid robots are already deployed in factories, including a landmark deployment at a Zeekr 5G smart factory where multiple robots work collaboratively, something that had never been done before anywhere in the world.

    On the roads, the story is equally dramatic. China’s robotaxi fleet is racing toward ten thousand vehicles. WeRide expects to deliver two thousand robotaxis in 2026 alone. Pony.ai has driven its per-vehicle cost down from five hundred thousand yuan to two hundred seventy thousand yuan by switching to domestic chips. Baidu’s Apollo Go continues to expand across multiple cities. And these companies are not just operating in China: Pony.ai has signed deals in Dubai and Saudi Arabia, WeRide has secured the first fully driverless commercial license in Abu Dhabi. Chinese autonomous driving technology is going global.

    Meanwhile, AI-powered smart glasses priced at under three thousand yuan can order food, hail a taxi, and book a hotel with a single voice command. AI has penetrated more than seventy percent of business processes in China’s most advanced factories, with over six thousand vertical AI models deployed across manufacturing. The phrase “AI Plus Manufacturing” is now official government policy, backed by a coordinated national strategy.

    The Numbers Are Staggering

    Sometimes you just need to look at the scale. China’s AI core industry surpassed 1.2 trillion yuan in 2025. Over sixty-two hundred AI companies now operate in the country. China’s open-source large model downloads rank first in the world. AI adoption among large-scale manufacturing enterprises has passed thirty percent. The government has announced plans to grow AI-related industries to over ten trillion yuan by the end of the country’s fifteenth five-year plan period.

    The policy support is equally formidable. The State Council has formally elevated “AI Plus” to the level of national strategy. The 2026 government work report introduced the concept of building a “new intelligent economy.” Local governments from Shenzhen to Shanghai are rolling out ambitious AI action plans. An 8.2 billion dollar national AI industry investment fund was launched. When the Chinese state puts its full weight behind something, the results tend to follow.

    What This Means for the Rest of Us

    I know what some of you are thinking. America still leads in frontier proprietary models. The US still dominates in private AI investment, with over 285 billion dollars deployed in 2025 compared to China’s officially disclosed figures. American companies still produce more top-tier models. The West still has the talent concentration and the software ecosystem advantages.

    All of that is true. But it misses the point.

    What China has achieved is something structurally different. They have built a complete, integrated AI industry that spans from domestic chips to foundational models to real-world deployment at a scale no other country can match. They did it under sanctions. They did it with less investment. They did it by turning constraints into catalysts.

    The Washington Post put it bluntly in late 2025: China now leads the United States in the key area of open AI technology. Chinese companies are releasing free AI models that are more powerful and more popular than their American counterparts. The Stanford report confirmed that the gap has all but vanished. Multiple international analysts have noted that China’s AI lead is not based on cheap labor or scale alone; it is structural, built on coordinated strategy, massive investment in energy infrastructure, and a domestic market of 1.4 billion people.

    I travel between continents for work, and the shift in sentiment over the last year has been palpable. European investors who used to dismiss Chinese AI as derivative now study it obsessively. Southeast Asian governments that once looked exclusively to Silicon Valley for technology partnerships are now signing deals with Chinese AI firms. The global conversation has changed, and it is not changing back.

    Here is my advice: stop thinking about this as a race that someone wins or loses. Start thinking about it as a new reality you need to understand. China’s AI industry is not coming. It is already here. And if you are still waiting for it to stumble, you are going to be waiting a very long time.

    The chip-to-application stack has been built. The models are world-class. The robots are shipping. The cars are driving themselves. The factories are thinking. And all of this happened faster than almost anyone in the West predicted.

    I was wrong about Chinese innovation. Maybe it is time to ask yourself whether you are wrong too.

  • Kunlunxin P800 Completes Large-Scale Validation — A New Breakthrough for China’s AI Chips

    Let me tell you about something that happened quietly in China’s computing world over the past year, and why it matters for everyone — not just the Chinese.

    There is a chip called the Kunlunxin P800. It is not made by NVIDIA. It is not made by AMD. It is not made by Intel. It was designed by a team inside Baidu, the Chinese search giant, and manufactured using a 7-nanometer process. If you follow the semiconductor industry, you know what that means: this is not bleeding-edge 4-nanometer or 3-nanometer territory. On paper, it is a generation or two behind what TSMC churns out for Apple and NVIDIA.

    And yet, this chip just passed a milestone that should make anyone paying attention sit up and take notice.

    In February of last year, Baidu’s cloud division lit up a cluster of ten thousand P800 cards — the first domestically designed ten-thousand-card AI cluster in China. Two months later, in April, they lit up a thirty-thousand-card cluster. Not a lab experiment. Not a proof of concept. A fully operational, production-grade computing beast sitting in a data center in Ningxia, running real workloads for real customers. China’s information and communications research academy put it through its paces and awarded it the highest possible stability rating: five stars. Over a thousand customers were simultaneously fine-tuning hundred-billion-parameter models on that cluster, while multiple trillion-parameter models trained in parallel.

    That is the headline. But the story behind it is far more interesting.

    What the P800 Actually Is

    The Kunlunxin P800 is a third-generation AI accelerator built on Baidu’s proprietary XPU architecture. Each card packs over fifty billion transistors and uses a three-dimensional memory architecture that stacks HBM3 and DDR5 together, delivering 1.2 terabytes per second of memory bandwidth. In real terms, that means a single card pumps out 128 teraflops in FP16 precision — or 256 trillion operations per second in INT8, the format most useful for inference.

    These numbers put the P800 roughly in the same league as NVIDIA’s A100 and Huawei’s Ascend 910B, according to a detailed die analysis published by TechInsights in May. Independent benchmarking by Morgan Stanley showed the P800 hitting 1,521 tokens per second on DeepSeek R1 inference — sitting comfortably between NVIDIA’s H20 at the high end and the A100 as a baseline. In some configurations, a single machine with eight P800 cards can deploy the full 671-billion-parameter DeepSeek V3 model and pump out over 4,800 tokens per second.

    But what really separates the P800 from a generic GPU is its efficiency. Each card draws between 150 and 160 watts — a fraction of what a comparable NVIDIA card consumes. In a thirty-thousand-card cluster, that power advantage compounds into millions of dollars in annual electricity savings. The architecture physically separates compute units from communication units, meaning data can flow between cards while calculations run simultaneously, rather than forcing one to wait for the other.

    The cluster itself achieved something remarkable: a ninety-nine-point-five percent effective training uptime across tens of thousands of cards. Anyone who has managed GPU clusters knows that keeping a system of this scale stable is not a hardware problem — it is a software and networking nightmare. Baidu’s engineers had to build custom high-performance networking, liquid cooling, and a scheduling layer that could handle node failures without bringing the whole system down.

    Why This Happened

    You cannot understand the P800 story without understanding the sanctions story. Since October 2022, the United States has progressively tightened export controls on advanced AI chips to China. The H100 was banned. The H200 was effectively banned, then conditionally allowed in exchange for a twenty-five percent fee paid directly to the US government — a policy that, whatever you think of its merits, fundamentally treats chips as a geopolitical weapon rather than a commercial product.

    By 2025, the restrictions had expanded beyond chips themselves. The US Commerce Department notified the three dominant EDA software vendors — Cadence, Synopsys, and Siemens — to stop serving mainland Chinese customers. Together, these three control roughly three-quarters of the global chip design software market. The message from Washington was unambiguous: China should not be able to design or manufacture advanced AI chips at scale.

    This was not a subtle nudge. It was an attempt to sever the supply chain entirely.

    The P800 exists because Baidu saw this coming years ago. The company’s chip team was founded in 2011 — long before AI chips were a geopolitical flashpoint. They started with FPGAs, moved to custom silicon with the first-generation Kunlun chip in 2018, and kept iterating. By the time the sanctions hammer came down, the team already had a third-generation chip in the pipeline and a decade of institutional knowledge about what large-scale AI infrastructure actually requires.

    Baidu’s founder Robin Li put it bluntly at the company’s World Conference in late 2025: the industry structure where chip companies capture the bulk of AI value while application builders scrape for crumbs is “extremely unhealthy and unsustainable.” The only way to fix it, he argued, is to own the chip layer yourself. Amazon, Microsoft, Google, and OpenAI have all reached the same conclusion. The difference is that Baidu had no choice.

    The Scariest Chart You Have Not Seen

    Here is where things get genuinely alarming if you are sitting in Santa Clara or Washington.

    At that same November conference, Baidu unveiled a five-year chip roadmap. The M100, an inference-optimized chip, is launching right now in 2026. The M300, designed for training trillion-parameter multimodal models, arrives in 2027. Then comes the N-series in 2029. And the endpoint, penciled in for 2030, is a single cluster of one million Kunlun cards.

    Alongside the chips, Baidu announced “supernodes” — dense configurations that pack hundreds of cards into a single logical unit. The Tianchi 256, shipping in the first half of 2026, integrates 256 P800 cards into one node with four times the interconnect bandwidth of previous clusters and more than triple the per-card inference throughput. The Tianchi 512, coming in the second half of 2026, doubles the bandwidth again and can train a trillion-parameter model within a single node. By 2028, they plan thousand-card and four-thousand-card supernodes.

    Whether every milestone on this roadmap gets hit on schedule is an open question. Roadmaps are aspirational by nature. But the direction of travel is unmistakable: this is not a company trying to scrape by with sanctions-compliant scraps. This is a company building a full-stack alternative to NVIDIA’s ecosystem, from silicon to server to software.

    The financial machinery behind all this is also worth noting. Kunlunxin spun out from Baidu as an independent company in 2021, immediately valued at roughly eighteen billion dollars. By mid-2025, after multiple funding rounds including investments from China Mobile and BYD, the valuation had climbed to about twenty-nine billion dollars. In January 2026, the company filed confidentially for a Hong Kong IPO. In May, it kicked off the process for a Shanghai STAR Market listing as well — a dual A-share and H-share play. Goldman Sachs projects the company’s revenue could jump from about five billion dollars in 2025 to nine billion dollars in 2026.

    Investors are betting that the sanctions have created something more valuable than they destroyed: a captive market of Chinese enterprises that need AI compute and cannot reliably source it from overseas.

    What This Means

    I want to be careful here, because the temptation in tech journalism is always to declare that everything has changed forever. It has not. NVIDIA still dominates AI training. CUDA is still the default software ecosystem. TSMC’s latest nodes are still years ahead of what Chinese fabs can produce. The P800 is built on 7-nanometer technology; NVIDIA’s H200 uses 4-nanometer. That gap is real, and it will not close overnight.

    The P800 also has limitations that matter in practice. Its software ecosystem — while compatible with PyTorch, TensorFlow, and PaddlePaddle — does not offer the seamless developer experience of CUDA. Migrating complex training pipelines is still painful. And while the P800 excels at inference, where it can go toe-to-toe with NVIDIA’s H20, training performance on the largest models still lags. One analysis from Morningstar in late 2025 concluded that Baidu’s chips remain one to two generations behind NVIDIA and that their viability as a long-term solution was still unproven.

    But here is the thing: the P800 is not designed to beat NVIDIA on every dimension. It is designed to be good enough at the things that matter most — inference throughput, energy efficiency, cluster stability — and cheap enough to deploy at enormous scale. A P800 card costs roughly half what a comparable NVIDIA card costs. When you are buying tens of thousands of cards, that math changes procurement decisions.

    And the validation has been real. Baidu is now running the vast majority of its internal AI inference on P800 clusters. The company trained its Qianfan-VL multimodal model — a seventy-billion-parameter vision-language system that scored in the ninety-eighth percentile on science reasoning benchmarks — entirely on a five-thousand-card P800 cluster. External customers including China Merchants Bank, State Grid, and China Steel Research have deployed P800 systems at scale. China Mobile, the country’s largest telecom operator, awarded Kunlunxin a billion-dollar procurement contract in 2025.

    This is the part that should give policymakers pause. The theory behind export controls is straightforward: deny China access to advanced chips, and you slow its AI progress. The reality, as the P800 demonstrates, is more complicated. Denying access creates a guaranteed market for domestic alternatives. That market attracts investment. That investment funds R&D. That R&D produces better chips. Rinse and repeat.

    The Information Technology and Innovation Foundation, a Washington think tank, published a report in mid-2025 arguing that US export controls were not slowing China’s ascent but accelerating it, pointing to Huawei’s Ascend series as a case study. The P800 story fits the same pattern. Before the sanctions, a Chinese cloud provider could simply order H100s from NVIDIA and focus on building applications. Now, the compute layer itself has become a strategic priority, with resources flowing into chip design, manufacturing, and software ecosystems that would have been considered too risky or expensive in a world of free trade.

    Whether this ultimately produces a genuine competitor to NVIDIA’s flagship products remains to be seen. The P800 is a strong inference chip with respectable training capabilities, not a direct H200 replacement. The upcoming M300, with its 3D packaging and HBM3e memory, will be a more serious test of whether China’s domestic chip industry can close the training gap.

    But here is what we know today: a thirty-thousand-card cluster of Chinese-designed AI accelerators is running production workloads for a thousand enterprise customers. It passed a rigorous stability certification with a top rating. It trains frontier models. It deploys DeepSeek. It costs less to buy and less to run than the sanctioned alternative. And the company behind it just filed for a dual-listing IPO with a valuation approaching thirty billion dollars.

    If you had predicted any of this in 2022, most semiconductor analysts would have called you naive. Today, it is simply the state of play. The sanctions did not prevent China from building AI infrastructure. They redirected the money that would have gone to NVIDIA into a domestic ecosystem that now has every incentive to close the remaining gaps as quickly as engineering allows.

    The Kunlunxin P800 is not the end of that story. It is the beginning of the next chapter. And it is being written in a language that policymakers in Washington should probably learn to read.