People look at a video clip of Xpeng's next-gen Iron humanoid robot during AI Day at its headquarter in Guangzhou, in southern China's Guangdong province on November 5, 2025.
Source: Getty
paper

Embodied AI: China’s Big Bet on Smart Robots

Beijing believes that true AI dominance will come from systems capable of autonomous operation in the physical world—AI-powered robotics, or embodied AI.

Published on November 24, 2025

On what at first glance appeared to be an ordinary workday in March this year at the factory of China’s leading EV manufacturer, Zeekr, a small team of workers went about their usual tasks: lifting boxes, assembling car parts, and performing quality checks. But unlike any typical shift, none of them paused to rest or even stopped for a drink of water. The reason—they were not human. These UBTech robots, powered by a multimodal reasoning model based on DeepSeek R1, were the first publicly known group of humanoid robots deployed as a coordinated team to carry out a wide range of tasks in a complex, real-world industrial setting.1 Just a few months later, UBTech unveiled the Walker S2, the world’s first humanoid robot capable of autonomously changing its own batteries—potentially enabling uninterrupted, twenty-four-hour operation on the factory floor without any human assistance.2 

These demonstrations offer a glimpse into leading Chinese companies’ ability to translate frontier AI capabilities into useful real-world industrial applications. More significantly, these private sector achievements have become central to Beijing’s evolving national AI strategy, which seeks to leverage China’s combined strengths in AI software and robotics hardware to achieve a distinctive form of technological leadership. While Washington and most of Silicon Valley focus primarily on scaling large language models (LLMs) like ChatGPT and digital AI applications, Beijing has placed a fundamentally different bet. It believes that true AI dominance will come from systems capable of autonomous operation in the physical world—AI-powered robotics, commonly known as embodied AI.

To achieve its ambitions in embodied AI, the Chinese government is rolling out a familiar technological development playbook: encouraging local governments to experiment with different approaches to develop and scale the technology. In response to Beijing’s call, provinces are channeling investment into local companies working on AI systems that can learn from human workers on factory floors, navigate complex physical environments, and bridge the gap between digital reasoning and real-world action.

If China can achieve its vision, embodied AI could potentially solve some of China’s most pressing domestic challenges, including turbocharging its sluggish economy and addressing social challenges like its aging population. It could also yield a broad-based set of strategic advantages. Building a mature embodied AI ecosystem and securing durable global leadership in this area could, in theory, form a key pillar of China’s future economic and military power, and place it ahead in the global race toward artificial general intelligence (AGI). But how does China plan to do this, and what obstacles stand in the way?

This paper addresses that question by examining how the Chinese government conceives of embodied AI, the steps it is taking to advance it, and the ecosystem’s strengths and weaknesses. The paper also sheds light on how progress in embodied AI could reshape China’s economy, strengthen its national competitiveness, and impact the world. 

What Is Embodied AI?

In both the West and China, embodied AI is generally understood as AI-powered hardware systems that can perceive and interpret multimodal inputs from the physical environment, including by sight, sound, and touch. They make autonomous decisions, carry out physical actions in the real world, and learn from those interactions.3  

Embodied AI is generally understood as AI-powered hardware systems that can perceive and interpret multimodal inputs from the physical environment, including by sight, sound, and touch.

Importantly, while embodied AI is often associated with humanoid robots, which mimic the appearance and functions of humans, the two are not synonymous. Humanoids are machines defined by their human-like physical appearance, but their functional capabilities vary greatly depending on their software, sensing, and control systems. In practice, many humanoid robots remain limited-purpose systems, only capable of performing narrowly defined tasks, such as sitting at a reception desk and waving at visitors. The defining feature of embodied AI systems, on the other hand, does not lie in their physical form (see figure 1). These systems can take multiple physical forms, with humanoid robots being only one example; others include quadrupeds, drones, autonomous vehicles, industrial manipulators, and microbots. Industrial robots, for example, have long automated repetitive work on assembly lines even without generative AI.

What defines embodied AI systems instead are their capabilities. They integrate advanced software, sophisticated sensors, and intricate control systems that enable a high degree of autonomous decisionmaking, adaptability to dynamic environments, and the ability to perform diverse tasks—for instance, completing all stages of a car production line traditionally handled by human workers. 

Figure 1: Relationship between humanoid robots and embodied AI systems

Embodied AI systems are typically equipped with a rich set of sensors and are powered by multimodal AI models that support perception, reasoning, and action. These models typically draw on the capabilities of large language models (LLMs), using them as the linguistic and reasoning backbone to which vision- and action-modules are added. Embodied AI systems stand apart for their ability not only to interpret data but also to interact with and adapt to diverse physical environments. This sets them apart from conventional applications, in which AI enhances mostly single-purpose tools and services, such as CT diagnostic assistants or weather prediction models. It also distinguishes them from software-based “agents,” which act purely in digital environments.4

In practice, embodied AI has fuzzy boundaries and is more of a spectrum than a singular, easily recognized threshold. The lower end of this spectrum, comprising systems capable of performing a limited range of real-world tasks, such as UBTech’s Walker S2, has already been developed. However, neither China nor the West have achieved the fully autonomous systems grounded in continuously updated world models that some imagine as the foundation for embodied AGI.

Western and Chinese research communities differ in the amount of attention and resources they are allocating to studying embodied AI. In the United States, most interest in embodied AI appears to be confined to the private sector. Big tech players such as Tesla, with its autonomous vehicles and smart robot Optimus, are making substantial investments in embodied AI.5 Meanwhile, emerging companies such as Figure AI, which produces general-purpose humanoid robots, are also gaining traction, securing substantial external funding.6

By contrast, in China, enthusiasm for embodied AI extends well beyond the private sector. Since the beginning of this year, the country’s expert community has witnessed a rapid surge of interest in embodied AI, with the topic rapidly gaining prominence in both policy and academic discussions.7

Embodied AI: A Physical World Extension into the Real Economy 

The rise of embodied AI in Chinese policy discourse has not emerged in a vacuum. China’s embodied AI push reflects a deliberate effort by the government to translate advances in generative AI from the purely digital realm into what Beijing frequently refers to as the real economy: sectors encompassing the production of tangible goods and essential services that underpin economic activity in the physical world. In so doing, China hopes to revitalize its economy amid recent slow economic growth.8 

China’s embodied AI push reflects a deliberate effort by the government to translate advances in generative AI from the purely digital realm into what Beijing frequently refers to as the real economy.

The idea that the real economy constitutes the foundation of China’s economic strength and must therefore be the primary focus of domestic AI adoption is deeply rooted in President Xi Jinping’s economic thought. Xi’s views on the economy were shaped in part by lessons from the 2008 global financial crisis. According to an analysis by the Chinese Communist Party’s Institute of Party History and Literature, the crisis exposed the risks of excessive reliance on the virtual economy—nonphysical forms of economic activity such as financial services and digital platforms.9 Since becoming party general secretary in 2012, Xi has repeatedly called for the deep integration (深度融合) of the digital economy with the real economy,10 which he regards as the key to China’s long-term growth.11

Therefore, when the release of DeepSeek’s R1 model in January 2025 made it clear that China’s capabilities in AI were becoming increasingly mature,12 the party likely concluded that the time had come to transition AI into the real economy. This conclusion was likely reinforced by the pressing need to revive economic growth amid China’s property-market crisis, sluggish domestic consumption, and the beginnings of population decline.13 Building on China’s earlier advances in hardware systems such as robotics, autonomous vehicles, and drones, Beijing sought to leverage embodied AI as one of the key tools to facilitate this transition.

The party’s emerging focus on embodied AI has crystallized across several important reports and speeches after the DeepSeek moment. The March 2025 annual Government Work Report identified embodied AI alongside biomanufacturing, quantum technologies, and 6G as core tools for building the industries of the future.14 While embodied AI has emerged as a new focus, the party has made clear that LLMs remain an important, complementary element of Beijing’s AI agenda. For example, Premier Li Qiang’s July 2025 keynote speech at the World AI Conference highlighted embodied AI alongside LLMs as areas of AI currently experiencing major breakthroughs.15 Most recently, the CCP Central Committee recommended incorporating embodied AI as one of the new drivers of economic growth in China’s forthcoming 15th Five-Year Plan for Economic and Social Development.16 The Central Committee’s decision offers the clearest indication to date that the CCP leadership intends to place a long-term strategic bet on embodied AI, expecting it to play a defining role in shaping China’s future.

What Beijing Hopes to Achieve with Embodied AI 

Embodied AI could help Chinese leadership achieve five concrete goals across three key dimensions: domestic, strategic, and comprehensive (see figure 2). Domestically, embodied AI could help address structural economic and demographic challenges. Strategically, it could strengthen China’s position amid global technological and geopolitical competition. Its third objective is crosscutting: achieving AGI in its truest form by creating general AI fully capable of autonomously interacting in the physical world.

Figure 2: Unpacking China's Embodied AI Ambitions

Goal 1: Catalyze Economic Growth

Domestically, Beijing seeks to use the development and deployment of embodied AI to significantly boost productivity in key industries, such as manufacturing and logistics, that are critical to revitalizing China’s slowing economic growth. This goal is articulated in the 2024 “Report on the Development of Embodied AI” co-authored by the China Academy of Information and Communications Technology (CAICT),17 a research body under the Ministry of Industry and Information Technology. The report’s authors note that embodied AI could be leveraged to revolutionize human-machine collaboration, enabling autonomous hardware systems to learn from human coworkers through natural language interfacing, gestures, and motion. CAICT imagines potentially transformative economic impact and claims that over time these systems could replace humans as the most flexible units on industrial production lines, significantly boosting productivity.

Goal 2: Address Aging Population

Additionally, the CAICT report identifies embodied AI as a means of addressing China’s rapidly aging population that is expected to pose a serious challenge to the country’s economic growth. Embodied AI systems could substitute for human labor in roles typically affected by worker shortages due to the undesirable or hazardous nature of the tasks, such as those that might expose people to extreme temperatures. They could also provide eldercare services, thereby easing the burden on working-age individuals who, according to Chinese cultural norms, would otherwise be responsible for caring for aging family members. 

Goal 3: Strengthen the Military

In addition to its role in advancing the AI frontier, Beijing also regards embodied AI as a technology capable of significantly enhancing the country’s military capabilities. The official news outlet of China’s People’s Liberation Army, PLA Daily, assesses that embodied AI could help China overcome human physiological limitations, redefine the boundaries of unmanned warfare, and give rise to a more agile and efficient combat system.18 It envisions embodied AI systems that integrate multimodal sensors—such as vision, sound, touch, and even smell—with large multimodal models to autonomously observe their environment, make decisions, and act on the battlefield. Unlike current unmanned systems, which generally still require varying levels of human involvement, these embodied AI systems would offer a much higher degree of real-time tactical autonomy, making them significantly more resilient to electronic warfare interference. They would no longer be mere order executors, but commander-fighters capable of making adjustments in response to changing battlefield conditions. 

Goal 4: Gain Geoeconomic Advantage

In addition to addressing economic and societal challenges, embodied AI could provide the Chinese government with substantial geoeconomic advantages. If Beijing effectively leverages its vast manufacturing capacity, it could emerge as the world’s leading supplier of embodied AI systems—creating a level of dependence on Chinese technology that might surpass earlier global reliance on emerging technologies such as 5G networks or solar panels. Unlike those technologies, embodied AI has the potential to become a foundational tool for automating a wide spectrum of physical tasks across the global economy, meaning that access to such systems could ultimately determine a country’s future economic competitiveness.

Recognizing these stakes, Beijing has already begun laying the institutional groundwork for the global diffusion of embodied AI. Through its recent Global AI Governance Action Plan and AI+ International Cooperation Initiative,19 China has sought to position itself as the partner best equipped to deliver practical AI applications tailored to the development needs of other countries. It has also announced the creation of a new platform for sharing these applications, the World AI Cooperation Organization (WAICO).20 While China is currently concentrating on exporting more general AI applications, once embodied AI matures, it could use these established diffusion channels to spread the technology globally—even if it faces real economic constraints.

Goal 5: Achieve AGI

In addition to addressing domestic challenges, some key Chinese thought leaders see embodied AI as a means of accelerating the country’s frontier AI capabilities. For example, the CAICT report argues that embodied AI could help advance the ambition of achieving AGI. The view that embodied AI offers the most promising path toward AGI is echoed by influential Chinese scholars such as Zhang Bo, a member of the Chinese Academy of Sciences and professor at Tsinghua University.21 Zhang argues that while LLMs have laid a crucial foundation for AGI by enabling machines to understand and generate human language, it is embodied AI that will ultimately allow AI to replicate the full spectrum of human capabilities. He attributes this to embodied AI’s ability to interact with and learn from the physical world, which he sees as essential for achieving true general intelligence.22 

What the Chinese Government is Doing to Make It Happen

By listing embodied AI among its key priorities in this year’s Government Work Report, the Chinese leadership has indicated that it is determined to accelerate its development. The government hopes to encourage academia and local authorities to explore and pilot potential development strategies that could later be scaled up nationally. This “pilot first, scale later” approach is not new—it has long been an important tool in Beijing’s innovation policy implementation. It was first employed in the 1980s by Deng Xiaoping during the Reform and Opening Up period, when he established special economic zones in cities like Shenzhen, allowing local governments to experiment with market-oriented reforms before scaling the most successful policies nationwide.23 More recently, the People’s Bank of China adopted this approach for digital currency deployment, launching pilot programs in cities such as Hangzhou and Shenzhen to test the use of the digital renminbi (RMB) with a view toward a nationwide rollout in the future.24 

Following the government’s encouragement, embodied AI has become the subject of an increasing number of scholarly discussions, exhibitions, and conferences across China. One notable example is the annual China Embodied AI Conference, co-organized by the Chinese Association for Artificial Intelligence and major Chinese universities active in AI and robotics research and talent development, such as Tsinghua and Tongji.25 Its second annual meeting, held in 2025, culminated in the publication “15 Key Research Directions in Embodied Intelligence,” which highlighted priorities such as multimodal embodied perception, embodied world model construction, and embodied AI safety.26 This document, reportedly the first systematic roadmap for embodied AI development in China, is poised to provide the government with a clearer sense of priorities in future funding allocation. 

Various Chinese provincial and municipal governments have sought to position themselves as pioneers for future nationwide policy.

At the same time, various Chinese provincial and municipal governments have sought to position themselves as pioneers for future nationwide policy. Many have launched their own initiatives, but a few stand out due to the scale, ambition, and visibility of their efforts. In particular, the authorities of Beijing, Shanghai, Guangdong, Zhejiang, and Hubei—who are traditionally among the wealthiest in China and have a strong track record of spearheading development of emerging industries through ambitious policy initiatives—have so far taken the lead.27 These regions have adopted diverse approaches to shaping the development of embodied AI, varying both in their emphasis on different segments of the production chain and in the strategies used to incentivize and support local industry. They have also sought to allocate resources more strategically, focusing on areas of embodied AI development where local companies already hold a relative advantage.

For example, Beijing, home to one of China’s leading AI chip developers, Cambricon,28 has prioritized elements such as high-performance chips tailored for embodied AI applications.29 Shanghai—headquarters of Hesai Technology,30 a national leader in sensor technologies for autonomous vehicles—has concentrated on core hardware components.31 At the same time, governments in Guangdong and Zhejiang have focused on complete embodied AI platforms, specifically multipurpose humanoid robotics. These regions already hold a competitive edge with companies such as Shenzhen-based UBTech Robotics and Hangzhou-based Unitree Robotics leading private sector developments in the embodied AI field. Meanwhile, the Hubei provincial government has focused its efforts on another niche of embodied AI platforms: smart vehicles. It has leveraged the resources of local Dongfeng Motor—one of China’s largest car manufacturers—and established the Laboratory for Embodied Intelligence Technology in Automobiles.32 To support these efforts, the provincial governments have pledged varying levels of funding. For instance, Beijing has launched a 100 billion RMB (approximately $14.3 billion) investment fund with a fifteen-year lifespan to support AI and robotics.33 Meanwhile, Shanghai has established an embodied AI fund, with an initial closing of 560 million RMB (approximately $77 million).34

However, this largely devolved, local government-driven strategy is not without downsides. While provinces appear to be attempting to specialize in different segments of embodied AI, the lack of detailed guidance from Beijing risks leading to duplication of efforts and misallocation of resources. A parallel can be drawn with the electric vehicle sector. Fierce inter-provincial competition and overinvestment in local manufacturers produced staggering levels of economic waste and industrial overcapacity.35 Yet the evolution of China’s world-leading EV industry also demonstrates that this seemingly wasteful competition can, over time, generate genuine technological breakthroughs. As with EVs, local government efforts in embodied AI could foster the rise of national champions and provide Beijing with clearer models of how to scale emerging technologies that can later be replicated at the national level. 

Ultimately, whether the approach of devolving the initial development of embodied AI to local governments succeeds will depend on several factors. First, national success likely hinges on local governments effectively dividing responsibility across different segments of the embodied AI supply chain and avoiding excessive duplication. Second, success will also depend on whether sufficient and sustainable financing will be available to support the long-term growth of local embodied AI companies. The latter may prove difficult given the substantial debt many local governments have accumulated over the past decade. This debt was driven by provincial governments responding to Beijing’s national growth targets, which encouraged heavy borrowing for investment in infrastructure and manufacturing such as electric vehicles, regardless of returns.36 Third, technological advances could plateau, and fundamental breakthroughs could be limited. Whether embodied AI systems will ultimately match the dexterity, adaptability, and efficiency of human workers remains uncertain and could take far longer than many current industrial plans assume.37

Where The Industry Stands and Where It Is Headed 

While the role of academia and local governments is crucial, Beijing’s bet on embodied AI ultimately hinges on the industry’s ability to deliver breakthroughs that will make the technology cost-effective and scalable. That effort is already well underway.

National champions are already emerging across key sectors in China’s embodied AI industry: Unitree and UBTech in humanoid robotics,38 DJI in drones,39 and Baidu Apollo and XPeng in autonomous vehicles.40 In each of these cases, industry-driven capabilities in robotics, manufacturing, and EVs formed the foundation for integrating AI into existing company infrastructures. 

China’s advantage in embodied AI lies in its robust hardware manufacturing base and supply chain, which could position it to rapidly scale production once the technology matures.

Compared to the rest of the world, China’s advantage in embodied AI lies in its robust hardware manufacturing base and supply chain, which could position it to rapidly scale production once the technology matures.41 This includes cost-effective manufacturing of embodied AI hardware platforms such as robots and drones, as well as some key components of embodied AI systems, such as LiDAR, the sensor technology essential for 3D mapping and environmental navigation.42 As the world’s largest manufacturer, China also holds a competitive edge in the diversity and abundance of real-world application scenarios, which generate rich datasets critical for training embodied AI systems—many of which are intended for use in production lines.43 However, Chinese developers often struggle to access this data, prompting some industry leaders to call on the government to grant companies access to datasets held by government agencies and state-owned enterprises.44 

Much like the rest of the world, China’s embodied AI technology remains at a relatively early stage, with the industry still lacking a complete commercial cycle. It largely remains confined to R&D, small-batch production, and pilot applications in specific scenarios, with key components such as balance and motion control software still under development.45 However, major industry players are increasingly pushing to accelerate large-scale production and deployment. UBTech, for instance, recently secured a 250 million RMB (approximately $35 million) order for its Walker S2 industrial humanoid robots from an undisclosed domestic company.46

China’s comparative weaknesses, on the other hand, include limited access to advanced AI chips and compute for both training and inference of models used in embodied AI systems. It still trails the West, though it is making steady progress.47 Another key limitation lies in multidimensional sensors, such as torque sensors and six-axis force sensors for high-precision force measurement and perception in embodied AI systems, where China still relies heavily on Western imports.48 

Despite these limitations, most Chinese experts and industry leaders believe that the country’s existing advantages, combined with steady progress in lagging areas and sustained government support, could enable steady technological progress in embodied AI within the next five to ten years.49 This progress would culminate in the development and deployment of AI systems capable of performing physical work in real-economy settings, advancing Xi Jinping’s conceptual objective for the use of AI in the real economy.50 Meanwhile, some Chinese experts, such as Huang Tiejun, Director of the Beijing Academy of Artificial Intelligence—one of China’s leading AI labs working on embodied AI—have made longer-term predictions. Huang has suggested that embodied AI could fully surpass human capabilities by 2045.51 

Conclusion 

Beijing is making a long-term strategic bet on embodied AI, viewing it as a potential solution to many of China’s economic, social, and geopolitical challenges. Yet, whether this gambit succeeds, and in what ways, remains uncertain. Technological breakthroughs may prove elusive, and high levels of local debt could constrain sustained investment. Nevertheless, Chinese leadership appears determined to push forward, committing substantial political will and financial resources to make embodied AI a viable and scalable technology.

Should Beijing develop fully commercially applicable embodied AI systems ahead of other countries, it would likely leverage its comparative advantages—a robust manufacturing base, comprehensive supply chains, and hundreds of thousands of factories where robots are likely to be deployed in the future—to rapidly scale up production and position itself as a dominant player in the global embodied AI systems market. In such a scenario, the world could depend on Chinese embodied AI technologies across critical economic sectors, including manufacturing, logistics, healthcare, and service industries. It would also greatly strengthen China’s capabilities in autonomous warfare, potentially influencing the balance of power in the Indo-Pacific and beyond—especially if China chooses to export military applications of embodied AI to countries such as Russia. And if embodied AI ultimately holds the key to unlocking true AGI, Beijing could hold a decisive edge in its frontier AI rivalry with Washington. How the United States responds to this challenge will not just determine the trajectory of AI development but also shape the balance of economic and military power in the decades ahead.

Acknowledgments

The authors thank Jon Bateman, Matt Sheehan, Irene Solaiman, Noah Tan, Teddy Tawil, and Sam Winter-Levy for their helpful feedback on previous drafts. The writers acknowledge the use of LLM tools for preliminary desk research and clarificatory editing.

Notes

Carnegie does not take institutional positions on public policy issues; the views represented herein are those of the author(s) and do not necessarily reflect the views of Carnegie, its staff, or its trustees.