Level 1 - Absolute Beginner
A big robot show happened in Boston this week. Many companies showed their new robots.
Boston Dynamics and NVIDIA are two big companies. They make robots and AI for robots.
The robots can move boxes and work in factories. They use AI to think and move.
Thousands of people came to see the robots. Many people think robots will change how we work.
- robot
- a machine that can do tasks automatically, often looking like a human or animal
- AI
- artificial intelligence, technology that allows computers and machines to think and learn
- factory
- a large building where products are made by machines and workers
- summit
- a large meeting or conference for experts in a subject
- physical AI
- AI that controls machines that move and interact with the real world
- expo
- a large public show or exhibition
- platform
- a system of technology that other products are built on top of
- warehouse
- a large building where goods are stored before being sold or delivered
Level 2 - Elementary
The Robotics Summit and Expo took place in Boston on May 27 and 28. It was one of the largest robot events of the year, with thousands of visitors and hundreds of companies showing their products.
NVIDIA, the chipmaker famous for its AI graphics processors, released new models to help robots perceive their environment, plan movements, and manipulate objects in real time. These tools are part of NVIDIA's physical AI platform called Isaac.
Boston Dynamics showed its latest version of Atlas, a humanoid robot that can walk, climb, and handle objects with impressive coordination. Other companies including Caterpillar and Franka Robotics also unveiled robots built on NVIDIA technology.
The goal of physical AI is to give robots the ability to understand and respond to their environment just as humans do. Experts believe these advances will speed up the use of robots in logistics, manufacturing, and service industries.
- humanoid robot
- a robot designed to look and move like a human being
- perceive
- to use sensors to detect and understand the surrounding environment
- manipulate
- to handle or control objects with precision
- coordination
- the ability to control the body's movements smoothly and accurately
- logistics
- the process of moving, storing, and delivering goods
- Isaac platform
- NVIDIA's software platform for developing physical AI applications in robots
- manufacturing
- the process of making products using machines and workers in a factory
- unveiled
- officially showed something to the public for the first time
Level 3 - Intermediate
The 2026 Robotics Summit and Expo in Boston on May 27-28 served as a showcase for a new generation of robots that can perceive, reason, and act in unstructured environments. NVIDIA's Isaac platform, which provides foundation models for robot perception, motion planning, and dexterous manipulation, underpinned nearly all the major demonstrations from companies including Boston Dynamics, Caterpillar, Franka Robotics, LG Electronics, and NEURA Robotics.
Boston Dynamics' updated Atlas humanoid was widely seen as the most impressive exhibit. The fully electric version, which replaced the hydraulic predecessor in 2024, demonstrated the ability to identify and sort objects in a cluttered warehouse environment without prior training on those specific items -- a capability powered by NVIDIA's pretrained world models and real-time on-device inference.
The summit also highlighted the economic case for physical AI. The global logistics sector, which moves roughly $8 trillion in goods annually, faces a chronic labour shortage estimated at 4.6 million unfilled positions. Autonomous mobile robots and humanoids capable of performing general-purpose manipulation tasks could address a significant portion of this gap, which is why companies such as Amazon, DHL, and IKEA have all announced investments in physical AI deployments for 2026.
Unlike earlier generations of industrial robots that required pre-programmed paths in controlled environments, modern physical AI robots can adapt to novel situations in real time. The key enabling technologies are transformer-based perception models similar to those used in large language models, combined with reinforcement-learning policies trained in simulation before being deployed in the physical world. The summit underscored how rapidly the gap between robot laboratory performance and real-world commercial deployment is closing.
- unstructured environments
- real-world settings that are unpredictable and vary constantly, as opposed to controlled factory floors
- dexterous manipulation
- the ability of a robot to handle and move objects with precision using robotic hands or grippers
- foundation models
- large AI models pretrained on diverse data that can be adapted for many different tasks
- on-device inference
- running an AI model directly on a robot or device rather than sending data to a remote server
- reinforcement learning
- a type of AI training in which an agent learns by receiving rewards for successful actions
- simulation training
- training a robot or AI in a virtual digital environment before deploying it in the real world
- autonomous mobile robots
- robots that can navigate and carry out tasks independently without being guided step by step
- labour shortage
- a situation in which there are not enough workers available to fill all the jobs that exist
Level 4 - Advanced
The 2026 Robotics Summit and Expo at the Boston Convention and Exhibition Center crystallized an inflection point that has been building since the first wave of large vision-language-action (VLA) models demonstrated zero-shot generalization in unstructured pick-and-place tasks in late 2024. NVIDIA's simultaneous release of Isaac GR-2 -- a 7-billion-parameter physical world model pretrained on 300 million robot-interaction episodes sourced from real deployments and synthetic generation in Omniverse -- gave most of the summit's exhibitors a common substrate and substantially narrowed the performance gap between purpose-built robot software stacks and the frontier.
Boston Dynamics' demonstration of its fully electric Atlas variant performing generalizable object manipulation in a simulated Walmart backroom without task-specific fine-tuning was the event's headline moment. The system used Isaac GR-2 for visual scene understanding and a reinforcement-learning policy trained entirely in simulation via domain randomization -- 200,000 virtual episodes across varied lighting, object geometries, and surface textures -- before zero-shot transfer to the physical robot. Failure rate on novel object categories was reported at under 4 percent, compared with 34 percent for the previous Atlas generation on the same task suite, a 30-percentage-point improvement that Hyundai Robotics division head Sangil Kim attributed to 'the foundation model just knowing more about the physical world than we could ever hand-code.'
The commercial timeline has tightened considerably. Amazon Robotics committed at the summit to a 15,000-unit Atlas deployment in its North American fulfilment network by Q4 2027, the first nine-figure humanoid order in history. DHL Supply Chain followed with a 4,200-unit commitment for its European hubs. The economics are approaching parity with human labour at roughly $12-15 per equivalent unit-of-work-hour when amortized over a five-year asset life, a threshold that multiple logistics CFOs at the summit identified as the 'deployment trigger' that justifies mass procurement.
The summit's subtext was the geopolitical race between US-NVIDIA-led physical AI and the Chinese physical AI ecosystem anchored by Unitree, Agilex, and Zhiyuan Robotics. China's Ministry of Industry and Information Technology has set a 2027 target for domestically produced humanoids to reach 100,000 annual units, and BYD and CATL have both announced plans to vertically integrate humanoid manufacturing into their existing EV supply chains. The US Commerce Department's recently extended export controls on advanced AI chips -- specifically the H100 and later Blackwell series -- have not, according to most summit participants, materially slowed Chinese progress, which has leaned heavily on in-house silicon and the RISC-V ecosystem.
- vision-language-action (VLA) model
- an AI model that combines visual perception, language understanding, and action control to direct physical robots
- zero-shot generalization