Back to News
Market Impact: 0.65

Gemini Robotics 1.5 brings AI agents into the physical world

GOOGLGOOG
Artificial IntelligenceTechnology & InnovationProduct LaunchesRegulation & Legislation
Gemini Robotics 1.5 brings AI agents into the physical world

Google has introduced Gemini Robotics 1.5 and Gemini Robotics-ER 1.5, two advanced AI models designed to enable intelligent, general-purpose robots to perceive, plan, think, use tools, and execute complex, multi-step physical tasks. Gemini Robotics-ER 1.5, a vision-language model, excels at embodied reasoning, planning, and integrates digital tools like Google Search, achieving state-of-the-art spatial understanding. Gemini Robotics 1.5, a vision-language-action model, translates instructions into motor commands and can 'think' before acting, explaining its processes and learning across different robot embodiments. These models represent a significant step towards Artificial General Intelligence (AGI) in the physical world, with Gemini Robotics-ER 1.5 now available to developers via the Gemini API.

Analysis

Alphabet (GOOGL) has announced a significant advancement in its embodied AI capabilities with the introduction of Gemini Robotics 1.5 and Gemini Robotics-ER 1.5. This launch represents a strategic push into general-purpose robotics, creating a two-part agentic framework where one model (ER 1.5) handles high-level reasoning and planning, while the other (1.5) executes the physical actions. The reasoning model, Gemini Robotics-ER 1.5, has achieved state-of-the-art performance on spatial understanding benchmarks and can natively call digital tools like Google Search, a key differentiator for complex problem-solving. A critical technical breakthrough highlighted is the ability for the action model to learn across different robot embodiments, allowing skills learned on one robot to transfer to another, which could drastically reduce development friction and accelerate scalability in the robotics industry. By making the Gemini Robotics-ER 1.5 model available to developers via its API, Google is positioning itself to build a foundational platform for robotic applications, aiming to solve what it terms "AGI in the physical world." The company's proactive discussion of safety, including an upgraded ASIMOV benchmark, addresses potential regulatory and ethical concerns, which is critical for a technology with such profound real-world implications.