Friday, July 4, 2025
HomeAIHow Google built its Gemini robotics models

How Google built its Gemini robotics models

Published on

spot_img


“We’d trained models to help robots with specific tasks and to understand natural language before, but this was a step change,” Carolina says. “The robot had never seen anything related to basketball, or this specific toy. Yet it understood something complex — ‘slam dunk the ball’ — and performed the action smoothly. On its first try.

This all-rounder robot was powered by a Gemini Robotics model that is part of a new family of multimodal models for robotics. The models build upon Gemini 2.0 through fine-tuning with robot-specific data, adding physical action to Gemini’s multimodal outputs like text, video and audio. “This milestone lays the foundation for the next generation of robotics that can be helpful across a range of applications,” said Google CEO Sundar Pichai when announcing the new models on X.

The Gemini Robotics models are highly dextrous, interactive and general, meaning they can drive robots to react to new objects, environments and instructions without further training. Helpful, given the team’s ambitions.

“Our mission is to build embodied AI to power robots that help you with everyday tasks in the real world,” says Carolina, whose fascination with robotics began with childhood sci-fi cartoons, fueled by dreams of automated chores. “Eventually, robots will be just another surface on which we interact with AI, like our phones or computers — agents in the physical world.”



Source link

Latest articles

Soham Parekh joins Darwin Studios after Silicon Valley backlash for moonlighting

Soham Parekh announced he is joining AI startup Darwin Studios exclusively, after facing...

Dust hits $6M ARR helping enterprises build AI agents that actually do stuff instead of just talking

Want smarter insights in your inbox? Sign up for our weekly newsletters to...

ChatGPT, Claude and Gemini not helping? Here’s how to fix your prompts for better output

AI chatbots like OpenAI’s ChatGPT, Google’s Gemini, and Anthropic’s Claude are increasingly woven...

A Leading Indicator Has Emerged Suggesting That the AI Industry Is Cooked

In the world of cultural commentary, there are plenty of benchmarks used to...

More like this

Soham Parekh joins Darwin Studios after Silicon Valley backlash for moonlighting

Soham Parekh announced he is joining AI startup Darwin Studios exclusively, after facing...

Dust hits $6M ARR helping enterprises build AI agents that actually do stuff instead of just talking

Want smarter insights in your inbox? Sign up for our weekly newsletters to...

ChatGPT, Claude and Gemini not helping? Here’s how to fix your prompts for better output

AI chatbots like OpenAI’s ChatGPT, Google’s Gemini, and Anthropic’s Claude are increasingly woven...