Google's Gemini Robotics: A Slam Dunk on First Try

2025-04-02
Google's Gemini Robotics: A Slam Dunk on First Try

Google showcased its new Gemini Robotics model, enabling robots to perform complex tasks like successfully slam dunking a basketball on the first try, without prior training on the specific object or action. Built upon Gemini 2.0, the model is fine-tuned with robot-specific data, translating multimodal outputs (text, video, audio) into physical actions. Highly dexterous, interactive, and general, it adapts to new objects, environments, and instructions without further training. Google's ambition is to build embodied AI to power robots assisting with everyday tasks, eventually becoming as commonplace an AI interface as phones or computers.