Gemini Robotics: Google DeepMind’s New AI Models for Robots

AI Tolide The models are close to taking action in the real world. Indeed, the major artificial intelligence companies offer Artificial intelligence agents You can take care of the work on the web, or request your grocery stores or keep dinner. today, Google Deepmind DeclareTwo generation Artificial intelligence models Designed to run tomorrow’s robots.
Models are based on both Google GeminiMultimedia Foundation model that can process text, sound and images to answer questions and provide advice and assistance in general. DeepMind calls the first new models, Gemini RobotsIt is a “Language-Action”, which means that all these inputs can take the same inputs and then take out the material procedures for robot. Models are designed to work with any device system, but they are often tested on gunmen Aloe 2 The system presented by Deepmind last year.
In a cross video, Voice says: “Pick up basketball and The four championships Donk “(at 2:27 in the video below). Then a Robot She carefully captures a miniature basketball and drops it to a miniature network-and although it was not Donk at the American Professional League, it was enough to stir deep researchers.
https://www.youtube.com/watch?Google DeepMind This experimental video shows the capabilities of the Gemini Robotics Foundation model to control robots. Gemini robots
“For example, this basketball is one of my favorites,” he said Kanishka RaoThe main software engineer, at a press conference. He explains that the robot has never seen anything related to basketball, “but the basic basis model that had a general understanding of the game, knows how the basketball network looks, and understood what the term” Slam Dunk “means. So the robot was able to deliver them [concepts] “To accomplish the task in the material world,” says Rao.
What is the progress of Gemini robots?
Carolina ParadaGoogle DeepMind said in a briefing that new models improve the previous robots of the company in three dimensions: generalization, ability to adapt, and ingenuity. She said that all these developments are necessary to create a “new generation of useful robots.”
Circular means that the robot can Apply the concept of his learning in one context to another position, and the researchers looked at the optical circular (for example, is it confused if the color of an object or background changes), and the generalization of instructions (can it explain the orders that are formulated in different ways), and the circular of the procedure (can a procedure not have previously been done before).
Parada also says that the robots with Gemini can adapt to the instructions and changing conditions. To prove this point in a video clip, one of the researchers told a robot to put a set of plastic grapes in the clear Tupperware containing, then moved to converting three containers on the table by bringing the shyster’s shell game. The robot arm follows the clear container around it so that it can direct it.
https://www.youtube.com/watch?GOMINI ROBITICS says that GIMINI ROBITICS is better than previous models in adapting to the instructions and changing conditions.Google DeepMind
As for ingenuity, experimental videos showed automatic weapons, folding a piece of paper in Origami Fox and perform other sensitive tasks. However, it is important to note that the impressive performance here is in the context A narrow range of high -quality data that has been trained in these specified tasks, so the level of ingenuity represented by these tasks is not generalized.
What is the embodied logic?
The second model presented today is Robotics Gemini, with ER of “embodied thinking”, a type of intuitive material world that understands that humans are developing with experience over time. We are able to do smart things like taking a look at an object that we have never seen before and we are guessing educated about the best way to interact with it, and this is what DeepMind seeks to simulate the Gemini Robotics-a.
Parada gave an example of the ability of Gemini Air robots to determine an appropriate absorption point for capturing a coffee cup. The handle is properly determined, because this is the place where humans tend to understand the cups of coffee. However, this shows a possible weakness in human dependence Training dataFor the robot, especially the robot that may be able to deal with a comfortable mug of hot coffee, the thin handle may be a less reliable absorption point than understanding the mug itself.
Debindnd’s approach to automatic safety
Vikas CentianDeepMind, the project’s automatic safety, says that the team has taken a layer of safety. It begins with the elements of classic physical safety that runs things like Avoid collision Stability, but also includes “semantic safety” systems that hold both instructions and the consequences of following them. Sindhwani, who “trained to evaluate whether it is a possible procedure in a specific scenario, says, says Sindhwani, who” trained to evaluate whether it is a possible procedure in a specific scenario, “says Sindhwani, who” trained to evaluate whether it is a possible procedure in a specific scenario, “says Sindhwani, says that these systems are the most sophisticated in the geminen of robotics, that these systems are they are The most advanced in the model of robots.
And since “safety is not a competitive endeavor,” says Sindhwani, DeepMind releases a new data set and what he calls ASIMOV IndexWhich aims to measure the ability of the model to understand the rules of proper life. The standard contains each of the questions about visual scenes and text scenarios, putting the opinions of models about things such as the mixing of bleaching and vinegar (mix chlorine gas) and putting a soft game on a hot stove. In the journalistic briefing, Sindhwani said that Gemini models have a “strong performance” on this standard, and Technical report It showed that the models got more than 80 percent of the correct questions.
DEPMIND partnerships
Again in December, DeepMind and Humanoid Robotics Apptronik A partnershipParada says that the two companies are working together to “build the next generation of Human robots With Gemini in its essence, “DeepMind also provides its models for a group of” trusted laboratories “: Greater robotsfor Early movement robotsfor Boston dynamicsAnd Charming tools.
From your site articles
Related articles about the web