Google's RT-2 Robot Learns Tasks from Internet Videos and Text

Google DeepMind

Jan 5, 2024 00:00

Google Robotics Team

1 views

roboticsgooglert-2vision-languagegeneralizationmanipulation

Summary

Google's RT-2 enables robots to learn from internet content, improving task generalization.

Google DeepMind has introduced RT-2 (Robotics Transformer 2), a vision-language-action model that enables robots to learn new tasks by observing internet videos and reading text instructions. Unlike traditional robotics systems that require extensive task-specific training, RT-2 can generalize from web-scale data to perform novel manipulation tasks. The system demonstrates the ability to understand abstract concepts and apply them to physical actions, such as "pick up the extinct animal" when presented with toy dinosaurs among other objects. This approach represents a significant step toward more adaptable and intelligent robotic systems.

Read Full Article More News

Generative AI to quantify uncertainty in weather forecasting

Google AI BlogMar 29

Posted by Lizao (Larry) Li, Software Engineer, and Rob Carver, Research Scientist, Google Research Accurate weather forecasts can have a direct impact on people’s lives, from helping make routine deci...

AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks

Google AI BlogMar 28

Posted by Urs Köster, Software Engineer, Google Research Time series problems are ubiquitous, from forecasting weather and traffic patterns to understanding economic trends. Bayesian approaches start ...

Computer-aided diagnosis for lung cancer screening

Google AI BlogMar 20

Posted by Atilla Kiraly, Software Engineer, and Rory Pilgrim, Product Manager, Google Research Lung cancer is the leading cause of cancer-related deaths globally with 1.8 million deaths reported in 20...

Google's RT-2 Robot Learns Tasks from Internet Videos and Text

Summary

Related Articles

Generative AI to quantify uncertainty in weather forecasting

AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks

Computer-aided diagnosis for lung cancer screening