SERVIDORES

Andriy Onufriyenko/Getty Images

As artificial intelligence advances, we look to a future with more robots and automations than ever before. They already surround us -- the robot vacuum that can expertly navigate your home, a robot pet companion to entertain your furry friends, and robot lawnmowers to take over weekend chores. We appear to be inching towards living out The Jetsons in real life. But as smart as they appear, these robots have their limitations.

Google DeepMind unveiled RT-2, the first vision-language-action (VLA) model for robot control, which effectively takes the robotics game several levels up. The system was trained on text data and images from the internet, much like the large language models behind AI chatbots like ChatGPT and Bing are trained.

Also: How researchers broke ChatGPT and what it could mean for future AI development

Our robots at home can operate simple tasks they are programmed to perform. Vacuum the floors, for example, and if the left-side sensor detects a wall, try to go around it. But traditional robotic control systems aren't programmed to handle new situations and unexpected changes -- often, they can't perform more than one task at a time.

RT-2 is designed to adapt to new situations over time, learn from multiple data sources like the web and robotics data to understand both language and visual input, and perform tasks it has never encountered nor been trained to perform.

"A visual-language model (VLM) pre-trained on web-scale data is learning from RT-1 robotics data to become RT-2, a visual-language-action (VLA) model that can control a robot," from Google DeepMind.

Google DeepMind

A traditional robot can be trained to pick up a ball and stumble when picking up a cube. RT-2's flexible approach enables a robot to train on picking up a ball and can figure out how to adjust its extremities to pick up a cube or another toy it's never seen before.

Instead of the time-consuming, real-world training on billions of data points that traditional robots require, where they have to physically recognize an object and learn how to pick it up, RT-2 is trained on a large amount of data and can transfer that knowledge into action, performing tasks it's never experienced before.

Also: Can AI detectors save us from ChatGPT? I tried 5 online tools to find out

"RT-2's ability to transfer information to actions shows promise for robots to more rapidly adapt to novel situations and environments," said Vincent Vanhoucke, Google DeepMind's head of robotics. "In testing RT-2 models in more than 6,000 robotic trials, the team found that RT-2 functioned as well as our previous model, RT-1, on tasks in its training data, or 'seen' tasks. And it almost doubled its performance on novel, unseen scenarios to 62% from RT-1's 32%."

Some of the examples of RT-2 at work that were published by Google DeepMind.

Google DeepMind/

The DeepMind team adapted two existing models, Pathways Language and Image Model (PaLI-X) and Pathways Language Model Embodied (PaLM-E), to train RT-2. PaLI-X helps the model process visual data, trained on massive amounts of images and visual information with other corresponding descriptions and labels online. With PaLI-X, RT-2 can recognize different objects, understand its surrounding scenes for context, and relate visual data to semantic descriptions.

PaLM-E helps RT-2 interpret language, so it can easily understand instructions and relate them to what is around it and what it's currently doing.

Also: The best AI chatbots

As the DeepMind team adapted these two models to work as the backbone for RT-2, it created the new VLA model, enabling a robot to understand language and visual data and subsequently generate the appropriate actions it needs.

RT-2 is not a robot in itself -- it's a model that can control robots more efficiently than ever before. An RT-2-enabled robot can perform tasks ranging in degrees of complexity using visual and language data, like organizing files alphabetically by reading the labels on the documents and sorting them, then putting them away in the correct places.

It could also handle complex tasks. For instance, if you said, "I need to mail this package, but I'm out of stamps," RT-2 could identify what needs to be done first, like finding a Post Office or merchant that sells stamps nearby, take the package, and handle the logistics from there.

Also: What is Google Bard? Here's everything you need to know

"Not only does RT-2 show how advances in AI are cascading rapidly into robotics, it shows enormous promise for more general-purpose robots," Vanhoucke added.

Let's hope that 'promise' leans more towards living out The Jetsons' plot than The Terminator's.

Artificial Intelligence

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advancesChatGPT's new web browsing feature is a big disappointment. Use this plugin insteadWhat is Amazon Bedrock? 4 ways it can help businesses use generative AI toolsCan generative AI solve computer science's greatest unsolved problem?

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advances
ChatGPT's new web browsing feature is a big disappointment. Use this plugin instead
What is Amazon Bedrock? 4 ways it can help businesses use generative AI tools
Can generative AI solve computer science's greatest unsolved problem?

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVIDORES

NOTÍCIAS QUENTES

Huawei Switches Visio Stencils

Huawei Switches Distributor in UAE

PoE vs PoE+ vs UPoE: What's the best switch to meet your network needs?

Understanding PoE Standards and Wattage

Power Supply Standards for POE Switches. Why is the Power Supply Distance Limited to 100 Meters?

How to Choose the Right 10G SFP+ Module: SR, LR, or LRM?

Huawei Switches: Comprehensive Guide and Insights

How Does Cisco Wireless Network Work?

How Do I Connect to a Cisco Wireless Router?

Cisco Catalyst 9800 Series Wireless Controller Software Configuration Guide

Cisco Access Point and Wireless Controller Selector

Compare Cisco Wireless Architectures and AP Modes

Cisco Wireless Architectures and AP Modes

Joining Process of an Cisco Access Point

Cisco Wireless AP Datasheet

Cisco Wireless AP and Controllers: A Comprehensive Guide to Efficient Networking

Cisco Aironet 3700 Series Access Points Datasheet

Cisco Wireless AP License: Unlocking the Power of Cisco DNA Software for Wireless Networks

Set up a Wireless Network using a Wireless Access Point (WAP)

Cisco Wireless Access Point (AP) Modes Explained

Cisco Wireless AP Comparison: A Comprehensive Guide to Finding the Right Solution for Your Network Needs

Cisco Business Wireless Startup LED Status Codes

Regulatory Compliance (Rest of the World) for Domain Reduction

Getting Started with the Cisco Catalyst Wireless Mobile Application

Cisco Wireless AP Models

Cisco Wireless Access Points: Future-Proofing Connectivity for the Modern Workplace

Cisco 9300 Stacking Configuration Guide Book

Best Practices for Cisco Catalyst 9300 Switches

Cisco 9300 Switches Dimensions

Cisco Catalyst IE9300 Rugged Series Data Sheet

Google DeepMind's new RT-2 system enables robots to perform novel tasks

Artificial Intelligence

Tags quentes : Inteligência artificial Inovação

Ordering Guide

Recursos

Quem somos