Three reasons why robots are about to enter the "ChatGPT moment."
Since the inception of robotics, practitioners in the field have always aspired to create robots capable of performing various household chores. However, for a long time, this remained an elusive dream.
Although roboticists have been able to make robots perform some impressive feats in laboratories, such as parkour, these tasks typically require meticulous planning in a strictly controlled environment.
This makes it difficult for robots to work reliably at home, especially in households with children and pets. Moreover, each house is constructed differently, and there are various chaotic situations that can arise.
There is a famous observation in the field of robotics known as Moravec's Paradox: what is difficult for humans is easy for machines, and what is easy for humans is hard for robots.
Now, with artificial intelligence, this situation is changing. Robots are beginning to be able to perform tasks such as folding clothes and cooking, which not long ago were considered almost impossible to accomplish.In the cover story of the latest issue of "MIT Technology Review," I explored how the field of robotics is reaching its turning point.
The field of robotics research has seen an incredibly exciting convergence of technologies, which may (just may) allow robots to step out of the laboratory and into our homes.
Here are three reasons why robotics is about to experience its "ChatGPT moment."
---
Affordable hardware makes research easier to carry out
Robots are expensive. Highly sophisticated robots start at hundreds of thousands of dollars, making them unaffordable for most researchers. For instance, the first batch of home robots, PR2, weighing 440 pounds, was priced at $400,000.But new, cheaper robots allow more researchers to do some cool things. A startup called Hello Robot has developed and launched a new robot named Stretch, priced at about $18,000, weighing about 22.6 kilograms.
It has a small mobile base, a pole with a camera hanging on it, an adjustable arm, and at the end, there is a suction cup that can be controlled with a controller.
Meanwhile, a team from Stanford University in the United States has built a system called Mobile ALOHA (an acronym for "Affordable Low-cost Open-source Hardware for Autonomous Robotic Operations"), which has learned to cook shrimp relying only on data from 20 human demonstrations and other tasks.
They have cobbled together a cheaper robot using off-the-shelf components, priced in tens of thousands of dollars, instead of hundreds of thousands.Artificial Intelligence is helping us build a "Robot Brain"
The software of these new robots is different from that of the past. Due to the rapid development of artificial intelligence, the current research focus is shifting from making expensive robots more flexible to building a "universal robot brain" in the form of neural networks.
Roboticists have already begun to use deep learning and neural networks to create systems that practice and learn in the environment, adjusting their behavior accordingly, rather than traditional planning and training.
In the summer of 2023, Google introduced a visual language action model called RT-2. This model gains a general understanding of the world through online text and images, as well as its own interactions. It translates these data into robot actions.
Researchers from the Toyota Research Institute, Columbia University, and the Massachusetts Institute of Technology have been able to quickly teach robots to perform many new tasks with the help of artificial intelligence learning techniques and generative artificial intelligence, known as imitation learning.They believe they have found a method that will propel generative artificial intelligence technology from the realms of text, images, and videos to the field of robotic motion.
Many are attempting to harness generative artificial intelligence. Covariant is a robotics startup spun off from OpenAI's now-defunct robotics research division, and it has developed a multimodal model called the RFM-1.
It can accept prompts in the form of text, images, videos, robotic instructions, or measurements (data). Generative artificial intelligence enables robots to both understand instructions and generate images or videos related to these tasks.
More data, more skills.The formidable capabilities of large artificial intelligence models like GPT-4 stem from the vast amount of data collected from the internet. However, this does not apply to robots, as they require data specifically collected for robots.
They need demonstration data on how to operate washing machines and refrigerators, as well as how to pick up plates, how to fold clothes, and so on. Currently, such data is very scarce, and it takes humans a long time to collect it.
Google DeepMind has initiated a new initiative called "Open X Avatar Collaboration" aimed at changing this situation.
In 2023, the company collaborated with 34 research laboratories and approximately 150 researchers to collect data from 22 different robots, including Hello Robot's Stretch robot.
The resulting dataset, released in October 2023, showcases 527 skills of the robots, such as picking up objects, pushing, and moving.Early indications suggest that more data is giving rise to smarter robots. Researchers have constructed two versions of a model for robots, called RT-X, which can run locally on computers in various laboratories or be accessed via the internet.
The larger, internet-accessible model is pre-trained with internet data to develop "visual common sense," or a basic understanding of the world, from large language and image models.
When researchers ran the RT-X model on many different robots, they found that the success rate of these robots learning skills was 50% higher than the systems being developed in each laboratory.
Comments
Share your experience
Related Articles
The most comprehensive solid-state drive purchase strategy, how to start more co
Nowadays, the demand for storage media is increasing among more and more people, and solid-state drives (SSDs) are no lo...
After 11 Years Without Children, Wife's Pregnancy Raises Questions For Her Infertile Husband-1
10,000-Year-Old "Airplane Rock Art" Found in Hidden Cave: Proof of Alien Civilization? The Truth Revealed!
Fudan team reveals a new mechanism of plasmonics, which can be extended to other
In light of the current energy crisis and environmental pollution, the storage, conversion, and utilization of clean ene...
Academician Qiao Shizhang's team develops a bimetallic catalyst, achieving high
Recently, the team of Academician Qiao Shi Zhang from the University of Adelaide in Australia has successfully enabled l...
What does CPU mean | The working principle of CPU.
CPU, fully known as the Central Processing Unit, is the core component of a computer's hardware system, responsible for ...
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-9
Breaking: The Top 10 Grossest Foods on Earth – Why Do People Actually Eat These?
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-2
Researchers propose a new concept of artificial intelligence that allows large l
Recently, a team led by Xu Hatao, a doctoral student at Nanyang Technological University in Singapore and a research ass...
The Northeast is blowing the "national car" wind! Hongqi EH7, what's the big dif
In the winter of the 23rd year, the "Northeast" ice and snow were very popular. In the spring of the 24th year, the "Nor...
After 11 Years Without Children, Wife's Pregnancy Raises Questions For Her Infertile Husband-12
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-13
Scientists create an organic semiconductor glass thin film that can be used to m
Recently, Luo Peng and his team, who are engaged in postdoctoral research at the University of Pennsylvania in the Unite...
Fudan team develops a jailbreak attack framework, revealing new patterns of the
Recently, Wang Xiao, a Ph.D. student at Fudan University, and his team developed the first unified jailbreak attack fram...
How to read memory parameters.
We can usually see the model number and parameters on the label of a memory module (SDRAM). The model number is defined ...
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-8
After 11 Years Without Children, Wife's Pregnancy Raises Questions For Her Infertile Husband-11
Only after watching the development history of kitchen appliances can we underst
Did everyone watch the popular TV series "Fan Hua" recently?In addition to the bustling traffic on the Yellow River Road...
What is a semiconductor?
Recently, I took the opportunity during my free time to go through some course materials on semiconductors, and found so...
After 11 Years Without Children, Wife's Pregnancy Raises Questions For Her Infertile Husband-3
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-12
From space lighting to thousands of households, how does Opple Lighting roll a b
April 24th marks the 9th "Chinese Space Day" in China. At this very moment, at an orbital altitude of 390 kilometers abo...
My Husband Installed A Camera In The Bedroom. The Truth Is Beyond My Belief-4
1 Death Every 10 Minutes! Top 10 Fatal Diseases Rising in the West – No.1 Is Untreatable
2023 monitor recommendation + purchase guide! IPS, VA, TN panels, which one to c
This content is sourced from @What's Worth Buying APP, and the views expressed are solely those of the author.Before del...
Three reasons why robots are about to enter the "ChatGPT moment."
Since the inception of robotics, practitioners in the field have always aspired to create robots capable of performing v...
Xi'an Jiaotong University proposes a new paradigm for the synthesis of polysacch
Recently, considering the global supply and demand tension of essential goods such as alternative proteins and biopolyme...
How does photovoltaic generate electricity?
Photovoltaic power generation, as a green and renewable source of energy, is being increasingly applied to our daily liv...
A comprehensive look at LLM alignment techniques: RLHF, RLAIF, PPO, DPO...
To align Large Language Models (LLMs), researchers have come up with numerous ingenious solutions.LLMs are powerful, but...