Saturday, May 24, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

Researchers from Stanford Propose ‘EquivAct’: A Breakthrough in Robot Learning for Generalizing Tasks Across Different Scales and Orientations

November 2, 2023
in AI Technology
Reading Time: 4 mins read
0 0
A A
0
Share on FacebookShare on Twitter


Humans can extrapolate and learn to solve variations of a manipulation task if the objects involved have varied visual or physical attributes, given just a few examples of how to complete the task with standard objects. To make the learnt policies universal to different object scales, orientations, and visual appearances, existing studies in robot learning still need considerable data augmentation. Despite these enhancements, however, generalization to undiscovered variations is not guaranteed.

A new paper by Stanford University investigates the challenge of zero-shot learning of a visuomotor policy that may take as input a small number of sample trajectories from a single source manipulation scenario and generalize to scenarios with unseen object visual appearances, sizes, and poses. In particular, it was important to learn policies to deal with deformable and articulated objects, like clothes or boxes, in addition to rigid ones, like pick-and-place. To ensure that the learnt policy is robust across different object placements, orientations, and scales, the proposal was to incorporate equivariance into the visual object representation and policy architecture.

They present EquivAct, a novel visuomotor policy learning approach that can learn closed-loop policies for 3D robot manipulation tasks from demonstrations in a single source manipulation scenario and generalize zero-shot to unseen scenarios. The learnt policy takes as input the robot’s end-effector postures and a partial point cloud of the environment and as output the robot’s actions, such as end-effector velocity and gripper commands. In contrast to most previous work, the researchers used SIM(3)-equivariant network architectures for their neural networks. This means that the output end-effector velocities will adjust in kind when the input point cloud and end-effector positions are translated and rotated. Since their policy architecture is equivariant, it can learn from demonstrations of smaller-scale tabletop activities and then zero-shot generalize to mobile manipulation tasks involving larger variations of the demonstrated objects with distinct visual and physical appearances.

This approach is split into two parts: learning the representation and the policy. To train the agent’s representations, the team first provides it with a set of synthetic point clouds that were captured using the same camera and settings as the target task’s objects but with a different random nonuniform scale. They supplemented the training data in this way to accommodate for nonuniform scaling, even if the suggested architecture is equivariant to uniform scaling. The simulated data doesn’t have to show robot activities or even demonstrate the actual task. To extract global and local features from the scene point cloud, they employ the simulated data to train a SIM(3)-equivariant encoder-decoder architecture. During training, a contrastive learning loss was used on paired point cloud inputs to combine local features for related object sections of objects in similar positions. During the policy-learning phase, it was presumed that access to a sample of previously-verified task trajectories is limited.

The researchers use data to train a closed-loop policy that, given a partial point cloud of the scene as input, uses a previously learned encoder to extract global and local features from the point cloud and then feeds those features into a SIM(3)-equivariant action prediction network to predict end effector movements. Beyond the standard rigid object manipulation tasks of previous work, the proposed method is evaluated on the more complex tasks of comforter folding, container covering, and box sealing.

The team presents many human examples in which a person manipulates a tabletop object for each activity. After demonstrating the method, they assessed it on a mobile manipulation platform, where the robots will have to solve the same problem on a much grander scale. Findings show that this method is capable of learning a closed-loop robot manipulation policy from the source manipulation demos and executing the target job in a single run without any need for fine-tuning. It is further demonstrated that the approach is more efficient than that and relies on significant augmentations for generalization to out-of-distribution object poses and scales. It also outperforms works that do not exploit equivariance.

Check out the Paper and Project. All Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to join our 32k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, you will love our newsletter..

We are also on Telegram and WhatsApp.

Dhanshree Shenwai is a Computer Science Engineer and has a good experience in FinTech companies covering Financial, Cards & Payments and Banking domain with keen interest in applications of AI. She is enthusiastic about exploring new technologies and advancements in today’s evolving world making everyone’s life easy.

🔥 Meet Retouch4me: A Family of Artificial Intelligence-Powered Plug-Ins for Photography Retouching



Source link

Tags: BreakthroughEquivActGeneralizingLearningOrientationsProposeResearchersrobotScalesStanfordtasks
Previous Post

Automation Techniques in C++ Reverse Engineering

Next Post

SAP Cloud Application Programming Model resources with Iwona Hahn

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
SAP Cloud Application Programming Model resources with Iwona Hahn

SAP Cloud Application Programming Model resources with Iwona Hahn

1 Min Business News (Daily) | OpenAI CEO on A.I Chat #breakingnews #business #news #trending #shorts

1 Min Business News (Daily) | OpenAI CEO on A.I Chat #breakingnews #business #news #trending #shorts

Why React.js is taking a new direction

Why React.js is taking a new direction

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

April 10, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
Implementing User Authentication in React Apps with Appwrite — SitePoint

Implementing User Authentication in React Apps with Appwrite — SitePoint

January 30, 2024
NousResearch Released Nous-Hermes-2-Mixtral-8x7B: An Open-Source LLM with SFT and DPO Versions

NousResearch Released Nous-Hermes-2-Mixtral-8x7B: An Open-Source LLM with SFT and DPO Versions

January 25, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In