Friday, May 9, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

A New Microsoft AI Research Proposes HMD-NeMo: A New Approach that Addresses Plausible and Accurate Full Body Motion Generation Even When the Hands may be Only Partially Visible

November 14, 2023
in AI Technology
Reading Time: 4 mins read
0 0
A A
0
Share on FacebookShare on Twitter


In the realm of immersive experiences in mixed-reality scenarios, generating accurate and plausible full-body avatar motion has been a persistent challenge. Existing solutions relying on Head-Mounted Devices (HMDs) typically utilize limited input signals, such as head and hands 6-DoF (degrees of freedom). While recent advancements have demonstrated impressive performance in generating full-body motion from head and hand signals, they all share a common limitation – the assumption of full-hand visibility. This assumption, valid in scenarios involving motion controllers, falls short in many mixed reality experiences where hand tracking relies on egocentric sensors, introducing partial hand visibility due to the restricted field of view of the HMD.

Researchers from Microsoft Mixed Reality & AI Lab, Cambridge, UK, have introduced a groundbreaking approach- HMD-NeMo (HMD Neural Motion Model). This unified neural network generates plausible and accurate full-body motion even when hands are only partially visible. HMD-NeMo operates in real-time and online, making it suitable for dynamic mixed-reality scenarios.

At the core of HMD-NeMo lies a spatiotemporal encoder featuring novel temporally adaptable mask tokens (TAMT). These tokens play a crucial role in encouraging plausible motion in the absence of hand observations. The approach incorporates recurrent neural networks to capture temporal information efficiently and a transformer to model complex relations between different input signal components.

The paper outlines two scenarios considered for evaluation: Motion Controllers (MC), where hands are tracked with motion controllers, and Hand Tracking (HT), where hands are tracked via egocentric hand-tracking sensors. HMD-NeMo proves to be the first approach capable of handling both scenarios within a unified framework. In the HT scenario, where hands may be partially or entirely out of the field of view, the temporally adaptable mask tokens demonstrate their effectiveness in maintaining temporal coherence.

The proposed method is trained using a loss function that considers data accuracy, smoothness, and auxiliary tasks for human pose reconstruction in SE(3). The experiments involve extensive evaluations of the AMASS dataset, a large collection of human motion sequences converted into 3D human meshes. Metrics such as mean per-joint position error (MPJPE) and mean per-joint velocity error (MPJVE) are employed to assess the performance of HMD-NeMo.

Comparisons with state-of-the-art approaches in the motion controller scenario reveal that HMD-NeMo achieves superior accuracy and smoother motion generation. Furthermore, the model’s generalizability is demonstrated through cross-dataset evaluations, outperforming existing methods on multiple datasets.

Ablation studies delve into the impact of different components, including the effectiveness of the TAMT module in handling missing hand observations. The study shows that HMD-NeMo’s design choices, such as the spatiotemporal encoder, contribute significantly to its success.

In conclusion, HMD-NeMo represents a significant step forward in addressing the challenges of generating full-body avatar motion in mixed-reality scenarios. Its versatility in handling both motion controller and hand tracking scenarios, coupled with its impressive performance metrics, positions it as a pioneering solution in the field.

Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to join our 32k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, you will love our newsletter..

We are also on Telegram and WhatsApp.

\"\"

Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is currently pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech enthusiast and has a keen interest in the scope of software and data science applications. She is always reading about the developments in different field of AI and ML.

🔥 Meet Retouch4me: A Family of Artificial Intelligence-Powered Plug-Ins for Photography Retouching



Source link

Tags: AccurateAddressesApproachBODYFullGenerationhandsHMDNeMoMicrosoftMotionPartiallyPlausibleProposesResearchVisible
Previous Post

U.S. considering new A.I. chip export restrictions for China

Next Post

Andriy Bilous – How Algorithms knowledge may help in DevOps Automation

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Andriy Bilous – How Algorithms knowledge may help in DevOps Automation

Andriy Bilous - How Algorithms knowledge may help in DevOps Automation

Frontech Monitor No Power Problem Solution

Frontech Monitor No Power Problem Solution

How to Become a Certified Prompt Engineering Expert?

How to Become a Certified Prompt Engineering Expert?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

May 19, 2024
A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

April 10, 2024
Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

November 20, 2023
Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

December 6, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In