Friday, May 9, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

Microsoft AI Research Introduces SIGMA: An Open-Source Research Platform to Enable Research and Innovation at the Intersection of Mixed Reality and AI

May 7, 2024
in AI Technology
Reading Time: 4 mins read
0 0
A A
0
Share on FacebookShare on Twitter


Recent breakthroughs in generative AI and huge language, vision, and multimodal models can be a foundation for open-domain knowledge, inference, and generation capabilities, enabling open-ended task aid scenarios. The capacity to produce pertinent instructions and content is just the beginning of what is needed to construct AI systems that work with humans in the real world. This includes mixed-reality task assistants, interactive robots, smart manufacturing floors, autonomous vehicles, and many more.

Artificial intelligence systems must continuously perceive and reason multimodally in a stream about their environment to seamlessly work with humans in the real world. This criterion extends beyond object detection and tracking. For physical teamwork to be successful, everyone involved must be aware of the objects’ potential functions, their relationships to one another, and spatial limitations and how these factors change over time.

These systems must be able to reason not only about the physical world but also about humans. Judgments regarding cognitive states and social norms of real-time collaborative behavior should be included in this reasoning, in addition to lower-level judgments about body stance, voice, and actions.

Using a combination of mixed-reality and artificial intelligence technologies, such as big language and vision models, Microsoft Research introduces SIGMA. This interactive program can use HoloLens 2 to walk users through procedural tasks. A big language model, such as GPT-4, or a set of manually defined stages in a task library can be used to dynamically create tasks. When a user asks SIGMA an open-ended question during the interaction, the system can use its extensive language model to provide an answer. To top it all off, SIGMA can locate and highlight task-relevant objects in the user’s field of view using vision models such as Detic and SEEM.

Several design choices support these research goals. One example of the system’s implementation is a client-server architecture. The HoloLens 2 device runs a lightweight client application that transmits multiple multimodal data streams to a more powerful desktop server. These streams include RGB (red, green, and blue), depth, audio, head, hand, and gaze tracking information. Client apps receive data and instructions from the desktop server on displaying content on the device, which executes the application’s basic functionality. By using this design, researchers can get beyond the headset’s present computing limits and open the door to possibilities for expanding the program to additional mixed-reality devices.

The open-source architecture known as Platform for Situated Intelligence (psi) is the foundation for SIGMA, allowing for developing and researching multimodal integrative AI systems. Performant streaming and logging infrastructure are provided by the underlying \\psi framework, which also allows for fast prototyping. The framework’s data replay infrastructure makes data-driven application-level development and tuning possible. Finally, there is a wealth of support for visualization, debugging, tuning, and maintenance in Platform for Situated Intelligence Studio.

While SIGMA’s present functionality lacks sophistication, it does serve as a foundation for future research into the convergence of mixed reality and artificial intelligence. Many research topics, particularly perception, can and have been explored using collected datasets. These problems range from computer vision to speech recognition.

As an example of Microsoft’s ongoing dedication to the field, SIGMA is a research platform. It is representative of the company’s efforts to investigate novel artificial intelligence and mixed reality technologies. Dynamics 365 Guides is another enterprise-ready mixed-reality solution that Microsoft provides to frontline employees. Frontline employees are empowered with step-by-step procedural assistance and relevant information in the workflow with Copilot in Dynamics 365 Guides, which customers currently utilize in private preview. AI and mixed reality work together to make this possible. Enterprise users can benefit greatly from Dynamics 365 Guides, a feature-rich tool designed for frontline workers who execute difficult operations.

By making the system publicly available, the researchers hope to alleviate other researchers’ burdens associated with the fundamental engineering tasks of building a full-stack interactive application so they can proceed straight to the exciting new frontiers in their field.

Check out the Details and Project. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter.

Don’t Forget to join our 41k+ ML SubReddit

Dhanshree Shenwai is a Computer Science Engineer and has a good experience in FinTech companies covering Financial, Cards & Payments and Banking domain with keen interest in applications of AI. She is enthusiastic about exploring new technologies and advancements in today’s evolving world making everyone’s life easy.

✅ [FREE AI WEBINAR Alert] Live RAG Comparison Test: Pinecone vs Mongo vs Postgres vs SingleStore: May 9, 2024 10:00am – 11:00am PDT



Source link

Tags: EnableInnovationIntersectionIntroducesMicrosoftmixedOpenSourcePlatformRealityResearchSigma
Previous Post

Visual Intuitive Physics: Enhancing Understanding Through Visualization

Next Post

Joachim Nagel: Central Banks Need to Embrace Digital Currencies for Future Relevance

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Joachim Nagel: Central Banks Need to Embrace Digital Currencies for Future Relevance

Joachim Nagel: Central Banks Need to Embrace Digital Currencies for Future Relevance

Embracing Small-Scale Homesteading From The Comfort Of Your Home

Embracing Small-Scale Homesteading From The Comfort Of Your Home

Which IT Skills Are Most in Demand in Q1 2024?

Which IT Skills Are Most in Demand in Q1 2024?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

May 19, 2024
Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

November 20, 2023
Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

December 6, 2023
A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

April 10, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In