Sunday, June 29, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

Microsoft Releases Orca 2: Pioneering Advanced Reasoning in Smaller Language Models with Tailored Training Strategies

November 27, 2023
in AI Technology
Reading Time: 4 mins read
0 0
A A
0
Share on FacebookShare on Twitter


LLMs (Large Language Models) are trained on vast volumes of textual data to comprehend and produce language similar to that of humans. The GPT-3, GPT-4, and PaLM-2 are few examples. These models perform complex language tasks, including text generation, conversational interaction, and question answering. They have been used in various domains, enhancing user experiences in chatbots, coding, web search, customer support, and content production.

However, as the AI community delves into the vast landscape of smaller models, Microsoft has introduced the next version of Orca called Orca 2, designed to amplify the capacities of compact AI models. Orca 1, through the integration of detailed explanation, traces, surpasses traditional instruction-tuned models in performance on challenging benchmarks like BigBench Hard and AGIEval. Orca 2 further delves into the potential of enhanced training signals to boost the reasoning capabilities of smaller language models

Imitation learning has been a prevalent approach in refining small language models. These smaller models often need to catch up in reasoning and comprehension skills, even though they can produce content in a manner akin to that of their teachers. Although imitation learning has some benefits, it has drawbacks that may limit smaller models’ ability to reach their full potential and prevent them from using the best possible solutions given the particular problem and the model’s capabilities. They often need help matching their larger counterparts’ reasoning and comprehension skills, hindering their full potential.

Instead of simply imitating, Orca instructs the model in various reasoning techniques. These include step-by-step processing, recall then generate, recall-reason-generate, and direct answers. The objective is to guide the model in acquiring the ability to discern the most effective solution strategy tailored to the nuances of each specific task.

Orca 2’s zero-shot reasoning ability highlights the possibility of improving smaller neural networks. Microsoft continues to believe that specialized training methods, like the one used for Orca 2, may reveal new useful applications. This method seeks to improve the effectiveness of these neural network deployments.

Most importantly, Orca 2 is protected from the initial cues that elicited particular behaviors during the training phase. Orca 2 transforms into a Cautious Reasoner through the innovative Prompt Erasure technique. Unlike blind imitation, this method uses larger models as a source of behaviors from which the best ones are chosen for the given task.

The researchers tested Orca 2 on comprehensive benchmarks. They showed that it outperforms other equivalent models related to language understanding, common sense reasoning, multi-step math problems, reading comprehension, summarization, and more. For instance, on zero-shot reasoning tasks, Orca 2-13B achieves over 25% higher accuracy than comparable 13B models and is on par with a 70B model.

Orca 2 marks a significant stride in the evolution of small language models. Its departure from conventional imitation learning, coupled with a focus on teaching diverse reasoning techniques, showcases a new approach to unleashing the potential of compact AI models.

Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to join our 33k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, you will love our newsletter..

Rachit Ranjan is a consulting intern at MarktechPost . He is currently pursuing his B.Tech from Indian Institute of Technology(IIT) Patna . He is actively shaping his career in the field of Artificial Intelligence and Data Science and is passionate and dedicated for exploring these fields.

↗ Step by Step Tutorial on ‘How to Build LLM Apps that can See Hear Speak’



Source link

Tags: advancedlanguageMicrosoftmodelsOrcaPioneeringReasoningReleasessmallerstrategiesTailoredtraining
Previous Post

Part 5 – Cloud Application Programming Model (Custom handlers, bcrypt)

Next Post

Crypto Revolution: Mass Adoption Is Coming! (Crypto News 2023)

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Crypto Revolution: Mass Adoption Is Coming! (Crypto News 2023)

Crypto Revolution: Mass Adoption Is Coming! (Crypto News 2023)

Israel enlists drones, AI and big data to farm for the future | AFP

Israel enlists drones, AI and big data to farm for the future | AFP

6 Benefits of Data-Driven Project Portfolio Management (PPM) Software

6 Benefits of Data-Driven Project Portfolio Management (PPM) Software

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
How ‘Chain of Thought’ Makes Transformers Smarter

How ‘Chain of Thought’ Makes Transformers Smarter

May 13, 2024
Amazon’s Bedrock and Titan Generative AI Services Enter General Availability

Amazon’s Bedrock and Titan Generative AI Services Enter General Availability

October 2, 2023
The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

October 30, 2023
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Managing PDFs in Node.js with pdf-lib

Managing PDFs in Node.js with pdf-lib

November 16, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In