Sunday, June 8, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

Outperforming and boosting large multi-task language models with a small scorer – Google Research Blog

March 14, 2024
in AI Technology
Reading Time: 2 mins read
0 0
A A
0
Share on FacebookShare on Twitter



In a blog post by Yun Zhu and Lijuan Liu, both Software Engineers at Google Research, they discuss the advancements in Large Language Models (LLMs) and how they have led to a new paradigm that unifies various natural language processing (NLP) tasks within an instruction-following framework. This new paradigm is exemplified by recent multi-task LLMs such as T0, FLAN, and OPT-IML.

The process begins with the gathering of multi-task data, where each task follows a task-specific template. Each labeled example is converted into an instruction paired with a corresponding response. These instruction-response pairs are used to train the LLM, resulting in a conditional generation model that takes an instruction as input and generates a response.

Multi-task LLMs have shown remarkable task-wise generalization capabilities, allowing them to address unseen tasks by understanding and solving brand-new instructions. The demonstration of instruction-following pre-training of multi-task LLMs, like FLAN, has shown improved performance for unseen tasks.

Due to the complexity of understanding and solving various tasks solely using instructions, multi-task LLMs typically have a large number of parameters, ranging from several billion to hundreds of billions. Operating such sizable models poses challenges as they require significant computational power and memory capacities, making training and inference expensive and inefficient.

To address these challenges, the engineers propose a novel approach called Cappy. Cappy is a lightweight pre-trained scorer with only 360 million parameters. It takes an instruction and a candidate response as input and produces a score between 0 and 1, indicating the estimated correctness of the response with respect to the instruction. Cappy can function independently on classification tasks or serve as an auxiliary component for LLMs, boosting their performance.

Cappy enables downstream supervision without requiring fine-tuning, reducing memory requirements and avoiding the need for back-propagation through LLM parameters. It can be adapted with closed-source multi-task LLMs and is compatible with WebAPIs.

The engineers conducted pre-training on Cappy using a diverse dataset collection and a pre-training data instance that includes an instruction-response pair with a correctness annotation. They used Rouge-L as a metric for measuring similarity between responses for weak supervision. The continuous pre-training of Cappy on top of the RoBERTa model was conducted on Google’s TPU-v4.

Cappy can be applied to solve practical tasks within a candidate-selection mechanism, providing scores for candidate responses based on the given instruction. It can be fine-tuned to integrate downstream task information into LLM predictions, boosting performance on downstream tasks.

Overall, Cappy with its scoring-based pre-training strategy has shown to outperform existing multi-task LLMs in terms of performance and parameter efficiency. It provides a new approach to adapting LLMs with downstream supervision, reducing memory requirements and improving overall performance on complex tasks.



Source link

Tags: BlogBoostingGooglelanguageLargemodelsMultiTaskOutperformingResearchScorerSmall
Previous Post

How to Record a Video Presentation With Google Slides

Next Post

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

New York says first U.S. utility-scale offshore wind farm starts operations (NYSE:ES)

New York says first U.S. utility-scale offshore wind farm starts operations (NYSE:ES)

Designing RAGs. A guide to Retrieval-Augmented… | by Michał Oleszak | Mar, 2024

Designing RAGs. A guide to Retrieval-Augmented… | by Michał Oleszak | Mar, 2024

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
Accenture creates a regulatory document authoring solution using AWS generative AI services

Accenture creates a regulatory document authoring solution using AWS generative AI services

February 6, 2024
Managing PDFs in Node.js with pdf-lib

Managing PDFs in Node.js with pdf-lib

November 16, 2023
Graph neural networks in TensorFlow – Google Research Blog

Graph neural networks in TensorFlow – Google Research Blog

February 6, 2024
13 Best Books, Courses and Communities for Learning React — SitePoint

13 Best Books, Courses and Communities for Learning React — SitePoint

February 4, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In