Sunday, June 29, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

CMU AI Researchers Unveil TOFU: A Groundbreaking Machine Learning Benchmark for Data Unlearning in Large Language Models

January 15, 2024
in AI Technology
Reading Time: 3 mins read
0 0
A A
0
Share on FacebookShare on Twitter


LLMs are trained on vast amounts of web data, which can lead to unintentional memorization and reproduction of sensitive or private information. This raises significant legal and ethical concerns, especially regarding violating individual privacy by disclosing personal details. To address these concerns, the concept of unlearning has emerged. This approach involves modifying models after training to deliberately ‘forget’ certain elements of their training data.

The central problem addressed here is effectively unlearning sensitive information from LLMs without retraining from scratch, which is both costly and impractical. Unlearning aims to make models forget specific data, thereby protecting private information. However, evaluating unlearning efficacy is challenging due to the complex nature of generative models and the difficulty in defining what it truly means to be forgotten.

Recent studies have focused on unlearning in classification models. Still, there’s a need to shift focus to generative models like LLMs, which are more prevalent in real-world applications and pose a greater threat to individual privacy. Researchers from Carnegie Mellon University introduced the TOFU (Task of Fictitious Unlearning) benchmark to address this need. It involves a dataset of 200 synthetic author profiles, each with 20 question-answer pairs, and a subset known as the ‘forget set’ targeted for unlearning. TOFU allows for a controlled evaluation of unlearning, offering a dataset specifically designed for this purpose with various levels of task severity.

https://arxiv.org/abs/2401.06121

Unlearning in TOFU is evaluated across two axes:

Forget quality: Several performance metrics are used for model utility, and new evaluation datasets have been created. These datasets range in relevance, allowing a comprehensive assessment of the unlearning process.

Model utility: For forget quality, a metric compares the probability of generating true answers to false answers on the forget set, using a statistical test to compare unlearned models to the gold standard retained models that were never trained on the sensitive data.

Four baseline methods were assessed in TOFU, each showing that existing methods are inadequate for effective unlearning. This points to a need for continued efforts to develop unlearning approaches that tune models to behave as if they never learned the forgotten data.

https://arxiv.org/abs/2401.06121

The TOFU framework is significant for several reasons:

  • It introduces a new benchmark for unlearning in the context of LLMs, addressing the need for controlled and measurable unlearning techniques.
  • The framework includes a dataset of fictitious author profiles, ensuring that the only source of information to be unlearned is known and can be robustly evaluated.
  • TOFU provides a comprehensive evaluation scheme, considering forget quality and model utility to measure unlearning efficacy.
  • The benchmark challenges existing unlearning algorithms, highlighting their limitations and the need for more effective solutions.

However, TOFU also has its limitations. It focuses on entity-level forgetting, leaving out instance-level and behavior-level unlearning, which are also important aspects of this domain. The framework does not address alignment with human values, which could be framed as a type of unlearning.

In conclusion, the TOFU benchmark presents a significant step forward in understanding the challenges and limitations of unlearning in LLMs. The researchers’ comprehensive approach to defining, measuring, and evaluating unlearning sheds light on the complexities of ensuring privacy and security in AI systems. The study’s findings highlight the need for continued innovation in developing unlearning methods that can effectively balance the removal of sensitive information while maintaining the overall utility and performance of the model.

Check out the Paper and Project. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our 36k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

Source link

Tags: benchmarkCMUdataGroundbreakinglanguageLargeLearningMachinemodelsResearchersTOFUUnlearningUnveil
Previous Post

Apple’s Data Operations Annotations Team Relocation

Next Post

North Korea scraps agencies managing relations with South

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
North Korea scraps agencies managing relations with South

North Korea scraps agencies managing relations with South

Workshop Review: Data Visualisation Fundamentals with Andy Kirk

Workshop Review: Data Visualisation Fundamentals with Andy Kirk

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
How ‘Chain of Thought’ Makes Transformers Smarter

How ‘Chain of Thought’ Makes Transformers Smarter

May 13, 2024
Amazon’s Bedrock and Titan Generative AI Services Enter General Availability

Amazon’s Bedrock and Titan Generative AI Services Enter General Availability

October 2, 2023
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

October 30, 2023
How To Build A Quiz App With JavaScript for Beginners

How To Build A Quiz App With JavaScript for Beginners

February 22, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In