Monday, June 2, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

FusOn-pLM: Advancing Precision Therapy for Fusion Oncoproteins through Enhanced Protein Language Modeling

June 8, 2024
in AI Technology
Reading Time: 5 mins read
0 0
A A
0
Share on FacebookShare on Twitter


Fusion oncoproteins, formed by chromosomal translocations, are key drivers in many cancers, especially pediatric ones. These chimeric proteins are difficult to target with drugs due to their large, disordered structures and lack of distinct binding pockets. Traditional drug design methods, like small molecules, often fail because they need more specificity or bind crucial cellular proteins. Protein language models (pLMs) have emerged as a promising tool, offering detailed sequence-based insights into protein function. Despite success with various proteins, current plans still need to include training on fusion oncoproteins, limiting their application in designing therapeutics for these challenging targets.

Researchers at Duke University have developed FusOn-pLM, a new protein language model tailored for fusion oncoproteins. This model fine-tunes the advanced ESM-2 pLM, specifically on fusion oncoprotein sequences from large databases. They introduced an innovative masked language modeling approach focusing on key residues likely involved in protein interactions. This method improves the representation of fusion oncoproteins, outperforming the base ESM-2 model and other embeddings in various benchmarks. The enhanced embeddings are designed to aid in the therapeutic targeting these challenging proteins. FusOn-pLM is publicly accessible for further research and applications.

The training dataset for FusOn-pLM was meticulously curated from FusionPDB and FOdb databases, gathering 41,420 sequences from FusionPDB and 4,536 from FOdb. Only sequences containing the 20 natural amino acids and under 2000 amino acids in length were selected, ensuring they fit within GPU memory constraints. After removing duplicates from overlapping entries, 177 FOdb sequences were set aside for benchmarking. The remaining sequences underwent clustering using the MMSeqs2 tool, applying a minimum sequence identity threshold of 30% and an 80% coverage threshold. The clusters were divided into training, validation, and testing sets in an 80/10/10 ratio. For benchmarking, specific datasets from FOdb were curated for tasks like predicting the tendency of fusion oncoproteins to form condensates and their cellular localization. Additional datasets were used to predict cancer-related disease outcomes and analyze the properties of intrinsically disordered regions (IDRs).

FusOn-pLM’s effectiveness was evaluated using several benchmark tasks, including predicting phase separation of fusion oncoproteins, their localization in the cell, and their associations with specific cancers like breast invasive carcinoma and stomach adenocarcinoma. A targeted probabilistic masking strategy was employed to improve model comprehension, focusing on amino acids likely to participate in protein-protein interactions identified through SaLT&PepPr predictions. This masking strategy, applied to 15% of each sequence, enhances the model’s ability to recognize the interaction points within fusion oncoproteins. During training, the model fine-tuned the advanced ESM-2-650M model by unfreezing the weights and biases of its final layers. FusOn-pLM’s embeddings were benchmarked against other types, including those from ESM-2-650M, and manually curated FOdb embeddings, demonstrating superior performance in disorder prediction and capturing key physicochemical properties.

FusOn-pLM’s embeddings were enhanced using probabilistic masking, particularly the SaLT&PepPr-based approach, leading to optimal performance. The model was evaluated on various tasks, demonstrating superior performance in predicting the behavior and properties of fusion oncoproteins, such as their propensity to form puncta and their cellular localization. Additionally, FusOn-pLM excelled in identifying intrinsically disordered regions and their physicochemical properties, outperforming other embedding methods. Visualization techniques showed that FusOn-pLM embeddings distinctly separate fusion oncoproteins from their components, reflecting their unique characteristics and biological relevance.

In conclusion, FusOn-pLM, an ESM-2-based protein language model, is specifically tuned to capture the unique properties of fusion oncoproteins, which are typically disordered and contribute to cancer development. Unlike traditional models and embeddings like FOdb, FusOn-pLM excels in tasks related to fusion oncoproteins and effectively distinguishes these proteins from their components. Future work aims to utilize FusOn-pLM to design targeted protein degraders and integrate post-translational modifications for more precise therapeutic interventions. This model represents a significant advancement in biologics for treating fusion protein cancers.

Check out the Paper and Model. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 44k+ ML SubReddit

Super excited to share our new protein language model: FusOn-pLM, developed by my wonderful first-year PhD student @SophieVincoff and her team of amazing undergrads!! 🎉

Sure, we know AlphaFold3/RFDiffusion and other structure prediction/design methods are great — but what… pic.twitter.com/uuZFxeNbmd

— Pranam Chatterjee (@pranamanam) June 4, 2024

Sana Hassan, a consulting intern at Marktechpost and dual-degree student at IIT Madras, is passionate about applying technology and AI to address real-world challenges. With a keen interest in solving practical problems, he brings a fresh perspective to the intersection of AI and real-life solutions.

🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others…





Source link

Tags: AdvancingEnhancedFusionFusOnpLMlanguageModelingOncoproteinsprecisionProteintherapy
Previous Post

Gold tumbles on solid US nonfarm payroll report, China slowing down gold buying

Next Post

Propagandists are using AI too

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Propagandists are using AI too

Propagandists are using AI too

Bajaj Housing Finance files papers with SEBI for Rs 7,000 crore IPO; move in line with RBI norms for ‘upper layer’ NBFCs

Bajaj Housing Finance files papers with SEBI for Rs 7,000 crore IPO; move in line with RBI norms for 'upper layer' NBFCs

Breaking the Language Barrier for All: Sparsely Gated MoE Models Bridge the Gap in Neural Machine Translation

Breaking the Language Barrier for All: Sparsely Gated MoE Models Bridge the Gap in Neural Machine Translation

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Accenture creates a regulatory document authoring solution using AWS generative AI services

Accenture creates a regulatory document authoring solution using AWS generative AI services

February 6, 2024
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
Managing PDFs in Node.js with pdf-lib

Managing PDFs in Node.js with pdf-lib

November 16, 2023
NVIDIA’s Marketing Strategy Case Study

NVIDIA’s Marketing Strategy Case Study

October 25, 2023
25+ CSS Pricing Tables

25+ CSS Pricing Tables

October 28, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In