Saturday, May 17, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

A research AI system for diagnostic medical reasoning and conversations – Google Research Blog

January 12, 2024
in AI Technology
Reading Time: 3 mins read
0 0
A A
0
Share on FacebookShare on Twitter



Posted by Alan Karthikesalingam and Vivek Natarajan, Research Leads, Google Research

The physician-patient conversation is crucial in medicine, as effective communication drives diagnosis, management, empathy, and trust. AI systems that can engage in diagnostic dialogues have the potential to improve the availability, accessibility, quality, and consistency of care by serving as conversational partners for clinicians and patients. However, replicating the expertise of clinicians is a significant challenge. While large language models (LLMs) have shown promise in planning, reasoning, and holding rich conversations in other domains, there are unique aspects of diagnostic dialogue in the medical field that require attention.

To address this challenge, we have developed Articulate Medical Intelligence Explorer (AMIE), a research AI system based on LLMs that is optimized for diagnostic reasoning and conversations. We have trained and evaluated AMIE in various dimensions that reflect the quality of real-world clinical consultations from the perspectives of both clinicians and patients. To scale AMIE across different disease conditions, specialties, and scenarios, we have created a novel simulated diagnostic dialogue environment and employed automated feedback mechanisms to enhance its learning process. We have also implemented an inference time chain-of-reasoning strategy to improve the accuracy and quality of AMIE’s diagnostics and conversations. Finally, we have tested AMIE in real examples of multi-turn dialogue by simulating consultations with trained actors.

In addition to developing and optimizing AI systems for diagnostic conversations, we have also explored how to assess the performance of such systems. Inspired by established tools used to measure consultation quality and clinical communication skills in real-world settings, we have developed an evaluation rubric to assess diagnostic conversations in terms of history-taking, diagnostic accuracy, clinical management, clinical communication skills, relationship fostering, and empathy. We have conducted a randomized, double-blind crossover study in which text-based consultations were performed with validated patient actors interacting with either board-certified primary care physicians or the AMIE system optimized for diagnostic dialogue. The consultations were designed in the style of an objective structured clinical examination (OSCE), a practical assessment commonly used to evaluate clinicians’ skills in a standardized and objective way.

To train AMIE, we have used real-world datasets comprising medical reasoning, medical summarization, and clinical conversations. However, training LLMs for medical conversations using existing real-world data has limitations. Therefore, we have designed a self-play based simulated learning environment with automated feedback mechanisms to simulate diagnostic medical dialogues in a virtual care setting. This has allowed us to scale AMIE’s knowledge and capabilities across various medical conditions and contexts. We have employed an iterative process of self-play loops to refine AMIE’s behavior and progressively improve its diagnostic responses. Additionally, we have implemented an inference time chain-of-reasoning strategy to enable AMIE to provide informed and grounded replies.

In our evaluation of AMIE’s performance, we have observed that it performs diagnostic conversations as well as primary care physicians when evaluated along multiple clinically-meaningful axes of consultation quality. AMIE has demonstrated greater diagnostic accuracy and superior performance in various evaluation axes from the perspectives of specialist physicians and patient actors.

It is important to note that our research has limitations and further research is needed to develop a safe and robust tool that can be used in real-world clinical practice. Our evaluation technique, using a text-chat interface, may not fully capture the value of human conversations in real-world settings. Additionally, important considerations such as health equity, fairness, privacy, and robustness need to be addressed to ensure the safety and reliability of AI technology in healthcare.

In conclusion, AMIE shows promise as an AI system for diagnostic conversations, and our research is a first exploratory step towards developing a tool that can assist clinicians in providing care.



Source link

Tags: BlogConversationsdiagnosticGoogleMedicalReasoningResearchsystem
Previous Post

Setting the stage: Designing settings screen UI

Next Post

5 Tips for Software Developers to Excel in Their Careers

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
5 Tips for Software Developers to Excel in Their Careers

5 Tips for Software Developers to Excel in Their Careers

Optimism (OP) Earns High Praise from Vitalik Buterin for $100M Public Goods Initiative

Optimism (OP) Earns High Praise from Vitalik Buterin for $100M Public Goods Initiative

The Evolving Landscape of Generative AI: A Survey of Mixture of Experts, Multimodality, and the Quest for AGI

The Evolving Landscape of Generative AI: A Survey of Mixture of Experts, Multimodality, and the Quest for AGI

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

May 19, 2024
Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

November 20, 2023
Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

December 6, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In