AI Chatbots: From Exams to Real-World Challenges
A study highlights the limitations of AI chatbots in real-world medical interactions, despite their success in medical exam scenarios. Researchers from Harvard and Stanford tested large-language models using a framework called 'CRAFT-MD' and found issues in symptom gathering, patient history collection, and diagnostic accuracy.

- Country:
- India
AI chatbots have become increasingly relied upon for making sense of symptoms or test results. However, recent research indicates that while these AI tools excel in medical exam-like settings, they face challenges in real-world conversations.
Published in Nature Medicine, the study suggests that large-language models (LLMs), which power chatbots like ChatGPT, should be rigorously evaluated before their clinical use. Developed by researchers at Harvard Medical School and Stanford University, the framework 'CRAFT-MD' analyzed LLMs such as GPT-4 for their performance in realistic patient interactions.
The study reveals limitations in chatbots' abilities to conduct coherent clinical conversations, gather complete patient histories, and accurately diagnose conditions. Senior author Pranav Rajpurkar of Harvard Medical School noted that despite their prowess in exams, these models struggle significantly with dynamic, real-world medical dialogues.
(With inputs from agencies.)
- READ MORE ON:
- AI
- chatbots
- healthcare
- diagnostic
- study
- Harvard
- Stanford
- limitations
- LLM
- clinical
ALSO READ
Red Wine's Health Halo Debunked: New Study Challenges Cancer Prevention Claims
Goa's Groundbreaking Health Study Aims for Early NCD Detection
Tweets, likes, and needles: Study finds social media messaging spurs vaccine intentions
Shaping Ethiopia’s Preschool Educators: A Study on Pedagogical Competence Growth
GSK's Massive Study: Could Shingles Vaccine Cut Dementia Risk?