News|Articles|July 26, 2024

Capability of ChatGPT Reduced in Complex Vitreoretinal Scenarios

Across 40 clinical scenarios, ChatGPT did not provide a comprehensive response in 50% of clinical questions with nearly 30% hallucinated sources.

ChatGPT answered correctly in more than 80% of complex open-ended vitreoretinal clinical scenarios but demonstrated a reduced capability to offer a comprehensive response, according to data presented at the American Society of Retina Specialists (ASRS) 42nd Annual Meeting.¹

Across the 40 open-ended clinical scenarios, the artificial intelligence (AI) chatbot was incapable of a comprehensive response in approximately 50% of clinical questions and generated nearly 30% hallucinated sources. Hallucinations occur when a large language model (LLM) produces nonsensical or inaccurate responses presented as factual.²

“This demonstrates that while ChatGPT is rapidly growing more accurate, it is not yet suitable as an information source for patients,” wrote the investigative team, led by Michael J. Maywood, MD, department of ophthalmology, Corewell Health William Beaumont University Hospital.¹

AI chatbots continue to evolve as a medical tool, particularly in ophthalmology, making it critical to evaluate its strengths and limitations. In this retrospective, cross-sectional study, Maywood and colleagues assessed the performance of ChatGPT by determining the accuracy of the chatbot’s responses to complex open-ended vitreoretinal clinical scenarios, as well as the sources used in answering the clinical prompts.

Investigators designed 40 open-ended clinical scenarios across 4 primary topics in vitreoretinal disease, with responses graded on correctness and comprehensiveness by 3 blinded retina specialists. The primary outcome of the analysis was the number of clinical scenarios answered correctly and comprehensively by the chatbot.

Secondary outcomes involved theoretical harm to patients from an incorrect response, the distribution of the type of references used by ChatGPT, and the occurrence of hallucinated references.

Upon analysis, in June 2023, ChatGPT answered 83% (n = 33 of 40) of clinical scenarios correctly, while providing a comprehensive answer in only 52.5% (n = 21 of 40) of cases. Subgroup analysis showed an average correct response of 86.7% in neovascular age-related macular degeneration (nAMD), 100% in diabetic retinopathy (DR), 76.7% in retinal vascular disease, and 70% in the surgical domain.

After the assessment of the references, ChatGPT generated 70% real references and 30% hallucinated references. Overall, there were 6 incorrect responses with 1 (16.7%) cases of no harm, 3 (50%) cases of possible harm, and 2 (33.3%) cases of definitive harm.

“It was unable to provide a comprehensive response in ~50% of clinical questions and generated 30% hallucinated sources,” Maywood added.

Reference

_{Maywood MJ. Performance Assessment of an Artificial Intelligence Chatbot in Clinical Vitreoretinal Scenarios. Poster presented at the American Society of Retina Specialists (ASRS) 42nd Annual Meeting. Stockholm, Sweden. July 17-20, 2024.}
_{What are ai hallucinations? IBM. September 1, 2023. Accessed July 25, 2024. https://www.ibm.com/topics/ai-hallucinations.}

Join thousands of clinicians staying current on new therapies, trial data, and expert insights—subscribe to HCPLive today.

Latest CME

Multimedia

Burst CME: Managing Fluid Overload in Patients with Chronic Kidney Disease (Part 1)

Suneel Udani, MD

Capability of ChatGPT Reduced in Complex Vitreoretinal Scenarios

Related Content

Q&A: How Is Risankizumab’s FDA Approval Changing the Psoriatic Disease Landscape?

New Nonsteroidal Options Are Reshaping Pediatric Atopic Dermatitis Care

How Does the FDA Approval of Risankizumab Impact Pediatric Psoriasis and PsA? With Amy Paller, MD

AI-CPA Plaque Assessment Results Consistent, Reproducible Among Observers

Counseling Patients On Exa-cel Gene Therapy Journey, With Haydar Frangoul, MD, MS

Latest CME

Burst CME: Managing Fluid Overload in Patients with Chronic Kidney Disease (Part 1)

Looking Beneath the Surface: Latest Updates in Identifying and Managing Hypercortisolism in Patients with Type 2 Diabetes

Navigating Safety Data with Janus Kinase (JAK) Inhibitors in Atopic Dermatitis (AD) Management

Rapid Reviews in Retina™: Emerging Updates from Spring 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Interventional Dry Eye: A Stepwise Treatment & Management Approach

Assessing the Evidence for OX40-OX40L Axis Inhibition for the Treatment of Atopic Dermatitis

Burst CME: Managing Fluid Overload in Patients with Chronic Kidney Disease (Part 2)

Patient, Provider, and Caregiver Connection: Turning a New Leaf in Acute Pain Management – How Recent Advancements Impact the Treatment Paradigm

(CME Track) Collaborating Across the Continuum™: Best Practices in Patient-Centric Team Management of XLRP

Burst CME™: Optimizing Care for Patients with Psoriasis – Incorporating a Buy-and-Bill Model for Biologic Agents into Dermatological Practice

(CME Track) The Evolution of MacTel Management: Integrating Neuroprotective Therapies Into Clinical Practice

Live Expert Illustrations & Commentary™: Visualizing Novel Therapeutic Targets for Patients with Major Depressive Disorder

(CME Track) Rapid Reviews in Retina™: Emerging Updates from Summer 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

(CME Track) A Forward Look at Anti-VEGF Therapies: A Paradigm Shift in Neovascular Retinal Disease Management

Cases and Conversations™: Biologic Matchmaking in Psoriasis – Finding the Right Therapy for the Right Patient

Collaborating Across the Continuum™: The Pediatrician’s Vital Role in Multidisciplinary Management of Pediatric PAH

Collaborating Across the Continuum™: The Pediatrician’s Vital Role in Multidisciplinary Management of Pediatric PAH

Collaborating Across the Continuum™: The Pediatrician’s Vital Role in Multidisciplinary Management of Pediatric PAH

Patient, Provider, and Caregiver Connection™: Addressing Patient Challenges With Holistic Approaches to Vitiligo Management

(CME Track) Community Collaborative Connections™: Optimizing the Collaborative Care of Neovascular Retinal Disease in a New Age of Treatment

(CME Track) SoCal Psych 2025: Overcoming Barriers to Long-Acting Injectable Agents in Schizophrenia

Cases and Conversations™: Mineralocorticoid Receptor Antagonists in Patients With HF—Augmenting Current Guidelines with Emerging Evidence

Hidradenitis Suppurativa: Diving Deeper Into Disease Pathogenesis, Severity Assessment, and Holistic Management Approaches

Burst CME CGM: Continuous Glucose Monitoring Considerations – Maximizing Quality of Life for Patients

Progress in Hyperlipidemia Management to Reduce ASCVD Risk: An Illustrated Update

Navigating Advances in Neovascular Retinal Disease: Translating Evidence to Practice in AMD, DME, and RVO

(CME Track) Beyond the Collarette: Empowering Patients in the Management of Demodex Blepharitis

(COPE Track) Beyond the Collarette: Empowering Patients in the Management of Demodex Blepharitis

IgAN Case Files: Real Conversations, Evolving Evidence

Expert Illustrations & Commentaries™: Visualizing the Role of Dystrophin Dysregulation as a Therapeutic Target in Duchenne Muscular Dystrophy and MD STARnet

Shining a Light on an Ultra-Rare Disease – A Closer Look at Thymidine Kinase 2 Deficiency (TK2d)

Shining a Light on an Ultra-Rare Disease – A Closer Look at Thymidine Kinase 2 Deficiency (TK2d)

Identifying and Treating Generalized Myasthenia Gravis in the Modern Era

Addressing Unmet Needs for Patients With Spinal Muscular Atrophy—Understanding Patient Challenges and Management Approaches

Rewiring Recovery: Evidence-Based Approaches to Managing Chronic Inflammatory Demyelinating Polyneuropathy

Navigating Ocular Toxicities: A Multidisciplinary Roadmap for Managing Adverse Events in Targeted Cancer Therapy

(CME Track) Antibody–Drug Conjugates in Oncology: The Essentials of AE Management for Better Patient Outcomes

Clinical Consultations™: Tailoring Treatment for Cystic Fibrosis (CF) Across Life Stages and Evolving Health Needs

Clinical Consultations™: Tailoring Treatment for Cystic Fibrosis (CF) Across Life Stages and Evolving Health Needs

Clinical Consultations™: Tailoring Treatment for Cystic Fibrosis (CF) Across Life Stages and Evolving Health Needs

SimulatEd™: Partnering for Precision – A Framework for Personalized Care Planning in Acute Lymphoblastic Leukemia

Putting the Patient First in Acute Pain Management: The PA’s Guide to Incorporating Cutting-Edge Science Into Their Treatment Strategies

Putting the Patient First in Acute Pain Management: The PA’s Guide to Incorporating Cutting-Edge Science Into Their Treatment Strategies

Expert Illustrations & Commentaries™: Visualizing the Role of B Cells as Therapeutic Targets for Generalized Myasthenia Gravis

Rapid Reviews in Retina™: Emerging Updates from Fall 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Targeting the Cortisol Cascade: Diagnosis and Treatment Strategies in Patients with Hypertension

Targeting the Cortisol Cascade: Diagnosis and Treatment Strategies in Patients with Hypertension

Patient, Provider, and Caregiver Connection™: Individualizing Care in C3 Glomerulopathy – Understanding Patient Challenges and the Role of Innovative Treatment

Biomarker Testing in HER2+ GEA: Diagnosis and Treatment Implications

Navigating the Adverse Event Landscape in HER2+ GEA Therapy

Unlocking the Future of Glioma Care: Integrating Recent Advances to Personalize Treatment

Screening for Type 1 Diabetes and Delaying Its Onset—An Innovative View

Clear Skin, Clear Mind: Integrating Mental Health into Psoriasis Care

Clear Skin, Clear Mind: Integrating Mental Health into Psoriasis Care

Expert Illustrations & Commentary: Visualizing the Role of Novel Muscarinic Agents in the Management of Schizophrenia

Bridging Regional Challenges in Retinal Disease Management: Applying Advanced Anti-VEGF Therapy in Community Practice - NYC Metro

Bridging Regional Challenges in Retinal Disease Management: Applying Advanced Anti-VEGF Therapy in Community Practice - California

(CME Track) Tackling Oncologic Emergencies in Patients Treated With High-Dose Methotrexate

From Clue to Care: Rapid Recognition and Coordinated Management of Paraneoplastic LEMS in SCLC

Burst CME™: Optimal Management of Complications of Sickle Cell Disease

Burst CME™: Optimal Management of Complications of Sickle Cell Disease

Burst CME™: Transition from Pediatric to Adult Care in Sickle Cell Disease

Burst CME™: Transition from Pediatric to Adult Care in Sickle Cell Disease

Burst CME™: Disease-Modifying vs. Curative Therapy – Which Way to Go in Sickle Cell Disease?

Burst CME™: Disease-Modifying vs. Curative Therapy – Which Way to Go in Sickle Cell Disease?

Burst CME™ in Gaucher Disease: Patient Evaluation and Management

Optimizing Lipid-Lowering Strategies for ASCVD Risk Reduction: Bridging the Gap in Treatment Intensification

Collaborating Across the Continuum™: Integrating Novel Therapies Into Multidisciplinary Treatment Plans for Generalized Myasthenia Gravis

Burst CME™: Staying Informed and Up-To-Date on the Treatment of Lupus Nephritis

Burst CME™: Staying Informed and Up-To-Date on the Treatment of Lupus Nephritis

SimulatEd™: A Roadmap to Personalized Care Plans and Shared Decision-Making in Low-Grade Serous Ovarian Cancer

Community Collab™: Identifying the Role of Complement Inhibitors in the Management of Generalized Myasthenia Gravis