Close Menu
Healthtost
  • News
  • Mental Health
  • Men’s Health
  • Women’s Health
  • Skin Care
  • Sexual Health
  • Pregnancy
  • Nutrition
  • Fitness
  • Recommended Essentials
What's Hot

9 Easy Chia Pudding Recipes (+ The Perfect Pudding Ratio) • Kath Eats

May 4, 2026

Randomized controlled trial validates total hip arthroplasty to improve functional capacity

May 4, 2026

Dr. William O. Brant on male sexual health and the risks and benefits of supplements

May 4, 2026
Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
Healthtost
SUBSCRIBE
  • News

    Randomized controlled trial validates total hip arthroplasty to improve functional capacity

    May 4, 2026

    New genetic risk report reveals hidden risk of heart disease before symptoms appear

    May 3, 2026

    Five-target drug beats GLP-1/GIP therapy in obese diabetic mice

    May 3, 2026

    How fast your face ages can predict cancer survival outcomes

    May 2, 2026

    AI scribes save doctors time, but fail to reduce overtime

    May 2, 2026
  • Mental Health

    Every mental health journey starts with being seen

    May 2, 2026

    What animal studies teach us about toxic work environments

    April 27, 2026

    I hate hope: How to manage hope when you have treatment-resistant bipolar disorder

    April 19, 2026

    Rose Byrne is raw, magnetic and unfiltered as a woman in crisis

    April 18, 2026

    Can a single mother change her child’s surname in India?

    April 16, 2026
  • Men’s Health

    Dr. William O. Brant on male sexual health and the risks and benefits of supplements

    May 4, 2026

    3 Day Home Workout Plan: Build Muscle and Burn Fat

    April 30, 2026

    GLP-1 drugs promise broader health benefits, but experts advise caution on use

    April 28, 2026

    Trauma patients recover faster when medical teams know each other well, new study finds

    April 28, 2026

    I did red light therapy for 3 months so I shouldn’t have

    April 27, 2026
  • Women’s Health

    How to do a breast self-exam and spot lumps

    May 4, 2026

    Finding the best lupus treatments

    May 3, 2026

    What is the difference between UVA and UVB rays?

    May 1, 2026

    Are you a fungus fanatic? We unpack the nutritional trend of mushroom mania

    April 29, 2026

    What the Patients’ Bill of Rights Could Mean for Black Women

    April 29, 2026
  • Skin Care

    How I Did It: Fading Hormonal Hyperpigmentation Without Lasers

    May 3, 2026

    The truth about waterless care: What your skin really needs

    May 2, 2026

    What happens to your skin while you sleep? (the science of “Beauty Sle

    May 1, 2026

    Face Peeling Mask Guide: Shine Without Irritation

    April 28, 2026

    Is your moisturizing face mist really drying out your skin?

    April 28, 2026
  • Sexual Health

    Early signs of Peyronie’s disease and when to seek help

    May 3, 2026

    Boost erectile health and confidence

    May 1, 2026

    Judicial Restrictions on Abortion COVID-19 < SRHM

    April 30, 2026

    Can herpes affect fertility?

    April 29, 2026

    The Importance of Personalized Care in Medication Assisted Therapy (MAT) Programs I Novus

    April 28, 2026
  • Pregnancy

    Why is anemia during pregnancy high in Indian women?

    May 2, 2026

    5 things you need for the third trimester

    May 1, 2026

    Eating disorders in pregnancy and breastfeeding: Why “healthy eating” is not always easy

    May 1, 2026

    Comprehensive yoga for pregnancy, birth and beyond

    April 29, 2026

    Midwifery and Life – The postnatal health check New mums don’t know they can ask for

    April 28, 2026
  • Nutrition

    9 Easy Chia Pudding Recipes (+ The Perfect Pudding Ratio) • Kath Eats

    May 4, 2026

    A cancer-causing contaminant in drugs and meat

    May 3, 2026

    How Nutrition Supports Mood, Energy and Gut Health

    May 2, 2026

    How to create a self-care plan when you’re stressed

    May 1, 2026

    I answer the most HOT Questions about Fatty Liver

    April 29, 2026
  • Fitness

    The most underrated skill I wish everyone learned

    May 3, 2026

    Landmine Training and Why I Love It – Tony Gentilcore

    May 3, 2026

    9 Powerful Fitness Tips for Pear Shaped Bodies

    May 2, 2026

    If you can still do these 7 things at 60, your body is aging better than most

    May 2, 2026

    A Hike Leader’s Must-Have Kit

    April 30, 2026
  • Recommended Essentials
Healthtost
Home»News»ChatGPT Health fails critical emergency and suicide safety tests
News

ChatGPT Health fails critical emergency and suicide safety tests

healthtostBy healthtostFebruary 24, 2026No Comments6 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
Chatgpt Health Fails Critical Emergency And Suicide Safety Tests
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email

ChatGPT Health, a widely used artificial intelligence (AI) tool that provides health guidance directly to the public — including advice on how to seek emergency medical care — may fail to properly direct users to emergency care in a significant number of serious cases, according to researchers at the Icahn School of Medicine at Mount Sinai.

The study, fast-tracked in the February 23, 2026 online issue Nature Medicine [https://doi.org/10.1038/s41591-026-04297-7]is the first independent safety assessment of the LLM-based tool since its launch in January 2026. It also identified serious concerns about the tool’s safeguards against suicidal ideation.

“LLMs have become patients’ first port of call for medical advice—but in 2026 they are less secure on the clinical fringes, where judgment separates missed emergencies from unnecessary alarm,” says Isaac S. Kohane, MD, PhD, Chair, Department of Biomedical Informatics at Harvard Medical School, who was not involved in the research.. “When millions of people use an AI system to decide if they need emergency care, the stakes are extremely high. Independent evaluation should be routine, not optional.”

Within weeks of its launch, ChatGPT Health’s maker, OpenAI, reported that about 40 million people use the tool daily to seek health information and guidance, including advice on whether to seek emergency or urgent care. At the same time, the researchers say, there was little independent evidence about how safe or reliable his advice actually was.

This gap prompted our study. We wanted to answer a very basic but critical question: if someone is experiencing a real medical emergency and reaches out to ChatGPT Health for help, will it clearly tell them to go to the emergency room?”


Ashwin Ramaswamy, MD, lead author, Instructor in Urology, Icahn School of Medicine, Mount Sinai

Regarding suicide risk alerts, ChatGPT Health was designed to direct users to the 988 Suicide and Crisis Lifeline in high-risk situations. However, the researchers found that these alerts appeared inconsistently, sometimes triggering lower-risk scenarios, while – alarmingly – failing to appear when users described specific plans to self-harm.

“This was a particularly surprising and disturbing finding,” says senior and co-corresponding author of the study Girish N. Nadkarni, MD, MPH, Barbara T. Murphy Chair of Windreich’s Department of Artificial Intelligence and Human Health, Director of the Hasso Plattner Institute for Digital Health, and Irene and Dr. Sinai, and Chief AI Officer of Mount Sinai Health System. “While we expected some variability, what we observed exceeded the inconsistency. The system’s alerts were inversely related to clinical risk, appearing more reliable for lower-risk scenarios than for cases where someone shared how they intended to harm themselves. In real life, when someone talks about exactly how they will harm themselves, that is a sign of more immediate, not less serious, risk.”

As part of the evaluation, the research team created 60 structured clinical scenarios covering 21 medical specialties. Cases ranged from minor conditions suitable for home care to true medical emergencies. Three independent physicians determined the correct level of urgency for each case using guidelines from 56 medical societies.

Each scenario was tested under 16 different contextual conditions, including variations in race, gender, social dynamics (such as someone minimizing symptoms), and barriers to care, such as lack of insurance or transportation. In total, the team conducted 960 interactions with ChatGPT Health and compared its recommendations to the consensus of doctors.

When testing 60 realistic patient scenarios developed by doctors, the researchers found that while the tool generally handled clear emergencies correctly, it underplayed more than half of the cases that doctors judged to require urgent care.

The researchers were also impressed by how the system failed in medical emergencies. The tool often proved to recognize dangerous findings in its own explanations, yet reassured the patient.

“ChatGPT Health has performed well in textbook emergencies like stroke or severe allergic reactions,” says Dr. Ramaswamy. “But it struggled in more nuanced situations where the risk is not immediately obvious, and these are often the situations where clinical judgment matters most. In an asthma scenario, for example, the system identified early warning signs of respiratory failure in its explanation, but recommended waiting rather than seeking emergency treatment.”

The study’s authors advise that for worsening or worrying symptoms, including chest pain, shortness of breath, severe allergic reactions or changes in mental status, people should seek medical attention directly rather than relying solely on the chatbot’s guidance. In cases involving thoughts of self-harm, people should contact 988 Suicide and Crisis Lifeline or go to an emergency department.

However, the researchers stress that the findings do not suggest that consumers should abandon AI health tools altogether.

“As a medical student in training at a time when AI health tools are already in the hands of millions, I see them as technologies that we must learn to carefully integrate into care, not substitutes for clinical judgment,” says Alvira Tyagi, a first-year medical student at the Icahn School of Medicine at Mount Sinai and second author of the study. “These systems are changing rapidly, so part of our training now must consider learning how to critically understand their results, identify where they fall short, and use them in ways that protect patients.”

The study evaluated the system at a single time point. Because AI models are updated frequently, performance can change over time, underscoring the need for independent evaluation, the researchers say.

“The start of medical education alongside tools evolving in real time makes it clear that today’s results are not static,” says Ms Tyagi. “This reality requires ongoing review to ensure that improvements in technology translate into safer care.”

The team plans to continue evaluating updates to ChatGPT Health and other consumer-facing AI tools, expanding future research into areas such as pediatric care, drug safety, and non-English language use.

The paper is titled “The performance of ChatGPT Health in a structured trial of triage recommendations.”

The authors of the study, as reported in the journal, are Ashwin Ramaswamy, MD, MPP. Alvira Tyagi, BA; Hannah Hugo, MD; Joy Jiang, PhD; Pushkala Jayaraman, PhD; Mateen Jangda, MSc; Alexis E. Te, MD; Steven A. Kaplan, MD; Joshua Lampert, MD; Robert Freeman, MSN, MS; Nicholas Gavin, MD, MBA; Ashutosh K. Tewari, MBBS, MCh; Ankit Sakhuja, MBBS MS; Bilal Naved, PhD; Alexander W. Charney, MD, PhD; Mahmoud Omar, MD; Michael A. Gorin, MD; Eyal Klang, MD; Girish N. Nadkarni, MD, MPH.

Source:

Mount Sinai Health System

Journal Reference:

Ramaswamy, A., et al. (2026). ChatGPT Health performance in a structured test of screening proposals. Nature Medicine. DOI: 10.1038/s41591-026-04297-7.

ChatGPT critical emergency fails health safety Suicide Tests
bhanuprakash.cg
healthtost
  • Website

Related Posts

Randomized controlled trial validates total hip arthroplasty to improve functional capacity

May 4, 2026

Dr. William O. Brant on male sexual health and the risks and benefits of supplements

May 4, 2026

New genetic risk report reveals hidden risk of heart disease before symptoms appear

May 3, 2026

Leave A Reply Cancel Reply

Don't Miss
Nutrition

9 Easy Chia Pudding Recipes (+ The Perfect Pudding Ratio) • Kath Eats

By healthtostMay 4, 20260

Looking for easy chia pudding recipes that you can make overnight? These healthy chia pudding…

Randomized controlled trial validates total hip arthroplasty to improve functional capacity

May 4, 2026

Dr. William O. Brant on male sexual health and the risks and benefits of supplements

May 4, 2026

How to do a breast self-exam and spot lumps

May 4, 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
TAGS
Baby benefits body brain cancer care Day Diet disease exercise finds Fitness food Guide health healthy heart Improve Life Loss Men mental Natural Nutrition Patients Pregnancy protein research reveals risk routine sex sexual Skin Skincare study Therapy Tips Top Training Treatment ways weight women Workout
About Us
About Us

Welcome to HealthTost, your trusted source for breaking health news, expert insights, and wellness inspiration. At HealthTost, we are committed to delivering accurate, timely, and empowering information to help you make informed decisions about your health and well-being.

Latest Articles

9 Easy Chia Pudding Recipes (+ The Perfect Pudding Ratio) • Kath Eats

May 4, 2026

Randomized controlled trial validates total hip arthroplasty to improve functional capacity

May 4, 2026

Dr. William O. Brant on male sexual health and the risks and benefits of supplements

May 4, 2026
New Comments
    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2026 HealthTost. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.