Close Menu
Healthtost
  • News
  • Mental Health
  • Men’s Health
  • Women’s Health
  • Skin Care
  • Sexual Health
  • Pregnancy
  • Nutrition
  • Fitness
  • Recommended Essentials
What's Hot

Traveling by plane with BPH

April 9, 2026

Virica Biotech and FUJIFILM Biosciences Collaborate on Canada-Japan Co-Innovation Program to Advance AAV Production Enhancers

April 9, 2026

30 Minute Kettlebell Full Body Workout for Over 50

April 9, 2026
Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
Healthtost
SUBSCRIBE
  • News

    Virica Biotech and FUJIFILM Biosciences Collaborate on Canada-Japan Co-Innovation Program to Advance AAV Production Enhancers

    April 9, 2026

    Long-term overweight is a stronger predictor of cardiovascular risk

    April 8, 2026

    Sugar intake can reduce the effectiveness of relaxation exercises

    April 8, 2026

    AI tool predicts Barrett’s esophagus recurrence with high accuracy

    April 7, 2026

    Salaera™ is launched to advance the future of breathing and gas technologies

    April 7, 2026
  • Mental Health

    the surprisingly common condition with a scary name

    April 6, 2026

    How yoga helps heal emotional wounds

    April 4, 2026

    Will medicinal cannabis help my mental health? Here are the facts and the risks

    April 1, 2026

    Does World Bipolar Day have an impact?

    March 29, 2026

    Worried about your preschooler’s anxiety? See how you can help

    March 28, 2026
  • Men’s Health

    Traveling by plane with BPH

    April 9, 2026

    30 Minute Kettlebell Full Body Workout for Over 50

    April 9, 2026

    The study shows that male depression is not just a pattern of men’s mental health

    April 7, 2026

    Dr. Jason Snibbe: Men’s health from a doctor who does it the right way

    April 6, 2026

    Coping with sexual health and erectile dysfunction as a couple

    April 3, 2026
  • Women’s Health

    Midlife Weight Gain Isn’t Just Willpower: Understanding Your Second Adolescence With WONDERBIOTICS

    April 8, 2026

    8 Things to Do When Attraction Dies in Your Marriage

    April 8, 2026

    I was finally diagnosed with Addison’s disease

    April 7, 2026

    I lost 60 pounds and got my life back

    April 7, 2026

    4.3 Friday Faves – The Fitnessista

    April 6, 2026
  • Skin Care

    What happens when you stop using hyaluronic acid – UMERE

    April 7, 2026

    The truth about "Pure Beauty" — What it means, what it doesn’t and what sensitive skin really needs

    April 6, 2026

    Backed by Science. Built for results. – Lifeline Skin Care

    April 4, 2026

    Best Facials | What to book for real results

    April 4, 2026

    Don’t Sabotage Your Laser Treatment Aftercare: 7 Mistakes

    April 3, 2026
  • Sexual Health

    Endometriosis procedures are reimbursed at lower rates, doctors say

    April 8, 2026

    Reflections two years later in a global context < SRHM

    April 8, 2026

    Can exercise improve HIV symptoms?

    April 7, 2026

    An Introduction to the Kink Literature Database — Sexual Health Alliance

    April 6, 2026

    No, abortion pills do not poison your drinking water

    April 1, 2026
  • Pregnancy

    How your partner can support a happier pregnancy

    April 9, 2026

    Exposure to plastic during pregnancy may be linked to more premature births than expected

    April 4, 2026

    How to relieve numbness and tingling in the legs in the third trimester?

    April 3, 2026

    The best stroller accessories for every type of stroller

    March 29, 2026

    A new study says pre-pregnancy health is a conversation between two parents

    March 29, 2026
  • Nutrition

    The Weekly Reset That Saves My Sanity (Lily’s Guacamole Recipe)

    April 7, 2026

    Double Chocolate Veggie Muffins (Kids and Lunchtime)

    April 7, 2026

    Nut Nutrition Comparison: Understanding Nutrient Content

    April 4, 2026

    Is Berberine ‘Nature’s Metformin’? | HUM Nutrition Blog

    April 3, 2026

    12 Healthy Egg Dishes • Kath Eats

    April 3, 2026
  • Fitness

    Best Health & Fitness Certifications (My Favorites After 17+ Years in the Industry)

    April 6, 2026

    Dose 1 – Tony Gentilcore

    April 6, 2026

    How to take care of your internal organs

    April 5, 2026

    Doctors say these 5 daily habits can improve heart health naturally

    April 5, 2026

    Magnesium Oxide vs. Glycinate: Which is Better?

    April 4, 2026
  • Recommended Essentials
Healthtost
Home»News»The study evaluates safety and accuracy in emergency medicine
News

The study evaluates safety and accuracy in emergency medicine

healthtostBy healthtostDecember 7, 2024No Comments5 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
The Study Evaluates Safety And Accuracy In Emergency Medicine
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email

Study evaluates large language model for emergency medicine handover notes, finding high utility and safety comparable to physicians

Study: Development and Evaluation of Emergency Medical Emergency Management Notes Generated by the Large Language Model. Image credit: Kamon_wongnon / Shutterstock.com

In a recent study published in JAMA Network Openresearchers developed and evaluated the accuracy, safety, and utility of Emergency Medicine (EM)-generated Long Language Model (LLM) handoff notes to reduce physician documentation burden without compromising patient safety.

The critical role of transfers in health care

Handles are critical points of contact in healthcare and a known source of medical errors. As a result, many organizations, such as The Joint Commission and the Accreditation Council for Graduate Medical Education (ACGME), have advocated standardized procedures to improve safety.

EM to inpatient (IP) transfers are associated with unique challenges, such as medical complexity, time constraints, and diagnostic uncertainty. However, they remain poorly standardized and inconsistently implemented. Electronic health record (EHR)-based tools have attempted to overcome these limitations. However, they remain unexplored in emergency situations.

LLMs have emerged as potential solutions for streamlining clinical documentation. However, concerns about factual inconsistencies require further research to ensure safety and reliability in critical workflows.

About the study

The present study was conducted in an 840-bed urban tertiary care academic hospital in New York City. EHR data from 1,600 EM patient encounters resulting in acute hospital admissions between April and September 2023 were analyzed. Only encounters after April 2023 were included due to the implementation of an updated EM-to-IP handover system.

Retrospective data were used with a waiver of informed consent to ensure minimal risk to patients. Handoff notes were created using a combination of LLM detail and rule-based heuristics while adhering to standard reference guidelines.

The delivery note template closely resembled the current structure of the manual, incorporating rule-based elements such as laboratory tests and vital signs and LLM-generated elements such as history of present illness and differential diagnoses. IT experts and EM physicians curated data to refine the LLM to improve their quality, while excluding race-based characteristics to avoid bias.

Two LLMs, robust Optimized Bidirectional Encoder Representations by Transformers Approach (RoBERTa) and Large Language Model Meta AI (Llama-2), were used for meaningful content selection and abstract summarization, respectively. Data processing included heuristic prioritization and saliency modeling to address potential limitations of the models.

The researchers evaluated automated metrics, such as the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) and the Bidirectional Encoder Representations from Transformers Score (BERTScore), alongside a new framework focused on patient safety. A clinical review of 50 delivery notes assessed their completeness, readability and safety to ensure their rigorous validation.

Study findings

Among the 1,600 patient cases included in the analysis, the mean age was 59.8 years with a standard deviation of 18.9 years, and 52% of patients were female. Automated evaluation metrics revealed that LLM-generated summaries outperformed those written by physicians in many aspects.

ROUGE-2 scores were significantly higher for LLM-generated summaries compared to physician summaries at 0.322 and 0.088, respectively. Similarly, BERT accuracy scores were higher at 0.859 compared to 0.796 for physician summaries. In contrast, the source segmentation approach for large-scale inconsistency assessment (SCALE) produced a score of 0.691 compared to 0.456. These results indicate that LLM-generated summaries demonstrated greater lexical similarities, higher fidelity to source notes, and provided more detailed content than their human-generated counterparts.

In clinical evaluations, the quality of LLM-generated summaries was comparable to physician-written summaries, but slightly inferior on several dimensions. On a Likert scale of one to five, LLM-generated summaries scored lower on usefulness, completeness, curation, readability, correctness, and patient safety. Despite these differences, the automated summaries were generally considered acceptable for clinical use, with none of the identified issues identified as life-threatening for patient safety.

When assessing worst-case scenarios, clinicians identified potential second-level safety risks, which included incomplete and flawed logic in 8.7% and 7.3%, respectively, for LLM-generated summaries compared to written summaries by doctors, which were not associated with these risks. Hallucinations were rare in LLM-generated summaries, with five identified cases all receiving safety scores between four and five, thus indicating mild to negligible safety risks. Overall, LLM-generated notes had a higher inaccuracy rate at 9.6% compared to written physician notes at 2%, although these inaccuracies rarely involved significant safety implications.

Interrater reliability was calculated using intraclass correlation coefficients (ICC). The ICCs showed good agreement between the three expert raters for completeness, diligence, correctness, and utility at 0.79, 0.70, 0.76, and 0.74, respectively. Readability achieved fair reliability with an ICC of 0.59.

conclusions

The current study successfully generated EM-to-IP handoff notes using a refined LLM and rule-based approach within a user-developed template.

Traditional automated assessments were associated with superior LLM performance. However, manual clinical assessments revealed that although most LLM-generated notes achieved promising quality scores between four and five, they were generally inferior to physician written notes. Detected errors, including incompleteness and faulty logic, occasionally pose moderate security risks, with less than 10% causing significant problems compared to doctor’s notes.

Journal Reference:

  • Hartman, V., Zhang, X., Poddar, R., et al. (2024). Development and Evaluation of Emergency Medical Emergency Management Notes Generated by the Large Language Model. JAMA Network Open. doi:10.1001/jamanetworkopen.2024.48723
accuracy emergency evaluates Medicine safety study
bhanuprakash.cg
healthtost
  • Website

Related Posts

Virica Biotech and FUJIFILM Biosciences Collaborate on Canada-Japan Co-Innovation Program to Advance AAV Production Enhancers

April 9, 2026

Long-term overweight is a stronger predictor of cardiovascular risk

April 8, 2026

Sugar intake can reduce the effectiveness of relaxation exercises

April 8, 2026

Leave A Reply Cancel Reply

Don't Miss
Men's Health

Traveling by plane with BPH

By healthtostApril 9, 20260

As we reach middle age and with airfare as low as it is, air travel…

Virica Biotech and FUJIFILM Biosciences Collaborate on Canada-Japan Co-Innovation Program to Advance AAV Production Enhancers

April 9, 2026

30 Minute Kettlebell Full Body Workout for Over 50

April 9, 2026

How your partner can support a happier pregnancy

April 9, 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
TAGS
Baby benefits body brain cancer care Day Diet disease exercise finds Fitness food Guide health healthy heart Improve Life Loss Men mental Natural Nutrition Patients People Pregnancy research reveals risk routine sex sexual Skin Skincare study Therapy Tips Top Training Treatment ways weight women Workout
About Us
About Us

Welcome to HealthTost, your trusted source for breaking health news, expert insights, and wellness inspiration. At HealthTost, we are committed to delivering accurate, timely, and empowering information to help you make informed decisions about your health and well-being.

Latest Articles

Traveling by plane with BPH

April 9, 2026

Virica Biotech and FUJIFILM Biosciences Collaborate on Canada-Japan Co-Innovation Program to Advance AAV Production Enhancers

April 9, 2026

30 Minute Kettlebell Full Body Workout for Over 50

April 9, 2026
New Comments
    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2026 HealthTost. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.