Resources


Database Contributor Review Federated

ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

Source: Physionet

The ER-Reason dataset is a benchmark designed to evaluate LLM-based clinical reasoning and decision-making in the emergency room (ER), a high-stakes setting where clinicians make rapid, consequential decisions across diverse patient presentations an…

Published: Oct. 23, 2025. Version: 1.0.0 | DOI: 10.13026/55s7-3c27


Database Credentialed Federated

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Source: Physionet

Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…

structured chest x-ray qa intermediate reasoning steps evaluation structured reasoning chest x-ray benchmark grounded reasoning diagnostic reasoning structured diagnostic pipeline

Published: Oct. 23, 2025. Version: 1.0.1 | DOI: 10.13026/d14j-1b45


Database Credentialed Federated

PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions

Source: Physionet

Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often…

multi-turn dialogue llm simulation electronic health records doctor-patient consultation

Published: Oct. 18, 2025. Version: 1.0.0 | DOI: 10.13026/vq0d-v871


Database Credentialed Federated

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Source: Physionet

Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…

structured diagnostic pipeline structured chest x-ray qa chest x-ray evaluation diagnostic reasoning benchmark intermediate reasoning steps grounded reasoning structured reasoning

Published: Oct. 16, 2025. Version: 1.0.0 | DOI: 10.13026/z3dn-nh22


Model Credentialed Federated

RadVLM model

Source: Physionet

We present RadVLM, a compact (7B) multitask conversational foundation model designed for CXR interpretation. Its development relies on the curation of a large-scale instruction dataset comprising over 1 million image-instruction pairs containing bot…

Published: Oct. 8, 2025. Version: 1.0.0 | DOI: 10.13026/50kn-p490


Database Credentialed Federated

MIMIC-IV-Ext clinical decision support for referral, triage and diagnosis

Source: Physionet

Accurate medical decision-making is critical for both patients and clinicians. Patients often find it difficult to interpret their symptoms, determine their severity, and select the right specialist to see. At the same time, clinicians face challeng…

Published: Oct. 8, 2025. Version: 1.0.2 | DOI: 10.13026/stnm-qx35


Database Credentialed Federated

MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering

Source: Physionet

We present MIMICEchoQA, a benchmark dataset for echocardiogram-based question answering, built from the publicly available MIMIC-IV-ECHO database. Each echocardiographic study was paired with the closest discharge summary within a 7-day window, and …

Published: Oct. 7, 2025. Version: 1.0.0 | DOI: 10.13026/rndk-4s36


Database Restricted Federated

TN-Mammo: A Multi-view Mammography Dataset for Breast Density Classification

Source: Physionet

Breast cancer is one of the most common types of cancer among women, leading to a growing and essential need for early and precise detection. A variety of machine learning techniques have been demonstrating great promise in improving diagnostic accu…

Published: Oct. 4, 2025. Version: 1.0.0 | DOI: 10.13026/1kx0-xc60


Database Open Federated

MIMIC-IV demo data in the Medical Event Data Standard (MEDS)

Source: Physionet

This dataset is an automated ETL conversion of the MIMIC-IV Clinical Database Demo into the Medical Event Data Standard (MEDS). MEDS is a data schema for storing streams of medical events such as those sourced from Electronic Health Records or …

electronic health record mimic meds machine learning critical care medical event data standard ehr

Published: Sept. 30, 2025. Version: 0.0.1 | DOI: 10.13026/t2y8-ea41


Database Restricted Federated

Organ Retrieval and Collection of Health Information for Donation (ORCHID)

Source: Physionet

There are well-documented inefficiencies and inequities in the current system of deceased donor organ transplantation. While much prior research has focused on designing better allocation systems to distribute donated organs, more can be done to stu…

organ procurement organizations organ transplantation

Published: Sept. 30, 2025. Version: 2.1.1 | DOI: 10.13026/rfeq-j318