Resources
Database Contributor Review Federated
ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room
The ER-Reason dataset is a benchmark designed to evaluate LLM-based clinical reasoning and decision-making in the emergency room (ER), a high-stakes setting where clinicians make rapid, consequential decisions across diverse patient presentations an…
Published: Oct. 23, 2025. Version: 1.0.0 | DOI: 10.13026/55s7-3c27
Database Credentialed Federated
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays
Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…
structured chest x-ray qa intermediate reasoning steps evaluation structured reasoning chest x-ray benchmark grounded reasoning diagnostic reasoning structured diagnostic pipeline
Published: Oct. 23, 2025. Version: 1.0.1 | DOI: 10.13026/d14j-1b45
Database Credentialed Federated
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often…
multi-turn dialogue llm simulation electronic health records doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0 | DOI: 10.13026/vq0d-v871
Database Credentialed Federated
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays
Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…
structured diagnostic pipeline structured chest x-ray qa chest x-ray evaluation diagnostic reasoning benchmark intermediate reasoning steps grounded reasoning structured reasoning
Published: Oct. 16, 2025. Version: 1.0.0 | DOI: 10.13026/z3dn-nh22
Model Credentialed Federated
RadVLM model
We present RadVLM, a compact (7B) multitask conversational foundation model designed for CXR interpretation. Its development relies on the curation of a large-scale instruction dataset comprising over 1 million image-instruction pairs containing bot…
Published: Oct. 8, 2025. Version: 1.0.0 | DOI: 10.13026/50kn-p490
Database Credentialed Federated
MIMIC-IV-Ext clinical decision support for referral, triage and diagnosis
Accurate medical decision-making is critical for both patients and clinicians. Patients often find it difficult to interpret their symptoms, determine their severity, and select the right specialist to see. At the same time, clinicians face challeng…
Published: Oct. 8, 2025. Version: 1.0.2 | DOI: 10.13026/stnm-qx35
Database Credentialed Federated
MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering
We present MIMICEchoQA, a benchmark dataset for echocardiogram-based question answering, built from the publicly available MIMIC-IV-ECHO database. Each echocardiographic study was paired with the closest discharge summary within a 7-day window, and …
Published: Oct. 7, 2025. Version: 1.0.0 | DOI: 10.13026/rndk-4s36
Database Restricted Federated
TN-Mammo: A Multi-view Mammography Dataset for Breast Density Classification
Breast cancer is one of the most common types of cancer among women, leading to a growing and essential need for early and precise detection. A variety of machine learning techniques have been demonstrating great promise in improving diagnostic accu…
Published: Oct. 4, 2025. Version: 1.0.0 | DOI: 10.13026/1kx0-xc60
Database Open Federated
MIMIC-IV demo data in the Medical Event Data Standard (MEDS)
This dataset is an automated ETL conversion of the MIMIC-IV Clinical Database Demo into the Medical Event Data Standard (MEDS). MEDS is a data schema for storing streams of medical events such as those sourced from Electronic Health Records or …
electronic health record mimic meds machine learning critical care medical event data standard ehr
Published: Sept. 30, 2025. Version: 0.0.1 | DOI: 10.13026/t2y8-ea41
Database Restricted Federated
Organ Retrieval and Collection of Health Information for Donation (ORCHID)
There are well-documented inefficiencies and inequities in the current system of deceased donor organ transplantation. While much prior research has focused on designing better allocation systems to distribute donated organs, more can be done to stu…
organ procurement organizations organ transplantation
Published: Sept. 30, 2025. Version: 2.1.1 | DOI: 10.13026/rfeq-j318