Resources


Database Credentialed Federated

MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark

Source: Physionet

MedVAL-Bench is a dataset containing physician evaluations of errors in language model (LM)-generated medical text. The dataset spans 6 diverse medical text generation tasks and includes annotations from 12 physicians on clinically significant error…

Published: Nov. 14, 2025. Version: 1.0.1 | DOI: 10.13026/653w-3038


Database Credentialed Federated

Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset

Source: Physionet

This repository contains a de-identified and curated patient-level dataset for modeling the impact of fine-grained environmental and patient-level factors on nosocomial acquisition of a wide range of drug-susceptible and drug-resistant pathogens. Th…

infection control clinical machine learning infectious diseases electronic health records hospital onset infection colonization pressure

Published: Nov. 4, 2025. Version: 1.0.0 | DOI: 10.13026/k70x-0m81


Database Credentialed Federated

MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark

Source: Physionet

MedVAL-Bench is a dataset containing physician evaluations of errors in language model (LM)-generated medical text. The dataset spans 6 diverse medical text generation tasks and includes annotations from 12 physicians on clinically significant error…

Published: Nov. 4, 2025. Version: 1.0.0 | DOI: 10.13026/8ga5-6661


Database Open Federated

HeartCycle: A comprehensive dataset of synchronized impedance cardiography and echocardiography for accurate hemodynamic predictions

Source: Physionet

The "HeartCycle" dataset offers a comprehensive collection of synchronized impedance cardiography (ICG) and echocardiography (ECHO) signals, supplemented with finger photoplethysmography (PPG), heart sounds, and electrocardiography (ECG) data from 1…

cardiovascular physiology electrophysiological study echocardiography machine learning impedance cardiography

Published: Nov. 3, 2025. Version: 1.0.0 | DOI: 10.13026/z865-eb23


Database Credentialed Federated

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Source: Physionet

Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…

structured chest x-ray qa intermediate reasoning steps evaluation structured reasoning chest x-ray benchmark grounded reasoning diagnostic reasoning structured diagnostic pipeline

Published: Oct. 23, 2025. Version: 1.0.1 | DOI: 10.13026/d14j-1b45


Database Contributor Review Federated

ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

Source: Physionet

The ER-Reason dataset is a benchmark designed to evaluate LLM-based clinical reasoning and decision-making in the emergency room (ER), a high-stakes setting where clinicians make rapid, consequential decisions across diverse patient presentations an…

Published: Oct. 23, 2025. Version: 1.0.0 | DOI: 10.13026/55s7-3c27


Database Credentialed Federated

PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions

Source: Physionet

Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often…

multi-turn dialogue llm simulation electronic health records doctor-patient consultation

Published: Oct. 18, 2025. Version: 1.0.0 | DOI: 10.13026/vq0d-v871


Database Credentialed Federated

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Source: Physionet

Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…

structured diagnostic pipeline structured chest x-ray qa chest x-ray evaluation diagnostic reasoning benchmark intermediate reasoning steps grounded reasoning structured reasoning

Published: Oct. 16, 2025. Version: 1.0.0 | DOI: 10.13026/z3dn-nh22


Database Credentialed Federated

MIMIC-IV-Ext clinical decision support for referral, triage and diagnosis

Source: Physionet

Accurate medical decision-making is critical for both patients and clinicians. Patients often find it difficult to interpret their symptoms, determine their severity, and select the right specialist to see. At the same time, clinicians face challeng…

Published: Oct. 8, 2025. Version: 1.0.2 | DOI: 10.13026/stnm-qx35


Database Credentialed Federated

MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering

Source: Physionet

We present MIMICEchoQA, a benchmark dataset for echocardiogram-based question answering, built from the publicly available MIMIC-IV-ECHO database. Each echocardiographic study was paired with the closest discharge summary within a 7-day window, and …

Published: Oct. 7, 2025. Version: 1.0.0 | DOI: 10.13026/rndk-4s36