Resources
Database Credentialed Federated
MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark
MedVAL-Bench is a dataset containing physician evaluations of errors in language model (LM)-generated medical text. The dataset spans 6 diverse medical text generation tasks and includes annotations from 12 physicians on clinically significant error…
Published: Nov. 14, 2025. Version: 1.0.1 | DOI: 10.13026/653w-3038
Database Credentialed Federated
Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset
This repository contains a de-identified and curated patient-level dataset for modeling the impact of fine-grained environmental and patient-level factors on nosocomial acquisition of a wide range of drug-susceptible and drug-resistant pathogens. Th…
infection control clinical machine learning infectious diseases electronic health records hospital onset infection colonization pressure
Published: Nov. 4, 2025. Version: 1.0.0 | DOI: 10.13026/k70x-0m81
Database Credentialed Federated
MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark
MedVAL-Bench is a dataset containing physician evaluations of errors in language model (LM)-generated medical text. The dataset spans 6 diverse medical text generation tasks and includes annotations from 12 physicians on clinically significant error…
Published: Nov. 4, 2025. Version: 1.0.0 | DOI: 10.13026/8ga5-6661
Database Open Federated
HeartCycle: A comprehensive dataset of synchronized impedance cardiography and echocardiography for accurate hemodynamic predictions
The "HeartCycle" dataset offers a comprehensive collection of synchronized impedance cardiography (ICG) and echocardiography (ECHO) signals, supplemented with finger photoplethysmography (PPG), heart sounds, and electrocardiography (ECG) data from 1…
cardiovascular physiology electrophysiological study echocardiography machine learning impedance cardiography
Published: Nov. 3, 2025. Version: 1.0.0 | DOI: 10.13026/z865-eb23
Database Credentialed Federated
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays
Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…
structured chest x-ray qa intermediate reasoning steps evaluation structured reasoning chest x-ray benchmark grounded reasoning diagnostic reasoning structured diagnostic pipeline
Published: Oct. 23, 2025. Version: 1.0.1 | DOI: 10.13026/d14j-1b45
Database Contributor Review Federated
ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room
The ER-Reason dataset is a benchmark designed to evaluate LLM-based clinical reasoning and decision-making in the emergency room (ER), a high-stakes setting where clinicians make rapid, consequential decisions across diverse patient presentations an…
Published: Oct. 23, 2025. Version: 1.0.0 | DOI: 10.13026/55s7-3c27
Database Credentialed Federated
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often…
multi-turn dialogue llm simulation electronic health records doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0 | DOI: 10.13026/vq0d-v871
Database Credentialed Federated
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays
Recent progress in Large Vision-Language Models (LVLMs) has enabled promising applications in medical tasks such as report generation and visual question answering. However, existing benchmarks focus mainly on the final diagnostic answer, offering l…
structured diagnostic pipeline structured chest x-ray qa chest x-ray evaluation diagnostic reasoning benchmark intermediate reasoning steps grounded reasoning structured reasoning
Published: Oct. 16, 2025. Version: 1.0.0 | DOI: 10.13026/z3dn-nh22
Database Credentialed Federated
MIMIC-IV-Ext clinical decision support for referral, triage and diagnosis
Accurate medical decision-making is critical for both patients and clinicians. Patients often find it difficult to interpret their symptoms, determine their severity, and select the right specialist to see. At the same time, clinicians face challeng…
Published: Oct. 8, 2025. Version: 1.0.2 | DOI: 10.13026/stnm-qx35
Database Credentialed Federated
MIMIC-IV-ECHO-Ext-MIMICEchoQA: A Benchmark Dataset for Echocardiogram-Based Visual Question Answering
We present MIMICEchoQA, a benchmark dataset for echocardiogram-based question answering, built from the publicly available MIMIC-IV-ECHO database. Each echocardiographic study was paired with the closest discharge summary within a 7-day window, and …
Published: Oct. 7, 2025. Version: 1.0.0 | DOI: 10.13026/rndk-4s36