Resources


Database Credentialed Federated

MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation

Source: Physionet

The VeriFact-BHC dataset is designed to verify the factuality of long-form text written about a patient against their own electronic health record. There is increasing interest in using large language models (LLMs) to generate clinical text in patie…

long-form text chart review artificial intelligence clinical notes text reranking brief hospital course natural language processing atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic large language models llm-as-a-judge electronic health records llm evaluation

Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/abat-g475


Database Credentialed Federated

MIMIC-III-Ext-tPatchGNN

Source: Physionet

This dataset is a curated subset of MIMIC-III (v1.4), specifically formatted to facilitate reproducibility of the experiments in the work t-PatchGNN. It serves as part of a benchmark designed for forecasting irregular multivariate clinical time seri…

Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/ckn0-3868


Database Credentialed Federated

MIMIC-IV-Ext-CEKG: A Process-Oriented Dataset Derived from MIMIC-IV for Enhanced Clinical Insights

Source: Physionet

Maintaining a healthy population is essential for improving quality of life and overall societal well-being. One approach to achieving a healthy population is by improving patients' care pathways. This is particularly vital for patients with multipl…

process mining multi entity process mining mimic object centric event log clinical event knowledge graph

Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/qr9d-6t52


Database Open Federated

ReXErr-v1: Clinically Meaningful Chest X-Ray Report Errors Derived from MIMIC-CXR

Source: Physionet

Interpreting medical images and writing radiology reports is a critical yet challenging task in healthcare. Despite their importance, both human-written and AI-generated reports are liable to errors, leaving a need for robust and representative data…

Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/9dns-vd94


Database Restricted Federated

ALarms, Outcomes Telemetry with Timing (ALOTT): a Bedside-EMR Database

Source: Physionet

ALOTT is a pilot project that gathered telemetry data from 270 beds and over 15,000 hospital admissions from September 2018 through November 2020 at The James Cancer Hospital and Ross Heart Hospital. ALOTT contains telemetry waveforms, such as elect…

Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/sbq5-dy17


Challenge Credentialed Federated

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

Source: Physionet

Chest radiography presents a "long-tailed" distribution of findings, where a few diseases are common, but most are rare. Diagnosis is further complicated by its multi-label nature, as patients often exhibit multiple co-occurring findings. While rece…

disease classification computer-aided diagnosis artificial intelligence chest x-ray deep learning long-tailed learning cardiopulmonary disease zero-shot learning

Published: March 19, 2025. Version: 2.0.0 | DOI: 10.13026/ryj9-x506


Database Open Federated

Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease

Source: Physionet

Interpretation of Electrocardiograms (ECG) is increasingly complemented by algorithms. These algorithms are based on large datasets. This ECG database consists of children and adults with congenital heart defects (CHD) including many arrhythmia anno…

arrhythmias artificial intelligence chd 12-lead intracardiac recordings annotated congenital heart disease ecg

Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/7a4j-vn37


Database Credentialed Federated

EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records

Source: Physionet

Electronic Health Records (EHRs) are integral for storing comprehensive patient medical records, combining structured data (e.g., medications) with detailed clinical notes (e.g., physician notes). These elements are essential for straightforward dat…

Published: March 19, 2025. Version: 1.0.1 | DOI: 10.13026/m4vd-y789


Database Credentialed Federated

MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

Source: Physionet

Large Vision Language Models (LVLMs) have recently achieved superior performance in various tasks on natural image and text data, which inspires a large amount of studies for LVLMs fine-tuning and training. Despite their advancements, there has been…

Published: March 11, 2025. Version: 1.0.0 | DOI: 10.13026/8ymd-c338


Database Restricted Federated

Microbiological, Immunological and Biochemical Characteristics of the Development of Ventilator Associated Pneumonia

Source: Physionet

The respiratory microbiome plays a critical role in metabolism, immune system maturation, and protection against pathogens. Traditionally, respiratory microbiology in pneumonia focused on identifying a specific pathogen, often disregarding normal or…

Published: March 10, 2025. Version: 1.0.0 | DOI: 10.13026/nx6s-hf22