Resources
Database Credentialed Federated
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
The VeriFact-BHC dataset is designed to verify the factuality of long-form text written about a patient against their own electronic health record. There is increasing interest in using large language models (LLMs) to generate clinical text in patie…
long-form text chart review artificial intelligence clinical notes text reranking brief hospital course natural language processing atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic large language models llm-as-a-judge electronic health records llm evaluation
Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/abat-g475
Database Credentialed Federated
MIMIC-III-Ext-tPatchGNN
This dataset is a curated subset of MIMIC-III (v1.4), specifically formatted to facilitate reproducibility of the experiments in the work t-PatchGNN. It serves as part of a benchmark designed for forecasting irregular multivariate clinical time seri…
Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/ckn0-3868
Database Credentialed Federated
MIMIC-IV-Ext-CEKG: A Process-Oriented Dataset Derived from MIMIC-IV for Enhanced Clinical Insights
Maintaining a healthy population is essential for improving quality of life and overall societal well-being. One approach to achieving a healthy population is by improving patients' care pathways. This is particularly vital for patients with multipl…
process mining multi entity process mining mimic object centric event log clinical event knowledge graph
Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/qr9d-6t52
Database Open Federated
ReXErr-v1: Clinically Meaningful Chest X-Ray Report Errors Derived from MIMIC-CXR
Interpreting medical images and writing radiology reports is a critical yet challenging task in healthcare. Despite their importance, both human-written and AI-generated reports are liable to errors, leaving a need for robust and representative data…
Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/9dns-vd94
Database Restricted Federated
ALarms, Outcomes Telemetry with Timing (ALOTT): a Bedside-EMR Database
ALOTT is a pilot project that gathered telemetry data from 270 beds and over 15,000 hospital admissions from September 2018 through November 2020 at The James Cancer Hospital and Ross Heart Hospital. ALOTT contains telemetry waveforms, such as elect…
Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/sbq5-dy17
Challenge Credentialed Federated
CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays
Chest radiography presents a "long-tailed" distribution of findings, where a few diseases are common, but most are rare. Diagnosis is further complicated by its multi-label nature, as patients often exhibit multiple co-occurring findings. While rece…
disease classification computer-aided diagnosis artificial intelligence chest x-ray deep learning long-tailed learning cardiopulmonary disease zero-shot learning
Published: March 19, 2025. Version: 2.0.0 | DOI: 10.13026/ryj9-x506
Database Open Federated
Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease
Interpretation of Electrocardiograms (ECG) is increasingly complemented by algorithms. These algorithms are based on large datasets. This ECG database consists of children and adults with congenital heart defects (CHD) including many arrhythmia anno…
arrhythmias artificial intelligence chd 12-lead intracardiac recordings annotated congenital heart disease ecg
Published: March 19, 2025. Version: 1.0.0 | DOI: 10.13026/7a4j-vn37
Database Credentialed Federated
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Electronic Health Records (EHRs) are integral for storing comprehensive patient medical records, combining structured data (e.g., medications) with detailed clinical notes (e.g., physician notes). These elements are essential for straightforward dat…
Published: March 19, 2025. Version: 1.0.1 | DOI: 10.13026/m4vd-y789
Database Credentialed Federated
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context
Large Vision Language Models (LVLMs) have recently achieved superior performance in various tasks on natural image and text data, which inspires a large amount of studies for LVLMs fine-tuning and training. Despite their advancements, there has been…
Published: March 11, 2025. Version: 1.0.0 | DOI: 10.13026/8ymd-c338
Database Restricted Federated
Microbiological, Immunological and Biochemical Characteristics of the Development of Ventilator Associated Pneumonia
The respiratory microbiome plays a critical role in metabolism, immune system maturation, and protection against pathogens. Traditionally, respiratory microbiology in pneumonia focused on identifying a specific pathogen, often disregarding normal or…
Published: March 10, 2025. Version: 1.0.0 | DOI: 10.13026/nx6s-hf22