Resources


Database Open Federated

Myocardial perfusion scintigraphy image database

Source: Physionet

This database provides a collection of myocardial perfusion scintigraphy images in DICOM format with all metadata and segmentations (masks) in NIfTI format. The images were obtained from patients undergoing scintigraphy examinations to investigate c…

myocardial perfusion systems modeling myocardial perfusion scintigraphy dicom metadata artificial intelligence ventricular walls coronary artery disease convolutional neural networks automated segmentation clinical diagnosis anonymization nifti

Published: Sept. 10, 2025. Version: 1.0.0 | DOI: 10.13026/ce2z-dw74


Database Credentialed Federated

MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples

Source: Physionet

Large language models (LLMs) have shown impressive capabilities in solving a wide range of tasks based on human instructions. However, developing a conversational AI assistant for electronic health record (EHR) data remains challenging due to the la…

medical question answering large language models instruction tuning

Published: Sept. 9, 2025. Version: 1.0.0 | DOI: 10.13026/e5bq-pr14


Database Restricted Federated

HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis

Source: Physionet

The Hillel Yaffe Age Related Macular Degeneration (HYAMD) longitudinal dataset comprises of 1,560 Digital Fundus Images (DFIs) of 325 patients examined at the Hillel Yaffe Medical Center (Hadera, Israel, Helsinki approval number 0048-24-HYMC) provid…

Published: Sept. 9, 2025. Version: 1.0.0 | DOI: 10.13026/ydf1-z238


Database Open Federated

MIMIC-IV Clinical Database Demo on FHIR

Source: Physionet

Interoperability of healthcare data has become increasingly important given the increase in deployment of data driven algorithms in clinical settings. The Fast Healthcare Interoperability Resources (FHIR) standard has emerged as a promising mechanis…

electronic health records fhir mimic

Published: Aug. 27, 2025. Version: 2.1.0


Database Restricted Federated

Community-Acquired Pneumonia, Endotypes and Phenotypes (NACef): Prospective, observational cohort study of Translational Medicine

Source: Physionet

Community-Acquired Pneumonia (CAP) remains a prominent infectious process associated with elevated in-hospital morbidity and mortality rates. Through the exploration of phenotypes, endotypes, and biomarkers, it becomes feasible to identify individua…

Published: Aug. 22, 2025. Version: 2.0.1 | DOI: 10.13026/4y3t-pq44


Database Restricted Federated

Organ Retrieval and Collection of Health Information for Donation (ORCHID)

Source: Physionet

There are well-documented inefficiencies and inequities in the current system of deceased donor organ transplantation. While much prior research has focused on designing better allocation systems to distribute donated organs, more can be done to stu…

organ procurement organizations organ transplantation

Published: Aug. 21, 2025. Version: 2.1.0 | DOI: 10.13026/637d-3e59


Database Credentialed Federated

CXR-Align: A Benchmark for CXR-Report Alignment with Negations

Source: Physionet

CXR-Align is a benchmark dataset designed to evaluate vision-language processing (VLP) models' ability to accurately interpret negations in chest X-ray (CXR) reports. Negations are prevalent in medical documentation and pose significant challenges f…

Published: Aug. 21, 2025. Version: 1.0.0 | DOI: 10.13026/7ebc-s018


Database Credentialed Federated

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Source: Physionet

The human voice contains complex acoustic markers which have been linked to important health conditions including dementia, mood disorders, and cancer. When viewed as a biomarker, voice is a promising characteristic to measure as it is simple to col…

bridge2ai voice

Published: Aug. 18, 2025. Version: 2.0.1 | DOI: 10.13026/gzjs-0535


Database Restricted Federated

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Source: Physionet

This dataset contains a de-identified collection of 100,000 12-lead electrocardiograms (ECGs) with paired structural heart disease (SHD) labels derived from echocardiography, collected at Columbia University Irving Medical Center. Each ECG is provid…

ai model deployment population health ecg transthoracic echocardiogram machine learning left ventricular dysfunction electrocardiogram structural heart disease aortic stenosis cardiovascular screening digital health clinical decision support artificial intelligence ai in healthcare health equity deep learning valvular heart disease heart failure

Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/r9pp-3y42


Database Credentialed Federated

Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database

Source: Physionet

Immunosuppression due to underlying conditions or immunosuppressive medication use increases the risk of morbidity and mortality in the context of infectious disease. Identifying patients with immunosuppression is important for better studying and u…

Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/etd0-dq69