Resources


Database Credentialed Federated

CXR-Align: A Benchmark for CXR-Report Alignment with Negations

Source: Physionet

CXR-Align is a benchmark dataset designed to evaluate vision-language processing (VLP) models' ability to accurately interpret negations in chest X-ray (CXR) reports. Negations are prevalent in medical documentation and pose significant challenges f…

Published: Aug. 21, 2025. Version: 1.0.0 | DOI: 10.13026/7ebc-s018


Database Restricted Federated

Organ Retrieval and Collection of Health Information for Donation (ORCHID)

Source: Physionet

There are well-documented inefficiencies and inequities in the current system of deceased donor organ transplantation. While much prior research has focused on designing better allocation systems to distribute donated organs, more can be done to stu…

organ procurement organizations organ transplantation

Published: Aug. 21, 2025. Version: 2.1.0 | DOI: 10.13026/637d-3e59


Database Credentialed Federated

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Source: Physionet

The human voice contains complex acoustic markers which have been linked to important health conditions including dementia, mood disorders, and cancer. When viewed as a biomarker, voice is a promising characteristic to measure as it is simple to col…

bridge2ai voice

Published: Aug. 18, 2025. Version: 2.0.1 | DOI: 10.13026/gzjs-0535


Database Credentialed Federated

Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database

Source: Physionet

Immunosuppression due to underlying conditions or immunosuppressive medication use increases the risk of morbidity and mortality in the context of infectious disease. Identifying patients with immunosuppression is important for better studying and u…

Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/etd0-dq69


Database Restricted Federated

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Source: Physionet

This dataset contains a de-identified collection of 100,000 12-lead electrocardiograms (ECGs) with paired structural heart disease (SHD) labels derived from echocardiography, collected at Columbia University Irving Medical Center. Each ECG is provid…

ai model deployment population health ecg transthoracic echocardiogram machine learning left ventricular dysfunction electrocardiogram structural heart disease aortic stenosis cardiovascular screening digital health clinical decision support artificial intelligence ai in healthcare health equity deep learning valvular heart disease heart failure

Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/r9pp-3y42


Database Credentialed Federated

SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia

Source: Physionet

Traditional approaches to analyzing episodes of patient care in the ICU examine features on presentation and outcomes on discharge, collapsing the numerous events that happen during a patient’s stay and ignoring intercurrent ICU complications …

Published: Aug. 5, 2025. Version: 1.8.0 | DOI: 10.13026/w7kd-qj96


Database Credentialed Federated

Annotated Social Determinants of Health Dataset for Adverse Pregnancy Outcomes

Source: Physionet

This project presents an annotated dataset derived from MIMIC-III and MIMIC-IV discharge summaries, focusing on key Social Determinants of Health (SDoH) factors—social support, occupation, and substance use—and their association with adv…

Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/qk2y-wx30


Database Credentialed Federated

MIMIC-Ext-CXR-QBA: A Structured, Tagged, and Localized Visual Question Answering Dataset with Question-Box-Answer Triplets and Scene Graphs for Chest X-ray Images

Source: Physionet

Visual Question Answering (VQA) enables flexible and context-dependent analysis of medical images, such as chest X-rays (CXRs), by allowing users to pose specific questions and receive nuanced answers. However, existing CXR VQA datasets are typicall…

localization chest x-rays scene graphs vqa

Published: July 23, 2025. Version: 1.0.0 | DOI: 10.13026/8qmz-da41


Challenge Credentialed Federated

SNOMED CT Entity Linking Challenge

Source: Physionet

This challenge, sponsored by SNOMED International, seeks to advance the development of Entity Linking models that operate on unstructured clinical texts. Participants in the challenge will train entity linking models using a subset of MIMIC-IV-Note …

entity linking clinical annotation snomed

Published: July 22, 2025. Version: 1.1.0 | DOI: 10.13026/qn8t-6e19


Database Open Federated

tOLIet: Single-lead Thigh-based Electrocardiography Using Polimeric Dry Electrodes

Source: Physionet

Our team previously introduced an innovative concept for an "invisible" Electrocardiography (ECG) system, incorporating electrodes and sensors into a toilet seat design to enable signal acquisition from the thighs. Building upon that work, we now pr…

Published: June 24, 2025. Version: 1.0.0 | DOI: 10.13026/v66k-sk82