Resources
Database Credentialed Federated
CXR-Align: A Benchmark for CXR-Report Alignment with Negations
CXR-Align is a benchmark dataset designed to evaluate vision-language processing (VLP) models' ability to accurately interpret negations in chest X-ray (CXR) reports. Negations are prevalent in medical documentation and pose significant challenges f…
Published: Aug. 21, 2025. Version: 1.0.0 | DOI: 10.13026/7ebc-s018
Database Restricted Federated
Organ Retrieval and Collection of Health Information for Donation (ORCHID)
There are well-documented inefficiencies and inequities in the current system of deceased donor organ transplantation. While much prior research has focused on designing better allocation systems to distribute donated organs, more can be done to stu…
organ procurement organizations organ transplantation
Published: Aug. 21, 2025. Version: 2.1.0 | DOI: 10.13026/637d-3e59
Database Credentialed Federated
Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information
The human voice contains complex acoustic markers which have been linked to important health conditions including dementia, mood disorders, and cancer. When viewed as a biomarker, voice is a promising characteristic to measure as it is simple to col…
bridge2ai voice
Published: Aug. 18, 2025. Version: 2.0.1 | DOI: 10.13026/gzjs-0535
Database Credentialed Federated
Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database
Immunosuppression due to underlying conditions or immunosuppressive medication use increases the risk of morbidity and mortality in the context of infectious disease. Identifying patients with immunosuppression is important for better studying and u…
Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/etd0-dq69
Database Restricted Federated
EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs
This dataset contains a de-identified collection of 100,000 12-lead electrocardiograms (ECGs) with paired structural heart disease (SHD) labels derived from echocardiography, collected at Columbia University Irving Medical Center. Each ECG is provid…
ai model deployment population health ecg transthoracic echocardiogram machine learning left ventricular dysfunction electrocardiogram structural heart disease aortic stenosis cardiovascular screening digital health clinical decision support artificial intelligence ai in healthcare health equity deep learning valvular heart disease heart failure
Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/r9pp-3y42
Database Credentialed Federated
SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia
Traditional approaches to analyzing episodes of patient care in the ICU examine features on presentation and outcomes on discharge, collapsing the numerous events that happen during a patient’s stay and ignoring intercurrent ICU complications …
Published: Aug. 5, 2025. Version: 1.8.0 | DOI: 10.13026/w7kd-qj96
Database Credentialed Federated
Annotated Social Determinants of Health Dataset for Adverse Pregnancy Outcomes
This project presents an annotated dataset derived from MIMIC-III and MIMIC-IV discharge summaries, focusing on key Social Determinants of Health (SDoH) factors—social support, occupation, and substance use—and their association with adv…
Published: Aug. 5, 2025. Version: 1.0.0 | DOI: 10.13026/qk2y-wx30
Database Credentialed Federated
MIMIC-Ext-CXR-QBA: A Structured, Tagged, and Localized Visual Question Answering Dataset with Question-Box-Answer Triplets and Scene Graphs for Chest X-ray Images
Visual Question Answering (VQA) enables flexible and context-dependent analysis of medical images, such as chest X-rays (CXRs), by allowing users to pose specific questions and receive nuanced answers. However, existing CXR VQA datasets are typicall…
localization chest x-rays scene graphs vqa
Published: July 23, 2025. Version: 1.0.0 | DOI: 10.13026/8qmz-da41
Challenge Credentialed Federated
SNOMED CT Entity Linking Challenge
This challenge, sponsored by SNOMED International, seeks to advance the development of Entity Linking models that operate on unstructured clinical texts. Participants in the challenge will train entity linking models using a subset of MIMIC-IV-Note …
entity linking clinical annotation snomed
Published: July 22, 2025. Version: 1.1.0 | DOI: 10.13026/qn8t-6e19
Database Open Federated
tOLIet: Single-lead Thigh-based Electrocardiography Using Polimeric Dry Electrodes
Our team previously introduced an innovative concept for an "invisible" Electrocardiography (ECG) system, incorporating electrodes and sensors into a toilet seat design to enable signal acquisition from the thighs. Building upon that work, we now pr…
Published: June 24, 2025. Version: 1.0.0 | DOI: 10.13026/v66k-sk82